• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2026, Vol. 48 ›› Issue (5): 779-792.doi: 10.3969/j.issn.1007-130X.2026.05.003

• 高性能计算 • 上一篇    下一篇

面向分布式文件系统的元数据预取策略研究综述

王振飞,顿龙祥,鲍梓良,杨芮嘉,李桂秋


  

  1. (郑州大学计算机与人工智能学院,河南 郑州 450001)
  • 收稿日期:2024-11-17 修回日期:2025-01-11 出版日期:2026-05-25 发布日期:2026-05-21
  • 基金资助:
    国家重点研发计划(2023YFB4502704);国家级大学生创新创业训练计划(202410459121)

A survey on metadata prefetching strategies for distributed file system

WANG Zhenfei,DUN Longxiang,BAO Ziliang,YANG Ruijia,LI Guiqiu   

  1. (School of Computer Science and Artificial Intelligence,Zhengzhou University,Zhengzhou 450001,China)
  • Received:2024-11-17 Revised:2025-01-11 Online:2026-05-25 Published:2026-05-21

摘要: 在分布式文件系统中处理大规模数据时,元数据管理是关键挑战。元数据操作占文件系统操作的大部分,因此提升元数据服务性能至关重要。传统元数据访问方式存在网络延迟和服务器负载问题,导致效率低下。为了解决这些问题,研究基于DFS的元数据预取策略,包括基于访问模式、缓存机制和预测模型的预取。这些策略通过提前缓存即将使用的元数据,降低延迟,提高I/O效率。然而,预取策略面临预测准确性、缓存管理、数据一致性和安全性挑战。未来的发展方向包括基于深度学习和智能化算法的预取策略,以及自适应和动态调整的预取策略。这些策略将有助于提高元数据管理的效率和准确性,从而在大数据时代满足日益增长的存储需求,使得元数据预取策略在其中发挥至关重要的作用。


关键词: 分布式文件系统, 元数据, 元数据管理, 预取策略

Abstract: When handling large-scale data in distributed file system (DFS), metadata management poses a critical challenge. Metadata operations account for the majority of file system operations, so enhancing the performance of metadata services is of utmost importance. Traditional metadata access methods suffer from issues such as network latency and server load, resulting in inefficiency. To address these problems, research has been conducted on DFS-based metadata prefetching strategies, including prefetching based on access patterns, caching mechanisms, and prediction models. These strategies reduce latency and improve I/O efficiency by proactively caching metadata that is about to be used. However, prefetching strategies face challenges related to prediction accuracy, cache management, data consistency, and security. Future development directions include prefetching strategies based on deep learning and intelligent algorithms, as well as the adaptive and dynamically adjusted prefetching strategies. These strategies will contribute to enhancing the efficiency and accuracy of metadata management, thereby meeting the ever-increasing storage demands in the era of big data, with metadata prefetching strategies playing a crucial role in this process.


Key words: distributed file system (DFS), metadata, metadata management, prefetching strategy