• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (12): 8-13.

• 论文 • 上一篇    下一篇

对象存储并行文件系统小文件性能优化研究

周恩强,董勇,张伟,卢宇彤   

  1. (国防科学技术大学高性能计算国家重点实验室,湖南 长沙 410073)
  • 收稿日期:2013-08-11 修回日期:2013-10-15 出版日期:2013-12-25 发布日期:2013-12-25
  • 基金资助:

    国家自然科学基金资助项目(61120106005);国家863计划资助项目(2012AA01A301)

Optimization of small file access on
object based parallel file system  

ZHOU Enqiang,DONG Yong,ZHANG Wei,LU Yutong   

  1. (State Key Laboratory of High Performance Computing,National University of Defense Technology,Changsha 410073,China)
  • Received:2013-08-11 Revised:2013-10-15 Online:2013-12-25 Published:2013-12-25

摘要:

对象存储并行文件系统在高性能计算领域被广泛采用,虽然大文件访问性能优异,但其小文件访问存在局限性。通过分析典型对象存储并行文件系统下的小文件的访问开销,采用服务端小文件专用缓存和基于对象命名关联性预取的方法提高小文件的缓存命中率,降低对象存储架构下小文件的读取延迟。基于Lustre并行文件系统实现了该方法。实际系统测试表明,该方法可显著提高小文件访问性能,其中小文件打包性能提升最大可达88%。

关键词: 并行文件系统, 缓存, 对象存储, 文件预取

Abstract:

Object based parallel file systems are widely used in current supercomputers for its excellent performance of large file at the expense of its lower efficiency on small file access. Through analysis of the overhead of small file access on typical object based parallel file system, a serverside small file aware cache is proposed in order to reduce file access latency by aggressively prefetching a set of related files object and improving small file hit rate. This method is implemented and evaluated in Lustrefile system which is a popular parallel file system for HPC system. The experiments in real system show that it can improve the performance of small file access dramatically, especially with the reduction of up to 88% of total cost in scenario of file archiving operation.

Key words: parallel file system;cache;object storage;prefetching