• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2014, Vol. 36 ›› Issue (01): 12-18.

• 论文 • 上一篇    下一篇

基于可变步长的访存延迟测量模型的研究与实现

毛席龙,杨安,吕高锋,林琦,程辉   

  1. (国防科学技术大学计算机学院,湖南 长沙 410073)
  • 收稿日期:2012-04-23 修回日期:2012-06-29 出版日期:2014-01-25 发布日期:2014-01-25
  • 基金资助:

    国家973计划资助项目(2009CB320503)

Research and implementation of memory latency measurement model based on variable stride    

MAO Xilong,YANG An,L Gaofeng,LIN Qi,CHENG Hui   

  1. (College of Computer,National University of Defense Technology,Changsha 410073,China)
  • Received:2012-04-23 Revised:2012-06-29 Online:2014-01-25 Published:2014-01-25

摘要:

评测访存延迟对于优化应用访存模式和数据放置有重要的指导意义,然而数据Cache、多线程、数据预取等技术却严重干扰了访存延迟测量的精度。设计并实现了基于可变步长的访存延迟测量模型,在一块空间内根据用户指定的步长创建访问序列环,循环访问这个序列得出平均时间,即为访存延迟。最后对Intel的通用处理器和飞腾处理器在不同数据大小、步长、线程数等情况下的访存延迟进行了测量比较,该模型能够显示存储层次并精确显示测量延迟。

关键词: 内存延迟, 可变步长, 测量方法, SMT, 多核处理器, 飞腾处理器

Abstract:

Evaluating the memory access latency has important significance for optimizing application patterns and data placement. However, cache, multithreading, data prefetching and other techniques have serious interference with the accuracy of measurement of memory access latency. A measurement model based on variable strides is designed and implemented. According to userspecified strides, we create a sequence ring in a space, and circularly access this ring to obtain the average time as the memory access latency.  Finally, we measure an Intel common processor and FT processor’s memory latency by different data size, stride and thread, and make the data contrast with each other. This model can display the memory hierarchy and display memory latency precisely.

Key words: Memory latency;variable stride;measurement;SMT;multicore processor;FT Processor