• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2011, Vol. 33 ›› Issue (9): 42-47.

• 论文 • 上一篇    下一篇

基于安腾微处理器的程序性能优化与分析

迟利华,刘杰   

  1. (国防科学技术大学计算机学院,湖南 长沙 410073)
  • 收稿日期:2009-03-10 出版日期:2011-09-25 发布日期:2011-09-25
  • 作者简介:迟利华(1970),女,山东威海人,博士,副研究员,研究方向为并行算法和大规模科学与工程数值模拟并行应用软件。刘杰(1969),男,江苏赣榆人,博士,副研究员,研究方向为大型科学与工程计算中并行算法、并行应用软件和性能评价等方面的研究。
  • 基金资助:

    国家自然科学基金资助项目(60673150);国家863计划资助项目(2008AA01Z137)

Code Performance Optimization and Analysis Based on Itaniuam Microprocessors

CHI Lihua,LIU Jie   

  1. (School of Computer Science,National University of Defense Technology,Changsha 410073,China)
  • Received:2009-03-10 Online:2011-09-25 Published:2011-09-25

摘要:

高性能计算越来越广泛地应用到科学和工程的各个领域,但实际应用程序获得的性能并未随着机器峰值性能的提高而同比例提高,应用程序只能发挥峰值性能的5%~10%左右,而且两者的差距在扩大,程序性能优化作为解决该问题的方法之一得到了学术界的广泛关注。本文基于安腾微处理器,总结了程序优化的通用方法,给出了程序优化与分析的一般步骤。根据优化与分析步骤,首先对四个程序进行了详细的性能分析,找到性能瓶颈和重点子程序;然后分别根据四个程序的特点,采用基于Cache和指令流水线的优化技术,对程序进行了性能优化;最后给出了性能优化测试结果,分别得到8%~33%的性能提高,取得了良好的优化效果。

关键词: 安腾微处理器, 程序性能优化, 程序性能分析

Abstract:

High performance computing is widely used in science and engineering to solve large scale computation problems. But the sustained performances achieved for the real applications do not increase as fast as the peak performances do. In fact, the sustained performance is a only about 5~10% of the peak performance, and the gap between the sustained performance and the peak performance is widening. Code performance optimization, which is one of the effective ways to solve this problem, draws the attentions of the research community. Based on Itanium microprocessors, this paper summarizes the general methods for code performance optimization and gives the common steps for code performance optimization and analysis. According to the steps, the performances for four codes are analysed in detail to find the performance bottlenecks and the key subroutine codes. Then four codes are optimized in the Itanium microprocesspor, using the code optimization techniques based on cache and instruction pipeline. Finally, the test results for the four performance optimization codes show that the performances are increased by 8~33% respectively.

Key words: itaniuam microprocessor;code performance optimization;code performance analysis