• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2024, Vol. 46 ›› Issue (12): 2117-2127.

• 高性能计算 • 上一篇    下一篇

Bowtie 2-NUMA:具有NUMA体系结构适应性的基因序列比对应用#br#

王强,孙彦洁,齐星云,徐佳庆   

  1. (国防科技大学计算机学院,湖南 长沙 410073) 
  • 收稿日期:2023-03-29 修回日期:2023-10-17 接受日期:2024-12-25 出版日期:2024-12-25 发布日期:2024-12-23
  • 基金资助:
    国家重点研发计划(2021YFB0300101)

Bowtie 2-NUMA: Gene sequence alignment application with NUMA architecture adaptability

WANG Qiang,SUN Yan-jie,QI Xing-yun,XU Jia-qing   

  1. (College of Computer Science and Technology,National University of Defense Technology,Changsha 410073,China)
  • Received:2023-03-29 Revised:2023-10-17 Accepted:2024-12-25 Online:2024-12-25 Published:2024-12-23

摘要: Bowtie 2作为生物信息领域使用最广泛的二代测序软件之一,具有计算密集的特点。如何根据多核平台的体系结构进行适应性优化以提高并行效率,成为亟待解决的问题。首先分析了非一致性访存架构的多样性,以及Bowtie 2在多种非一致性访存架构下内存访问拥塞和最后一级缓存命中率低的结构性瓶颈。然后,基于Bowtie 2在不同非一致性访存平台的表现特点,在复制索引、内存分配以及数据划分3个方面进行优化,提出了Bowtie 2-NUMA。最后,通过实验表明,Bowtie 2-NUMA面向不同计算平台,能够基于体系结构进行适应性优化,实现并行效率的提升。

关键词: 非一致性访存, Bowtie 2, 适应性优化, 体系结构, 内存带宽, 最后一级缓存

Abstract: Bowtie 2, as one of the most widely used second-generation sequencing software in the field of bioinformatics, is characterized by its computational intensity. How to conduct adaptive optimization based on the architecture of multi-core platforms to improve parallel efficiency has become an urgent problem to be solved. This paper first analyzes the diversity of non-uniform memory access (NUMA) architectures and the structural bottlenecks of Bowtie 2 under various NUMA architectures, including memory access congestion and low last level cache hit rates. Then, based on the performance characteristics of Bowtie 2 on different NUMA platforms, optimizations are carried out in three aspects: index replication, memory allocation, and data partitioning, leading to the proposal of Bowtie 2- NUMA. Finally, experiments show that Bowtie 2-NUMA can achieve adaptive optimization based on the architecture for different computing platforms, thereby improving parallel efficiency.


Key words: non-uniform memory access (NUMA), Bowtie 2, adaptive optimization, computer architecture, memory bandwidth, last level cache (LLC)