基于公共云的 HPC 集群实现及自动伸缩闲时计算研究

计算机工程与科学

基于公共云的 HPC 集群实现及自动伸缩闲时计算研究

田永军，何万青，孙相征，余洋

（阿里云计算有限公司,浙江杭州 310024）

收稿日期:2018-10-17 修回日期:2018-12-21 出版日期:2019-07-25 发布日期:2019-07-25

HPC cluster and low cost auto

scaling model based on public cloud

TIAN Yongjun,HE Wanqing,SUN Xiangzheng,YU Yang

（Alibaba Cloud Computing Co.Ltd.,Hangzhou 310024,China）

Received:2018-10-17 Revised:2018-12-21 Online:2019-07-25 Published:2019-07-25

摘要/Abstract

摘要：

对于HPC用户来说，计算成本是迁云所考虑的重要因素之一，阿里云上提供的抢占式实例，是一种按需实例，旨在降低使用公共云计算资源成本，抢占式实例市场价格是波动的，通常远低于正常的按需实例，甚至达到正常按需实例的一折。抢占式实例一般会在创建时为用户保留一段最短时间，过后有可能会被释放，所以一般适用于无状态的应用场景。提出在公共云上的自动伸缩策略，其面向通用的HPC集群调度器，基于用户的应用软件类型、提交作业规律以及用户对性能和成本等多方面需求,自动在云上部署扩容计算资源，控制成本。对用户来说，可以做到”only pay for what you want and what you use”。基于公共云上丰富的资源规格类型和售卖方式，利用自动伸缩服务，抢占式实例，断点续算等技术可以配置低成本的公共云上HPC自动伸缩方案：用户提交作业的同时可以指定成本上限，自动伸缩服务自动在低于此成本的前提下寻找和扩容抢占式计算资源，同时利用断点续算功能保证作业在计算资源切换的时候可以继续运算。最后，通过 LAMMPS 和 GROMACS 两个高性能应用实例验证了该策略的可行性和有效性。

关键词: 高性能计算, 公共云, 自动伸缩, 断点续算, 闲时计算伸缩模型

Abstract:

For many HPC users, computing cost is one important factor for whether moving workloads to the public cloud. Alibaba cloud provides “preemptible instance”. It is an on-demand instance to reduce the cost of using public cloud computing resources. The market price of “preemptible instance” fluctuates and it can be as low as 10% of “pay as you go instance”. And “preemptible instance” cannot be kept as long as users’ requirement, and be released due to datacenter scheduler or some other reasons, so it can be used in some stateless scenarios. On the public cloud, based on users’ application types, job submission patterns, performance requirements, timing and cost, we propose an auto scaling strategy on the public cloud for general HPC cluster schedulers, which can automatically deploy computing resources and control cost. HPC users only pay for what they want and what they use. Due to abundant resource types and resource rent models, and taking advantages of auto scaling service, “preemptible instance” and application checkpoint/restart, we can supply a low cost auto scaling model. When users submit jobs, they can set their expectation cost, and the auto scaling service will find the “preemptible instance” under this cost setting, and use checkpoint/restart technique to keep job running during computing resource exchanging. Finally, we verify the feasibility and effectiveness of our solution through LAMMPS and GROMACS applications.

Key words: high performance computing, public cloud, auto scaling, checkpoint/restart, low cost scaling model

田永军，何万青，孙相征，余洋. 基于公共云的 HPC 集群实现及自动伸缩闲时计算研究[J]. 计算机工程与科学.

TIAN Yongjun,HE Wanqing,SUN Xiangzheng,YU Yang.

HPC cluster and low cost auto

scaling model based on public cloud

[J]. Computer Engineering & Science.

[1]	孙岩, 张建民, 黎渊, 孙舜禹. 面向高性能计算的互连网络拥塞控制分析与评估[J]. 计算机工程与科学, 2024, 46(02): 209-216.
[2]	张云泉, 邓力, 袁良, 袁国兴. 2023年中国高性能计算机发展现状分析[J]. 计算机工程与科学, 2023, 45(12): 2091-2098.
[3]	施得君, 李宏亮, 胡舒凯. 基于Clos网络的高阶路由器结构[J]. 计算机工程与科学, 2023, 45(12): 2099-2112.
[4]	张天阳, 池成悦, 郭武, 高亦沁, 文敏华, 韦建文. 校级异地超算集群管理的关键技术研究与实践[J]. 计算机工程与科学, 2023, 45(12): 2135-2145.
[5]	肖调杰, 周峰, 郑翾宇, 刘剑, 陈琳, 刘杰, 易明宽, 陈旭光, 龚春叶, 杨博, 甘新标, 李胜国, 左克, . 大规模三维频率域电磁积分方程法数值模拟[J]. 计算机工程与科学, 2023, 45(11): 1901-1910.
[6]	朱文龙, 江嘉治, 黄聃, 肖侬. ParM:基于国产处理器的异构并行编程模型[J]. 计算机工程与科学, 2023, 45(09): 1521-1531.
[7]	吴铁彬, 过锋, 王谛. 面向E级计算的高性能处理器核心运算架构研究进展[J]. 计算机工程与科学, 2023, 45(05): 761-771.
[8]	陈奉贤. 基于NR-Transformer的集群作业运行时间预测[J]. 计算机工程与科学, 2022, 44(07): 1181-1190.
[9]	曹继军. 面向HPC和DC的可重构光互连网络体系结构综述[J]. 计算机工程与科学, 2022, 44(06): 951-963.
[10]	袁国兴, 张云泉, 袁良. 2021年中国高性能计算机发展现状分析[J]. 计算机工程与科学, 2021, 43(12): 2091-2097.
[11]	袁远, 李世杰, 邢建英, 蒋句平. E级高性能计算机系统中监控分系统的挑战与设计[J]. 计算机工程与科学, 2021, 43(08): 1366-1375.
[12]	吴君楠, 欧洋, 李琰. 基于LAMP的高性能计算用户组织架构管理系统设计与实现[J]. 计算机工程与科学, 2021, 43(02): 235-241.
[13]	刘杰, 龚春叶, 杨博, 郭晓威, 甘新标, 李胜国, 李超, 陈旭光, 肖调杰, 穆利安, 宋敏, 赵冬勇, 鞠羽中. YH-ACT：热工流体力学并行应用程序[J]. 计算机工程与科学, 2021, 43(01): 58-69.
[14]	袁国兴, 张云泉, 袁良. 2020年中国高性能计算机发展现状分析[J]. 计算机工程与科学, 2020, 42(12): 2103-2108.
[15]	李哲, 谭郁松, 李宝, 余杰. 面向HPC的函数计算冷启动优化[J]. 计算机工程与科学, 2020, 42(11): 1973-1980.