咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Scalability and efficiency cha... 收藏

Scalability and efficiency challenges for the exascale supercomputing system:practice of a parallel supporting environment on the Sunway exascale prototype system

[面对E级超算系统的可扩展性和效率挑战: 神威E级原型系统并行支撑环境的实践]

作     者:Xiaobin HE Xin CHEN Heng GUO Xin LIU Dexun CHEN Yuling YANG Jie GAO Yunlong FENG Longde CHEN Xiaona DIAO Zuoning CHEN Xiaobin HE;Xin CHEN;Heng GUO;Xin LIU;Dexun CHEN;Yuling YANG;Jie GAO;Yunlong FENG;Longde CHEN;Xiaona DIAO;Zuoning CHEN

作者机构:National Research Center of Parallel Computer Engineering and TechnologyBeijing 100190China 

出 版 物:《Frontiers of Information Technology & Electronic Engineering》 (信息与电子工程前沿(英文版))

年 卷 期:2023年第24卷第1期

页      面:41-58页

核心收录:

学科分类:0711[理学-系统科学] 08[工学] 0835[工学-软件工程] 081201[工学-计算机系统结构] 0812[工学-计算机科学与技术(可授工学、理学学位)] 

基  金:Project supported by the Key R&D Program of Zhejiang Province,China(No.2022C01250) the National Key R&D Program of China(No.2019YFA0709402)。 

主  题:Parallel computing Sunway Ultra-large-scale Supercomputer 

摘      要:With the continuous improvement of supercomputer performance and the integration of artificial intelligence with traditional scientific computing,the scale of applications is gradually increasing,from millions to tens of millions of computing cores,which raises great challenges to achieve high scalability and efficiency of parallel applications on super-large-scale systems.Taking the Sunway exascale prototype system as an example,in this paper we first analyze the challenges of high scalability and high efficiency for parallel applications in the exascale era.To overcome these challenges,the optimization technologies used in the parallel supporting environment software on the Sunway exascale prototype system are highlighted,including the parallel operating system,input/output(I/O)optimization technology,ultra-large-scale parallel debugging technology,10-million-core parallel algorithm,and mixed-precision method.Parallel operating systems and I/O optimization technology mainly support largescale system scaling,while the ultra-large-scale parallel debugging technology,10-million-core parallel algorithm,and mixed-precision method mainly enhance the efficiency of large-scale applications.Finally,the contributions to various applications running on the Sunway exascale prototype system are introduced,verifying the effectiveness of the parallel supporting environment design.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分