APPLE-MASNUM: Accelerating parallel processing for lightweight expansion of MASNUM on a single multi-GPU node
文献类型:期刊论文
| 作者 | Lou, Qi6; Wu, Changmao6; Dong, Changming5; Feng, Xingru4; Xia, Yuanyuan6; Liu, Li3; Xu, Zhengwei9; Gao, Xu1,2; Sun, Meng7,8; Yin, Xunqiang7,8 |
| 刊名 | OCEAN MODELLING
![]() |
| 出版日期 | 2025-07-01 |
| 卷号 | 196页码:9 |
| 关键词 | MASNUM Lightweight Heterogeneous computing Parallel MPI |
| ISSN号 | 1463-5003 |
| DOI | 10.1016/j.ocemod.2025.102557 |
| 通讯作者 | Wu, Changmao(changmao@iscas.ac.cn) ; Dong, Changming(cmdong@nuist.edu.cn) |
| 英文摘要 | The Marine Science and Numerical Modeling (MASNUM) system, developed for oceanic wave forecasting, play an important role in marine disaster prevention and maritime activities. However, its application is hampered by the requirement of large computing resources. To overcome these barriers, we have implemented an accelerating parallel processing for lightweight expansion of MASNUM (APPLE-MASNUM) on a single compute node with multiple GPUs. In initiating our approach, the mathematical-physics equations of the MASNUM system are thoroughly analyzed to pinpoint the primary computational bottlenecks. This study then transforms MASNUM from a multi-process MPI program into a preliminary GPU-compatible algorithms. Subsequently, the paper proposes an optimization strategy for two-dimensional four-point stencil computations. Following this, an optimization method for overlapping computation with communication is introduced. Finally, a refined data layout scheme tailored for GPUs is designed and implemented. Three numerical experiments with five-day wave forecasts demonstrated that compared to single-core MASNUM, the acceleration ratios of the framework presented in this study are 49.29-fold, 62.58-fold, and 65.74-fold, respectively. This considerable performance boost highlights the efficiency of the lightweight APPLE-MASNUM framework introduced in this research. This signifies the first implementation and optimization of the MASNUM model on a GPU-based heterogeneous platform. |
| WOS关键词 | MODEL |
| 资助项目 | National Key R&D Program of China[2023YFC3008202] ; National Supercomputing Center in Zhengzhou |
| WOS研究方向 | Meteorology & Atmospheric Sciences ; Oceanography |
| 语种 | 英语 |
| WOS记录号 | WOS:001517308400001 |
| 出版者 | ELSEVIER SCI LTD |
| 源URL | [http://ir.qdio.ac.cn/handle/337002/202430] ![]() |
| 专题 | 海洋研究所_海洋环流与波动重点实验室 |
| 通讯作者 | Wu, Changmao; Dong, Changming |
| 作者单位 | 1.Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou, Peoples R China 2.Natl Supercomp Ctr Zhengzhou, Zhengzhou, Peoples R China 3.Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi, Peoples R China 4.Chinese Acad Sci, Inst Oceanol, CAS Key Lab Ocean Circulat & Waves, Qingdao, Peoples R China 5.Nanjing Univ Informat Sci & Technol, Sch Marine Sci, Nanjing, Peoples R China 6.Chinese Acad Sci, Inst Software, Beijing, Peoples R China 7.Minist Nat Resources, Inst Oceanog 1, Qingdao, Peoples R China 8.Minist Nat Resources, Key Lab Marine Sci & Numer Modeling MASNUM, Qingdao, Peoples R China 9.Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang, Peoples R China |
| 推荐引用方式 GB/T 7714 | Lou, Qi,Wu, Changmao,Dong, Changming,et al. APPLE-MASNUM: Accelerating parallel processing for lightweight expansion of MASNUM on a single multi-GPU node[J]. OCEAN MODELLING,2025,196:9. |
| APA | Lou, Qi.,Wu, Changmao.,Dong, Changming.,Feng, Xingru.,Xia, Yuanyuan.,...&Yin, Xunqiang.(2025).APPLE-MASNUM: Accelerating parallel processing for lightweight expansion of MASNUM on a single multi-GPU node.OCEAN MODELLING,196,9. |
| MLA | Lou, Qi,et al."APPLE-MASNUM: Accelerating parallel processing for lightweight expansion of MASNUM on a single multi-GPU node".OCEAN MODELLING 196(2025):9. |
入库方式: OAI收割
来源:海洋研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。

