タイトル | Implementation of a Message Passing Interface into a Cloud-Resolving Model for Massively Parallel Computing |
本文(外部サイト) | http://hdl.handle.net/2060/20040171260 |
著者(英) | Shie, Chung-Lin; Zeng, Xi-Ping; Lang, Steve; Juang, Hann-Ming Henry; Tao, Wei-Kuo; Simpson, Joanne |
著者所属(英) | NASA Goddard Space Flight Center |
発行日 | 2004-06-02 |
言語 | eng |
内容記述 | The capability for massively parallel programming (MPP) using a message passing interface (MPI) has been implemented into a three-dimensional version of the Goddard Cumulus Ensemble (GCE) model. The design for the MPP with MPI uses the concept of maintaining similar code structure between the whole domain as well as the portions after decomposition. Hence the model follows the same integration for single and multiple tasks (CPUs). Also, it provides for minimal changes to the original code, so it is easily modified and/or managed by the model developers and users who have little knowledge of MPP. The entire model domain could be sliced into one- or two-dimensional decomposition with a halo regime, which is overlaid on partial domains. The halo regime requires that no data be fetched across tasks during the computational stage, but it must be updated before the next computational stage through data exchange via MPI. For reproducible purposes, transposing data among tasks is required for spectral transform (Fast Fourier Transform, FFT), which is used in the anelastic version of the model for solving the pressure equation. The performance of the MPI-implemented codes (i.e., the compressible and anelastic versions) was tested on three different computing platforms. The major results are: 1) both versions have speedups of about 99% up to 256 tasks but not for 512 tasks; 2) the anelastic version has better speedup and efficiency because it requires more computations than that of the compressible version; 3) equal or approximately-equal numbers of slices between the x- and y- directions provide the fastest integration due to fewer data exchanges; and 4) one-dimensional slices in the x-direction result in the slowest integration due to the need for more memory relocation for computation. |
NASA分類 | Meteorology and Climatology |
権利 | Copyright, Distribution under U.S. Government purpose rights |
|