摘要
We first introduced the multicore specific optimization modules of two common MPI implementations - MPICH2 and OpenMPI, and then tested their performance on one multicore computer. By enabling and disabling these modules, we provided their performance, including bandwidth and latency, under different circumstances. Finally, we analyzed the two MPI implementations and discussed the choice of MPI implementations and possible improvements.
- 出版日期2012
- 单位清华大学