Parallel Performance Studies for a Parabolic Test Problem on the Cluster tara
dc.contributor.author | Muscedere, Michael | |
dc.contributor.author | Raim, Andrew M. | |
dc.contributor.author | Gobbert, Matthias K. | |
dc.date.accessioned | 2018-10-25T13:31:12Z | |
dc.date.available | 2018-10-25T13:31:12Z | |
dc.date.issued | 2010 | |
dc.description.abstract | The performance of parallel computer code depends on the intricate interplay of processors, the architecture of the computer nodes, their interconnect network, the numerical algorithm, and its implementation. The solution of large, sparse, highly structured of equations of linear equations by an iterative linear solver that requires communication between the parallel processes at every iteration is an instructive test of this interplay. This note considers a parabolic test problem given by a time-dependent, scalar, linear reaction-diffusion equation in three dimensions, whose time-stepping requires the solution of such a system of linear equations at every timestep. The results presented here show excellent performance on the cluster tara with up to 512 parallel processes when using 64 compute nodes. The results support the scheduling policy implemented, since they confirm that it is beneficial to use all eight cores of the two quad-core processors on each node simultaneously, giving us in-effect a computer that can run jobs efficiently with up to 656 parallel processes when using all 82 compute nodes. The cluster tara is an IBM server x iDataPlex purchased in 2009 by the UMBC High Performance Computing Facility (www.umbc.edu/hpcf). It is an 86-node distributed-memory cluster comprised of 82 compute, 2 develop, 1 user and 1 management nodes. Each node features two quad-core Intel Nehalem X5550 processors (2.66 GHz, 8 MB cache), 24 GB memory, and a 120 GB local hard drive. All nodes and the 160 TB central storage are connected by an InfiniBand (QDR) interconnect network. | en_US |
dc.description.sponsorship | The hardware used in the computational studies is part of the UMBC High Performance Computing Facility (HPCF). The facility is supported by the U.S. National Science Foundation through the MRI program (grant no. CNS–0821258) and the SCREMS program (grant no. DMS–0821311), with additional substantial support from the University of Maryland, Baltimore County (UMBC). See www.umbc.edu/hpcf for more information on HPCF and the projects using its resources. Andrew Raim additionally acknowledges financial support as HPCF RA. | en_US |
dc.description.uri | https://userpages.umbc.edu/~gobbert/papers/MuscedereRaimGobbert2010.pdf | en_US |
dc.format.extent | 17 pages | en_US |
dc.genre | technical report | en_US |
dc.identifier | doi:10.13016/M28S4JS9V | |
dc.identifier.uri | http://hdl.handle.net/11603/11680 | |
dc.language.iso | en_US | en_US |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Mathematics Department Collection | |
dc.relation.ispartof | UMBC Faculty Collection | |
dc.relation.ispartofseries | HPCF Technical Report;HPCF–2010–4 | |
dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | |
dc.subject | Parallel Performance | en_US |
dc.subject | Parabolic Test Problem | en_US |
dc.subject | Cluster tara | en_US |
dc.subject | UMBC High Performance Computing Facility (HPCF) | en_US |
dc.title | Parallel Performance Studies for a Parabolic Test Problem on the Cluster tara | en_US |
dc.type | Text | en_US |