Data Entry: Please note that the research database will be replaced by UNIverse by the end of October 2023. Please enter your data into the system https://universe-intern.unibas.ch. Thanks

Login for users with Unibas email account...

Login for registered users without Unibas email account...

 
Two-level Dynamic Load Balancing for High Performance Scientific Applications
ConferencePaper (Artikel, die in Tagungsbänden erschienen sind)
 
ID 4517961
Author(s) Mohammed, Ali; Cavelan, Aurélien; Ciorba, Florina M.; Cabezon, Ruben; Banicescu, Ioana
Author(s) at UniBasel Mohammed, Ali Omar Abdelazim
Cavelan, Aurélien
Ciorba, Florina M.
Cabezon, Ruben
Year 2020
Title Two-level Dynamic Load Balancing for High Performance Scientific Applications
Editor(s) Biros, George; Meier Yang, Ulrike
Book title (Conference Proceedings) Proceedings of the 2020 SIAM Conference on Parallel Processing for Scientific Computing
Place of Conference Seatle, WA, USA
Year of Conference 2020
Publisher SIAM
Pages 69-80
ISSN/ISBN 978-1-61197-613-7
Keywords two-level dynamic load balancing, computationally-intensive applications, high performance computing, self-scheduling, MPI+OpenMP
Abstract Scientific applications are often complex, irregular, and computationally-intensive. To accommodate their ever-increasing computational demands, the high-performance computing (HPC) systems have become larger and more complex, offering increased hardware parallelism at multiple levels (e.g., nodes, cores per node, threads per core). Scientific applications need to exploit all multilevel hardware parallelism to harness the available computational power. The performance of applications executing on such HPC systems may adversely be affected by load imbalance at multiple levels, caused by problem, algorithmic, and systemic characteristics. Existing dynamic load balancing methods do not simultaneously address load imbalance at multiple software parallelism levels. This work investigates the joint impact of load imbalance on the performance of three scientific applications at the thread and process levels. We jointly apply and evaluate selected dynamic loop self-scheduling (DLS) techniques to both levels. This approach is generic and applicable to any multiprocess-multithreaded computationally-intensive application. We conduct an exhaustive set of experiments to assess and compare the combination of six DLS techniques at the thread level and eleven at the process level. The results show that improved overall application performance, by up to 21%, can only be achieved by jointly addressing load imbalance at both software parallelism levels. We offer insights into the performance of the selected DLS techniques and discuss the interplay of dynamic load balancing at the thread and process levels.
URL https://doi.org/10.1137/1.9781611976137.7
edoc-URL https://edoc.unibas.ch/73178/
Full Text on edoc No
Digital Object Identifier DOI 10.1137/1.9781611976137.7
 
   

MCSS v5.8 PRO. 0.379 sec, queries - 0.000 sec ©Universität Basel  |  Impressum   |    
02/05/2024