Large improvements in application throughput of long-running multi-component applications using batch grids

Sundari M, Sivagama ; Vadhiyar, Sathish S. ; Nanjundiah, Ravi S. (2011) Large improvements in application throughput of long-running multi-component applications using batch grids Concurrency and Computation: Practice and Experience, 24 (15). pp. 1775-1791. ISSN 1532-0626

Full text not available from this repository.

Official URL: http://doi.org/10.1002/cpe.1878

Related URL: http://dx.doi.org/10.1002/cpe.1878

Abstract

Computational grids with multiple batch systems (batch grids) can be powerful infrastructures for executing long-running multi-component parallel applications. In this paper, we evaluate the potential improvements in throughput of long-running multi-component applications when the different components of the applications are executed on multiple batch systems of batch grids. We compare the multiple batch executions with executions of the components on a single batch system without increasing the number of processors used for executions. We perform our analysis with a foremost long-running multi-component application for climate modeling, the Community Climate System Model (CCSM). We have built a robust simulator that models the characteristics of both the multi-component application and the batch systems. By conducting large number of simulations with different workload characteristics and queuing policies of the systems, processor allocations to components of the application, distributions of the components to the batch systems and inter-cluster bandwidths, we show that multiple batch executions lead to 55% average increase in throughput over single batch executions for long-running CCSM. We also conducted real experiments with a practical middleware infrastructure and showed that multi-site executions lead to effective utilization of batch systems for executions of CCSM and give higher simulation throughput than single-site executions. Copyright © 2011 John Wiley & Sons, Ltd.

Item Type:Article
Source:Copyright of this article belongs to ResearchGate GmbH.
ID Code:120451
Deposited On:30 Jun 2021 06:58
Last Modified:30 Jun 2021 06:58

Repository Staff Only: item control page