A scalable parallel debugging library with pluggable communication protocols

Jin, Chao, Abramson, David, Dinh, Minh Ngo, Gontarek, Andrew, Moench, Robert and DeRose, Luiz (2012). A scalable parallel debugging library with pluggable communication protocols. In: Pavan Balaji, Rajkumar Buyya, Shikharesh Majumdar and Suraj Pandey, Proceedings, 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2012, Ottawa, ON, (252-259). 13 - 16 May 2012. doi:10.1109/CCGrid.2012.13

Attached Files (Some files may be inaccessible until you login with your UQ eSpace credentials)
Name Description MIMEType Size Downloads

Author Jin, Chao
Abramson, David
Dinh, Minh Ngo
Gontarek, Andrew
Moench, Robert
DeRose, Luiz
Title of paper A scalable parallel debugging library with pluggable communication protocols
Conference name 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2012
Conference location Ottawa, ON
Conference dates 13 - 16 May 2012
Proceedings title Proceedings, 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing
Place of Publication Los Alamitos, Calif
Publisher I E E E Communications Society
Publication Year 2012
Year available 2012
Sub-type Fully published paper
DOI 10.1109/CCGrid.2012.13
ISBN 9780769546919
Editor Pavan Balaji
Rajkumar Buyya
Shikharesh Majumdar
Suraj Pandey
Start page 252
End page 259
Total pages 8
Language eng
Abstract/Summary Parallel debugging faces challenges in both scalability and efficiency. A number of advanced methods have been invented to improve the efficiency of parallel debugging. As the scale of system increases, these methods highly rely on a scalable communication protocol in order to be utilized in large-scale distributed environments. This paper describes a debugging middleware that provides fundamental debugging functions supporting multiple communication protocols. Its pluggable architecture allows users to select proper communication protocols as plug-ins for debugging on different platforms. It aims to be utilized by various advanced debugging technologies across different computing platforms. The performance of this debugging middleware is examined on a Cray XE Supercomputer with 21,760 CPU cores.
Subjects 1703 Computational Theory and Mathematics
2604 Applied Mathematics
Keyword Parallel debugger
Q-Index Code E1
Q-Index Status Provisional Code
Institutional Status Non-UQ

 
Versions
Version Filter Type
Citation counts: Scopus Citation Count Cited 4 times in Scopus Article | Citations
Google Scholar Search Google Scholar
Created: Tue, 22 Oct 2013, 01:03:52 EST by Ms Diana Cassidy on behalf of Research Computing Centre