CPC 2006 CPC 2006

Call for Participation
Current List of
		      Participants
Program (List of Talks)
Registration Form
Accomodation
Travel Information
Links

Funded by the Vicerreitoría de Investigación of the University of A Coruña and the Ministry of Education and Science of Spain

Universidade de A Coruña Ministerio de
		      Educación y Ciencia



Portable Checkpointing of MPI Applications

Gabriel Rodríguez, María J. Martín, Patricia González, Juan Touriño and Ramón Doallo
Computer Architecture Group, University of A Coruña, Spain


CPPC (Controller/Precompiler for Portable Checkpointing) is a checkpoint-based fault tolerance tool focused on the execution of parallel applications over heterogeneous clusters or Grids. In addition to using portable code, protocols and data storage formats, compile-time analysis of the application to be checkpointed is needed in order to avoid restart inconsistencies, without falling back to traditional approaches like process coordination or message-logging, which typically introduce unscalable overheads. These solutions are implemented into the CPPC library, used by the CPPC source-to-source preprocessor to automatically transform a parallel application into a fault tolerant counterpart. Experimental results to assess the checkpointing tool are also provided.


Back to the Workshop Program 

Please contact our webadmin with any comments or changes.