Publications
Detailed Information
The FTMPS-Project: Design and Implementation of Fault-Tolerance Techniques for Massively Parallel Systems
Cited 0 time in
Web of Science
Cited 2 time in Scopus
- Authors
- Issue Date
- 1994-04
- Publisher
- Springer Verlag
- Citation
- Lecture Notes in Computer Science, Vol. 797/1994 (1994) 401-406
- Abstract
- The FTMPS-project provides a solution to the need for faulttolerance
in large systems . A complete fault-tolerance approach is developed
and being implemented . The built-in hardware error-detection features
combined with software error-detection techniques provide a high
coverage of transient as well as perananent failures . Combined with the
diagnosis software, the necessary information for the OSS (statistics and
visualisation) and the possibly reconfigm-ation is collected . Backward error
recovery based on checkpointing and rollback, is implemented
- ISSN
- 0302-9743 (print)
1611-3349 (online)
- Language
- English
- Files in This Item:
- There are no files associated with this item.
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.