S-Space College of Engineering/Engineering Practice School (공과대학/대학원) Program in Technology, Management, Economics and Policy (협동과정-기술·경영·경제·정책전공) Others_협동과정-기술·경영·경제·정책전공
The FTMPS-Project: Design and Implementation of Fault-Tolerance Techniques for Massively Parallel Systems
- Vounckx, Johan; Deconinck, G.; Lauwereins, Rudy; Viehover, G.; Wagner, R.; Madeira, H.; Silva, J.G.; Balbach, F.; Altmann, Jorn; Bieker, B.; Willeke, H.
- Issue Date
- Springer Verlag
- Lecture Notes in Computer Science, Vol. 797/1994 (1994) 401-406
- The FTMPS-project provides a solution to the need for faulttolerance
in large systems . A complete fault-tolerance approach is developed
and being implemented . The built-in hardware error-detection features
combined with software error-detection techniques provide a high
coverage of transient as well as perananent failures . Combined with the
diagnosis software, the necessary information for the OSS (statistics and
visualisation) and the possibly reconfigm-ation is collected . Backward error
recovery based on checkpointing and rollback, is implemented
- 0302-9743 (print)
- Files in This Item: There are no files associated with this item.