Publications

Detailed Information

The FTMPS-Project: Design and Implementation of Fault-Tolerance Techniques for Massively Parallel Systems

Cited 0 time in Web of Science Cited 2 time in Scopus
Authors

Vounckx, Johan; Deconinck, G.; Lauwereins, Rudy; Viehover, G.; Wagner, R.; Madeira, H.; Silva, J.G.; Balbach, F.; Altmann, Jorn; Bieker, B.; Willeke, H.

Issue Date
1994-04
Publisher
Springer Verlag
Citation
Lecture Notes in Computer Science, Vol. 797/1994 (1994) 401-406
Abstract
The FTMPS-project provides a solution to the need for faulttolerance
in large systems . A complete fault-tolerance approach is developed
and being implemented . The built-in hardware error-detection features
combined with software error-detection techniques provide a high
coverage of transient as well as perananent failures . Combined with the
diagnosis software, the necessary information for the OSS (statistics and
visualisation) and the possibly reconfigm-ation is collected . Backward error
recovery based on checkpointing and rollback, is implemented
ISSN
0302-9743 (print)
1611-3349 (online)
Language
English
URI
https://hdl.handle.net/10371/6894
DOI
https://doi.org/10.1007/3-540-57981-8_151
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share