Symptom-based Fault Detection in Modern Computer Systems
dc.contributor.author | Becker, Thomas | |
dc.contributor.author | Rudolf, Nico | |
dc.contributor.author | Yang, Dai | |
dc.contributor.author | Karl, Wolfgang | |
dc.date.accessioned | 2020-08-25T09:05:20Z | |
dc.date.available | 2020-08-25T09:05:20Z | |
dc.date.issued | 2020 | |
dc.description.abstract | Miniaturization and the increasing number of components, which get steadily more complex, lead to a rising failure rate in modern computer systems. Especially soft hardware errors are a major problem because they are usually temporary and therefore hard to detect. As classical fault-tolerance methods are very costly and reduce system efficiency, light-weight methods are needed to increase system reliability. A method that copes with this requirement is symptom-based fault detection. In this work, we evaluate the ability to detect different faults with symptom-based fault detection by using hardware performance counters. As the knowledge of a fault occurrence is usually not enough, we also evaluate the possibility to make conclusions about which fault occurred. For the evaluation, we used the fault-injection library FINJ and manually manipulated loops. The results show that symptom-based fault detection enables the system to detect faulty application behavior, however fine-grained conclusions about the causing fault are hardly possible. | en |
dc.identifier.pissn | 0177-0454 | |
dc.identifier.uri | https://dl.gi.de/handle/20.500.12116/33864 | |
dc.language.iso | en | |
dc.publisher | Gesellschaft für Informatik e.V., Fachgruppe PARS | |
dc.relation.ispartof | PARS-Mitteilungen: Vol. 35, Nr. 1 | |
dc.title | Symptom-based Fault Detection in Modern Computer Systems | en |
dc.type | Text/Journal Article | |
gi.citation.endPage | 50 | |
gi.citation.publisherPlace | Berlin | |
gi.citation.startPage | 39 |
Dateien
Originalbündel
1 - 1 von 1