Stochastic models for fault tolerance [electronic resource] : restart, rejuvenation and checkpointing / Katinka Wolter.

As modern society relies on the fault-free operation of complex computing systems, system fault-tolerance has become an indispensable requirement. Therefore, we need mechanisms that guarantee correct service in cases where system components fail, be they software or hardware elements. Redundancy pat...

Full description

Saved in:
Bibliographic Details
Online Access: Full Text (via Springer)
Main Author: Wolter, Katinka
Format: Electronic eBook
Language:English
Published: Berlin ; London : Springer, ©2010.
Subjects:

MARC

LEADER 00000cam a2200000xa 4500
001 b6361261
006 m o d
007 cr |||||||||||
008 100909s2010 gw a ob 001 0 eng d
005 20240418143642.2
019 |a 654384288  |a 701051523  |a 771442762 
020 |a 9783642112560 
020 |a 3642112560 
020 |a 9783642112577  |q (ebk.) 
020 |a 3642112579  |q (ebk.) 
035 |a (OCoLC)spr663096585 
035 |a (OCoLC)663096585  |z (OCoLC)654384288  |z (OCoLC)701051523  |z (OCoLC)771442762 
037 |a spr10.1007/978-3-642-11257-7 
040 |a GW5XE  |b eng  |e pn  |c GW5XE  |d HNK  |d CDX  |d EBLCP  |d OCLCQ  |d E7B  |d OCLCQ  |d N$T  |d OCLCO  |d OCLCQ  |d YDXCP  |d OCLCF  |d DEBSZ  |d BEDGE  |d OCLCQ  |d A7U  |d OCLCQ  |d MYUML 
049 |a GWRE 
050 4 |a QA76.758 
100 1 |a Wolter, Katinka.  |0 http://id.loc.gov/authorities/names/nb2007022718  |1 http://isni.org/isni/0000000121323616. 
245 1 0 |a Stochastic models for fault tolerance  |h [electronic resource] :  |b restart, rejuvenation and checkpointing /  |c Katinka Wolter. 
260 |a Berlin ;  |a London :  |b Springer,  |c ©2010. 
300 |a 1 online resource (xvi, 269 pages) :  |b illustrations (some color) 
336 |a text  |b txt  |2 rdacontent. 
337 |a computer  |b c  |2 rdamedia. 
338 |a online resource  |b cr  |2 rdacarrier. 
504 |a Includes bibliographical references (pages 255-264) and index. 
505 0 |a Part I: Introduction -- 1) Basic Concepts and Problems -- 2) Task Completion Time -- Part II: Restart -- 3) Applicability Analysis of Restart -- 4) Moments of Completion Time under Restart -- 5) Meeting Deadlines through Restart -- Part III: Software Rejuvenation -- 6) Practical Aspects of Preventive Maintenance and Software Rejuvenation -- 7) Stochastic Models for Preventive Maintenance and Software Rejuvenation -- Part IV: Checkpointing -- 8) Checkpointing Systems -- 9) Stochastic Models for Checkpointing -- 10) Summary, Conclusion and Outlook -- Appendix -- A) Properties in Discrete Systems -- B) Important Probability Distributions -- C) Estimating the Hazard Time -- D) The Laplace and the Laplace-Stieltjes Transform. 
520 |a As modern society relies on the fault-free operation of complex computing systems, system fault-tolerance has become an indispensable requirement. Therefore, we need mechanisms that guarantee correct service in cases where system components fail, be they software or hardware elements. Redundancy patterns are commonly used, for either redundancy in space or redundancy in time. Wolter's book details methods of redundancy in time that need to be issued at the right moment. In particular, she addresses the so-called "timeout selection problem", i.e., the question of choosing the right time for different fault-tolerance mechanisms like restart, rejuvenation and checkpointing. Restart indicates the pure system restart, rejuvenation denotes the restart of the operating environment of a task, and checkpointing includes saving the system state periodically and reinitializing the system at the most recent checkpoint upon failure of the system. Her presentation includes a brief introduction to the methods, their detailed stochastic description, and also aspects of their efficient implementation in real-world systems. The book is targeted at researchers and graduate students in system dependability, stochastic modeling and software reliability. Readers will find here an up-to-date overview of the key theoretical results, making this the only comprehensive text on stochastic models for restart-related problems. 
588 0 |a Print version record. 
650 0 |a Fault-tolerant computing.  |0 http://id.loc.gov/authorities/subjects/sh85047488. 
650 0 |a Stochastic models.  |0 http://id.loc.gov/authorities/subjects/sh2005004376. 
650 7 |a Fault-tolerant computing.  |2 fast  |0 (OCoLC)fst00921988. 
650 7 |a Stochastic models.  |2 fast  |0 (OCoLC)fst01737780. 
776 0 8 |i Print version:  |a Wolter, Katinka.  |t Stochastic models for fault tolerance.  |d Berlin ; London : Springer, ©2010  |z 9783642112560  |z 3642112560  |w (OCoLC)503649523. 
856 4 0 |u https://colorado.idm.oclc.org/login?url=http://link.springer.com/10.1007/978-3-642-11257-7  |z Full Text (via Springer) 
907 |a .b63612616  |b 03-20-20  |c 10-14-10 
998 |a web  |b 05-01-17  |c g  |d b   |e -  |f eng  |g gw   |h 0  |i 1 
907 |a .b63612616  |b 07-02-19  |c 10-14-10 
944 |a MARS - RDA ENRICHED 
907 |a .b63612616  |b 07-06-17  |c 10-14-10 
907 |a .b63612616  |b 05-23-17  |c 10-14-10 
915 |a I 
956 |a Springer e-books 
956 |b Springer Nature - Springer Computer Science eBooks 2010 English International 
956 |b Springer Nature - Springer Computer Science eBooks 2010 English International 
999 f f |i bceb2620-4e4e-5fda-8ff5-c0a662e6c34c  |s 8f30f156-33c7-59d3-b046-e38fc30bcb69 
952 f f |p Can circulate  |a University of Colorado Boulder  |b Online  |c Online  |d Online  |e QA76.758  |h Library of Congress classification  |i Ebooks, Prospector  |n 1