Faults, failures, availability and reliability
Reliability is the probability that a system or component will function within specified limits for a specified period of time under specified conditions. Reliability engineering consists of estimating the probability of failure of different components, analyzing component failure modes and examining the manner in which they can lead to failure of the service provided by a system. Metrics analyzed include the mean time to failure (MTTF), mean time to repair (MTTR) and MTBF (mean time between failures).
In these slides, we give an overview of the role of reliability engineering in system safety and the way in which it is integrated into risk analysis.
This submodule is a part of the risk analysis module.
Overview of reliability engineering
Python notebook with exercises in reliability calculations
We recommend the following sources of further information on this topic:
- Wired.com article Why Things Fail: From Tires to Helicopter Blades, Everything Breaks Eventually from 2010