Fault Tolerant Systems
EE-40188
Learn about Fault Tolerant Systems
Fault Tolerant Systems course introduces basic concepts of design and implementation of fault tolerance mechanisms in general systems. Specific kind of fault tolerance are addressed, such as physical fault tolerance (adding redundant physical components), fault tolerance for information (error detecting and correcting codes) and temporal fault tolerance (via retry mechanisms). The purpose of this course is to establish fault tolerance as a measure to improve the dependability of systems in the presence of faults and correlating this dependability with the effects to the system and functional safety.
Participants in this course will be familiarized with the quantitative and qualitative methodology used in the evaluation of specific fault tolerance principles. At the end of this course, students will be able to understand and perform key alterations to system functions, components, or mechanisms, to add an appropriate level of redundancy with the goal to achieve expected reliability of systems despite the residual faults in the system design.
Course Highlights:
- Basics of fault tolerance
- Physical fault tolerance
- Fault tolerance in information exchange
- Practical considerations of fault tolerance
Course Benefits:
- Understand system and safety integrity concepts
- Be aware of threats to system dependability stemming from residual design faults or security issues
- Perform basic analysis of high-level composite system diagrams
- Identify key areas to perform dependability improvement
- Design fault tolerance mechanism in appropriate stages
- Determine the required level of redundancy for the target reliability of a system, and perform its high-level design alterations
- Design fault tolerant information exchange and applying appropriate information fault tolerance mechanisms
- Understand the relation between fault tolerance and system repair.
Course Typically Offered: Live Online during Fall and Spring quarter
Prerequisites: Students should have previous knowledge of selected calculus topics, such as basic differential equations, operation with matrices, Galois fields, vector spaces, and Boolean algebra. Students shall have basic knowledge of system and safety engineering and system reliability concepts, such as failure probability, reliability, failure rate, constant failure rate, MTTF, FIT, etc. or have completed the course “Systems, Functions and Safety”.
Hardware (required): Computer with Internet connection, working speakers and microphone.
Next steps: Upon completion of this course, consider taking other courses in the Functional Safety Fundamentals For Automotive Certificate
More information: Contact unexengr@ucsd.edu to learn more about Functional Safety Fundamentals For Automotive and course offerings.
Course Information
Course sessions
Section ID:
Class type:
Textbooks:
All course materials are included unless otherwise stated.
Policies:
- No refunds after: 11/23/2024
Schedule:
Instructor: Milan Bjelica
Dr. Bjelica holds a Ph.D. degree in computer engineering from the University of Novi Sad, Serbia, where he also holds the position of Associate Professor for computer engineering and automotive engineering. He also received advanced training on system safety, functional safety, fault-tolerant design, and safety-relevant processes from Vienna Institute for Safety and Systems Engineering (Austria). His professional and research focus is on complex system and software architectures with specific interactions and virtualization. He authored more than 100 publications across major journals and scientific conferences, and also holds 30 patents.
Section ID:
Class type:
Textbooks:
All course materials are included unless otherwise stated.
Policies:
- No refunds after: 3/25/2025