Course Outline
Introduction
- How SRE marries traditional IT and software development.
- The need for automation and observability
- The role of a software engineers vs system administrators.
- Site Reliability Engineers vs DevOps engineers.
Overview of an IT System
- System architecture, on-premise and in the cloud.
Overview of SRE Principles and Practices
- Infrastructure as a Code.
- The role of containerization and orchestration (Docker, Kubernetes, etc.)
- Continuous Integration, Continuous Deployment and Continuous Delivery.
- Observability.
Evaluating an IT System
- Taking stock of the team and organizational resources.
- Maping out the systems and processes.
- Estimating the potential impact of SRE.
- The role the software engineering team.
- The role of the operational team.
- The role of management.
Maintaining the Reliability of a System
- Describing and measuring the desired reliability of a service.
- Understanding Service Level Objectives (SLOs)
- Understanding Service Level Indicators (SLIs) and Service Level Agreements (SLAs).
- Working with Error Budgets.
- Developing an SLO.
Optimizing System Administration
- Setting up a development environment
- Evaluating SRE tools
- Prioritizing tasks for automation.
- Writing software.
Deploying "Infrastructure as Code"
- Testing and iterating code
- Making a system anti-fragile
- Learning from failure
Monitoring a System
- Observing system performance.
- SRE tools and techniques.
The Future of SRE
Summary and Conclusion
Requirements
- A general understanding of IT infrastructure.
- A general idea of the software development process.
- Programming or scripting experience in any language.
Audience
- Developers
- System administrators
- Software Architects
- DevOps engneers
- IT Managers
Testimonials (7)
How detailed subjects are explained with real world examples
Brian Hlabane - African Bank
Course - Site Reliability Engineering (SRE) Fundamentals
Dia ahli di bidangnya dan memberikan pelatihan yang sangat bagus. Materi, pelatihannya benar-benar perpaduan antara contoh, diskusi dan
Peter Tutka - Deutsche Telekom IT & Telecommunications Slovakia s.r.o.
Course - Site Reliability Engineering (SRE) Fundamentals
Machine Translated
Lihat SRE/ DevOps dari sudut pandang bisnis/teoretis. Paling bermanfaat bagi orang yang sudah mempunyai pandangan praktis.
Michael Varhol - Deutsche Telekom IT & Telecommunications Slovakia s.r.o.
Course - Site Reliability Engineering (SRE) Fundamentals
Machine Translated
Pendekatan pelatihan dengan mengirimkan kuisioner sebelum pelatihan, sehingga pelatihan direncanakan sesuai dengan harapan. Membuat peserta lebih aktif.
Stefan Girman - Deutsche Telekom IT & Telecommunications Slovakia s.r.o.
Course - Site Reliability Engineering (SRE) Fundamentals
Machine Translated
Berpegang teguh pada survei awal dari peserta tentang apa yang harus menjadi fokus pelatihan.
Denis Majorsky - Deutsche Telekom IT & Telecommunications Slovakia s.r.o.
Course - Site Reliability Engineering (SRE) Fundamentals
Machine Translated
diskusi, definisi SRE
Daniel Horvath - Deutsche Telekom IT & Telecommunications Slovakia s.r.o.
Course - Site Reliability Engineering (SRE) Fundamentals
Machine Translated
Konsep pelatihan, menjaga masyarakat tetap fokus dengan mengajukan pertanyaan dan memicu diskusi. Sesi breakout kelompok juga sangat bagus untuk memikirkan berbagai hal dalam kelompok dan melihat hasil yang berbeda dari kelompok lain.
Blazej Farkas - Deutsche Telekom IT & Telecommunications Slovakia s.r.o.
Course - Site Reliability Engineering (SRE) Fundamentals
Machine Translated