Site Reliability Engineering: How Google Runs Production Systems"O'Reilly Media, Inc.", 23 mar 2016 - 552 páginas The overwhelming majority of a software systemâ??s lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that software engineers focus primarily on the design and development of large-scale computing systems? In this collection of essays and articles, key members of Googleâ??s Site Reliability Team explain how and why their commitment to the entire lifecycle has enabled the company to successfully build, deploy, monitor, and maintain some of the largest software systems in the world. Youâ??ll learn the principles and practices that enable Google engineers to make systems more scalable, reliable, and efficientâ??lessons directly applicable to your organization. This book is divided into four sections:
|
Índice
Part I Introduction | 1 |
Part II Principles | 23 |
Part III Practices | 103 |
Part IV Management | 389 |
Part V Conclusions | 457 |
Appendix A Availability Table | 477 |
Appendix B A Collection of Best Practices for Production Services | 479 |
Appendix C Example Incident State Document | 485 |
Appendix D Example Postmortem | 487 |
Appendix E Launch Coordination Checklist | 493 |
Appendix F Example Production Meeting Minutes | 497 |
501 | |
511 | |
About the Authors | 523 |
Otras ediciones - Ver todo
Site Reliability Engineering: How Google Runs Production Systems Niall Richard Murphy,Betsy Beyer,Chris Jones,Jennifer Petoff Vista previa restringida - 2016 |
Site Reliability Engineering: How Google Runs Production Systems Betsy Beyer,Chris Jones,Jennifer Petoff,Niall Richard Murphy No hay ninguna vista previa disponible - 2016 |