Site Reliability Engineering

Niall Murphy is the team leader of Google Ireland's Ads Site Reliability Engineering. He has worked in the Internet industry for over 20 years and is currently the chairperson of INEX, Ireland's peering hub.


Betsy Beyer works for Google Site Reliability Engineering in New York City. She has authored documentation for the Google Datacenters and Hardware Operations teams in the past.


Chris Jones works for Google App Engine, a cloud platform-as-a-service that handles over 28 billion queries each day.


Jennifer Petoff, based in Dublin, Ireland, is a Program Manager for Google's Site Reliability Engineering team. She has managed huge global initiatives in a variety of fields such as scientific research, engineering, human resources, and advertising operations.


The vast majority of a software system's lifespan is spent in use rather than in design or implementation. So, why is it that common thinking holds that software engineers should be primarily concerned with the design and development of large-scale computing systems?


Key members of Google's Site Reliability Team explain how and why their commitment to the entire lifecycle has enabled the company to successfully build, deploy, monitor, and maintain some of the world's largest software systems in this collection of essays and articles. You'll learn the principles and practices that allow Google engineers to make systems more scalable, reliable, and efficient- lessons that will be directly applicable to your organization. It consists of developing and managing distributed computing systems.


There are four sections in Site Reliability Engineering:

  • Introduction: Learn what site reliability engineering is and how it differs from standard IT industry procedures.
  • Principles: Examine the trends, behaviors, and areas of concern that influence a site reliability engineer's (SRE) job.
  • Practices: Understand the theory and practice of an SRE's day-to-day work: developing and managing massive distributed computing systems.
  • Management: Investigate Google's best practices for training, communication, and meetings and how they may benefit your organization.


Author: Jennifer Petoff

Link to buy: https://www.amazon.com/Site-Reliability-Engineering-Production-Systems-ebook/dp/B01DCPXKZ6/

Ratings: 4.7 out of 5 stars (from 774 reviews)

Best Sellers Rank: #101,312 in Kindle Store

#2 in System Administration Disaster & Recovery

#4 in Network Disaster & Recovery Administration

#5 in System Administration

https://www.amazon.com/
https://www.amazon.com/
https://www.amazon.com/
https://www.amazon.com/

Toplist Joint Stock Company
Address: 3rd floor, Viet Tower Building, No. 01 Thai Ha Street, Trung Liet Ward, Dong Da District, Hanoi City, Vietnam
Phone: +84369132468 - Tax code: 0108747679
Social network license number 370/GP-BTTTT issued by the Ministry of Information and Communications on September 9, 2019
Privacy Policy