Site reliability engineers run some of the largest websites on the planet—and are inventing a new field of expertise while they do it An important role in the DevOps practice, these engineers ...
Over the past few years, various executives have come to me for advice on how they can build and implement a site reliability engineer (SRE) strategy within their organizations. Implementing this ...
In many firms, why is Site Reliability Engineering (SRE) becoming more important as a business function? Firms like AWS, Google, Microsoft, Red Hat, and Firefly are already pushing the boundaries of ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
OpenAI has hired Todd Underwood to head a new Site Reliability Engineering team focused on research and training workloads. The generative artificial intelligence company already has an SRE team for ...