Cover Image for Boost Reliability with Problem Detection and Management
Cover Image for Boost Reliability with Problem Detection and Management
Hosted By
16 Going

Boost Reliability with Problem Detection and Management

Hosted by Prequel
Zoom
Registration
Past Event
Welcome! To join the event, please register below.
About Event

Featuring Niall Murphy, co-author of Site Reliability Engineering: How Google Runs Production Systems.

Maintaining application reliability is crucial for success and business survival. However, even the most robust systems are vulnerable to unforeseen issues that disrupt services and impact user experience. SREs and software engineers find themselves on a proverbial hamster wheel chasing and resolving symptoms not underlying problems.  

Join us for an in-depth webinar where we explore barriers to problem management and how emerging problem detection and analysis techniques can help overcome them.

We’ll dive into real-world scenarios, showcasing how focused detection of causes can prevent downtime, enhance system scalability, and drive customer satisfaction.

This session will cover:

  • Problem vs. Issue vs. Incident 

  • Common barriers to problem management

  • Emerging problem detection and analysis techniques

  • Integrating problem detection into your reliability strategy 

  • Case studies demonstrating the impact of problem detection on reliability

Whether you’re an SRE responsible for system uptime, or a reliability manager seeking to improve your team's effectiveness, this webinar will equip you with the knowledge and tools to enhance your system’s reliability through effective problem detection.

Speakers

Tony, Cofounder & CTO at Prequel

Tony Meehan is an engineering leader obsessed with bugs.  He dedicated the first 10 years of his career to vulnerability and exploit development at the NSA. Tony spent the next 10 years leading Engineering at Endgame and Elastic, where he was responsible for a global engineering team.  In 2023, Tony co-founded Prequel to change the way application failure is detected and resolved.  He lives in Tulsa, Oklahoma.

Niall Murphy, Cofounder & CEO at Stanza Systems

Niall Richard Murphy has worked in computing infrastructure since the mid-1990s, and has been employed by every major cloud provider (specifically Amazon, Google, and Microsoft) from their Dublin, Ireland offices in a variety of roles from IC to Director. He is currently CEO/Founder of Stanza Systems, a small startup in the SRE/AI space. He is the instigator, co-author, and editor of multiple award-winning books on networking, reliability, and machine learning, and he is probably one of the few people in the world to hold degrees in Computer Science, Mathematics, and Poetry Studies. He lives in Dublin with his wife and two children.

Hosted By
16 Going