Conquering Chaos:
Where Resilience Meets Reliability

JANUARY 22, 2025

See how chaos engineering exposes the potential vulnerabilities in your infrastructure so you can address issues before they impact your users.

Featured Speakers

Reuben Rajan George

Resilience and Chaos Engineering

Gunnar Grosch

Principal Developer Advocate

Chetan Thakker

Technology Leader and Senior Cloud Engineer

Rahul Prasad

Director of Engineering, Platform & SRE Ops

Vamsi Rao

Principal Engineer

Ben Weitzer

Senior Product Manager

Rob Charlwood

Lead SRE

Mandi Walls

DevOps Advocate

Spiros Economakis

Director of ProductOps

Lex Neva

Staff Site Reliability Engineer

Jintao Zhang

Senior Software Engineer

Balaji Venkatesan

Co-Founder

Sivasubramanian Bagavathiappan

Co-Founder

Ron Vidal

Founding Partner

Empower your developers

Discover how leading organizations are leveraging machine learning (ML) and AI to automate their software delivery lifecycle.

Empower your developers
Maximize business outcomes

Discover how leading organizations are leveraging machine learning (ML) and AI to automate their software delivery lifecycle.

Maximize business outcomes
Measure impact & insights

Discover how leading organizations are leveraging machine learning (ML) and AI to automate their software delivery lifecycle.

Measure impact & insights
Achieve engineering excellence

Discover how leading organizations are leveraging machine learning (ML) and AI to automate their software delivery lifecycle.

Achieve engineering excellence

Why Attend?

Leaders from the brands you know and love like Amazon, Wells Fargo, Marriott, Taco Bell, and more will share best practices on how you can get the most out of your AI initiatives, as well as what pitfalls to avoid.

Why Attend?

Global experts share insights on strengthening chaos engineering and incident management to boost resilience, release with confidence, and proactively detect incidents.

Master Complex Systems

Gain insights to construct a resilient system that minimizes downtime, saves on costs, and uncovers hidden issues.

Improve System Resilience

Learn to build more resilient systems by identifying weaknesses and potential points of failure before they impact users.

Respond to Incidents Intelligently

Better prepare your teams and respond to incidents with more data and automation.

Build a Resilience Culture

Learn to foster a culture of reliability within teams and organizations to prioritize continuous improvement and recovery.

Two Reliability Tracks

Global experts share insights on strengthening chaos engineering and incident management to boost resilience, release with confidence, and proactively detect incidents.

Chaos Engineering

Experimenting on systems to build better resilience.

Incident Response

Discovering and responding to software-related threats and breaches.

Agenda

Time (PST)
All Tracks
8:00 am - 8:45 am
keynote
Revolutionizing Resilience: AI-Powered Chaos Engineering and Incident Response
Tina Huang, Harness
Umasankar Mukkara, Harness
8:45 am - 9:05 am
keynote
Banking on Resilience: How AZ Failure Testing Strengthens IT Disaster Recovery
Chetan, Deutsch Bank
9:05 am - 9:35 am
keynote
From Reactive to Proactive: Rethinking Incident Response in the Age of AI
Ryan Taylor, Harness
9:35 am - 10:15 am
Turning Incidents into Opportunities: An SLO-Centric Chaos Engineering Perspective
Sivasubramanian Bagavathiappan, GuhaTek Consulting Services
Balaji Venkatesan, GuhaTek Consulting Services
10:15 am - 10:35 am
Destroy on Friday: Taking Down an Availability Zone in Production
Lex Neva, Honeycomb
10:40 am - 11:15 am
Building Resilient Deployments with Google Cloud Build
Ben Weitzer, Google Cloud
Matt Schillerstrom, Harness
11:15 am - 12:00 pm
Break
Time
Chaos Engineering Track
Incident Response Track
12:00 pm - 12:30 pm
Building Custom Chaos: From Fault Design to Experimentation with Litmus SDK
Shubham Chaurdhary, Litmus
Adarsh Kumar, Harness
RespondNow: Building Resilience with Open-Source Incident Management
Sagar Kumar, Harness
12:30 pm - 1:00 pm
Scaling Chaos Engineering with the Power of GenAI
Karthik Satchitanand, Harness
Speakers
Turning Incidents into Learning Opportunities
Gunnar Grosch, AWS
1:00 pm - 1:30 pm
keynote
Resiliency at Scale: How a Bank Balances Innovation and Stability in the Modern Era
Rahul Prasad, Citizens Bank
Speakers
Managing Vendor Incidents
Mandi Walls, PagerDuty
1:30 pm - 1:45 pm
Effectively Using Resilience Probes to Validated Resilience Design Patterns
Reuben George, Accenture
Speakers
Empowering SRE Teams and Incident Management with AI
Spiros Economakis, Mattermost
1:45 pm - 2:45 pm
Taking Resilience: Navigating the Journey of End to End Resilience Engineering from the Deployment to Testing to Resolving Incidents
Ron Vidal, BlackRock3
Vamsi Krishna Rao, Salesforce
Jintao Zhang, Kong
Ryan Taylor, Harness
Matt Schillerstrom, Harness
2:45 pm - 3:15 pm
Embracing the Chaos: How SREs Could Have Saved Jurassic Park
Rob Charlwood, Lloyds Banking Group
by
JAN 22, 2025
Register Now
Chaos Carnival