Find partners
Incidentally Reliable Podcast by Xurrent

Incidentally Reliable Podcast by Xurrent

Hosted by Xurrent IMR

TechnologyInterviews guests

Episodes

19

Latest episode

Feb 2026

Language

EN

About the show

Welcome to the The Incidentally Reliable Podcast, where we dive into the world of engineering and bring you first-hand experiences and captivating insights from experts in the ever-evolving front lines of DevOps and Site Reliability. With a new guest every episode, learn how elusive reliability can be as we peek into their journey in the industry so far, engineering innovations made in distress, manoeuvred nightmares, their war-room stories, and their opinions on the current state of the space.

Listen to episodes

19 recent
February 13, 2026Episode 138 min

The Zenduty Journey, AI-Native Response, and a New Host | Incidentally Reliable S3E1

We are back for Season 3 of the Incidentally Reliable podcast! 🎙️In this season premiere, we are kicking off a new chapter as part of Xurrent. Our new host, Jim Hirschauer, sits down with podcast veteran and Zenduty co-founder, Vishwa, for a special throwback episode.They take a trip down memory lane to uncover the real story behind Zenduty, why it was built, how this podcast started, and the massive "needle in the haystack" observability problem that still plagues SRE teams today.💡 In this episode, we cover:-The transition from Zenduty to Xurrent: What’s changing?-Why observability tools often create more noise, not less.-The "Culture vs. Tooling" debate: Why you can't buy reliability.Jim’s vision for Season 3.----------------------------------------------------------------------🔗 Connect with us:Website: https://www.xurrent.com/podcastLinkedIn: https://www.linkedin.com/company/xurrent/#SRE #SiteReliabilityEngineering #Podcast #Zenduty #Xurrent #DevOps #IncidentManagement #TechPodcast #SeasonPremiere

August 1, 2025Episode 456 min

Once an SRE, always an SRE | Incidentally Reliable with Sudarshan Balakrishna

In this episode, Sudarshan shares his experience leading high-performing SRE and infrastructure teams at Rippling, Twilio, Walmart, and Epsilon. He talks about reducing CI/CD costs by 60 percent, cutting on-call alerts by 65 percent, and the mindset required to build resilient systems.

June 18, 2025Episode 354 min

CTRL + ALT + Scale: Building More Than Just Code | Incidentally Reliable with Sakshi Jain

In this episode, Madhu Rawat (CTO, Xurrent) sits down with Sakshi — Co-founder and Head of Engineering at Kapstan, with leadership experience at Sumo Logic and UpGrad. They discuss the evolution of observability, building for scale, the role of AI in incident management, and what it means to lead engineering teams through change.

May 21, 2025Episode 245 min

Redefining ITxM with Zenduty × Xurrent | Incidentally Reliable

In this episode, Phil (CPO) and Madhu (CTO) from Xurrent sit down with Vishwa and Ankur from Zenduty to talk about ITxM, building for reliability across teams, and how product and platform thinking come together in real-world incident workflows.

April 25, 2025Episode 142 min

S2 | #1 - Deepak Rajanna - From Cart Failures to Satellite Footprints

In this episode, we speak with Deepak Rajanna, CPTO at SatSure and ex-Amazon, Flipkart, xto10x, about pricing failures at scale, war room lessons from Big Billion Days, and building satellite-powered systems with SRE principles at their core.

December 20, 2024Episode 1439 min

#14 - Amit Rindhe - GoDaddy's Journey to Hosting Reliability

In this episode of Incidentally Reliable, we sit down with Amit Rhinde, Head of Engineering at GoDaddy, to uncover the secrets behind building resilient systems, scaling global operations, and ensuring uptime for millions of users.Amit takes us through his incredible journey, from pioneering SRE practices at Adobe and AWS to leading one of the world's most trusted hosting platforms.

September 27, 2024Episode 1329 min

#13 - Denys Pashutynski - Press Start to Scale: SRE in Gaming

In this episode of Incidentally Reliable, we chat with Denys Pashutynski, Senior Engineering Manager of Site Reliability at Roblox, about the challenges of maintaining gaming reliability for millions. Denys, with experience at companies like Twitter, AWS, and eBay, dives into how Roblox handles latency, traffic spikes, and customer expectations.

August 16, 2024Episode 1256 min

#12 - Abhishek Ghosh - Battle-Tested Reliability Strategies

We dive into the trenches with Abhishek Ghosh, a veteran who has led SRE teams at Pinterest, and now at Cribl. He shares gripping war room stories from Pinterest, strategies for maintaining uptime, insights into the role of AI in observability, and more! Discover the future of SRE and learn how to navigate the challenges of digital reliability. Tune in to gain valuable lessons from one of the industry's leading experts.

June 21, 2024Episode 1134 min

#11 - Ramiro Berrelleza - The Science of Building Cloud Native DevTools

Catch Ramiro Berrelleza — Founder and CEO at Okteto talk about how impactful DevTool startups are built, the importance of investing in Developer Experience, and the emerging issues in the Cloud Native ecosystem.

May 30, 2024Episode 1044 min

#10 - Krishnendu Majumdar - Credit-Worthy Reliability

Catch Krishnendu Majumdar (CPTO at Yubi) talk about his journey in the dynamic Indian startup ecosystem, strategies to build for scale from Day 1 and insights into building sustained user trust via exceptional product performance in high governance industries like credit and finance.

Is this your show?

Claim this listing to keep it up to date, reach guests who want to pitch you, and manage bookings with Guestify.

Claim this listing

More Technology podcasts