The Rootly AI Labs is a fellow-led community designed to redefine reliability engineering. We develop innovative prototypes, create open-source tools, and produce research that’s shared to advance the standards of operational excellence.
- SRE-skills-bench: Can LLMs resolve real-world SRE Tasks? A benchmark testing LLMs on SRE-type tasks. Like SWE-bench, but for SREs.
- On-Call Health: Detects potential signs of overwork in incident responders, which could lead to burnout.
- Rootly MCP server: Resolve production incidents in under a minute without leaving your IDE.
- IncidentDiagram: Generates a diagram highlighting what happened during an incident by ingesting the retrospective and associated codebase.
Rootly began in 2021 by building a category-defining on-call and incident response platform, trusted by thousands, including Replit, NVIDIA, LinkedIn, and Dropbox.
Now, GenAI is simultaneously introducing new complexities and unlocking opportunities to redefine reliability forever.
The Rootly AI Labs is a fellow-led community designed to redefine reliability engineering. We develop innovative prototypes, create open-source tools, and produce research that's shared to advance the standards of operational excellence.
- Allan Parson – Sr Staff Engineer at Venmo
- Casey Brown – Head of Infrastructure Engineering at Weights and Biases
- Kishan Rao – Engineering Manager at Okta
- Kishore Korathaluri – Staff Site Reliability Engineer at Cribl
- Laurence Liang – Student Researcher at McGill University
- Muhammad Hamza – Machine Learning Researcher at University of Toronto
- Sahil Kumar – Director of AI Product at Twilio
- Spencer Cheng – Software Engineer at Rivian
- Sylvain Kalache – Head of Rootly AI Labs
Thank you to our partners for supporting us.



