xMatters Open-Sources Chaos Engineering
May 22, 2019

xMatters has open-sourced Cthulhu, a chaos engineering tool that allows DevOps teams to design resilient, self-healing services across hybrid and multi-cloud infrastructures.

As more and more organizations move to a distributed systems model, chaos testing has become a leading-edge methodology for enhancing reliability. A major challenge, however, lies in executing these tests comprehensively across complex IT environments. Cthulhu enables automated cross-platform failure orchestration, using a data-driven approach to simulate complex disaster scenarios. This allows organizations to design more robust systems that better anticipate failure and — more importantly — improve self-healing mechanisms to accelerate automatic recovery.

“Microservice architecture can provide many benefits in scalability and functional encapsulation, but can also generate complex failure scenarios due to service dependencies. Chaos engineering can help expose these issues before they manifest themselves in production,” said Tobias Dunn-Krahn, CTO of xMatters. “Cthulhu exposes critical gaps in the self-healing ability of systems so that engineering teams can continuously re-fortify their applications against failures and keep the business running smoothly.”

The development of Cthulhu has been spearheaded by xMatters developer Gabrielle Gasse, who first identified an internal need to efficiently orchestrate failure experiments across xMatters’ varied infrastructure, which uses a mix of both virtual machines and Kubernetes deployments.

“No such tool existed that would allow us to comprehensively test failures across the multiple components of our environment, so we decided to build it ourselves — and then share it with the world,” said Gasse. “We’re excited to introduce Cthulhu to the developer community, and hope to engage and collaborate directly with those who will actually use and benefit from it.”

Cthulhu is designed to be extensible and easy to add to any platform. Startups can benefit from building better foundations from the get-go, while enterprises and established companies can rapidly identify critical areas for improvement and prioritize product roadmaps.

Core features of Cthulhu include:

- Cross-platform failure orchestration to automatically run random failure scenarios in any environment and on a schedule.

- Version-controllable scenarios so that once a vulnerability is identified, engineers can easily reproduce it in different environments.

- Automated communications to allow select team members to monitor the evolution of failure experiments and insights gained through targeted notifications.

Cthulhu is the second no-cost tool offered by xMatters. Last year, the company also released xMatters Free to allow smaller teams of up to 15 users to benefit from select collaboration features with no expiration date.

Cthulhu is available now.

Share this

Industry News

September 24, 2020

NetApp announced the availability of Elastigroup for Microsoft Azure Spot Virtual Machines (VMs).

September 24, 2020

CloudBees announced a robust new set of DevSecOps capabilities for CloudBees CI and CloudBees CD. The new capabilities enable customers to perform early and frequent security checks and ensure that security is an integral part of the whole software delivery pipeline workflow, without sacrificing speed or increasing risk.

September 24, 2020

Pulumi announced the release of a Pulumi-native provider for Microsoft Azure that provides 100% coverage of Azure Resource Manager (ARM), the deployment and management service for Azure that enables users to create, update and delete resources in their Azure accounts.

September 23, 2020

Puppet announced new Windows services, integrations and enhancements aimed at making it easier to automate and manage infrastructure using tools Windows admins rely on. The latest updates include services around Group Policy Migration and Chocolatey, as well as enhancements to the Puppet VS Code Extension, and a new Puppet PowerShell DSC Builder module.

September 23, 2020

Red Hat announced the release of Red Hat OpenShift Container Storage 4.5, delivering Kubernetes-based data services for modern, cloud-native applications across the open hybrid cloud.

September 23, 2020

Copado, a native DevOps platform for Salesforce, has acquired ClickDeploy.

September 22, 2020

CloudBees announced general availability of the first two modules of its Software Delivery Management solution.

September 22, 2020

Applause announced the availability of its Bring Your Own Testers (BYOT) feature that enables clients to manage their internal teams – employees, friends, family members and existing customers – and invite them to test cycles in the Applause Platform alongside Applause’s vetted and expert community of testers.

September 22, 2020

Kasten announced the integration of the K10 data management platform with VMware vSphere and Tanzu Kubernetes Grid Service.

September 21, 2020

PagerDuty entered into a definitive agreement to acquire Rundeck, a provider of DevOps automation for enterprise.

September 21, 2020

Grafana Labs announced the release of Grafana Metrics Enterprise, a modern Prometheus-as-a-Service solution designed for the scale, architecture, and security needs of enterprises as they expand their observability initiatives.

September 21, 2020

Portshift's Cloud Workload Protection platform is now available through the Red Hat Marketplace.

September 17, 2020

env0, a developer of Infrastructure-as-Code (IaC) management software, announced the availability of its new open source solution for Terraform users, Terratag.

September 17, 2020

Push Technology announced a partnership with Innova Solutions, an ACS Solutions company, specializing in global information technology services.

September 17, 2020

Alcide achieved the AWS Outposts Ready designation, part of the Amazon Web Services (AWS) Service Ready Program.