StackPulse Debuts Automated Kubernetes Troubleshooting and Remediation Tools

April 28, 2021

StackPulse announced a Kubernetes-centric “operations center” initiative as a part of its Reliability platform.

With these additions, StackPulse gives organizations running Kubernetes a powerful set of capabilities to augment their existing incident response practices, helping Site Reliability Engineers (SRE) understand and investigate issues faster, and deploy well-tested outage mitigation strategies, helping prevent customer-facing downtime.

Since Kubernetes is the de-facto standard for running containerized applications, StackPulse wanted to create a set of code-based tools engineers could use to operationalize incident response for production Kubernetes-based applications. When an error is detected in a Kubernetes environment, StackPulse automatically executes diagnostic steps to gather information from the clusters, and assists engineers in performing the root-cause analysis. This automation helps them quickly identify how to mitigate and resolve an issue.

Additionally, StackPulse has released more than a dozen playbooks built by SRE experts that remediate common Kubernetes problems. Using the StackPulse platform to automate these playbooks significantly reduces the time to resolution, helping teams restore services faster and meet SLOs.

“If you're serious about cloud-native, you're using Kubernetes, but it requires learning new concepts, and turning applications alongside infrastructure for best performance,” said Leonid Belkind, CTO and Co-Founder of StackPulse. “While developer teams push to adopt K8s due to the benefits in velocity it brings, it can be hard for Ops teams or on-call developers to know how to respond to alerts, or fix issues in production. This leads to costly incidents and outages. What we’re releasing today is a set of automated tools for diagnostics, mitigation, and remediation that help any Kubernetes environment operate with the best practices of planet-scale Kubernetes shops.”

All the Kubernetes tools and automated diagnostics are available to teams in the same platform as StackPulse's incident response functionality so teams can communicate during outages, centralize event data, and take action to remediate. From detecting issues by correlating signals from multiple sources to enriching alerts sent to on-call teams with root cause and remediation information, StackPulse drastically decreases the customer impact of production issues, helping stop outages in their tracks.

Industry News

Opsera Announces New Patents for AI-powered, Cloud-Native Unified DevOps Platform

April 24, 2024

Opsera announced that two new patents have been issued for its Unified DevOps Platform, now totaling nine patents issued for the cloud-native DevOps Platform.

mabl Introduces Mobile Application Testing

April 23, 2024

mabl announced the addition of mobile application testing to its platform.

Spectro Cloud Achieves AWS Competency Designation

April 23, 2024

Spectro Cloud announced the achievement of a new Amazon Web Services (AWS) Competency designation.

GitLab Duo Chat Released

April 22, 2024

GitLab announced the general availability of GitLab Duo Chat.

SmartBear Integrates Stoplight's Spectral, Elements, and Prism into SwaggerHub

April 18, 2024

SmartBear announced a new version of its API design and documentation tool, SwaggerHub, integrating Stoplight’s API open source tools.

Red Hat Expands Red Hat Trusted Software Supply Chain

April 18, 2024

Red Hat announced updates to Red Hat Trusted Software Supply Chain.

Tricentis Copilot Released

April 18, 2024

Tricentis announced the latest update to the company’s AI offerings with the launch of Tricentis Copilot, a suite of solutions leveraging generative AI to enhance productivity throughout the entire testing lifecycle.

CIQ Launches Support for Upstream Kernels in Rocky Linux

April 17, 2024

CIQ launched fully supported, upstream stable kernels for Rocky Linux via the CIQ Enterprise Linux Platform, providing enhanced performance, hardware compatibility and security.

Redgate Monitor Enterprise Released

April 17, 2024

Redgate launched an enterprise version of its database monitoring tool, providing a range of new features to address the challenges of scale and complexity faced by larger organizations.

Snyk Supports Google Cloud's Gemini Code Assist

April 17, 2024

Snyk announced the expansion of its current partnership with Google Cloud to advance secure code generated by Google Cloud’s generative-AI-powered collaborator service, Gemini Code Assist.

Kong Konnect Dedicated Cloud Gateways Available on AWS

April 16, 2024

Kong announced the commercial availability of Kong Konnect Dedicated Cloud Gateways on Amazon Web Services (AWS).

Pega Infinity '24.1 Released

April 16, 2024

Pegasystems announced the general availability of Pega Infinity ’24.1™.

Sylabs Launches Singularity Containers Certification

April 16, 2024

Sylabs announces the launch of a new certification focusing on the Singularity container platform.

OpenText Cloud Editions 24.2 Announced Including OpenText DevOps Cloud and OpenText™ DevOps Aviator

April 15, 2024

OpenText™ announced Cloud Editions (CE) 24.2, including OpenText DevOps Cloud and OpenText™ DevOps Aviator.

Postman Acquires Orbit

April 15, 2024

Postman announced its acquisition of Orbit, the community growth platform for developer companies.

DEVOPSdigest

Industry News

Upcoming Webinars

On-Demand Webinars

Analyst Reports

White Papers

Media Partners

The Latest

Hot Topics

Industry News

Search form

Upcoming Webinars

On-Demand Webinars

Analyst Reports

White Papers

Media Partners

User login

The Latest

Hot Topics