The State of Digital Quality in AI 2025: Disconnect Between AI Investment and Testing

April 09, 2025

Rob Mason
Applause

GenAI technologies continue to create rapid transformation across industries. My organization, Applause, surveyed more than 4,400 independent software developers, QA professionals and consumers globally for our third annual State of Digital Quality in AI report to explore the latest use cases, tools, challenges and user experiences with GenAI. The findings highlight that while investment and use of AI continue to climb, the adoption of essential quality assurance (QA) and testing practices for AI is not keeping pace.

Software Development and QA with AI

Leveraging AI throughout the software development lifecycle (SDLC) is bringing businesses a competitive advantage. Still, some organizations are slow to adopt this approach, and many are not altering testing practices to account for AI:

■ More than half of software professionals surveyed said GenAI tools improve productivity significantly, with 25% saying it brings a boost of 25-49%, and 27% saying it boosts productivity by 50-74%.

■ 23% of software professionals say their integrated development environment (IDE) lacks GenAI tools (such as GitHub Copilot) and 16% don't know if they have AI tools embedded in their IDE.

■ Red teaming, a best practice for mitigating the risks of bias, toxicity, and inaccuracy in AI, is only used by 33% of respondents.

■ The top AI testing practices involving people include prompt and response grading (61%), UX testing (57%), and accessibility testing (54%).

■ 41% of developers and QA professionals said they lean on domain experts for AI training.

While the productivity benefits that come from using AI are clear for the majority of software professionals, the best practices for testing AI are often not being incorporated in tandem.

AI Investment for Customer Experience

Organizations are investing in AI to improve customer experiences and reduce operational costs. Despite this, flaws and bugs are still reaching consumers:

■ More than 70% of developers and QA professionals said their organization is developing AI features and applications. Chatbots and customer support tools are the most popular AI applications being built at 55%, and 19% have started to build AI agents.

■ 65% of respondents reported issues using GenAI in the past three months. The top issues were that the AI responses:
- Lacked detail (40%)
- Misunderstood prompts (38%)
- Showed bias (35%)
- Showed hallucinations (32%)
- Were clearly incorrect (23%)
- Included offensive content (17%)

■ Only 20% of respondents said the GenAI tools they use understand their questions and deliver helpful responses every time.

While organizations are investing in AI with the goal of improving customer experiences, that goal is not always being met. Without incorporating proper testing techniques like red teaming and training models with diverse datasets, AI continues to yield flawed and inaccurate results. There is clearly plenty of room for improvement.

Additional AI Findings

Some additional interesting findings from this year's survey include:

■ The favorite AI tools from our 2024 survey are still the favorites in 2025, with 37% of respondents preferring GitHub Copilot and 34% preferring OpenAI Codex.

■ 78% of users want their AI tools to have multimodal functionality, or the ability to interpret multiple types of media — an increase of 16% from last year.

The results of this year's survey bring to light the disconnect between building and adopting AI applications and testing them. With so much investment and emphasis on leveraging AI to improve operational efficiency and enhance customer experience, it is critical to incorporate end-to-end testing best practices too. On top of the rise in investment and adoption, agentic AI is contributing to the technology's rapid evolution. Organizations that don't account and budget for adequate AI testing put their AI investments and brands at risk.

Rob Mason is CTO of Applause

Industry News

Check Point Software Technologies Named One of America's Best Cybersecurity Companies by Newsweek and Statista

May 22, 2025

Check Point® Software Technologies Ltd.(link is external) has been recognized on Newsweek’s 2025 list of America’s Best Cybersecurity Companies(link is external).

Red Hat Introduces AI-Powered Management and Extends Container-Native Reach for Red Hat Enterprise Linux

May 22, 2025

Red Hat announced enhanced features to manage Red Hat Enterprise Linux.

StackHawk Raises $12 Million in Strategic Funding

May 22, 2025

StackHawk has taken on $12 Million in additional funding from Sapphire and Costanoa Ventures to help security teams keep up with the pace of AI-driven development.

Red Hat Introduces Cloud-Optimized Red Hat Enterprise Linux

May 21, 2025

Red Hat announced jointly-engineered, integrated and supported images for Red Hat Enterprise Linux across Amazon Web Services (AWS), Google Cloud and Microsoft Azure.

Komodor Integrates with Internal Developer Portals

May 21, 2025

Komodor announced the integration of the Komodor platform with Internal Developer Portals (IDPs), starting with built-in support for Backstage and Port.

Operant Launches Woodpecker

May 21, 2025

Operant AI announced Woodpecker, an open-source, automated red teaming engine, that will make advanced security testing accessible to organizations of all sizes.

Shopify Summer '25 Edition Released

May 21, 2025

As part of Summer '25 Edition, Shopify is rolling out new tools and features designed specifically for developers.

Lenses.io Releases Suite of AI Agents

May 21, 2025

Lenses.io announced the release of a suite of AI agents that can radically improve developer productivity.

Google Announces New Developer Tools

May 20, 2025

Google unveiled a significant wave of advancements designed to supercharge how developers build and scale AI applications – from early-stage experimentation right through to large-scale deployment.

Red Hat Advanced Developer Suite Introduced

May 20, 2025

Red Hat announced Red Hat Advanced Developer Suite, a new addition to Red Hat OpenShift, the hybrid cloud application platform powered by Kubernetes, designed to improve developer productivity and application security with enhancements to speed the adoption of Red Hat AI technologies.

Perforce Intelligence Introduced

May 20, 2025

Perforce Software announced Perforce Intelligence, a blueprint to embed AI across its product lines and connect its AI with platforms and tools across the DevOps lifecycle.

CloudBees Unify Introduced

May 20, 2025

CloudBees announced CloudBees Unify, a strategic leap forward in how enterprises manage software delivery at scale, shifting from offering standalone DevOps tools to delivering a comprehensive, modular solution for today’s most complex, hybrid software environments.

Azul and JetBrains Collaborate to Enhance Runtime Performance for Kotlin Workloads

May 20, 2025

Azul and JetBrains announced a strategic technical collaboration to enhance the runtime performance and scalability of web and server-side Kotlin applications.

Docker Announces Hardened Images Catalog

May 19, 2025

Docker, Inc.® announced Docker Hardened Images (DHI), a curated catalog of security-hardened, enterprise-grade container images designed to meet today’s toughest software supply chain challenges.

GitHub Introduces Coding Agent For GitHub Copilot

May 19, 2025

GitHub announced that GitHub Copilot now includes an asynchronous coding agent, embedded directly in GitHub and accessible from VS Code—creating a powerful Agentic DevOps loop across coding environments.

DEVOPSdigest

Software Development and QA with AI

AI Investment for Customer Experience

Additional AI Findings

Industry News

On-Demand Webinars

Analyst Reports

White Papers

Media Partners

The Latest

Hot Topics

Software Development and QA with AI

AI Investment for Customer Experience

Additional AI Findings

Related Links

Industry News

Search form

On-Demand Webinars

Analyst Reports

White Papers

Media Partners

User login

The Latest

Hot Topics