Decagon Introduces Duet Autopilot, the First Verified Self-Improving AI Agent for Customer Experience

Decagon, the leader in conversational AI agents for concierge customer experiences, today announced Duet Autopilot, the first agent to deliver automatic and verifiable self-improvement for CX agents. ...

SAN FRANCISCO: Decagon, the leader in conversational AI agents for concierge customer experiences, today announced Duet Autopilot, the first agent to deliver automatic and verifiable self-improvement for CX agents.

To measure Autopilot’s efficacy, Decagon also built DuetBench, the industry's first benchmark for evaluating agent self-improvement end-to-end. Against it, Duet Autopilot passed 93% of diagnostic tasks, exceeding the average human score.

"Autopilot is a shift from building agents by hand to managing agents that improve themselves," said Alan Yiu, VP of Product at Decagon. "Teams set the direction and review the work; Autopilot handles the diagnosing, testing, and editing that used to consume their week. Every fix compounds, which ultimately empowers businesses to provide their customers with a 24/7 AI concierge that gets measurably better with every interaction."

Closing the loop on agent improvement

Until now, improving an AI agent has been bottlenecked by manual work. As customer signals accumulate, teams must interpret feedback, decide on changes, test them, and ship improvements by hand. Too many cycles go into identifying and prioritizing high-impact updates, and even then, manual effort caps how much gets done. Duet Autopilot removes that constraint by acting on the full breadth of production signals.

Duet Autopilot delivers three core capabilities that work together as a continuous loop:

Automated agent improvement: Autopilot continuously translates production signals into proposed updates, acting on opportunities ranging from highest priority to small adjustments.
Self-validation: Every proposed change is tested against the original conversation that surfaced the issue, regression tests, and a curated golden set representing real customer personas and intents. If a change doesn’t pass those tests, Autopilot keeps iterating until it does.
Enterprise governance: Teams set guidance up front using brand voice, writing standards, policy preferences, and off-limits rules. Every change surfaces as a versioned update with the issues found, validation results, and exact diffs, requiring human approval before going live.

Because Autopilot is itself a Decagon agent, it is subject to its own improvement loop. Every reviewer correction and successful outcome feeds back into how it operates, so each cycle produces higher-quality updates than the last. This way, agent performance improves not at a fixed rate, but exponentially.

Proven in the field, formalized in the benchmark

Duet Autopilot is being validated with a cohort of enterprise customers and design partners across financial services, retail, and consumer technology, who are measuring its impact on resolution rates, escalation rates, and coverage.

“At our scale, manually reviewing conversations for errors isn't an option,” said Matt McCollum, senior manager of customer experience at Opendoor. “Decagon Autopilot frees our team to focus on decisions rather than digging through logs. It surfaces what changed, what was considered, and why. That transparency is what makes AI actually trustworthy in production.”

Furthermore, DuetBench fills a gap in how conversational AI agents are evaluated. Existing benchmarks measure whether an agent can resolve a fixed set of issues, but they don’t yet measure the improvement loop. By contrast, DuetBench measures whether Autopilot can make verifiable agent improvements, rather than producing plausible-looking changes.

Duet Autopilot is available to Decagon customers beginning today. To see how Autopilot can accelerate agent outcomes, visit decagon.ai/blog/autopilot.

About Decagon

Decagon is the leading conversational AI platform empowering every brand to deliver an AI concierge for every customer. Our technology helps enterprises like Avis Budget Group, Chime, Oura Health, 1-800-FLOWERS.COM, and Hunter Douglas deploy AI agents that power personalized, deeply satisfying interactions across voice, chat, email, SMS, and every other channel.

Headquartered in San Francisco with offices in New York City, London, Sydney, and Toronto, we’re proud to be backed by world-class investors who share our vision to help every business create the concierge experiences their customers deserve. To learn more, please visit www.decagon.ai.

Fonte: Business Wire

Last News

RSA at Cybertech Europe 2024

Alaa Abdul Nabi, Vice President, Sales International at RSA presents the innovations the vendor brings to Cybertech as part of a passwordless vision for…

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

G11 Media's SecurityOpenLab magazine rewards excellence in cybersecurity: the best vendors based on user votes

How Austria is making its AI ecosystem grow

Always keeping an European perspective, Austria has developed a thriving AI ecosystem that now can attract talents and companies from other countries

Sparkle and Telsy test Quantum Key Distribution in practice

Successfully completing a Proof of Concept implementation in Athens, the two Italian companies prove that QKD can be easily implemented also in pre-existing…

G11 Media Networks

InnovationOpenLab is a channel of BitCity, a newspaper registered at the court of Como ,
n. 21/2007 del 11/10/2007- Registration ROC n. 15698

G11 MEDIA S.R.L. Registered office Via NUOVA VALASSINA, 4 22046 MERONE (CO) - P.IVA/C.F.03062910132 Como business register n. 03062910132 - REA n. 293834 CAPITALE SOCIALE Euro 30.000 i.v.

LicenseFortress Completes SOC 2 Type 1 Examination

Zenlayer Named Equinix 2025 Global Partner of the Year

PMI Publishes World's First Global Standard for AI in Project Work as Governance Frameworks Struggle to Keep Pace

Fresha Strengthens Its Global Barbering Presence at Toronto Barbers Expo Ahead of HairCon Powered by Fresha

Ethena Selects Centrifuge as Strategic Tokenization Partner to Accelerate Institutional RWA Adoption

PubMatic and Havas Launch the First Agentic CTV Campaign in Spain for Telefónica, Achieving 18% Lower CPM Vs Target

dunnhumby Launches Retail:Vision, a Blueprint for Connected, AI-Powered Retail Decision-Making in an Age of Disruption

Pacific Defense Successfully Launches Moonraker Payload on Avalon Mission, Advancing Open Architecture in Space

Decagon Introduces Duet Autopilot, the First Verified Self-Improving AI Agent for Customer Experience

Related news

Last News

RSA at Cybertech Europe 2024

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

How Austria is making its AI ecosystem grow

Sparkle and Telsy test Quantum Key Distribution in practice

Most read

AI Moves IT Management Platforms Toward Autonomy, ISG says

K-Startup Grand Challenge 2026: Korea's Full-Cycle Launchpad for Global…

ScienceSoft to Present AI Workbench for Claims Handling at Insurance Tech…

Trio-Tech International Receives Additional $2.6 Million in Orders for…

G11 Media Networks

Decagon Introduces Duet Autopilot, the First Verified Self-Improving AI Agent for Customer Experience

Related news

Last News

Most read

Newsletter signup

G11 Media Networks