Speechmatics Achieves a World First in Bilingual Voice AI with New Arabic–English Medical Model

#MENAai--Speechmatics today launched its new Arabic–English bilingual model, a single production-ready model that handles Arabic dialects and English simultaneously. It can be deployed on-premises a...

The new industry-leading bilingual model includes the world's first Arabic–English bilingual medical model, achieving 6.3% WER on mixed-speech benchmarks and 35% fewer errors than the nearest competitor.

CAMBRIDGE, England: #MENAai--Speechmatics today launched its new Arabic–English bilingual model, a single production-ready model that handles Arabic dialects and English simultaneously. It can be deployed on-premises and on-device, supports speaker diarization and speaker focus, and runs across real-time and batch workflows.

As part of the rollout, Speechmatics introduces the world's first Arabic–English bilingual medical model: a specialized clinical variant trained on twice the vocabulary of its English Medical Model, built to ensure that patient records are always accurate and up to date.

Code-switching: going beyond monolingual AI

A doctor names a drug in English then switches back to Arabic. A Gulf contact center agent shifts registers without thinking. A finance officer moves across both languages in a single sentence. Across MENA, this is Monday morning.

Monolingual models weren't built for this. When a speaker shifts between Arabic and English mid-sentence, the model loses the thread - misattributing words, dropping terminology, or simply getting it wrong. In a contact centre or a clinical setting, that's not an edge case. It's the norm.

Our model, tailored to support both languages, resolves this issue, with speaker diarization and speaker focus ensuring every word is attributed to the right person throughout.

In new benchmarking, Speechmatics achieves a 35% lower Word Error Rate than Google on Arabic–English code-switching tasks (6.3% vs 9.7%), making it the most accurate code-switching model available

Dialect coverage that clears the field

Arabic carries distinct vocabulary, phonology, and rhythm across the region, including Gulf, Egyptian and Levantine dialects. Models trained on broadcast Modern Standard Arabic struggle the moment a real conversation starts.

Speechmatics leads major providers on Arabic-only transcription, delivering 24% lower Word Error Rate than Google (4.5% vs 5.9%) and outperforming OpenAI Whisper, AssemblyAI, Deepgram, Amazon, and Microsoft.

Built for enterprise deployment

Data sovereignty is a hard requirement across MENA, with regional data protection legislation in Saudi Arabia, the UAE, and beyond placing strict obligations on where voice data is processed and stored. Speechmatics meets this directly.

The model deploys across cloud SaaS, on-premises, and on-device, powered by NVIDIA AI infrastructure and optimized through NVIDIA Dynamo-Trito n for high-throughput, low-latency processing at scale. Sub-second latency is maintained across all deployment modes.

Real-time streaming and batch transcription run on the same model, removing the accuracy trade-off that typically comes with switching between the two. Speaker diarization, speaker focus, punctuated transcripts, and timestamped outputs are included as standard.

The world's first bilingual medical model

In clinical environments across MENA, English drug names, procedures, and dosages appear constantly inside Arabic speech. Generic models mishandle them, and those errors can land in the patient record.

Trained on twice the vocabulary of Speechmatics' English Medical Model, incorporating both English and Arabic clinical terminology, real dialect variation, and speech from actual clinical settings, the world's first Arabic–English bilingual medical model accurately transcribes ICD-10-CM codes, drug names, dosages, and clinical shorthand regardless of which language carries them. On-premises and on-device deployment make it viable for the regulated environments where clinical AI is increasingly being built across the region.

“This was critical to achieving meaningful outcomes for customers across the region who kept describing the same challenge. In a Cairo hospital or a Riyadh contact center, Arabic and English flow concurrently - the drug name arrives in English, the rest of the sentence is Arabic. Delivering significant impact meant removing that friction from voice interactions. We trained on real voices, real dialects and real clinical vocabulary - because that’s the only way to build something that truly works where it’s used.” - Katy Wigdahl, CEO, Speechmatics

"We ran extensive evaluations on complex clinical audio, including code-switching and dialect-heavy consultations common across MENA. Speechmatics' bilingual medical model was the only one that met the performance thresholds we require to maintain high-quality clinical documentation as we scale regionally. That alignment made the partnership a strong fit for our expansion." - Patrick Nguyen, Head of Engineering, MENA, Sully.ai

Both models are available now. Visit speechmatics.com for access and deployment options.

Fonte: Business Wire

Last News

RSA at Cybertech Europe 2024

Alaa Abdul Nabi, Vice President, Sales International at RSA presents the innovations the vendor brings to Cybertech as part of a passwordless vision for…

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

G11 Media's SecurityOpenLab magazine rewards excellence in cybersecurity: the best vendors based on user votes

How Austria is making its AI ecosystem grow

Always keeping an European perspective, Austria has developed a thriving AI ecosystem that now can attract talents and companies from other countries

Sparkle and Telsy test Quantum Key Distribution in practice

Successfully completing a Proof of Concept implementation in Athens, the two Italian companies prove that QKD can be easily implemented also in pre-existing…

G11 Media Networks

InnovationOpenLab is a channel of BitCity, a newspaper registered at the court of Como ,
n. 21/2007 del 11/10/2007- Registration ROC n. 15698

G11 MEDIA S.R.L. Registered office Via NUOVA VALASSINA, 4 22046 MERONE (CO) - P.IVA/C.F.03062910132 Como business register n. 03062910132 - REA n. 293834 CAPITALE SOCIALE Euro 30.000 i.v.

Emerald Announces Date for First Quarter 2026 Financial Results

StratEdge Honored with a Partner 2 Win Gold Tier Award from BAE Systems

Payward Completes Acquisition of Bitnomial, the First Fully CFTC-Licensed Crypto-Native Derivatives Stack in the US

CORRECTING and REPLACING Cardlytics Announces Timing of Its First Quarter 2026 Earnings Release

Upland Announces Commencement of New CEO Tenure and Inducement Awards

Extreme Networks Announces Investor Conferences for May and June 2026

Asana to Announce First Quarter Fiscal Year 2027 Financial Results on Thursday, May 28, 2026

Summary Notice of Proposed Settlement of Derivative Action

Speechmatics Achieves a World First in Bilingual Voice AI with New Arabic–English Medical Model

Related news

Last News

RSA at Cybertech Europe 2024

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

How Austria is making its AI ecosystem grow

Sparkle and Telsy test Quantum Key Distribution in practice

Most read

NCR Atleos Announces Date of First Quarter 2026 Earnings Results

Hitachi Digital Services Announces Strategic Partnership with Stripe to…

General Analysis Raises $10M in Seed Funding to Secure Agentic AI

Manifest OS Raises $60M to Scale the World’s First AI-Native Law Firm…

G11 Media Networks

Speechmatics Achieves a World First in Bilingual Voice AI with New Arabic–English Medical Model

Related news

Last News

Most read

Newsletter signup

G11 Media Networks