▾ G11 Media Network: | ChannelCity | ImpresaCity | SecurityOpenLab | Italian Channel Awards | Italian Project Awards | Italian Security Awards | ...
InnovationOpenLab

Langsmart Publishes Industry’s First p95 Semantic Cache Benchmarks for On-Premises AI Gateway, Challenges Market: “Show Me the p95”

NVIDIA GTC 2026 — Langsmart, the enterprise AI governance company, today announced the successful completion of a rigorous enterprise evaluation with a Fortune 200 financial institution. The testing...

Immagine

Testing Confirms 10.2x Faster Response Times, Exceeding Cloud-Hosted Alternatives

SAN JOSE, Calif.: NVIDIA GTC 2026 - Langsmart, the enterprise AI governance company, today announced the successful completion of a rigorous enterprise evaluation with a Fortune 200 financial institution. The testing confirms that Langsmart’s Smartflow platform delivers a 10.2x speedup in response times, achieving sub-300ms latency on standard, low-resource hardware.

As enterprise adoption of AI gateways accelerates – with analysts projecting 70% of engineering teams will use them by 2028 – the industry has struggled with a lack of standardized performance data. Langsmart’s latest results provide a transparent blueprint for organizations requiring high-performance AI governance within strict on-premises and air-gapped environments.

Enterprise Evaluation: Performance Under Pressure

The evaluation focused on real-world financial services workloads, prioritizing reliability and speed within a secure infrastructure. Unlike cloud-based gateways that require data to leave the perimeter, Smartflow was deployed as a Docker container on a modest 4vCPU, 8GB server.

Key performance milestones included:

  • 10.2x Response Speedup – Cached responses were reduced from 2.2 seconds to just 220 milliseconds.
  • Sub-300ms p95 Latency – The system maintained a 285ms p95 latency on semantic cache hits, comfortably meeting the sub-500ms Service Level Objectives (SLOs) required by global financial institutions.
  • High-Accuracy Cache Hits – Achieved a 40–50% hit rate at a 0.95 similarity threshold, with 100% of tested workloads showing measurable improvement.
  • Rigorous Reliability – Smartflow passed 24 out of 24 automated health, latency, and threshold checks.

“For banking, insurance, and healthcare, routing prompts and model responses through a third-party cloud is a liability,” said Craig Alberino, Founder and CEO of Langsmart. “Smartflow eliminates that risk by deploying entirely within the client’s network, delivering performance that actually exceeds cloud-hosted alternatives.”

Raising the Industry Standard: "Show Me the p95"

While the evaluation highlights Smartflow’s technical achievements, it also exposes a critical transparency gap in the AI gateway market. Langsmart’s research found that while many vendors promise efficiency, none currently publish p95 or p99 latency data – the metrics most critical for production-grade enterprise stability.

“Enterprise buyers deserve to see real numbers on real hardware, not marketing claims,” said Alberino. “We are calling on all AI gateway vendors to follow our lead and publish standardized benchmarks. If you’re providing enterprise infrastructure, show me the p95.”

Langsmart’s push for transparency aims to provide CISOs and CTOs with the empirical data needed to evaluate AI governance tools effectively, ensuring that security does not come at the cost of performance.

For the full benchmarking methodology and results, visit langsmart.ai/blog/show-me-the-p95.

About Langsmart
Langsmart is the enterprise AI governance company building Smartflow – the on-premises AI firewall, gateway, and governance control plane for regulated industries. Smartflow enables financial services, healthcare, and insurance organizations to govern AI model traffic at the network layer without data leaving their infrastructure. Langsmart is headquartered in the New York City metro area with teams in Connecticut and Austin, TX.

Fonte: Business Wire

If you liked this article and want to stay up to date with news from InnovationOpenLab.com subscribe to ours Free newsletter.

Related news

Last News

RSA at Cybertech Europe 2024

Alaa Abdul Nabi, Vice President, Sales International at RSA presents the innovations the vendor brings to Cybertech as part of a passwordless vision for…

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

G11 Media's SecurityOpenLab magazine rewards excellence in cybersecurity: the best vendors based on user votes

How Austria is making its AI ecosystem grow

Always keeping an European perspective, Austria has developed a thriving AI ecosystem that now can attract talents and companies from other countries

Sparkle and Telsy test Quantum Key Distribution in practice

Successfully completing a Proof of Concept implementation in Athens, the two Italian companies prove that QKD can be easily implemented also in pre-existing…

Most read

Petwealth Emerges from Stealth with $1.7 Million in Funding, Landmark…

#catsofinstagram--Petwealth, the at-home PCR diagnostics and AI-powered health intelligence platform for dogs and cats, today announced its emergence…

NeuBird AI Launches Autonomous Production Operations Agent, Expanding…

NeuBird AI today announced the launch of its autonomous production operations agent to drive a fundamental shift in how enterprises run production environments.…

INSERTING and REPLACING Register for Galaxy Unpacked 2026 for a Chance…

Insert before first paragraph of relase dated Feb 16, 2026: Contest has closed. The updated release reads: REGISTER FOR GALAXY UNPACKED 2026 FOR A CHANCE…

From IDP to Intelligent Inference: Hyperscience Hypercell Spring 2026…

Hyperscience, a market leader in enterprise AI infrastructure software, focused on Intelligent Document Processing (IDP), today announced major advancements…

Newsletter signup

Join our mailing list to get weekly updates delivered to your inbox.

Sign me up!