▾ G11 Media Network: | ChannelCity | ImpresaCity | SecurityOpenLab | Italian Channel Awards | Italian Project Awards | Italian Security Awards | ...
InnovationOpenLab

Google DeepMind Features Hirundo’s Security-Hardened Gemma 4 Model – Outperforms LLMs 170x Its Size on Security

Google DeepMind has featured Hirundo’s security-hardened variant of Gemma 4 in its Gemmaverse – the official showcase for the Gemma open-model ecosystem. The feature validates Hirundo’s weight-l...

Immagine

TEL AVIV, Israel: Google DeepMind has featured Hirundo’s security-hardened variant of Gemma 4 in its Gemmaverse – the official showcase for the Gemma open-model ecosystem. The feature validates Hirundo’s weight-level machine unlearning approach as a production-grade solution to one of the most persistent vulnerabilities in enterprise AI deployments: prompt injection attacks.

The hardened model is built on Google’s Gemma 4 E4B instruction-tuned base. At 4 billion parameters, it outperforms models more than 170 times its size on prompt injection resistance – including DeepSeek V3.2-Exp (685B), GPT-OSS-120B, and Qwen3-235B – while preserving full utility across standard benchmarks.

AI security is a behavioral problem, not a size problem

Prompt injection – where adversarial inputs manipulate a model into overriding its system instructions – remains one of the most damaging attack vectors in production Large Language Model (LLM) deployments. The prevailing assumption has been that larger models are inherently more robust. Hirundo’s results directly contradict that.

Rather than applying external filters or inference-time guardrails, Hirundo’s platform targets and removes the specific model weights responsible for susceptibility to adversarial manipulation. The model effectively forgets the behaviors that enable attacks – at the weight level, not the prompt level. Despite those behaviors usually being tied to instruction-following, a desired trait of LLMs, Hirundo’s technology preserved general instruction following while mitigating prompt injection susceptibility.

The results against industry-leading open-weight models, benchmarked using Meta’s PurpleLlama CyberSecEval dataset:

  • DeepSeek V3.2-Exp (685B): 73.33% attack success rate – 15.6x worse than Hirundo’s model
  • GPT-OSS-120B: more than 3x Hirundo’s attack success rate
  • Qwen3-235B: 10.8x more vulnerable

Hirundo’s hardened Gemma 4 E4B achieved a 4.78% attack success rate – a 74.47% reduction versus the base model – with no measurable degradation across utility benchmarks including AIME25, LiveCodeBench, GPQA, IFBench, SCICode, AutoPatchBench, and CyberSOCEval.

Quote

“Prompt injection is not a prompting problem – it is a representational one. The vulnerability lives in the weights. Addressing it at the weight level is more durable and more precise than guardrails applied after the fact.”

– Prof. Em. Oded Shmueli, Chief Scientist, Hirundo; former Executive VP for Research and Dean of the Computer Science Faculty, Technion – Israel Institute of Technology

The Google DeepMind-endorsed Gemma 4 case study joins a growing portfolio of Hirundo’s publicly available debiased and security-hardened models, including variants of GPT-OSS, Qwen, NemoTron, Llama, and others – with results demonstrating up to 90%+ improvement in security robustness and fairness compared to the original base models.

Availability

Hirundo’s Gemma 4 E4B Unlearned Variant on HuggingFace:
https://huggingface.co/hirundo-io/gemma-4-E4B-it-reduced-prompt-injection

Hirundo’s Gemma 4 E4B Case Study on Google DeepMind’s Gemmaverse:
https://deepmind.google/models/gemma/gemmaverse/hirundo

About Hirundo

Hirundo is the pioneer of machine unlearning – making it possible to change what trained AI models know and how they behave. Rather than applying filters or guardrails after the fact, Hirundo’s platform operates at the weight level, enabling teams to remove sensitive data, harmful behaviors, jailbreak vulnerabilities, and bias at the core – with no measurable degradation to model utility.

The company holds nine filed US patents and is led by Ben Luria (CEO), one of the first Israeli Rhodes Scholars; Prof. Em. Oded Shmueli (Chief Scientist), former Executive VP for Research and Dean of the Computer Science Faculty at the Technion – Israel Institute of Technology; and Michael Leybovich, an award-winning researcher.

www.hirundo.io

Fonte: Business Wire

If you liked this article and want to stay up to date with news from InnovationOpenLab.com subscribe to ours Free newsletter.

Related news

Last News

RSA at Cybertech Europe 2024

Alaa Abdul Nabi, Vice President, Sales International at RSA presents the innovations the vendor brings to Cybertech as part of a passwordless vision for…

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

G11 Media's SecurityOpenLab magazine rewards excellence in cybersecurity: the best vendors based on user votes

How Austria is making its AI ecosystem grow

Always keeping an European perspective, Austria has developed a thriving AI ecosystem that now can attract talents and companies from other countries

Sparkle and Telsy test Quantum Key Distribution in practice

Successfully completing a Proof of Concept implementation in Athens, the two Italian companies prove that QKD can be easily implemented also in pre-existing…

Most read

Former Amazon, Meta Scientists Unveil Graphon AI, the First Pre-Model…

Graphon AI emerged from stealth today with $8.3 million in seed funding to build a new class of AI infrastructure: a pre-model intelligence layer that…

NTT DATA Announces Intent to Acquire WinWire to Scale Enterprise AI Adoption…

NTT DATA, a global leader in AI, digital business and IT services, today announced it has signed a definitive agreement to acquire WinWire, an award-winning…

SUI Group Co-Leads $15 Million Funding Round for AI Trading Lab Nof1,…

SUI Group Holdings Limited (NASDAQ: SUIG) (“SUI Group” or the “Company”), today announced it has co-led with Karatage Opportunities a $15 million funding…

Datavault AI Provides Q1 2026 Business Update Highlighting Tokenization…

Datavault AI Inc. ("Datavault AI" or the "Company") (NASDAQ: DVLT), a provider of data monetization, credentialing, digital engagement, and real‑world…

Newsletter signup

Join our mailing list to get weekly updates delivered to your inbox.

Sign me up!