Google DeepMind Features Hirundo’s Security-Hardened Gemma 4 Model – Outperforms LLMs 170x Its Size on Security

Google DeepMind has featured Hirundo’s security-hardened variant of Gemma 4 in its Gemmaverse – the official showcase for the Gemma open-model ecosystem. The feature validates Hirundo’s weight-l...

TEL AVIV, Israel: Google DeepMind has featured Hirundo’s security-hardened variant of Gemma 4 in its Gemmaverse – the official showcase for the Gemma open-model ecosystem. The feature validates Hirundo’s weight-level machine unlearning approach as a production-grade solution to one of the most persistent vulnerabilities in enterprise AI deployments: prompt injection attacks.

The hardened model is built on Google’s Gemma 4 E4B instruction-tuned base. At 4 billion parameters, it outperforms models more than 170 times its size on prompt injection resistance – including DeepSeek V3.2-Exp (685B), GPT-OSS-120B, and Qwen3-235B – while preserving full utility across standard benchmarks.

AI security is a behavioral problem, not a size problem

Prompt injection – where adversarial inputs manipulate a model into overriding its system instructions – remains one of the most damaging attack vectors in production Large Language Model (LLM) deployments. The prevailing assumption has been that larger models are inherently more robust. Hirundo’s results directly contradict that.

Rather than applying external filters or inference-time guardrails, Hirundo’s platform targets and removes the specific model weights responsible for susceptibility to adversarial manipulation. The model effectively forgets the behaviors that enable attacks – at the weight level, not the prompt level. Despite those behaviors usually being tied to instruction-following, a desired trait of LLMs, Hirundo’s technology preserved general instruction following while mitigating prompt injection susceptibility.

The results against industry-leading open-weight models, benchmarked using Meta’s PurpleLlama CyberSecEval dataset:

DeepSeek V3.2-Exp (685B): 73.33% attack success rate – 15.6x worse than Hirundo’s model
GPT-OSS-120B: more than 3x Hirundo’s attack success rate
Qwen3-235B: 10.8x more vulnerable

Hirundo’s hardened Gemma 4 E4B achieved a 4.78% attack success rate – a 74.47% reduction versus the base model – with no measurable degradation across utility benchmarks including AIME25, LiveCodeBench, GPQA, IFBench, SCICode, AutoPatchBench, and CyberSOCEval.

Quote

“Prompt injection is not a prompting problem – it is a representational one. The vulnerability lives in the weights. Addressing it at the weight level is more durable and more precise than guardrails applied after the fact.”

– Prof. Em. Oded Shmueli, Chief Scientist, Hirundo; former Executive VP for Research and Dean of the Computer Science Faculty, Technion – Israel Institute of Technology

The Google DeepMind-endorsed Gemma 4 case study joins a growing portfolio of Hirundo’s publicly available debiased and security-hardened models, including variants of GPT-OSS, Qwen, NemoTron, Llama, and others – with results demonstrating up to 90%+ improvement in security robustness and fairness compared to the original base models.

Availability

Hirundo’s Gemma 4 E4B Unlearned Variant on HuggingFace:
https://huggingface.co/hirundo-io/gemma-4-E4B-it-reduced-prompt-injection

Hirundo’s Gemma 4 E4B Case Study on Google DeepMind’s Gemmaverse:
https://deepmind.google/models/gemma/gemmaverse/hirundo

About Hirundo

Hirundo is the pioneer of machine unlearning – making it possible to change what trained AI models know and how they behave. Rather than applying filters or guardrails after the fact, Hirundo’s platform operates at the weight level, enabling teams to remove sensitive data, harmful behaviors, jailbreak vulnerabilities, and bias at the core – with no measurable degradation to model utility.

The company holds nine filed US patents and is led by Ben Luria (CEO), one of the first Israeli Rhodes Scholars; Prof. Em. Oded Shmueli (Chief Scientist), former Executive VP for Research and Dean of the Computer Science Faculty at the Technion – Israel Institute of Technology; and Michael Leybovich, an award-winning researcher.

www.hirundo.io

Fonte: Business Wire

Last News

RSA at Cybertech Europe 2024

Alaa Abdul Nabi, Vice President, Sales International at RSA presents the innovations the vendor brings to Cybertech as part of a passwordless vision for…

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

G11 Media's SecurityOpenLab magazine rewards excellence in cybersecurity: the best vendors based on user votes

How Austria is making its AI ecosystem grow

Always keeping an European perspective, Austria has developed a thriving AI ecosystem that now can attract talents and companies from other countries

Sparkle and Telsy test Quantum Key Distribution in practice

Successfully completing a Proof of Concept implementation in Athens, the two Italian companies prove that QKD can be easily implemented also in pre-existing…

G11 Media Networks

InnovationOpenLab is a channel of BitCity, a newspaper registered at the court of Como ,
n. 21/2007 del 11/10/2007- Registration ROC n. 15698

G11 MEDIA S.R.L. Registered office Via NUOVA VALASSINA, 4 22046 MERONE (CO) - P.IVA/C.F.03062910132 Como business register n. 03062910132 - REA n. 293834 CAPITALE SOCIALE Euro 30.000 i.v.

ISG Event to Explore How Enterprises Are Delivering Strategic Value From AI

Hark Raises $700M Series A at a $6B Valuation

Massive Bio and BeeKeeperAI Deploy Federated Confidential Computing to Expand Oncology Trial Access Across Atlanta’s Underserved Communities

YourStake Acquires First Affirmative and Relaunches the Advisory Firm as Formative, the Home for Modern Values-Based Advisors

Masters AI Legal Expands to Los Angeles with First Hybrid Conference as Momentum Builds Nationally

Cranium AI Acquires Aiceberg to Strengthen its End-to-End AI Security, Governance and Agentic AI Platform

Rune Technologies Joins Project Dynamis to Deliver Predictive Logistics for the Marine Corps

Rajant Health (RHI) and Chord Robotics Expand Cowbell Platform to Enable Scalable, Multi-Domain Collaborative Autonomy

Google DeepMind Features Hirundo’s Security-Hardened Gemma 4 Model – Outperforms LLMs 170x Its Size on Security

Related news

Last News

RSA at Cybertech Europe 2024

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

How Austria is making its AI ecosystem grow

Sparkle and Telsy test Quantum Key Distribution in practice

Most read

Former Amazon, Meta Scientists Unveil Graphon AI, the First Pre-Model…

NTT DATA Announces Intent to Acquire WinWire to Scale Enterprise AI Adoption…

SUI Group Co-Leads $15 Million Funding Round for AI Trading Lab Nof1,…

Datavault AI Provides Q1 2026 Business Update Highlighting Tokenization…

G11 Media Networks

Google DeepMind Features Hirundo’s Security-Hardened Gemma 4 Model – Outperforms LLMs 170x Its Size on Security

Related news

Last News

Most read

Newsletter signup

G11 Media Networks