Quesma Explores Novel AI's Security Capabilities Against Supply-Chain Attacks

Quesma, Inc. announced BinaryAudit, the independent benchmark testing whether AI can find hidden threats in software binaries before they cause damage. The results show both promise and limitations: w...

Built with world-class reverse engineer Michał "Redford" Kowalczyk, this open-source benchmark has sparked excitement among security experts, opening a new frontier in binary analysis.

WARSAW, Poland: Quesma, Inc. announced BinaryAudit, the independent benchmark testing whether AI can find hidden threats in software binaries before they cause damage. The results show both promise and limitations: while AI can detect some threats, even the best-performing model, Claude Opus 4.6, succeeded only 49% of the time and frequently flagged safe software as dangerous.

Supply-chain attacks are already causing real-world damage. State-sponsored actors recently hijacked Notepad++, replacing legitimate binaries with infected ones. Shai Hulud 2.0 compromised thousands of organizations, including Fortune 500 companies and governments, stealing credentials. In the XZ Utils case, a long-term contributor legitimately gained ownership access using it to insert malicious code. Security weaknesses can also originate from vendors, including manufacturer-planted code to disable trains and hardcoded credentials in Cisco devices. These public cases are only a fraction of what exists.

Traditional binary reverse engineering is a last-resort method. It’s performed by a small pool of specialists, typically only after a breach or major incident. AI has the potential to transform this reactive approach into a proactive layer of defense, making it feasible to inspect software at any point - before deployment, during updates, before the purchase, or years after release. This could change how organizations approach supply-chain security, turning what was once an emergency response tool into a preventive safeguard.

“We were genuinely surprised that today’s LLMs can detect malicious code at all. At current performance levels, it’s an assistant, not a solution,” said Jacek Migdał, CEO of Quesma. “AI binary analysis could be a new layer of defence in supply-chain security. We hope new AI models released in the next 1-2 years will make binary analysis go mainstream. BinaryAudit helps to track and encourage progress in this field.”

BinaryAudit is available today at https://quesma.com/benchmarks/binaryaudit/.

ABOUT QUESMA:

Quesma is a technological company that evaluates and tests advanced AI models. It creates benchmarks to evaluate how frontier LLMs perform across critical domains, such as DevOps, security, and database migrations. Quesma is backed by Heartcore Capital, Inovo, Firestreak Ventures, and several angels, including Christina Beedgen, co-founder of Sumo Logic. For more information, visit www.quesma.com or follow on LinkedIn.

Fonte: Business Wire

Last News

RSA at Cybertech Europe 2024

Alaa Abdul Nabi, Vice President, Sales International at RSA presents the innovations the vendor brings to Cybertech as part of a passwordless vision for…

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

G11 Media's SecurityOpenLab magazine rewards excellence in cybersecurity: the best vendors based on user votes

How Austria is making its AI ecosystem grow

Always keeping an European perspective, Austria has developed a thriving AI ecosystem that now can attract talents and companies from other countries

Sparkle and Telsy test Quantum Key Distribution in practice

Successfully completing a Proof of Concept implementation in Athens, the two Italian companies prove that QKD can be easily implemented also in pre-existing…

G11 Media Networks

InnovationOpenLab is a channel of BitCity, a newspaper registered at the court of Como ,
n. 21/2007 del 11/10/2007- Registration ROC n. 15698

G11 MEDIA S.R.L. Registered office Via NUOVA VALASSINA, 4 22046 MERONE (CO) - P.IVA/C.F.03062910132 Como business register n. 03062910132 - REA n. 293834 CAPITALE SOCIALE Euro 30.000 i.v.

Frost Law Warns of Emerging Scams Threatening Taxpayers, Businesses During Tax Season

ISG to Study Providers of AI, Analytics for Supply Chains

Network Wireless Solutions (NWS) Announces CEO Leadership Transition

Republic Technologies Announces Private Placement for Strategic Advisors

Supermicro Announces Participation in Upcoming Investor Events

Wolters Kluwer Unveils Plans for Medi-Span Expert AI to Advance Medication Intelligence for Digital Health

Cabot Properties Acquires Modern Logistics Asset in Osaka, Japan

VanEck Renames Green Metals ETF to Reflect Copper’s Role in Global Electrification and AI-Driven Infrastructure

Quesma Explores Novel AI's Security Capabilities Against Supply-Chain Attacks

Related news

Last News

RSA at Cybertech Europe 2024

Italian Security Awards 2024: G11 Media honours the best of Italian cybersecurity

How Austria is making its AI ecosystem grow

Sparkle and Telsy test Quantum Key Distribution in practice

Most read

Bretton AI Raises $75M Series B, Rebrands from Greenlite AI to Build the…

Forgent Power Solutions Announces Closing of Initial Public Offering

MassPay Closes Breakthrough 2025 with 286% Growth, Major Partnership Milestones

Honda and Mythic Announce Joint Development of 100x Energy-Efficient Analog…

G11 Media Networks

Quesma Explores Novel AI's Security Capabilities Against Supply-Chain Attacks

Related news

Last News

Most read

Newsletter signup

G11 Media Networks