Microsoft's AI outshines Anthropic's Mythos, setting new cybersecurity standards

Microsoft's AI outshines Anthropic's Mythos, setting new cybersecurity standards

Microsoft's MDASH scored 88.4% on the CyberGym benchmark, showcasing a leap in AI vulnerability detection with 16 new flaws identified, including critical Windows exploits.

NeboAI I summarize the news with data, figures and context
IN 30 SECONDS

IN 1 SENTENCE

SENTIMENT
Neutral

𒀭
NeboAI is working, please wait...
Preparing detailed analysis
Quick summary completed
Extracting data, figures and quotes...
Identifying key players and context
DETAILED ANALYSIS
SHARE

NeboAI produces automated editions of journalistic texts in the form of summaries and analyses. Its experimental results are based on artificial intelligence. As an AI edition, texts may occasionally contain errors, omissions, incorrect data relationships and other unforeseen inaccuracies. We recommend verifying the content.

Microsoft's new AI system, codenamed MDASH, has achieved a score of 88.45% on the CyberGym benchmark, outperforming Anthropic's Mythos, which scored 83.1%. This benchmark, developed by researchers at UC Berkeley, evaluates AI systems on their ability to identify real-world software vulnerabilities across a set of tasks derived from open-source projects.

Launched this week, MDASH utilizes over 100 specialized AI agents that collaborate within a multi-model framework to detect software vulnerabilities. The system was unveiled alongside the identification of 16 new vulnerabilities in Windows, including four critical flaws that were addressed during this month’s Patch Tuesday.

Unlike Anthropic's single-model Mythos, which has faced scrutiny for its vulnerability detection capabilities, MDASH operates through a structured process where agents scan code, validate findings, and simulate attacks to confirm the presence of bugs. The scores generated in the CyberGym benchmark are self-reported by the respective companies and have not been independently verified.

Want to read the full article? Access the original article with all the details.
Read Original Article
TL;DR

This article is an original summary for informational purposes. Image credits and full coverage at the original source. · View Content Policy

Editorial
Editorial Staff

Our editorial team works around the clock to bring you the latest tech news, trends, and insights from the industry. We cover everything from artificial intelligence breakthroughs to startup funding rounds, gadget launches, and cybersecurity threats. Our mission is to keep you informed with accurate, timely, and relevant technology coverage.

Press Enter to search or ESC to close