Microsoft's AI outshines Anthropic's Mythos, setting new cybersecurity standards

Microsoft's MDASH scored 88.4% on the CyberGym benchmark, showcasing a leap in AI vulnerability detection with 16 new flaws identified, including critical Windows exploits.

Editorial Staff

1 month ago 1 min read

Microsoft's new AI system, codenamed MDASH, has achieved a score of 88.45% on the CyberGym benchmark, outperforming Anthropic's Mythos, which scored 83.1%. This benchmark, developed by researchers at UC Berkeley, evaluates AI systems on their ability to identify real-world software vulnerabilities across a set of tasks derived from open-source projects.

Launched this week, MDASH utilizes over 100 specialized AI agents that collaborate within a multi-model framework to detect software vulnerabilities. The system was unveiled alongside the identification of 16 new vulnerabilities in Windows, including four critical flaws that were addressed during this month’s Patch Tuesday.

Unlike Anthropic's single-model Mythos, which has faced scrutiny for its vulnerability detection capabilities, MDASH operates through a structured process where agents scan code, validate findings, and simulate attacks to confirm the presence of bugs. The scores generated in the CyberGym benchmark are self-reported by the respective companies and have not been independently verified.

Related Articles

Scattered Spider Hackers Face Serious Consequences as Trial Kicks Off Today

Cybersecurity Industry Faces Growing Threats as Klue Hack Reveals Vulnerabilities

Members at Risk: Dialog Faces Fallout from Major Security Breach Due to Website Error

LastPass users face heightened security risks as data breach exposes sensitive information

LastPass breach exposes customer support data, raising security concerns for users

Witness challenges key evidence in El-Rufai phone-tapping case, raising doubts about hacking claims

Data breach from Klue hack exposes sensitive information at major cybersecurity firms

Android's June 2026 Updates Promise Enhanced Security Features for Users

Criminal Hacking Group's Guilty Pleas Expose £39M TfL Theft Scheme

Share article