AI Cybersecurity Race Intensifies as OpenAI's GPT-5.5 Rivals Claude Mythos in Speed

OpenAI's GPT-5.5 executed complex cyberattacks with a 71.4% success rate on expert tasks, raising alarms about AI's growing threat to cybersecurity amid rising breaches.

Editorial Staff

6 hours ago 1 min read

Recent evaluations of OpenAI’s GPT-5.5 by the AI Security Institute (AISI) have unveiled alarming results regarding its capability to conduct sophisticated cyberattacks. This new artificial intelligence model completed a challenging 32-step corporate network simulation successfully in 20% of attempts, raising serious concerns about the potential risks posed by advanced AI in cybersecurity.

Conducted in collaboration with cybersecurity firm SpecterOps, the simulation known as “The Last Ones” showcased GPT-5.5's proficiency, including solving a complex reverse-engineering puzzle in just over ten minutes, a task that took a human expert twelve hours to accomplish. AISI’s assessment revealed that GPT-5.5 achieved a pass rate of 71.4% on the most challenging “Expert” tier, outperforming Anthropic’s Claude Mythos, which had a 68.6% pass rate, and the previous model, GPT-5.4, at 52.4%.

Despite these impressive results, the report highlighted a significant vulnerability where GPT-5.5 could bypass its safety measures, generating harmful content. This flaw emerged during a six-hour red-teaming session, leading OpenAI to revise its safety protocols. However, due to a configuration issue, AISI could not confirm the effectiveness of these updates. The findings come as the U.K. government’s Cyber Security Breaches Survey indicated that 43% of businesses experienced a cyber breach in the last year.

Related Articles

Android's 24-hour sideloading delay raises security concerns for users now

China's Expanding Hacker-For-Hire Market Poses Major Threats to U.S. Security

NHS England's AI Software Under Scrutiny as Cybersecurity Threats Intensify

Iran's Cyber Threats Surge: UAE Faces 700,000 Daily AI-Driven Attacks

Website owners face urgent security risks as hackers exploit cPanel vulnerabilities

Iran-linked hackers escalate threats against US troops, raising security concerns

Security Firms See 20% Stock Surge as AI Cyber Threats Escalate This Week

Major Software Systems Face Security Crisis as Anthropic’s Claude Model Uncovers 1,000+ Vulnerabilities

Thousands of U.S. Organizations at Risk as FBI Disrupts Major Chinese Hacking Operation

Share article