Anthropic built Mythos to be unreachable: a model so capable at breaking into computer systems that the company said it would never be released publicly. Someone got in using a credential stolen from Mercor, an AI staffing firm, combined with an educated guess about where the model was hosted. The entry point was a data dump from a staffing platform breach, not a sophisticated exploit.
Sixteen days after Bloomberg first reported the unauthorized access, Anthropic has declined to say whether other models hosted on the same infrastructure were also accessed, whether the contractor whose credentials were used had been notified their password was circulating in a March data dump, or what specific security requirements applied to accounts with access to unreleased frontier models.
The access method was not complex. According to The Verge, the group obtained a credential from the March breach of Mercor, an AI staffing platform Anthropic uses to hire contractors, then guessed the URL where Mythos was hosted using details from the same breach. No zero-day vulnerabilities were involved. The breach was low-tech: a leaked password from the Mercor dump and inside knowledge of where to point it.
Anthropic presented Mythos as requiring coordinated international action to prevent the model from destabilizing global cybersecurity, calling it a watershed moment that would not be made generally available. The UK AI Security Institute warned that Mythos was a step up from previous models in the cyber threat it posed. The agency also found it could complete expert-level security tasks 73 percent of the time and was the first to solve a 32-step corporate network intrusion simulation, clearing three of ten attempts. Mozilla used early access to find 271 vulnerabilities in Firefox before its release. But less than 24 hours after Anthropic announced these capabilities, unauthorized users were already inside.
What comes next is a test of whether the Glasswing controlled-release model can survive a breach accomplished with a staffing-firm password dump and an educated URL guess. Anthropic has sixteen days of unanswered questions. The next one may be whether the model that was supposed to be unreachable stayed that way.