Project Glasswing: what Mythos showed us 2026-05-18 Grant Bourzikas 9 min read This post is also available in 简体中文 , Français , Deutsch , Italiano , 日本語 , 한국어 , Português , Español , Nederlands and 繁體中文 . For the last few months, we've been testing a range of security-focused LLMs on our own infrastructure. These LLMs help identify potential vulnerabilities in our own systems, so we can fix them – and they also show us what attackers are going to be able to do with the latest models. None of these LLMs has captured more attention than Mythos Preview, from Anthropic. A few weeks ago, we were invited to use Mythos Preview as part of Project Glasswing . We soon pointed it at more than fifty of our own repositories – to see what it would find, and to see how it works. This post shares what we observed, what the models did well and what they didn't, and how the architecture and process around them needs to change, so they can be used at scale. What changed with Mythos Preview Mythos Preview is a real step forward, and it's worth saying that plainly before getting into anything else. We've been running models against our code for a while now, and the jump from what was possible with previous general-purpose frontier models to what Mythos Preview does today is not just a refinement of what came before. It's a different kind of tool doing a different kind of work, and that makes a clean apples-to-apples comparison to earlier models difficult. So rather than trying to benchmark Mythos Preview against general-purpose frontier models, it's more useful to describe what it can actually do, and two features that stood out across the work we did with Mythos Preview: Exploit chain construction - A real attack rarely uses one bug. It chains several small attack primitives together into a working exploit. For instance, it might turn a use-after-free bug into an arbitrary read and write primitive, hijack the control flow, and use return-oriented programming (RO
Back to Home

📰Cloudflare Blog — blog.cloudflare.com
B
Blizine Admin
View Profile Staff Writer
Related Articles
It's not too late! Make your AWS Security Agent debut with a code review!
May 30, 2026·2 min read
AWS reportedly to tuck Elon Musk's Grok into Bedrock, despite zero enterprise demand
May 29, 2026·2 min read
AWS reportedly to tuck Elon Musk's Grok into Bedrock, despite zero enterprise demand
May 29, 2026·2 min read