The AI Security Institute (AISI) conducted evaluations of Anthropic’s Claude Mythos Preview (announced on 7th April) to assess its cybersecurity capabilities. Our results show that Mythos Preview represents a step up over previous frontier models in a landscape where cyber performance was already rapidly improving.
You must log in or # to comment.
It’s hacking stuff but at what cost? Can I use it to direct an agent at my self-hosted stuff and see how secure (or insecure it is)? Or will that set me back thousands and thus make it only feasible/worth it for the rich and wealthy?



