Our evaluation of Claude Mythos Preview’s cyber capabilities

codeinabox@programming.dev · 13 days ago

Our evaluation of Claude Mythos Preview’s cyber capabilities

onlinepersona@programming.dev · 13 days ago

It’s hacking stuff but at what cost? Can I use it to direct an agent at my self-hosted stuff and see how secure (or insecure it is)? Or will that set me back thousands and thus make it only feasible/worth it for the rich and wealthy?

Our evaluation of Claude Mythos Preview’s cyber capabilities

Our evaluation of Claude Mythos Preview’s cyber capabilities

Our evaluation of Claude Mythos Preview’s cyber capabilities | AISI Work