Anthropic Open Sources Petri: Automating Model Safety Audits with AI Agents
Anthropic's open-source AI safety auditing tool, Petri, automatically tests the behavior of complex AI models using AI agents. The tool is based on the Inspect framework developed by the UK's AISI, aiming to address the limitations of manual testing. It has been released on GitHub.