Comment

To elicit the blackmailing behavior from Claude Opus 4, Anthropic designed the scenario to make blackmail the last resort.

Today’s breaking news: LLM prompted to blackmail, attempts blackmail. Who woulda thought?

Sort:hotnew top