Dual-Use

A new ICML paper finds frontier LLMs generate harmful content at 95% rates — not from adversarial attacks, but from doing their jobs correctly. The most capable models are the most vulnerable.