Researchers Propose New Frameworks to Keep AI Agents from Going Rogue
As AI agents gain the ability to execute code, call APIs, and interact with external systems with minimal human oversight, two research teams have published independent frameworks aimed at making those agents safer — one by automatically refining safety rules over time, the other by placing an inviolable enforcement layer outside the agent's own control.