Make Trust Irrelevant: A Gamer’s Take on Agentic AI Safety

via news.ycombinator.com

Short excerpt below. Read at the original source.

I wrote a short position paper arguing that current agentic AI safety failures are the confused deputy problem on repeat. We are handing agents ambient authority and trying to contain it with soft constraints like prompts and userland wrappers. My take: you need hard, reduce-only authority enforced at a real boundary (kernel control plane class), […]

Read at Source