Show HN: Agent Arena – Test How Manipulation-Proof Your AI Agent Is

via news.ycombinator.com

Short excerpt below. Read at the original source.

Creator here. I built Agent Arena to answer a question that kept bugging me: when AI agents browse the web autonomously, how easily can they be manipulated by hidden instructions? How it works: 1. Send your AI agent to ref.jock.pl/modern-web (looks like a harmless web dev cheat sheet) 2. Ask it to summarize the page […]

Read at Source