Show HN: Agent Arena – Test How Manipulation-Proof Your AI Agent Is
via news.ycombinator.com
Short excerpt below. Read at the original source.
Creator here. I built Agent Arena to answer a question that kept bugging me: when AI agents browse the web autonomously, how easily can they be manipulated by hidden instructions? How it works: 1. Send your AI agent to ref.jock.pl/modern-web (looks like a harmless web dev cheat sheet) 2. Ask it to summarize the page […]