Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview
Scored 65.2% vs google’s official 47.8%, and the existing top closed source model Junie CLI’s 64.3%. Since there are a lot of reports of deliberate cheating on TerminalBench 2.0 lately (), I would like to also clarify a few things…