OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%)
via quesma.com
Short excerpt below. Read at the original source.
Article URL: https://quesma.com/blog/introducing-otel-bench/ Comments URL: https://news.ycombinator.com/item?id=46811588 Points: 15 # Comments: 9