A new AI benchmark tests whether chatbots protect human wellbeing

Short excerpt below. Click through to read at the original source.

Most AI benchmarks measure intelligence and instruction-following rather than psychological safety. Humane Bench evaluates models based on core principles of human flourishing, prioritizing wellbeing, and respecting user attention.

Read at Source