AI models know when they’re being tested – and change their behavior, research shows
Short excerpt below. Click through to read at the original source.
OpenAI and Apollo Research tried to stop models from lying – and discovered something else altogether.