Researchers show that training on “junk data” can lead to LLM “brain rot”
via llm-brain-rot.github.io
Short excerpt below. Read at the original source.
On the surface, it seems obvious that training an LLM with “high quality” data will lead to better performance than feeding it any old “low quality” junk you can find. Now, a group of researchers is attempting to quantify just how much this kind of low quality data can cause an LLM to experience effects […]