There are common threads though. LLMs do terribly in certain areas. They also do terribly when not supervised well.