|
|
|
|
|
by ALittleLight
15 days ago
|
|
The paper says the professors have a median of 200 comparisons each. It also says they only used 2 models because using more models would require more comparisons and they selected Google models because Google was branded/advertised as being education focused. When you see other models show up elsewhere, that's because they extended the main idea to other models but using LLMs to judge instead of human professors. |
|
But is it a surprise law professors aren't great statisticians?