|
|
|
|
|
by chaxor
589 days ago
|
|
Also importantly, they do have a 'not attempted' or 'do not know' type of response, though how it is used is not really well discussed in the article. As it has been for decades now, the 'Nan' type of answer in NLP is important, adds great capability, and is often glossed over. |
|
They don't really describe what "success" would look like but it seems to me like the primary goal is to minimize "incorrect", rather than to maximize "correct". the mini models would get there by maximizing "not attempted" with the larger models having much higher "correct". Then both model sizes could hopefully reach 90%+ "correct" when given access to external lookup tools.