Y
Hacker News
new
|
ask
|
show
|
jobs
by
2-718-281-828
848 days ago
but then it will either overfit or you need to train it on 20 times the amount of data ...
1 comments
Buttons840
848 days ago
I'm taking about when using a LLM, which doesn't involve training and thus no overfitting.
link
2-718-281-828
847 days ago
for an llm to exhibit a verbal relationship between counting and tokens you have to train it on that. maybe you mean something like a plugin or extension but that's something else and has nothing to do with llms specifically.
link