Y
Hacker News
new
|
ask
|
show
|
jobs
by
concinds
177 days ago
And it's a 4B model. I worry that nontechnical users will dramatically overestimate its accuracy and underestimate hallucinations, which makes me wonder how it could really be useful for academic research.
1 comments
DGoettlich
177 days ago
valid point. its more of a stepping stone towards larger models. we're figuring out what the best way to do this is before scaling up.
link
spicyusername
175 days ago
If there's very little text before the internet, what would scaling up look like?
link