Hacker News new | ask | show | jobs
by alfiedotwtf 7 days ago
Wasn’t it already obvious given the awfully familiar parameter numbers?
1 comments

That only tells what base architecture they used, but fine tuning does not increase the number of weights, it just adapts the weights to improve better on a fine tuning dataset- something they claimed they had done