Y
Hacker News
new
|
ask
|
show
|
jobs
by
TuringTest
1186 days ago
It's larger, but there are less parameters to train for your specific use case since you are training the small matrix only, while the original ones remain unaltered.