Hacker News new | ask | show | jobs
by TuringTest 1186 days ago
It's larger, but there are less parameters to train for your specific use case since you are training the small matrix only, while the original ones remain unaltered.