Y
Hacker News
new
|
ask
|
show
|
jobs
by
niyazpk
3114 days ago
>> We've seen ~200MB production models get down to ~4MB and not lose much precision.
Details please. What techniques are used to reduce the model size?
1 comments
hidenotslide
3113 days ago
I saw a talk on this paper a couple years ago.
https://arxiv.org/abs/1503.02531
The method is to train a smaller model on the predictions of a large model or ensemble. I'd be interested in knowing other techniques as well.
link