Y
Hacker News
new
|
ask
|
show
|
jobs
by
scotty79
1150 days ago
Do you know of any research that tries to take large pre-trained model and make it smaller by cutting out least activated neurons and training it a bit not to loose performance?
2 comments
sebzim4500
1150 days ago
https://arxiv.org/pdf/2301.00774.pdf
link
KRAKRISMOTT
1150 days ago
The entire field of ML distillation.
link