Y
Hacker News
new
|
ask
|
show
|
jobs
by
chipgap98
112 days ago
Deepseek showed that distillation is possible. Their results are possible without someone else doing the leading edge training