Hacker News new | ask | show | jobs
by chipgap98 112 days ago
Deepseek showed that distillation is possible. Their results are possible without someone else doing the leading edge training