Hacker News new | ask | show | jobs
by sinenomine 454 days ago
Impressive use of reasoner CoT distillation method applied to deepseek R1. MIT license for the weights. Thanks, Deepseek!