Hacker News new | ask | show | jobs
by littlestymaar 490 days ago
Submit a bunch of prompts to Deepseek R1 (a few tens of thousands), and then do a full fine tuning of the target model on the prompt/response pair.