Hacker News new | ask | show | jobs
by nielsole 491 days ago
With a bit of fiddling you should be able to get the LLM to translate/summarize the thinking process. Not a 1:1 thing, but still
1 comments

how would you do it?
my naive way would be to try to do seq2seq with the hidden state as input. Not sure how to replace the supervised samples though.
OK but what would you use as ground truth?