Hacker News new | ask | show | jobs
by atty 955 days ago
As another commenter pointed out, you can give context for the decoder. So you can feed previous chunks into the model as the context. This is how we do it for streaming, at least.