Hacker News new | ask | show | jobs
by astrange 769 days ago
That's called instruction tuning.

https://arxiv.org/abs/2308.10792