Hacker News new | ask | show | jobs
by PaulHoule 809 days ago
It is an active research topic

https://arxiv.org/pdf/1909.02803.pdf

Note that the “fine tuning” and “alignment” (RLHF) stages are much shorter than the early training so technically savvy people can already customize models.