| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by PaulHoule 809 days ago

It is an active research topic

https://arxiv.org/pdf/1909.02803.pdf

Note that the “fine tuning” and “alignment” (RLHF) stages are much shorter than the early training so technically savvy people can already customize models.