Hacker News new | ask | show | jobs
by Jax_Hax 959 days ago
There are models trained via RNN, but LLMs usually use Transformer architecture with a bit of human feedback on top which sort of uses reinforcement learning like AlphaGo