Y
Hacker News
new
|
ask
|
show
|
jobs
by
Jax_Hax
959 days ago
There are models trained via RNN, but LLMs usually use Transformer architecture with a bit of human feedback on top which sort of uses reinforcement learning like AlphaGo