|
|
|
|
|
by ACCount37
207 days ago
|
|
RL is very important - because while it's inefficient, and sucks at creating entirely new behaviors or features in LLMs, it excels at bringing existing features together and tuning them to perform well. It's a bit like LLM glue. The glue isn't the main material - but it's the one that holds it all together. |
|