|
|
|
|
|
by uh_uh
442 days ago
|
|
Examples of emergence: 1. Multi-step reasoning with backtracking when DeepSeek R1 was trained via GRPO. 2. Translation of languages they haven't even seen via in-context learning. 3. Arithmetic: heavily correlated with model size, but it does appear. I could go on. Albeit it's not an LLM, but a deep learning model trained via RL, would you say that AlphaZero's move 37 also doesn't count as emergence and the model has no understanding of Go? |
|