|
|
|
|
|
by prometheon1
565 days ago
|
|
I don't know if this is being done already, but couldn't we add some training data to teach the LLM how to spell? We also teach kids what each letter means and how they combine into words. Maybe we can do this with tokens as well? E.g.: Token 145 (ar) = Token 236 (a) + Token 976 (r) Repeat many times with different combinations and different words? |
|
Sure, but then we would lose a benchmark to measure progress of emergent behavior.
The goal is not to add one capability at a time by hand - because this doesn’t scale and we would never finish. The goal is that it picks up new capabilities automatically, all on its own.