|
|
|
|
|
by numba888
492 days ago
|
|
o1 is more than just math solver. And you cannot possibly train that much in a small model. However smaller specialized models looks to be the right way to handle world's complexity. Sort of mixture of experts on one level above. Orchestrating them will be another problem. Possible solution is generalists model "to rule them all". |
|