|
|
|
|
|
by aantthony
713 days ago
|
|
I’m not asserting why they were developed in the first place. The comment is just about which one is used, supposing that they already exist. Choosing the more stereotypical option (even if it’s only 51%) is a more efficient encoding in an LLM model. |
|