|
|
|
|
|
by karterk
637 days ago
|
|
Solving the strawberry problem will probably require a model that just works with bytes of text. There have been a few attempts at building this [1] but it just does not work as well as models that consume pre-tokenized strings. [1]: https://arxiv.org/abs/2106.12672 |
|