|
|
|
|
|
by _akhe
784 days ago
|
|
Thanks for clarifying, this is exactly where I was confused. I just read about how both sentencepiece and tiktoken tokenize. Thanks for making this (in JavaScript no less!) and putting it online! I'm going to use it in my auto-completion library (here: https://github.com/bennyschmidt/next-token-prediction/blob/m...) instead of just `.split(' ')` as I'm pretty sure it will be more nuanced :) Awesome work! |
|
I really love that it is 0 deps and that you provided the npm, and would love to defer this part of my work to an efficient library like this.