|
|
|
|
|
by zerojames
841 days ago
|
|
Another day, another story on HN that will have me down a rabbit hole (yesterday's was a three-hour tangent into ternary bit compression after the Microsoft paper) :D Your project is delightful -- thank you for sharing. I have explored this realm a bit before [0] [1], but in Python. The tool I made was for personal use, but streaming every keystroke through a network connection added a lot of unnecessary latency. I used word surprisals (entropy) to calculate the most likely candidates, and gave a boost to words from my own writing (thus, the predictive engine was "fine-tuned" on my writing). The result is a dictionary of words with their probabilities of use. This can be applied to bigrams, too. Your project has me thinking: how could that be pruned, massively, to create the smallest possible structure. Your engine feels like the answer. My use case is technical writing: you know what you want to say, including long words you have to repeat over again, but you want a quicker way of typing. [0]: https://jamesg.blog/2023/12/15/auto-write/ [1]: https://github.com/capjamesg/autowrite |
|