Hacker News new | ask | show | jobs
by CjHuber 231 days ago
Took me a minute to realize this is not about Chonkie. I would be interested in how this compares to the other's semantic chunking approach
1 comments

you can read the labels this (-y) uses modernBERT and even has an eval comparison to the (-ie) in it's GitHub so you can see the improvement as tested -- although if you want to do vanilla rules based chinking for whatever reason your data needs then (-ie) is still good.