Hacker News new | ask | show | jobs
by thelittleone 565 days ago
Do you have a blog per chance? Or any recommended reading on pre processing / data chunking strategies to improve results?
1 comments

I recently came across a project that looks promising: [WordLlama](https://github.com/dleemiller/WordLlama?tab=readme-ov-file#s...). It appears to be well-suited for semantic chunking, though I haven’t had a chance to try it out yet.