|
|
|
|
|
by snyy
158 days ago
|
|
Which language are you thinking of? Ideally, how would you identify split points in this language? I suppose we've only tested this with languages that do have delimiters - Hindi, English, Spanish, and French There are two ways to control the splitting point. First is through delimiters, and the second is by setting chunk size. If you're parsing a language where chunks can't be described by either of those params, then I suppose memchunk wouldn't work. I'd be curious to see what does work though! |
|