| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by iachimoe 1881 days ago
	One usecase where Lucene’s tokenizing approach tends to work less well than something like grep is when for some reason we want to query for a substring of a token, e.g. if the text is “I walked through the town” and I want to search for “oug”. Does lmgrep offer a performant solution for this kind of case, or would it be a situation where it’s better sticking with regular grep?

1 comments

Siira 1869 days ago

Opened an issue and asked this: https://github.com/dainiusjocas/lucene-grep/issues/99