Hacker News new | ask | show | jobs
by mcswell 1957 days ago
I've added this to my bookmarks, thanks!

I haven't looked at the code, but I'm guessing it indexes all the Unicode names of characters, and allows lookup by any substrings of those names, right? Maybe using a suffix array. But it would be nice if there were wildcards. Regular expressions would IMO be overkill, but s.t. like Unix globs might be nice, e.g. "a * acute" to find "LATIN CAPITAL LETTER A WITH ACUTE" or "LATIN SMALL LETTER A WITH ACUTE.

1 comments

Actually, it supports regex!

I detail this a bit more in the readme, but it supports general JS notation for certain things.

Surround your search with /these/ to perform a regex match on the character. To match on the name instead you can use the advanced search tab which is accessed via the button on the top left.

I may also add glob notation though, since that sounds like a pretty valid use case.

Got it, thanks! works well (and as others have mentioned, it's fast)