| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by the-smug-one 1889 days ago
	Regexes are at least useful for parsing numbers and symbols. But yeah, that shouldn't be where you get stuck.

1 comments

macintux 1889 days ago

[\s,](~@|[\[\]{}()'`~^@]|"(?:\\.|[^\\"])"?|;.|[^\s\[\]{}('"`,;)])

Step 0, so I didn't get very far.

https://github.com/kanaka/mal/blob/master/process/guide.md#s...

link

kanaka 1888 days ago

It's a long regex, but it's just whitespace followed by an alternation with 5 different types of data: split-unquote, special characters, strings, comments, symbols. The string tokenizing branch is a bit complicated because it has to allow internal escaping of quotes. Early iterations of the guide didn't explain the regex in detail but the section now describes each of the regex components.

There are online tools to help visualize regex's. Here is a recent tweet including a visualization of mal's tokenizer regex: https://twitter.com/Mehulwastaken/status/1382292764834996230

link

cellularmitosis 1889 days ago

Whoa that regex is a monster. Try starting with simpler pieces and see if you get further this time around. Good luck! https://gist.github.com/cellularmitosis/75dc4aefe88438c14e94...

link

the-smug-one 1888 days ago

Well, you certainly don't need that regex to implement a Lisp.

link