|
|
|
|
|
by ethhics
2985 days ago
|
|
Perhaps I’m missing the question, but there is a regular expression for matching a URI. Remove the leading carat and it can match anywhere in a text. https://tools.ietf.org/html/rfc3986#appendix-B Edit: I see it's not quite that simple. However, I still think that with some stricter matching requirements this could work. |
|
The URI language is of course regular, so it would be possible to construct a regexp that matches only URIs. But naively applying such regexp wouldn't work in practice, because many punctuation characters are allowed in URIs. For example, single quotes are allowed, so in this Python code the regexp would match too much: