Find URI in text: Difference between revisions

Content added Content deleted
m (added note on what IRIs are)
Line 2: Line 2:
Write a function to search plain text for URIs or IRIs.
Write a function to search plain text for URIs or IRIs.


The function should return a list of URIs or IRIs found in the text.
The function should return a list of URIs or IRIs found in the text.


The definition of a URI is given in RFC 3986.
The definition of a URI is given in RFC 3986.
Line 8: Line 8:


For searching URIs in particular "Appendix C. Delimiting a URI in Context" is noteworthy.
For searching URIs in particular "Appendix C. Delimiting a URI in Context" is noteworthy.

The abbreviation IRI isn't as well known as URI and the short description is that an IRI is just an alternate form of a URI that supports Internationalization and hence Unicode. While many specifications support both form this isn't universal.


Consider the following issues:
Consider the following issues: