Unicode strings: Difference between revisions

(Rename Perl 6 -> Raku, alphabetize, minor clean-up)
Line 793:
 
=={{header|langur}}==
Source code in langur is UTF-8. For clarity, identifiers are ASCII only.
 
Comments and string literals may use Unicode.
 
A string or regex literal using an "any" modifier may include any code point (without using an escape sequence). Otherwise, they are restricted to Graphic, Space, and SpacePrivate Use Area code points, and a select set of invisible spaces. The idea around the "allowed" characters is to keep source code from having hidden text or codes and to allay confusion and deception.
 
The following is an example of using the "any" modifier on a string literal.
For clarity, identifiers are ASCII only.
 
<lang langur>q:any"any code points here"</lang>
 
Indexing on a string indexes by code point. The index may be a single number, a range, or an array of such things.
1,006

edits