# Literals/String

Literals/String
You are encouraged to solve this task according to the task description, using any language you may know.

Show literal specification of characters and strings.

If supported, show how the following work:

•   verbatim strings   (quotes where escape sequences are quoted literally)
•   here-strings

Also, discuss which quotes expand variables.

Single character literals require single quotes

ch : character := 'a';

String literals use double quotes

msg : string := "hello world";
empty : string := ""; -- an empty string

The length of a string in Ada is equal to the number of characters in the string. Ada does not employ a terminating null character like C. A string can have zero length, but zero length strings are not often used. Ada's string type is a fixed length string. It cannot be extended after it is created. If you need to extend the length of a string you need to use either a bounded string, which has a pre-determined maximum length, similar to C strings, or an unbounded string which can expand or shrink to match the data it contains.

## Aime

Aime has no character representation, but it allows single quoted character constants. Their implied typed is integer.

integer c;
c = 'z';

String literals are double quoted.

text s;
s = "z";

hellox

## Axe

Character literal:

'A'

String literal:

"ABC"

Note that string literals are only null-terminated if they are assigned to a variable (e.g. Str1).

## BASIC

Traditional BASIC implementations do not use literal character notation within a string or here document notation. However, literal characters can be referenced using their character code and these can be added to strings as required. Here we use the ASCII code for doublequotes to get the characters into a string:

10 LET Q$=CHR$(34): REM DOUBLEQUOTES
20 LET D$=Q$+Q$: REM A PAIR OF DOUBLEQUOTES 30 LET S$=Q$+"THIS IS A QUOTED STRING"+Q$
40 PRINT Q$;"HELLO";Q$:REM ADD QUOTES DURING OUTPUT

### ZX Spectrum Basic

The ZX Spectrum supports the use of CHR$(34). Alternatively, it is possible to print the doublequotes, by adding an extra pair of doublequotes: 10 REM Print some quotes 20 PRINT CHR$(34)
30 REM Print some more doublequotes
40 PRINT """"
50 REM Output the word hello enclosed in doublequotes
60 PRINT """Hello"""

## BBC BASIC

Quoted (literal) strings consist of 8-bit characters and support both ANSI and UTF-8 encodings; they may not contain 'control' characters (0x00 to 0x1F). The only special character is the double-quote " which must be escaped as "". There is no special representation for a single character (it is just a string of length one). 'Here strings' are not supported.

PRINT "This is a ""quoted string"""
Output:
This is a "quoted string"

## bc

The double-quote " starts a literal string which ends at the next double-quote. Thus strings can span multiple lines and cannot contain a double-qoute (there is no escaping mechanism).

Characters are just strings of length one.

## Befunge

The double quote character (") enters a string literal mode, where ASCII values of characters encountered in the current instruction pointer direction up to the next quote are pushed onto the stack. Thus, any character may be used in a string except for a quote (ascii 34), which may be pushed using 57*1-. Note: since you are pushing the string onto a stack, you usually want to define the string in reverse order so that the first character is on top.

"gnirts">:#,[email protected]

## Bracmat

Strings of any length can always be surrounded by quotes. They must be surrounded by quotes if the string contains white space characters or one of the characters =.,|&:+*^'$_;{} or character sequences \D or \L. They must also be surrounded by quotes if they start with one or more characters from the set [~/#<>%@?!-. Inside strings the characters " and \ must be escaped with a backslash \. White space characters for carriage return, newline and tabulator can be expressed as \r, \n and \t, respectively. Escape sequences need not be enclosed in quotes. A string expression prepended with @ or % has no escape sequences: all characters except quotes are taken litterally. These are 10 examples of valid string expressions. (The last example is a multiline string.) Examples: string "string" stri\t-\tng\r\n "-10" "+10" "{a*b}" ".,|&:+*^'$_"
"[~/#<>%@?!-"
string[~/#<>%@?!-
"str; ing
termthe($adjectives*, fox) # "term" quasiliteral Strings and characters use syntax similar to Java; double and single quotes, respectively, and common backslash escapes. Quasiliterals are a user-extensible mechanism for writing any type of object "literally" in source, with "holes" for interpolation or pattern-matching. The fourth example above represents a Term object (terms are a tree language like XML or JSON), with the items from the variable adjectives spliced in. Quasiliterals can be used for strings as well. The third example above is the built-in simple interpolator, which also supports pattern matching. There is also a regular-expression quasi-pattern: ? if ("<abc,def>" =~ <@a,@b>) { [a, b] } else { null } # value: ["abc", "def"] ? if (" >abc, def< " =~ rx\W*(@a\w+)\W+(@b\w+)\W*) { [a, b] } else { null } # value: ["abc", "def"] ## Ela Ela has both characters: c = 'c' and strings: str = "Hello, world!" Both support C-style escape codes: c = '\t' str = "first line\nsecond line\nthird line" Also Ela supports verbatim strings with the following syntax: vs = <[This is a verbatim string]> ## Elena ELENA 3.2 : var c :=$65. // character
var s := "some text". // UTF-8 literal
var w := "some wide text"w. // UTF-16 literal
var s2 := "text with ""quotes"" and "$13$10"two lines".

## Elixir

### String

Strings are between double quotes; they're represented internally as utf-8 encoded bytes and support interpolation.

IO.puts "Begin String \n============"
str = "string"
str |> is_binary # true

While internally represented as a sequence of bytes, the String module splits the codepoints into strings.

str |> String.codepoints

The bytes can be accessed by appending a null byte to the string

str <> <<0>>

Strings can be evaluated using ? before a character in the command line or in a string, then evaluating the string

?a # 97
Code.eval_string("?b") # 98
Code.eval_string("?ł") # 322

### Char Lists

Char lists are simply lists of characters. Elixir will attempt to convert number values to characters if a string could be formed from the values. Char lists represent characters as single quotes and still allow for interpolation.

IO.inspect "Begin Char List \n============="
[115, 116, 114, 105, 110, 103]
ch = "hi"
'string #{ch}'

Again, since 0 cannot be rendered as a character, adding it to a char list will return the char list

'string #{ch}'++[0]

Output:

Begin String
============
"string"
true
["s", "t", "r", "i", "n", "g"]
<<115, 116, 114, 105, 110, 103, 0>>
97
98
322
Begin Char List
===============
'string'
'string hi'
[115, 116, 114, 105, 110, 103, 32, 104, 105, 0]

## Emacs Lisp

### Strings

The only string literal is a double-quote

"This is a string."

Backslash gives various special characters similar to C, such as \n for newline and \" for a literal double-quote. \\ is a literal backslash. See "Syntax for Strings" in the elisp manual.

### Characters

A character is an integer in current Emacs. (In the past character was a separate type.) ? is the read syntax.

?z    => 122
?\n => 10

See "Basic Char Syntax" in the elisp manual.

## Erlang

Erlang strings are lists containing integer values within the range of the ASCII or (depending on version and settings) Unicode characters.

"This is a string".
[$T,$h,$i,$s,$,$a,$,$s,$t,$r,$i,$n,$g,$,,$,$t,$o,$o].

Characters are represented either as literals (above) or integer values.

97 == $a. % => true With the string syntax, characters can be escaped with \. "\"The quick brown fox jumps over the lazy dog.\"". ## Forth In the interpreter: char c emit s" string" type In the compiler: : main [char] c emit s" string" type ; Strings may contain any printable character except a double quote, and may not span multiple lines. Strings are done via the word S" which parses ahead for a terminal quote. The space directly after S" is thus not included in the string. Works with: GNU Forth GNU Forth has a prefix syntax for character literals, and another string literal word S\" which allows escaped characters, similar to C. 'c emit s\" hello\nthere!" ## Fortran First Fortran (1958) did not offer any means to manipulate text except via the H (for Hollerith) code in FORMAT statements of the form nH where n was an integer that counted the exact numbers of characters following the H, any characters, that constituted the text literal. Miscounts would cause a syntax error, if you were lucky. This would be used for output to annotate the numbers, but consider the following: DIMENSION ATWT(12) PRINT 1 1 FORMAT (12HElement Name,F9.4) DO 10 I = 1,12 READ 1,ATWT(I) 10 PRINT 1,ATWT(I) END Evidently, the syntax highlighter here does not recognise the Hollerith style usage. Nor do some compilers, even if in its original home within FORMAT statements. The first PRINT statement writes out a heading, here with lower case letters as an anachronism. Then the loop reads a deck of cards containing the name of an element and its atomic weight into an array ATWT, but the special feature is that the first twelve characters of each card replace the text in the FORMAT statement, and thus the following PRINT statement shows the name of the element followed by its atomic weight as just read. Fortran IV introduced a text literal, specified within apostrophes, with two apostrophes in a row indicating an apostrophe within the text. Later, either an apostrophe or a double quote could be used to start a text string (and the same one must be used to end it) so that if one or the other were desired within a text literal, the other could be used as its delimiters. If both were desired, then there would be no escape from doubling for one. Because spaces are significant within text literals, a long text literal continued on the next line would have the contents of column seven of the continuation line immediately following the contents of column 72 of the continued line - except that (for some compilers reading disc files) if such lines did not extend to column 72 (because trailing spaces were trimmed from the records) rather less text would be defined. So, even though this is in fixed-format (or card image) style, again misinterpreted by the syntax highlighter, BLAH = " 1Stuff" might be the equivalent of only BLAH = "Stuff" instead of defining a text literal with many leading spaces. F90 formalised an opportunity for free-format source files; many compilers had also allowed usage beyond column 72. Within the text literal, any character whatever may be supplied as text grist, according to the encodement recognised by the card reader as this was a fixed-format file - cards have an actual physical size. This applied also to source text held in disc files, as they were either fixed-size records or, for variable-length records, records had a length specification and the record content was not involved. Variable-length records were good for omitting the storage of the trailing spaces on each line, except that the sequence numbers were at the end of the line! In this case they might be omitted (unlike a card deck, a disc file's records are not going to be dropped) or there may be special provision for them to be at the start of each line with the source text's column one staring in column nine of the record. But, for the likes of paper tape, the question "How long is a record?" has no natural answer, and record endings were marked by a special symbol. Such a symbol (or symbol sequence) could not appear within a record, such as within a text literal and be taken as a part of the text. This style has been followed by the ASCII world, with variously CR, CRLF, LFCR and CR sequences being used to mark end-of-record. Such characters cannot be placed within a text literal, but the CHAR(n) function makes them available in character expressions. Some compilers however corrupt the "literal" nature of text literals by allowing escape sequences to do so, usually in the style popularised by C, thus \n, and consequently, \\ should a single \ be desired. Some examples, supposing that TEXT is a CHARACTER variable. TEXT = 'That''s right!' !Only apostrophes as delimiters. Doubling required. TEXT = "That's right!" !Chose quotes, so that apostrophes may be used freely. TEXT = "He said ""That's right!""" !Give in, and use quotes for a "quoted string" source style. TEXT = 'He said "That''s right!"' !Though one may dabble in inconsistency. TEXT = 23HHe said "That's right!" !Some later compilers allowed Hollerith to escape from FORMAT. A similar syntax enables the specification of hexadecimal, octal or binary sequences, as in X = Z"01FE" for hexadecimal (the "H" code already being used for "Hollerith" even if the H-usage is not supported by the compiler) but this is for numerical values, not text strings. While one could mess about with EQUIVALENCE statements, numbers fill up from the right while text strings fill from the left and there would be "endian" issues as well, so it is probably not worth the bother. Just use the CHAR function in an expression, as in TEXT = "That's"//CHAR(10)//"right!" !For an ASCII linefeed (or newline) character. Which may or may not be acted upon by the output device. A lineprinter probably would ignore a linefeed character but a teletype would not - it would roll the printing carriage one line up without returning to the column one position, thus the usage LFCR (or CRLF) to add the carriage return action. Some systems regard the LF as also implying a CR and for these the notation \n for "newline" is mnemonic even though there is no "newline" character code in ASCII - though there is in EBCDIC. Display screens do not handle glyph construction via overprinting though teletypes (and lineprinters) do. Similarly, a display screen may or may not start a new screen with a formfeed character and a lineprinter won't start a new page - at least if attached to a mainframe computer. ## friendly interactive shell echo Quotes are optional in most cases. echo echo 'But they are when using either of these characters (or whitespace):' echo '#$ % ^ & * ( ) { } ; \' " \\ < > ?'
echo
echo Single quotes only interpolate \\ and \' sequences.
echo '\In \other \cases, \backslashes \are \preserved \literally.'
echo
set something variable
echo "Double quotes interpolates \\, \" and \$sequences and$something accesses."

## FurryScript

A name literal starts with  and is one word long; it functions like a string.

A normal string literal uses angle brackets (< and >) around it. You can have additional < > pairs inside (which can nest to any level) in order to represent subroutine calls (they are not called where the string literal appears; they are called only once it is processed).

A story text uses {|| and ||} around it, and can contain any text, with no escapes supported.

All three kinds are string literals.

IsChar('a');
# true
IsString("abc");
# true
IsString('a');
# false
IsChar("a");
# false

## gecho

'a outascii

Just one character.

'yo...dawg. print

A string.

## Go

See the language specification sections on rune literals and string literals.

In Go, character literals are called "rune literals" and can be any single valid Unicode code point. They are written as an integer value or as text within single quotes.

ch := 'z'
ch = 122 // or 0x7a or 0172 or any other integer literal
ch = '\x7a' // \x{2*hex}
ch = '\u007a' // \u{4*hex}
ch = '\U0000007a' // \U{8*hex}
ch = '\172' // \{3*octal}

A rune literal results in an untyped integer. When used in a typed constant or stored in a variable, usually the type is either byte or rune to distinguish character values from integer values. These are aliases for uint8 and int32 respectively, but like other integer types in Go, they are distinct and require an explicate cast.

ch := 'z'          // ch is type rune (an int32 type)
var r rune = 'z' // r is type rune
var b byte = 'z' // b is type byte (an uint8 type)
b2 := byte('z') // equivalent to b
const z = 'z' // z is untyped, it may be freely assigned or used in any integer expression
b = z
r = z
ch2 := z // equivalent to ch (type rune)
var i int = z
const c byte = 'z' // c is a typed constant
b = c
r = rune(c)
i = int(c)
b3 := c // equivalent to b

Strings literals are are either interpreted or raw.

Interpreted string literals are contained in double quotes. They may not contain newlines but may contain backslash escapes.

str := "z"
str = "\u007a"
str = "two\nlines"

This means that 'z' and "z" are different. The former is a character while the latter is a string.

Unicode may be included in the string literals. They will be encoded in UTF-8.

str := "日本語"

Raw string literals are contained within back quotes. They may contain any character except a back quote. Backslashes have no special meaning.

\n == "\\n"

Raw string literals, unlike regular string literals, may also span multiple lines. The newline is included in the string (but not any '\r' characters):

abc
def
== "abc\ndef", // never "abc\r\ndef" even if the source file contains CR+LF line endings

Go raw string literals serve the purpose of here-strings in other languages. There is no variable expansion in either kind of string literal (the Go text/template package provides something like variable expansion).

## Groovy

In Groovy, unlike in Java, a String literal is delimited with single quotes (apostrophes(')).

def string = 'Able was I'

There is a double quote (quotation mark(")) delimited syntax in Groovy, but it represents an expression construct called a GString (I know, I know). Inside of a GString, sub-expression substitution of the form ${subexpression} may take place. Thus the following results: def gString = "${string} ere I saw Elba"

println gString

//Outputs:
//Able was I ere I saw Elba

UNIX Shell command line users should recognize these forms of syntax as strong ('-delimited) and weak ("-delimited) quoting. And like UNIX Shell weak quoting syntax, the evaluated subexpression part of the GString syntax loses its special meaning when preceded by a backslash (\):

def gString2 = "1 + 1 = ${1 + 1}" assert gString2 == '1 + 1 = 2' def gString3 = "1 + 1 = \${1 + 1}"
assert gString3 == '1 + 1 = ${1 + 1}' Groovy also supports multi-line String literals and multi-line GString expressions. def multiLineString = ''' A man A plan A canal ''' def multiLineGString = """${multiLineString.trim()}:
Panama!
"""

println multiLineGString

//Outputs:
//
//A man
//A plan
//A canal:
//Panama!
//

UNIX Shell programmers should recognize these forms of syntax as similar in function to the strong and weak forms of Here Document syntax.

Both String literals and GString expressions support a number of special characters (usually non-printable characters) which consist of a single character preceded by a backslash (hence \X), or a 4-hexadecimal-digit UNICODE encoding preceded by a backslash and a lowercase "u" (hence \uXXXX).

One of these special characters is the backslash itself, denoted with in a String or GString as \\. This actually interferes with regular expression syntax in which literal backslashes play various important regular-expression-specific roles. Thus it can become quite onerous to write regular expressions using String or GString quoting syntax, since every regex backslash would have to be written as \\.

However, Groovy has a special GString syntax that uses slash (/) as a GString delimiter rather that quote ("). In this special syntax, most backslash usages that would require a double backslash in a regular String or GString require only a single backslash (\). This does not create a "regular expression object" (there is not such a thing in Groovy); however, it does evaluate to form a "regular expression ready" String, as demonstrated in the following:

def regexString = /($[Tt]itle$|$[Ss]ubject$)${10 * 5}/ assert regexString == '(\$[Tt]itle\$|\$[Ss]ubject\$)50' Javascript users (and others) will recognize the roots of this "regex-ready" syntax as a feature in their own language. Since apostrophe is used to delimit String literals, that delimiter syntax is not available, as it is in Java, to denote single character literals (type char or Character). However, single character string literals can be converted to character literals by casting. Shown in the examples below are casting using the as operator, Java-style parenthetical casting, and forced coercion in the intialization of a variable of type char or Character. assert 'a' instanceof String assert ('a' as char) instanceof Character assert ((char)'a') instanceof Character char x = 'a' assert x instanceof Character Character y = 'b' assert y instanceof Character && (x+1 == y) As in Java, backslash is also used to mask a string delimiter. Thus the following two assignments represent strings containing a single quote and a single apostrophe respectively def quote = "\"" def apostrophe = '\'' Of course, if you are not using GString subexpression evaluation, you can just use apostrophe delimiters to contain a quote, or quote delimiters to contain an apostrophe. def quote2 = '"' def apostrophe2 = "'" assert quote == quote2 assert apostrophe == apostrophe2 ## Haskell ### language support Characters use single quotes, strings use double quotes. Both allow Unicode. Escape sequences start with a backslash. There are no verbatim strings, no here-strings, and no expansion of variables in strings. Strings may be split across lines, even indented, using the 'gap' syntax: "abcdef" == "abc\ \def" "abc\ndef" == "abc\n\ \def" You can also use \& which expands into nothing (but can be useful to interrupt another escape sequence). The Haskell 98 Report section Character and String Literals has more information. ### using raw-strings-qq package using raw-strings-qq package: {-# LANGUAGE QuasiQuotes #-} import Text.RawString.QQ "abc\ndef" == [r|abc def|] ## HicEst HicEst makes no distinction between single characters and strings. One can use single quotes, or double quotes, or most non-standard characters. CHARACTER c1='A', c2="B", c3=&C& CHARACTER str1='single quotes', str2="double quotes", str3*100 str3 = % delimit "Nested 'strings' " if needed % A null character CHAR(0) is printed as " ", displayed as "." in dialogs, but ends the string in Windows controls such as StatusBar or ClipBoard str3 = 'a string' // CHAR(0) // "may contain" //$CRLF // ~ any character ~

Named literal constants in HicEst:

$TAB == CHAR(9) ! evaluates to 1 (true)$LF == CHAR(10)
$CR == CHAR(13)$CRLF == CHAR(13) // CHAR(10) ! concatenation

## Icon and Unicon

Below is a little program to demonstrate string literals.

procedure main()

# strings are variable length are not NUL terminated
# at this time there is no support for unicode or multi-byte charactersets

c1 := 'aaab' # not a string - cset
s1 := "aaab" # string
s2 := "\"aaab\b\d\e\f\n\l\n\r\t\v\'\"\\\000\x00\^c" # with escapes and imbedded zero

# no native variable substitution, a printf library function is available in the IPL

every x := c1|s1|s2 do # show them
write(" size=",*x,", type=", type(x),", value=", image(x))
end
Output:
size=2, type=cset, value='ab'
size=4, type=string, value="aaab"
size=21, type=string, value="\"aaab\b\d\e\f\n\n\n\r\t\v'\"\\\x00\x00\x03"

## IDL

The single and double quotes are fairly interchangeable allowing one to use whichever isn't to be quoted (though single-quotes seem more well-behaved around integers in strings). Thus the following are both valid character-constant assignments:

a = " that's a string "
b = ' a "string" is this '

In a pinch, a character constant doesn't absolutely have to be terminated, rendering the following valid:

a = " that's a string

Duplicating either of them quotes them. Thus the following contains three single quotes and no double-quotes:

a = ' that<nowiki>''</nowiki>s a string
print,a
;==> that'
s a string

Things in quotes are not expanded. To get to the content of a variable, leave it unquoted:

b = 'hello'
a = b+' world
print,a
;==> hello world

Single-quoted strings of valid hex or octal digits will be expanded if followed by "x" or "o":

print,'777'x
;==> 1911
print,'777'o
;==> 511
print,'777'
;==> 777

so will be unterminated double-quoted strings if they represent valid octal numbers:

print,"777
;==> 511
print,"
877
;==> 877

Note that this renders the following false (common trip-up for IDL newbies):

a = "0"
;==> Syntax error.

...because the number zero indicates that an octal number follows, but the second double-quote is not a valid octal digit.

Byte-arrays that are converted into strings are converted to the ascii-characters represented by the bytes. E.g.

crlf = string([13b,10b])

## Inform 7

String literals are enclosed in double quotes. These may include raw line breaks, or expressions to be substituted enclosed in square brackets.

Home is a room. The description is "This is where you live...

...with your [number of animals in Home] pet[s]."
Single quotes in a string are translated to double quotes when they occur outside of a word: the string literal
"'That's nice,' said the captain."

will print as

"That's nice," said the captain.

There are no character literals: phrases that manipulate characters pass them as single-character strings.

## J

Like C, J treats strings as lists of characters. Character literals are enclosed in single quotes, and there is no interpolation. Therefore, the only "escape" character neccessary is the single-quote itself, and within a character literal is represented by a pair of adjacent single quotes (much like in C, where within a character literal, a slash is represented by a pair of adjacent slashes).

Examples:

'x'                   NB.  Scalar character
'string' NB. List of characters, i.e. a string
'can''t get simpler' NB. Embedded single-quote

Like VB, J can include newlines and other special characters in literals with concatentation. Also like VB, J comes with certain constants predefined for some characters:

'Here is line 1',LF,'and line two'

'On a mac, you need',CR,'a carriage return'

'And on windows, ',CRLF,'you need both'

TAB,TAB,TAB,'Everyone loves tabs!'

These constants are simply names assigned to selections from the ASCII alphabet. That is, the standard library executes lines like this:

CR   =:  13 { a.
LF =: 10 { a.
CRLF =: CR,LF NB. Or just 10 13 { a.
TAB =: 9 { a.

Since these constants are nothing special, it can be seen that any variable can be similarly included in a literal:

NAME =:  'John Q. Public'
'Hello, ',NAME,' you may have already won $1,000,000' For multiline literals, you may define an explicit noun, which is terminated by a lone ) template =: noun define Hello, NAME. My name is SHYSTER, and I'm here to tell you that you my have already won$AMOUNT!!

To collect your winnings, please send $PAYMENT to ADDRESS. ) Simple substitution is most easily effected by using loading a standard script: load 'strings' name =: 'John Q. Public' shyster =: 'Ed McMahon' amount =: 1e6 payment =: 2 * amount address =: 'Publisher''s Clearing House' targets =: ;: 'NAME SHYSTER AMOUNT PAYMENT ADDRESS' sources =: ":&.> name;shyster;amount;payment;address message =: template rplc targets,.sources While C-like interpolation can be effected with another: load 'printf' 'This should look %d%% familiar \nto programmers of %s.' sprintf 99;'C' This should look 99% familiar to programmers of C. ## Java char a = 'a'; // prints as: a String b = "abc"; // prints as: abc char doubleQuote = '"'; // prints as: " char singleQuote = '\''; // prints as: ' String singleQuotes = "''"; // prints as: '' String doubleQuotes = "\"\""; // prints as: "" Null characters ('\0') are printed as spaces in Java. They will not terminate a String as they would in C or C++. So, the String "this \0is \0a \0test" will print like this: this is a test ## JavaScript A JavaScript string is a sequence of zero or more characters enclosed in either 'single quotes' or "double quotes". Neither form prevents escape sequences: "\n" and '\n' are both strings of length 1. There is no variable interpolation. Unicode characters can be entered as literals or as 4 character hexadecimal escapes. The following expressions are equivalent: (function () { return "αβγδ 中间来点中文 🐫 אבגד" })(); (function() { return "\u03b1\u03b2\u03b3\u03b4 \u4e2d\u95f4\u6765\u70b9\u4e2d\u6587 \ud83d\udc2b \u05d0\u05d1\u05d2\u05d3"; })(); Note that in the case of the Emoji character above, where more than 4 hexadecimal characters are needed, ES5 requires us to separately write a pair of surrogate halves, and the String.length of such characters is 2. ES6 introduces Unicode code point escapes such as '\u{2F804}' allowing direct escaping of code points up to 0x10FFFF. ## jq jq supports all JSON types, including JSON strings; jq also supports "string interpolation". The rules for constructing JSON string literals are explained elsewhere (notably at json.org), so here we'll focus on "string interpolation" -- a technique for creating JSON strings programmatically using string literals, much like ruby's "#{...}", for example. The twist is that the string literal for specifying string interpolation is (by design) not itself a valid JSON string. Suppose that: • s is (or is a reference to) a JSON entity (e.g. a string or a number), and • we wish to create a JSON string that is some combination of JSON strings and the string value of s, for example: "The value of s is " + (s|tostring). jq allows the shorthand: "The value of s is $$s)", and in general, arbitrarily many such interpolations may be made. ## Kotlin Kotlin supports two kinds of string literals (UTF-16 encoded): • escaped string literals, enclosed in double-quotes, which can contain 'escaped characters'. • raw string literals, enclosed in triple double-quotes, which ignore escaping but can contain new lines. The language also supports character literals - a single UTF-16 character (including an escaped character) enclosed in single quotes. Here are some examples of these : // version 1.0.6 fun main(args: Array<String>) { val cl = 'a' // character literal - can contain escaped character val esl = "abc\ndef" // escaped string literal - can contain escaped character(s) val rsl = """ This is a raw string literal which does not treat escaped characters (\t, \b, \n, \r, \', \", \\, \ and \u) specially and can contain new lines. " Quotes" or doubled ""quotes"" can be included without problem but not tripled quotes. " "" val msl = """ |Leading whitespace can be removed from a raw |string literal by including |a margin prefix ('|' is the default) |in combination with the trimMargin function. " "".trimMargin() println(cl) println(esl) println(rsl) println(msl) } Output: a abc def This is a raw string literal which does not treat escaped characters (\t, \b, \n, \r, \', \", \\, \ and \u) specially and can contain new lines. "Quotes" or doubled ""quotes"" can be included without problem but not tripled quotes. Leading whitepace can be removed from a raw string literal by including a margin ('|' is the default) in combination with the trimMargin function. ## Lasso All strings in Lasso are Unicode strings. This means that a string can contain any of the characters available in Unicode. Lasso supports two kinds of string literals: quoted and ticked. Quoted strings can contain escape sequences, while ticked strings cannot. Both quoted and ticked string literals can contain line breaks and they both return same type of string object. [1] ## Quoted Strings 'I\'m a 2" string\n' "I'm a 2\" string\n" ## Ticked Strings In the below example here \n would not be a line feed, it represents a backslash and n. I'm also a 2" string\n ## LabVIEW LabVIEW is a graphical language so it uses graphical string delimiters. No escaping is needed. This image is a VI Snippet, an executable image of LabVIEW code. The LabVIEW version is shown on the top-right hand corner. You can download it, then drag-and-drop it onto the LabVIEW block diagram from a file browser, and it will appear as runnable, editable code. ## LaTeX Since LaTeX is a markup language rather than a programming language, quotes are displayed rather than interpreted. However, quotes do deserve special mention in LaTeX. Opening (left) quotes are denoted with backquotes and closing (right) quotes are denoted with quotes. Single quotes use a single symbol and double quotes use double symbols. For example, to typeset 'a' is for "apple" in LaTeX, one would type a' is for apple''''''' One common mistake is to use the same symbol for opening and closing quotes, which results in the one of the quotes being backward in the output. Another common mistake is to use a double quote symbol in the input file rather than two single quotes in order to produce a double quote in the output. ## Liberty BASIC 'Liberty BASIC does not support escape characters within literal strings. print "Quotation mark:" print chr(34) print 'Print literal string print "Hello, World." 'Print literal string displaying quotation marks. print chr(34);"Hello, World.";chr(34) ## Lingo • Lingo only supports single quotes for string literals. Single quotes inside string literals have to be replaced by "&QUOTE&": str = "Hello "&QUOTE&"world!"&QUOTE put str -- "Hello "world!"" • Lingo does not support heredoc syntax, but only multiline string literals by using the line continuation character "\": str = "This is the first line.\ This is the second line.\ This is the third line." • Lingo does not support automatic variable expansion in strings. But the function value() can be used to expand template strings in the current context: template = QUOTE&"Milliseconds since last reboot: "&QUOTE&"&_system.milliseconds" -- expand template in current context str = value(template) put str -- "Milliseconds since last reboot: 20077664" ## Lisaac Characters: c1 := 'a'; c2 := '\n'; // newline c3 := '\''; // quote c4 := '\101o'; // octal c5 := '\10\'; // decimal c6 := '\0Ah\'; // hexadecimal c7 := '\10010110b\'; // binary Strings: s1 := "this is a\nsample"; // newline s2 := "\""; // double quote s3 := "abc\ \xyz"; // "abcxyz", cut the gap ## LiveCode LiveCode has only one string representation using quotes. Characters are accessed through chunk expressions, specifically char. Some special characters are built-in constants such as quote, space, comma, cr, return. There is no support for escaping characters or multiline literals. put "Literal string" -- Literal string put char 1 of "Literal string" -- L put char 1 to 7 of "Literal string" -- Literal put word 1 of "Literal string" -- Literal put quote & "string" & quote -- "string" ## Logo Logo does not have a string or character type that is separate from its symbol type ("word"). A literal word is specified by prefixing a double-quote character. Reserved and delimiting characters, ()[];~+-*/\=<>| and newline, may be used if preceded by a backslash. Alternatively, the string may be wrapped in vertical bars, in which case only backslash and vertical bar need be escaped. print "Hello\,\ world print "|Hello, world| ## Lua Strings can be enclosed using singlequotes or doublequotes. Having two different types of quotation symbols enables either of the symbols to be embedded within a string enclosed with the other symbol. singlequotestring = 'can contain "double quotes"' doublequotestring = "can contain 'single quotes'" longstring = [[can contain newlines]] longstring2 = [==[ can contain [[ other ]=] longstring " and ' string [===[ qualifiers]==] Note that interpolation of variables names within a string does not take place. However, interpolation of literal characters escape sequences does occur, irrespective of whether singlequote or doublequote enclosures are being used. ## M4 The quoting characters are  and ', but can be changed by the changequote macro: this is quoted string' changequote([',]')dnl [this is a quoted string] ## Maple There is no separate character type in Maple; a character is just a string of length equal to 1. > "foobar"; "foobar" > "foo\nbar"; # string with a newline "foo bar" > "c"; # a character "c" Note that adjacent strings in the input (separated only by white-space) are concatenated automatically by the parser. > "foo" "bar"; "foobar" Since variable names are not distinguished lexically from other text (such as by using a "" prefix, as in some shells), Maple does not do any kind of variable expansion inside strings. ## Mathematica There is no character type in Mathematica, only string type. "c"; // String (result: "c") "\n"; // String (result: newline character) ## MATLAB Strings start and end with single quotes, the escape sequence for a single quote with in a string, is the use of two consequtive single quotes s1 = 'abcd' % simple string s2 = 'ab''cd' % string containing a single quote Output: >> s1 = 'abcd' % simple string s1 = abcd >> s2 = 'ab''cd' % string containing a single quote s2 = ab'cd ## Maxima /* A string */ "The quick brown fox jumps over the lazy dog"; /* A character - just a one character string */ "a" ## ML/I ML/I treats all input and programs as character streams. Strings do not have to be quoted; they are taken 'as is'. If one wishes to ensure that a string is taken literally (i.e. not evaluated), it is enclosed in literal brackets. There are no predefined literal brackets; the programmer can define anything suitable, usually by setting up a matched text skip, using the MCSKIP operation macro. By convention, the pair <> is used for literal brackets, unless this clashes in the case of a particular processing task. ### Input MCSKIP "WITH" NL "" Literals/String MCINS %. MCSKIP MT,<> "" Demonstration of literal string MCDEF Bob AS Alice "" The following two lines both mention Bob. The first line is "" evaluated, but the second is surrounded by literal brackets and is not "" evaluated. This is the first mention of Bob <and here we mention Bob again> ### Output This is the first mention of Alice and here we mention Bob again ## Metafont In Metafont there's no difference between a single character string and a single character. Moreover, the double quotes (which delimites a string) cannot be inserted directly into a string; for this reason, the basic Metafont macro set defines string ditto; ditto = char 34; i.e. a string which is the single character having ASCII code 34 ("). Macro or variables expansion inside a string block is inhibited. message "You've said: " & ditto & "Good bye!" & ditto & "."; ## Modula-3 Characters in Modula-3 use single quotes. VAR char: CHAR := 'a'; Strings in Modula-3 use double quotes. VAR str: TEXT := "foo"; TEXT is the string type in Modula-3. Characters can be stored in an array and then converted to type TEXT using the function Text.FromChars in the Text module. Strings (of type TEXT) can be converted into an array of characters using the function Text.SetChars. VAR str: TEXT := "Foo"; VAR chrarray: ARRAY [1..3] OF CHAR; Text.SetChars(chrarray, str); (* chrarray now has the value ['F', 'o', 'o'] *) ## MUMPS All strings are delimited by the double quotes character. But you can escape the double quotes to add a double quotes character to a string. USER>SET S1="ABC" USER>SET S2="""DEF""" USER>SET S3="""GHI" USER>W S1 ABC USER>W S2 "DEF" USER>W S3 "GHI ## Nemerle Character literals are enclosed in single quotes. Regular strings are enclosed in double quotes, and use \ to delimit special characters, whitespace similar to C. A @ preceding the double quotes indicates a literal string. A preceding the double quote indicates string interpolation, identifiers prefixed with inside the string literal will be replaced with their value. Nemerle also has a recursive string literal, enclosed within <# #>, that is the same as a literal string, except that it allows nesting of strings. 'a' // character literal '\n' // also a character literal "foo\nbar" // string literal @"x\n" // same as "x\\n" @"x y" // same as "x\n y" @"""Hi!""" // "" replaces \" to escape a literal quote mark <#This string type can contain any symbols including " and new lines. It does not support escape codes like "\n".#> // same as "This string type can contain any symbols including \"\nand new lines. " // + "It does not\nsupport escape codes\nlike \"\\n\"." <#Test <# Inner #> end#> // same as "Test <# Inner #> end" (i.e. this string type support recursion. ## Nim var c = 'c' var s = "foobar" var l = """foobar and even more test here""" var f = r"C:\texts\text.txt" # Raw string ## OASYS Assembler There are two kinds, strings with quotation marks and strings with braces. Both kinds are treated exactly like numeric tokens for all purposes. Strings with quotation marks can contain repeated quotation marks to represent a quotation mark, a tilde to represent a line break, or a line break to represent a space (in which case any leading spaces on the following line are ignored). Strings with braces start with { and end with the next } (they don't nest), and all characters (except a right-brace) are treated as-is. There are no character literals. ## Objective-C The same as C, with the addition of the new string literal @"Hello, world!" which represents a pointer to a statically allocated string object, of type NSString *, similar to string literals in Java. You can use this literal like other object pointers, e.g. call methods on it [@"Hello, world!" uppercaseString]. ## Objeck Objeck string support is similar to Java except that string elements are 1-byte in length. In addition, string literals may terminated using a NULL character or the string's length calculation. ## OCaml Characters are contained in single quotes: # 'a';; - : char = 'a' Strings are contained in double quotes: # "Hello world";; - : string = "Hello world" Strings may be split across lines and concatenated using the following syntax: (the newline and any blanks at the beginning of the second line is ignored) # "abc\ def" ;; - : string = "abcdef" If the above syntax is not used then any newlines and whitespace are included in the string: # "abc def" ;; - : string = "abc\n def" ## Octave Strings can be defined in Octave with single or double quotes. In order to maintain compatible with Matlab, it is recommended to use single quotes for defining strings. s1 = 'abcd' % simple string s2 = 'ab''cd' % string containing a single quote using an escaped single quote s3 = 'ab"cd' % simple string containing a double quote s4 = "ab'cd" % string containing a single quote s5 = "ab""cd" % string containing a double quote using an escaped double quote Output: octave:5> s1 = 'abcd' % simple string s1 = abcd octave:6> s2 = 'ab''cd' % string containing a single quote using an escaped single quote s2 = ab'cd octave:7> s3 = 'ab"cd' % simple string containing a double quote s3 = ab"cd octave:8> s4 = "ab'cd" % string containing a single quote s4 = ab'cd octave:9> s5 = "ab""cd" % string containing a double quote using an escaped double quote s5 = ab"cd ## Oforth Oforth uses single quotes for characters and double quotes for strings. There is no character type : characters are integers representing unicode value of the character. 'a' '\'' "abcd" "ab\ncd" "ab\" and \" cd" ## Oz declare Digit0 = &0 %% the character '0' NewLine = &\n %% a character with special representation NewLine = &\012 %% characters can also be specified with octals %% Strings are lists of characters, but can also be written in double quotes: [&H &e &l &l &o] = "Hello" AnAtom = 'Hello' %% single quotes are used for atoms Atom2 = hello = 'hello' %% for atoms starting with a lower case letter, they are optional %% To build strings out of other values, so-called virtual strings are used: MyName = "Peter" MyAge = 8 {System.showInfo MyName # " is " # MyAge # " years old."} ## PARI/GP There are just three escapes: \e escape \n newline \t tab Any other escaped character simply represents itself; \\ and \" are the most useful. There are no characters or character strings as such, but Vectorsmall("string") is very similar to a character array. Version 2.4.3 added the functions printf and Strprintf which allow interpolation (typically with %Ps). ## Pascal See Delphi ## Perl Perl makes no distinction between single characters and strings. One can use single or double quotes, but they are different. Double-quotes allows you to interpolate variables and escape sequences, while single-quotes do not. 'c'; # character 'hello'; # these two strings are the same "hello"; 'Hi name. How are you?'; # result: "Hi name. How are you?" "Hi name. How are you?"; # result: "Hi Bob. How are you?" '\n'; # 2-character string with a backslash and "n" "\n"; # newline character ls; # runs a command in the shell and returns the output as a string q/hello/; # same as 'hello', but allows custom delimiters, eg: q(hi) and q!hi! qq/hello/; # same as "hello", but allows custom delimiters, eg: qq{hi} and qq#hi# qw/one two three/; # same as ('one', 'two', 'three'); constructs a list of the words qx/ls/; # quoted execution, same as ls qr/regex/; # creates a regular expression <<END; # Here-Document Hi, whatever goes here gets put into the string, including newlines and variables, until the label we put above END <<'END'; # Here-Document like single-quoted Same as above, but no interpolation of variables. END ## Perl 6 Unlike most languages that hardwire their quoting mechanisms, the quote mechanism in Perl 6 is extensible, and all normal-looking quotes actually derive from a parent quoting language called Q via grammatical mixins, applied via standard Perl 6 adverbial syntax. The available quote mixins, straight from current spec S02, are: Short Long Meaning ===== ==== ======= :x :exec Execute as command and return results :w :words Split result on words (no quote protection) :ww :quotewords Split result on words (with quote protection) :v :val Evaluate word or words for value literals :q :single Interpolate \\, \q and \' (or whatever) :qq :double Interpolate with :s, :a, :h, :f, :c, :b :s :scalar Interpolate vars :a :array Interpolate @ vars :h :hash Interpolate % vars :f :function Interpolate & calls :c :closure Interpolate {...} expressions :b :backslash Interpolate \n, \t, etc. (implies :q at least) :to :heredoc Parse result as heredoc terminator :regex Parse as regex :subst Parse as substitution :trans Parse as transliteration :code Quasiquoting :p :path Return a Path object (see S16 for more options In any case, an initial Q, q, or qq may omit the initial colon to form traditional Perl quotes such as qw//. And Q can be used by itself to introduce a quote that has no escapes at all except for the closing delimiter: my raw = Q'[email protected]\@#)&!#'; Note that the single quotes there imply no single quoting semantics as they would in Perl 5. They're just the quotes the programmer happened to choose, since they were most like the raw quoting. Single quotes imply :q only when used as normal single quotes are, as discussed below. As in Perl 5, you can use any non-alphanumeric, non-whitespace characters for delimiters with the general forms of quoting, including matching bracket characters, including any Unicode brackets. Using the definitions above, we can derive the various standard "sugar" quotes from Q, including: Normal Means ====== ===== q/.../ Q :q /.../ qq/.../ Q :qq /.../ '...' Q :q /.../ "..." Q :qq /.../ <...> Q :q :w :v /.../ «...» Q :qq :ww :v /.../ /.../ Q :regex /.../ quasi {...} Q :code {...} The :qq-derived languages all give normal Perlish interpolation, but individual interpolations may be chosen or suppressed with extra adverbs. Unlike in Perl 5, we don't use backticks as shorthand for what is now expressed as qqx// in Perl 6. (Backticks are now reserved for user-defined syntax.) Heredocs now have no special << syntax, but fall out of the :to adverb: say qq:to/END/; Your ad here. END Indentation equivalent to the ending tag is automatically removed. Backslash sequences recognized by :b (and hence :qq) include: "\a" # BELL "\b" # BACKSPACE "\t" # TAB "\n" # LINE FEED "\f" # FORM FEED "\r" # CARRIAGE RETURN "\e" # ESCAPE "\x263a" # ☺ "\o40" # SPACE "\0" # NULL "\cC" # CTRL-C "\c8" # BACKSPACE "\c[13,10]" # CRLF "\c[LATIN CAPITAL LETTER A, COMBINING RING ABOVE]" Leading 0 specifically does not mean octal in Perl 6; you must use \o instead. ## Phix single character literals (incidentally entirely equivalient to their ascii value) require single quotes, eg constant UPPERCASEJ = 'J' -- equivalent to 74 string literals use double quotes, eg constant hw = "Hello World!", mt = "" -- empty string Note that 'z' and "z" are quite different. In Phix there is a strong difference between a character and a string. All strings are ansi or utf8, depending on the encoding of the source file, eg s = "日本語" Utf8 strings are byte-subscripted rather than character-subscripted, so s[3] is not necessarily the third character. Phix strings have a length field in the (internal) header, /and/ a terminating null, so they can be used directly when interfacing to C-style languages. Phix strings can also be used to hold "raw binary", ie instead of a sequence of characters, a sequence of any bytes in the range 0 to 255. Strings are fully mutable: you can append, prepend, replace, substitute, and crop characters and slices (/substrings) any way you like, eg string s = "food" s[2..3] = 'e' -- s is now "feed" (replace all) s[2..2] = "east" -- s is now "feasted" (replace substring) s[2..5] = "" -- s is now "fed" Special characters may be entered (between quotes) using a back-slash: Code Value Meaning \n #10 newline \r #13 carriage return \b #08 backspace \t #09 tab \\ #5C backslash \" #22 double quote \' #27 single quote \0 #00 null \#HH #HH any hexadecimal byte \xHH #HH any hexadecimal byte (\u, \U currently omitted, see note below) There are no other automatic substitutions, other than through explict function calls such as printf. Strings can also be entered by using triple quotes or backticks intead of double quotes to include linebreaks and avoid any backslash interpretation. If the literal begins with a newline, it is discarded and any immediately following leading underscores specify a (maximum) trimming that should be applied to all subsequent lines. Examples: ts = """ this string\thing""" ts = """ _____this string\thing""" ts = 'this string\thing' ts = "this\nstring\\thing" which are all equivalent. On a practical note, as long as you have at least 2GB of physical memory, you should experience no problems whatsoever constructing a string with 400 million characters, and you could more than triple that by allocating things up front, however deliberately hogging the biggest block of memory the system will allow is generally considered bad programming practice, and may lead to disk thrashing. Hex string literals are also supported (mainly for compatibility with OpenEuphoria, x/u/U for 1/2/4 byte codes), eg: ?x"68 65 6c 6c 6f"; -- displays "hello" As noted above, escapes \u and \U are currently omitted from the previous table, but as the source (ptok.e) notes, it is more about testing/documenting than any technical difficulty handling them in the tokeniser. ## PHP PHP makes no distinction between single characters and strings. One can use single or double quotes, but they are different. Double-quotes allows you to interpolate variables and escape sequences, while single-quotes do not. 'c'; # character 'hello'; # these two strings are the same "hello"; 'Hi name. How are you?'; # result: "Hi name. How are you?" "Hi name. How are you?"; # result: "Hi Bob. How are you?" '\n'; # 2-character string with a backslash and "n" "\n"; # newline character ls; # runs a command in the shell and returns the output as a string <<END # Here-Document Hi, whatever goes here gets put into the string, including newlines and variables, until the label we put above END; <<'END' # Here-Document like single-quoted Same as above, but no interpolation of variables. END; ## PicoLisp PicoLisp doesn't have a string data type. Instead, symbols are used. Certain uninterned symbols, called "transient symbols", however, look and behave like strings on other languages. Syntactically, transient symbols (called "strings" in the following) are surrounded by double quotes. : "ab\"cd" -> "ab\"cd" Double quotes in strings are escaped with a backslash. ASCII control characters can be written using the hat ('^') character: : "ab^Icd^Jef" # Tab, linefeed There is no special character type or representation. Individual characters are handled as single-character strings: : (chop "abc") -> ("a" "b" "c") : (pack (reverse @)) -> "cba" A limited handling of here-strings is available with the 'here' function. ## Pike 'c'; // Character code (ASCII) (result: 99) "c"; // String (result: "c") "\n"; // String (result: newline character) "hi " + world // String (result: "hi " and the contents of the variable world) #"multiple line string using the preprocessor" // single literal string with newlines in it ## PL/I 'H' /* a single character as a literal. */ 'this is a string' '' /* an empty string literal. */ 'John''s cat' /* a literal containing an embedded apostrophe. */ /* stored are <<John's cat>> */ '101100'b /* a bit string, stored as one bit per digit. */ ## Plain TeX The same as LaTeX case, even though one should say the opposite. The  and '' in TeX (plainTeX, LaTeX and many more) are just examples of ligatures. ## Pop11 In Pop11 charaters literals are written in inverted quotes (backticks) a ;;; charater a String are written in quotes 'a' ;;; string consisting of single character Backslash is used to insert special charaters into strings: '\'\n' ;;; string consisting of quote and newline ## PowerShell PowerShell makes no distinction between characters and strings. Single quoted strings do not interpolate variable contents but double quoted strings do. Also, escape sequences are quoted literally as separate characters within single quotes. PowerShell here-strings begin with @' (or @") followed immediately by a line break and end with a line break followed by '@ (or "@). Escape sequences and variables are interpolated in @" quotes but not in @' quotes. ## Prolog Standard Prolog has no string types. It has atoms which can be formed in two ways, one of which is wrapping arbitrary text in single quotation marks: 'This is an "atom" and not a string.' Such atoms can be (and are) treated as immutable strings in Prolog in many cases. Another string-like form wraps text in double quotation marks: "This 'string' will fool you if you're in a standard Prolog environment." While this appears as a string to non-Prolog users, it is in reality a linked list of integers with each node containing the integer value of the character (or for Unicode-capable systems, code point) at that location. For example: ?- [97, 98, 99] = "abc". true. Individual character constants are special forms of integer (syntax sugar) using a 0' prefix: ?- 97 = 0'a. true. Works with: SWI Prolog version 7.0 SWI-Prolog, beginning with version 7, introduced a new native string type. Unless options are specifically set by the user, character sequences wrapped in double quotes are now a string data type. The older list-based version uses back quotes instead: ?- [97, 98, 99] = "abc". false. ?- [97, 98, 99] = abc. true. Also starting with SWI-Prolog version 7, quasiquotation became possible. While not exactly a string type directly, they can be (ab)used to give multi-line strings. More importantly, however, they permit special string handling to be embedded into Prolog code, in effect permitting entire other languages inside of Prolog to be used natively as per this example: test_qq_odbc :- myodbc_connect_db(Conn), odbc_query(Conn, {|odbc|| select P.image,D.description,D.meta_keywords,C.image,G.description from product P, product_description D, category C, category_description G, product_to_category J where P.product_id=D.product_id and P.product_id=J.product_id and C.category_id=J.category_id and C.category_id=G.category_id |}, Row), writeln(Row). In this example, the test_qq_odbc/0 predicate connects to an ODBC database and performs a query. The query is wrapped into a multi-line quasiquotation (beginning with {| and ending with |}) that checks the syntax and security of the query, so not only is the query a multi-line string, it is a **checked** multiline string in this case. ## PureBasic PureBasic supports char in ASCII and UNICODE as well as both dynamic and fixed length strings. ; Characters (*.c), can be ASCII or UNICODE depending on compiler setting Define.c AChar='A' ; defines as *.a it will be ASCII and *.u is always UNICODE Define.a A='b' Define.u U='水' ; Strings is defined as **.s or ending with '' Define.s AStrion ="String #1" Define BStrion.s ="String #2" Define CString ="String #3" ; Fixed length stings can be defined if needed Define XString.s{100} ="I am 100 char long!" ; '"' can be included via CHR() or its predefined constant Define AStringQuotes=Chr(34)+"Buu"+Chr(34)+" said the ghost!" Define BStringQuotes=#DOUBLEQUOTE+"Buu"+#DOUBLEQUOTE+" said yet a ghost!" To dynamically detect the current sizes of a character, e.g. ASCI or UNICODE mode, StringByteLength() can be used. Select StringByteLength("X") Case 1 Print("ASCII-mode; Soo, Hello world!") Case 2 Print("UNICODE-mode; Soo, 您好世界!") EndSelect ## Python Python makes no distinction between single characters and strings. One can use single or double quotes. 'c' == "c" # character 'text' == "text" ' " ' " ' " '\x20' == ' ' u'unicode string' u'\u05d0' # unicode literal As shown in the last examples, Unicode strings are single or double quoted with a "u" or "U" prepended thereto. Verbatim (a.k.a. "raw") strings are contained within either single or double quotes, but have an "r" or "R" prepended to indicate that backslash characters should NOT be treated as "escape sequences." This is useful when defining regular expressions as it avoids the need to use sequences like \\\\ (a sequence of four backslashes) in order to get one literal backslash into a regular expression string. r'\x20' == '\\x20' The Unicode and raw string modifiers can be combined to prefix a raw Unicode string. This must be done as "ur" or "UR" (not with the letters reversed as it: "ru"). Here-strings are denoted with triple quotes. ''' single triple quote ''' """ double triple quote """ The "u" and "r" prefixes can also be used with triple quoted strings. Triple quoted strings can contain any mixture of double and single quotes as well as embedded newlines, etc. They are terminated by unescaped triple quotes of the same type that initiated the expression. They are generally used for "doc strings" and other multi-line string expressions --- and are useful for "commenting out" blocks of code. ## R R makes no distinction between characters and strings, and uses single and double quotes interchangeably, though double quotes are considered to be preferred. Verbatim strings are not supported. See ?Quotes for more information. str1 <- "the quick brown fox, etc." str2 <- 'the quick brown fox, etc.' identical(str1, str2) #returns TRUE R also supports testing string literals with ==, e.g., modestring <- 'row,col' mode.vec <- unlist(strsplit(modestring, ',')) mode.vec[1] # "row" mode.vec[2] # "col" if (mode.vec[2] == 'col') { cat('Col!\n') } # Col! (with no quotes) if (mode.vec[1] == "row") { cat('Row!\n') } # Row! R also uses backticks, for creating non-standard variable names (amongst other things). a b <- 4 a b # 4 a b # Error: unexpected symbol in "a b" R will print different styles of single and double quote using sQuote and dQuote options(useFancyQuotes=FALSE) cat("plain quotes: ", dQuote("double"), "and", sQuote("single"), "\n") returns plain quotes: "double" and 'single' options(useFancyQuotes=TRUE) cat("fancy quotes: ", dQuote("double"), "and", sQuote("single"), "\n") returns fancy quotes: “double” and ‘single’ options(useFancyQuotes="TeX") cat("fancy quotes: ", dQuote("double"), "and", sQuote("single"), "\n") returns TeX quotes: double'' and single' ## Racket Characters are specified as hash-backslash-character, sometime using a name for the character. #\a #\space #\return Strings are double-quoted, and have most of the usual C-style escapes. To include a double-quote in strings, escape it with a backslash, and the same goes for doubly-escaped backslashes (leading to the usual regexp fun). Racket source code is read as UTF-8 text so strings can include Unicode characters -- but the internal representation is UCS-4. This includes "\NNN" for octals and "\xHH" for hex and "\uHHHH" for higher characters. See the docs for a complete specification. Racket also has here strings, and a more sophisticated facility for text that includes interpolation-like features, which is described in the Here Document entry ## Retro Strings are enclosed in double quotes, and can contain anything other than a double quote. ASCII characters are prefixed by a single quote. 'c "hello, world!" ## REXX There are two types of quotes used for REXX literals: • " (sometimes called a double quote or quote) • ' (sometimes called a single quote or apostrophe) There is no difference between them as far as specifying a REXX literal. You can double them (code two of them adjacent) to specify a quote within the string. char1 = "A" char2 = 'A' str = "this is a string" another = 'this is also a string' escape1 = "that's it!" escape2 = 'that''s it!' Variable expansion is not possible within REXX literals. Simply concatenate the string with the variable: amount = 100 result = "You got" amount "points." say result Output: You got 100 points. It's also possible to express characters in hexadecimal notation in a string: lf = '0A'x cr = '0D'x mmm = '01 02 03 34 ee'x ppp = 'dead beaf 11112222 33334444 55556666 77778888 00009999 c0ffee'X lang = '52455858'x /*which is "REXX" on ASCII-computers.*/ Binary strings are also possible: jjj = '01011011'B jjj = '01011011'b jjj = "0101 1011"b jjj = '0101 1011 1111'b longjjj = '11110000 10100001 10110010 11100011 11100100'B ## Ring see 'This is a "quoted string"' ## Ruby Quotes that do not interpolate: 'single quotes with \'embedded quote\' and \\backslash' %q(not interpolating with (nested) parentheses and newline) Quotes that interpolate: a = 42 "double quotes with \"embedded quote\"\nnewline and variable interpolation: #{a} % 10 = #{a % 10}" %Q(same as above) %|same as above| Heredocs print <<HERE With an unquoted delimiter, this interpolates: a = #{a} HERE print <<-INDENTED This delimiter can have whitespace before it INDENTED print <<'NON_INTERPOLATING' This will not interpolate: #{a} NON_INTERPOLATING ## Scala Character literals use single quotes marks: val c = 'c' However, symbols are denoted with a single quote, so care must be taken not to confuse the two: val sym = 'symbol Strings can use either double quotes, or three successive double quotes. The first allows special characters, the second doesn't: scala> "newline and slash: \n and \\" res5: java.lang.String = newline and slash: and \ scala> """newline and slash: \n and \\""" res6: java.lang.String = newline and slash: \n and \\ However, Unicode characters are expanded wherever they happen, even inside comments. So, for instance: scala> val uniquote = \u0022normal string" uniquote: java.lang.String = normal string scala> val insidequote = " ""an inside \u0022 quote""" insidequote: java.lang.String = an inside " quote Finally, on version 2.7, the triple-double-quoted string ends at the third consecutive quote, on version 2.8 it ends on the last quote of a series of at least three double-quotes. Scala 2.7 scala> val error = """can't finish with a quote: """" <console>:1: error: unterminated string val error = " ""can't finish with a quote: """" ^ Scala 2.8 scala> val success = """but it can on 2.8: """" success: java.lang.String = but it can on 2.8: " ## Scheme Characters are specified using the "#\" syntax: #\a #\A #\? #\space #\newline Strings are contained in double quotes: "Hello world" Literal symbols, lists, pairs, etc. can be quoted using the quote syntax: 'apple '(1 2 3) ; same as (list 1 2 3) '() ; empty list '(a . b) ; same as (cons 'a 'b) ## Seed7 The type char describes Unicode characters encoded with UTF-32. A character literal is written as UTF-8 encoded Unicode character enclosed in single quotes. var char: ch is 'z'; The type string describes sequences of Unicode characters. The characters in the string use the UTF-32 encoding. A string literal is a sequence of UTF-8 encoded Unicode characters surrounded by double quotes. var string: stri is "hello"; This means that 'z' and "z" are different. The former is a character while the latter is a string. Seed7 strings are not null terminated (they do not end with \0). They can contain any sequence of UNICODE (UTF-32) characters (including a \0). Empty strings are also allowed. In order to represent non-printable characters and certain printable characters the following escape sequences may be used.  audible alert BEL \a backspace BS \b escape ESC \e formfeed FF \f newline NL (LF) \n carriage return CR \r horizontal tab HT \t vertical tab VT \v backslash ($$ \\ apostrophe (') \' double quote (") \" control-A \A ... control-Z \Z A backslash followed by an integer literal and a semicolon is interpreted as character with the specified ordinal number. Note that the integer literal is interpreted decimal unless it is written as based integer. "Euro sign: \8364;" There is also a possibility to break a string into several lines. var string: example is "this is a string\ \ which continues in the next line\n\ \and contains a line break"; There is no built-in mechanism for expanding variables within strings. ## Sidef Quotes that do not interpolate: 'single quotes with \'embedded quote\' and \\backslash'; ‚unicode single quoted’; %q(not interpolating with (nested) parentheses and newline); Quotes that interpolate: var a = 42; "double \Uquotes\E with \"embedded quote\"\nnewline and variable interpolation: #{a} % 10 = #{a % 10}"; „same as above”; %Q(same as above); Heredocs: print <<EOT Implicit double-quoted (interpolates): a = #{a} EOT print <<"EOD" Explicit double-quoted with interpolation: a = #{a} EOD print <<'NON_INTERPOLATING' This will not interpolate: #{a} NON_INTERPOLATING ## Slate Characters are specified using the$ syntax:

$a$D
$8$,
$\s$\n

Strings are contained in single quotes, with backslash for escaping:

'Hello\'s the word.'

## SQL

String literals in SQL use single-quotation. There are no escapes, but you can double a ' mark to make a single ' in the text.

SELECT 'The boy said ''hello''.';

## Standard ML

Characters are contained in the #"" syntax:

- #"a";
val it = #"a" : char

Strings are contained in double quotes:

- "Hello world";
val it = "Hello world" : string

Strings may be split across lines and concatenated by having two backslashes around the newline and whitespace:

- "abc\
\def";
val it = "abcdef" : string

## Swift

let you = "You"
let str1 = "\(you) can insert variables into strings."
let str2 = "Swift also supports unicode in strings ı∫ƒ∂ß´™¡à"
let str3 = "Swift also supports control characters \n\tLike this"
let str4 = "'" // '
let str5 = "\"" // "
println(str3)
Output:
Swift also supports control characters
Like this

## Tcl

Tcl makes no distinction between single characters and strings.

Double quotes allow command and variable interpolation:

set str "This is Tcl $::tcl_version\tIt is [clock format [clock seconds]]" puts$str ;# ==> This is Tcl 8.5 It is Mon Apr 06 16:49:46 EDT 2009

Braces prevent interpolation

set str {This is Tcl $::tcl_version\tIt is [clock format [clock seconds]]} puts$str ;# ==> This is Tcl $::tcl_version\tIt is [clock format [clock seconds]] ## TI-89 BASIC Double quotes enclose strings, e.g. "Hello Rosetta Code". There are no escape characters. Quotes in strings are doubled: "This > "" < is one double-quote." ## TUSCRIPT$\$ MODE TUSCRIPT,{}
s1=*
DATA "string"
s2=*
DATA + "double" quotes
s3=*
DATA + 'single' quotes
s4=*
DATA + "double" + 'single' quotes
show=JOIN(s1," ",s2,s3,s4)
show=JOIN(show)
PRINT show

Output:
"string" + "double" quotes + 'single' quotes + "double" + 'single' quotes

## UNIX Shell

Works with: Bourne Shell
Works with: bash

The Unix shell supports several types of quotation marks:

• singlequotes - for literal string quotation
• doublequotes - for interpolated string quotation
• backticks - Used to capture the output from an external program

### Quotation marks within a literal String

It is possible to place singlequote characters within a string enclosed with doublequotes and to put doublequote characters in a string enclosed within singlequotes:

echo "The boy said 'hello'."
echo 'The girl said "hello" too.'

We can also use an escapesequence to put doublequote characters in an interpolated string:

print "The man said \"hello\".";

### Here documents

The shell supports the use of here documents for the passing of quoted text as input into a command. Here documents cannot be used to represent literal strings as an expression for variable assignment.

cat << END
1, High Street,
SMALLTOWN,
West Midlands.
WM4 5HD.
END

## Ursala

Single characters are denoted with a back quote.

a = x

Unprintable character constants can be expressed like this.

cr = 13%cOi&

Strings are enclosed in single forward quotes.

b = 'a string'

A single quote in a string is escaped by another single quote.

c = 'Hobson''s choice'

Multi-line strings are enclosed in dash-brackets.

d =

-[this is a list
of strings]-

Dash-bracket enclosed text can have arbitrary nested unquoted expressions, provided they evaluate to lists of character strings.

e = -[the front matter -[ d ]- the rest of it]-

f = -[text -[ d ]- more -[ e ]- text ]-

This notation can also be used for defining functions.

g "x" = -[ Dear -[ "x" ]- bla bla ]-

The double quotes aren't for character strings but dummy variables.

## V

A simple quoted string is of the form 'string' e.g

'hello world' puts

## Vim Script

A string constant delimited by double quotes " may contain escape sequences like \n, \t, \123 (byte value in octal), \xab (byte value in hexadecimal), \\ (backslash), \" or \u12ff (character code in hexadecimal according to the current encoding).

If a string constant is delimited by single quotes ' all characters are taken as they are. In order to use a single quote inside a literal string it must be escaped with another single quote, i.e. two single quotes stand for one.

Strings must always end at the current line and characters are just strings of length one.

## Visual Basic .NET

Visual Basic only supports single-line strings. The only escape sequence supported is the double double-quote (""), which is translated into a single double-quote.

Dim s = "Tom said, ""The fox ran away."""
Result: Tom said, "The fox ran away."

## WEB

WEB supports single-quoted strings exactly like Pascal strings (duplicate a ' to represent a literal ').

Double-quoted strings are "pool strings"; they are replaced by a numeric literal by the preprocessor, and placed into a string pool file (duplicate a " to represent a literal ").

Pool strings consisting of exactly one character represent numbers 0-255 according to their ASCII character code.

## XPL0

The literal value of a character is specified by preceding it with a
caret. For example, this assigns the ASCII value of A (hex 41) to Ch:

Ch:= ^A;

Strings are one-dimensional byte arrays.
For example, this assigns the address of the string enclosed in quote marks to Str:

Str:= "Hello, world!";

The ASCII code for each character is stored in sequential bytes.
By default strings are terminated with the most significant bit set
on the last character.
The exclamation point would be stored as hex A1.
Strings can also be terminated with a zero byte (like in the C language).
If the command:

string 0;

occurs anywhere before a string is set up then it will have an extra zero
byte at the end.

A quote mark can be included in a string by preceding it with a caret.
Carets are also included this way.
For example:

"^"^^^" is a ^"caret^""

results in:

"^" is a "caret"

Carets can also be used to specify control characters.
For example, this is escape E (hex 1B C5):

"^[E"

Strings can any length and can span lines, for example:

"Hello,
world!"

A carriage return (hex 0D) and line feed (hex 0A) are in the middle.

Strings are output to various devices (such as the console screen or
printer) with the Text intrinsic.

## XSLT

XSLT is based on XML, and so can use either " or ' to delimit strings. Since XML attribute values are defined using double-quotes, one must use single-quotes for string literals within attributes.

<xsl:if test="starts-with(@name, 'Mr.')">Mister</xsl:if>

Double and single quote characters may also be escaped with XML entities: &quot; and &apos; respectively.

## zkl

Interpreted string: "hoho". Raw string: 0'|hoho| where | is user choosen. \b, \f, \n, \r, \t, \e escapes are supported. Two adjacent strings are treated as one: "foo" 0'~bar~ --> "foobar". No variable expansion.

here-strings:

text:=
0'|foo|
"bar\n";
Output:
"foobar\n"
n:=7; text:=String(
"foo = ",3,"\n"
"bar=",n,"\n"
);
Output:
text = "foo = 3\nbar=7"