Word frequency: Difference between revisions
Content added Content deleted
Line 2,379: | Line 2,379: | ||
// Informally, this set is the set of all non-whitespace characters used to separate linguistic units in scripts, such as periods, dashes, parentheses, and so on. |
// Informally, this set is the set of all non-whitespace characters used to separate linguistic units in scripts, such as periods, dashes, parentheses, and so on. |
||
MutableCharacterSetFormUnionWithCharacterSet( separators, fn |
MutableCharacterSetFormUnionWithCharacterSet( separators, fn CharacterSetPunctuationSet ) |
||
// A character set containing all the whitespace and newline characters including characters in Unicode General Category Z*, U+000A U+000D, and U+0085. |
// A character set containing all the whitespace and newline characters including characters in Unicode General Category Z*, U+000A U+000D, and U+0085. |