Revision as of 18:24, 15 March 2012 (view source) MikeMol (talk \| contribs) (→‎Incorrect: new section) ← Older edit		Revision as of 18:31, 15 March 2012 (view source) Rdm (talk \| contribs) (→‎Incorrect) Newer edit →
Line 18: The byte length calculations for unicode appear generally incorrect. They're only valid for codepoints which are in the Basic Multilingual Plane, but not for the Supplemental planes. I.e. 🀁 wouldn't fit within a single wide character; it would be represented in UTF-16 as 0xD38C and 0xDC01 (if I've done the math right). --[[User:Short Circuit\|Michael Mol]] 18:24, 15 March 2012 (UTC) : If you want to be completely general, there exist [https://en.wikipedia.org/wiki/Unicode_normalization other issues to consider]. Note, in particular, that not all combining forms have codepoints. --[[User:Rdm\|Rdm]] 18:31, 15 March 2012 (UTC)

Talk:String length: Difference between revisions

Talk:String length (view source)