Talk:String length: Difference between revisions
Content deleted Content added
Component Pascal is UTF-16 only. |
→Incorrect: new section |
||
Line 14: | Line 14: | ||
The example for character length does not deal with utf-8 and as much as I understand also fails with Non-BMP code points. |
The example for character length does not deal with utf-8 and as much as I understand also fails with Non-BMP code points. |
||
== Incorrect == |
|||
The byte length calculations for unicode appear generally incorrect. They're only valid for codepoints which are in the Basic Multilingual Plane, but not for the Supplemental planes. I.e. 🀁 wouldn't fit within a single wide character; it would be represented in UTF-16 as 0xD38C and 0xDC01 (if I've done the math right). --[[User:Short Circuit|Michael Mol]] 18:24, 15 March 2012 (UTC) |