beta BLOG dot NET

/* bugs, features, drafts, and solutions. */

// recently in “.NET” category

Sebastian blogged on 2009-11-17T20:23:58+00:00

understanding unicode surrogates / or: how to deal with Linear B strings in .NET


Remember a String object in .NET is a collection of Char objects, where a Char object in turn s announced as a unicode character, encoded by a 16bit unsigned integer. Thus, more precisely speaking, a single Char object is able to encode any codepoint within the basic multilingual lane (BMP), i.e. between U+0000 and U+FFFF. So, where goes the rest of the story? Unicode, as an universal character set, is designed to support much more than 65536 characters of ourse.  → read more …

# 

$tags

 = [  ];
# 

$categories

 = [  ];
# 

no comments

, 

no trackbacks

→ view entry

here goes the message.