Unicode in C++



Unicode in C++

James McNellis (@JamesMcNellis) Senior Software Development Engineer Microsoft Visual C++

Before there was Unicode...

Single-Byte Encodings

ASCII

!"#$%&'()*+,-./0123456789:;? @ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_ `abcdefghijklmnopqrstuvwxyz{|}~

ASCII

H e l l o ! \0 48 65 6C 6C 6F 21 00

ASCII

A 7-bit character encoding 32 control characters 95 printable characters Near-ubiquitous, but often with substitutions or extensions Great for English; not so great for most other languages

Extended ASCII

IBM code pages, e.g. CP437 ("Latin US"), CP737 ("Greek") Mac OS Roman DEC Multinational Character Set ISO/IEC 8859 ...and many more

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download