Unicode and UTF-8
Unicode and UTF-8
Computer Science and Engineering College of Engineering The Ohio State University
Lecture 33
A standard for the discrete representation of written text
The Big Picture
Computer Science and Engineering The Ohio State University
glyphs
m
'
characters
code points
code units
Cyrillic ef
Euro sign
Latin M
Apostrophe
Tei chou ten
U+0444
U+20AC
U+006D
U+2019
U+5975
code unit
6D
D1 84
E2 82 AC
E2 80 99
E5 A5 BD
The Big Picture
Computer Science and Engineering The Ohio State University
glyphs
m
'
characters
code points
code units
Cyrillic ef
Euro sign
Latin M
Apostrophe
Tei chou ten
U+0444
U+20AC
U+006D
U+2019
U+5975
6D
D1 84
E2 82 AC
E2 80 99
E5 A5 BD
Computer Science and Engineering The Ohio State University
Text: A Sequence of Glyphs
Computer Science and Engineering The Ohio State University
Glyph: "An individual mark on a written medium that contributes to the meaning of what is written."
See foyer floor in main library
One character can have many glyphs
Example: Latin E can be e, e, e, e, e, e, e...
One glyph can be different characters
A is both (capital) Latin A and Greek Alpha
One unit of text can consist of multiple glyphs
An accented letter (?) is two glyphs The ligature of f+i (fi) is two glyphs
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- unicode characters and utf 8 city university of new york
- the impact of change from wlatin1 to utf 8 encoding in sas pharmasug
- the unicode standard version 13 0 core specification
- utf what a guide for handling sas transcoding errors with utf 8
- unicode and utf 8
- character sets and unicode in firebird
- what are double byte single byte and multi byte encodings
- ascii encoding stanford university
- if you have to process difficult characters utf 8 encoding and sas
- computer hardware and data representation ascii jon garvin