Python and Unicode

嚜燕ython and Unicode

Unicode Support in Python

EuroPython Conference 2002

Charleroi, Belgium

Marc-Andr谷 Lemburg

Software GmbH

Germany

(c) 2001 Software, Skills and Services GmbH, info@

European Python Meeting at Bordeaux 2001

Python & Unicode

Python & Unicode: Overview

1. Introduction to Unicode

2. Python&s Path to Unicode

3. Using Unicode in Python

4. The Future

(c) 2002 Software, Skills and Services GmbH, info@

EuroPython Confrence 2002

Python & Unicode

Python & Unicode: Part 1

1. Introduction to Unicode

2. Python&s Path to Unicode

3. Using Unicode in Python

4. The Future

(c) 2002 Software, Skills and Services GmbH, info@

EuroPython Confrence 2002

Python & Unicode

Introduction to Unicode: The Problem

? Storing scripts: human readable text data

每 Localization (l10n) and Internationalization (i18n) of software and

GUIs

每 Basis for national language and script support

每 Common ground for textual information

exchange

(c) 2002 Software, Skills and Services GmbH, info@

EuroPython Confrence 2002

Python & Unicode

Introduction to Unicode: First Approximations

? Mappings of bytes to characters: Code Pages (CP)

每 Problem: Attaching the encoding information to the data

No support in the OS for maintaining per data buffer encoding information

Each application/protocol has to implement its own way of dealing with encodings

每 Problem: Scripts with many characters

e.g. Asian scripts use shift information to address all characters using 8 bits

每 Problem: Not available for ancient scripts

e.g. Old Italic

每 Problem: Incompatible mappings for the same script

e.g. Latin-1 and Windows CP-152x

(c) 2002 Software, Skills and Services GmbH, info@

EuroPython Confrence 2002

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download