Using database engines and unicode
[Pages:15]Databases
Unicode
References
Using database engines and unicode
Marcin Szewczyk PhD student
msz@imm.dtu.dk
DTU Informatics Technical University of Denmark
21 IX 2009 / web mining
Marcin Szewczyk: msz@imm.dtu.dk SQL + Unicode
university-logo Technical University of Denmark
Databases
Outline
Unicode
1 Databases Introduction Sample usage
2 Unicode Background Unicode in Python Example
Marcin Szewczyk: msz@imm.dtu.dk SQL + Unicode
References
university-logo Technical University of Denmark
Databases
Unicode
Introduction
Python database interface
References
Python database interface: DB-API 2.0 Many implementations: sqllite MySQL IBM DB2 ...
Marcin Szewczyk: msz@imm.dtu.dk SQL + Unicode
university-logo Technical University of Denmark
Databases
Introduction
Logical steps
Unicode
Independent from actual implementation: connect to the database acquire cursor execute SQL statement fetch the results commit changes close connection
Marcin Szewczyk: msz@imm.dtu.dk SQL + Unicode
References
university-logo Technical University of Denmark
Databases
Unicode
References
Sample usage
Sample usage
1 import s q l l i t e 3
2
3 t e s t c o n n e c t i o n = s q l l i t e 3 . connect ( '/tmp/db' ) #
or fx. ':memory' 4 test cursor= test connection . cursor ()
5
6 t e s t c u r s o r . execute ( 'CREATE TABLE people (id INTEGER,name TEXT)' )
7
8 t e s t c u r s o r . execute ( 'INSERT INTO people(id, name) VALUES(?,?)' , ( 7 8 , 'Marcin' ) )
9 test connection . commit ( )
university-logo
Marcin Szewczyk: msz@imm.dtu.dk SQL + Unicode
Technical University of Denmark
Databases
Unicode
References
Sample usage
Example continued
When changing the database remember:
1 i d = 78 #our query data 2 # in MySQLdb use %s instead of ? 3 t e s t c u r s o r . execute ( 'SELECT name FROM people
WHERE id=?' , ( i d , ) ) 4 # OR in sqllite3 5 t e s t c u r s o r . execute ( 'SELECT name FROM people
WHERE id=:id' , { 'id' : i d } )
6
7 p r i n t t e s t c u r s o r . f e t c h o n e ( ) #or .fetchall() or .fetchmany(n)
8
9 test connection . commit ( ) Ma1rc0in S#zOewRczyk:imfsz@simom.mdtue.dtk hing went wrong:
SQL + Unicode
university-logo Technical University of Denmark
Databases
Background
Codepage chaos
Unicode
References
Pre-Unicode:
ASCII - 7bit: 127 symbols based on latin alphabet Many regional 8-bit code pages Central Europe: Latin2, cp852, windows1250, iso-8859-2
Unicode: >1000000 symbols - about 220 16 planes each able to contain about 216 symbols most alphabets belong to plane 0 - BMP symbols referenced as U+xxxx U+1xxxx .... 'A' is U+0041
university-logo
Marcin Szewczyk: msz@imm.dtu.dk SQL + Unicode
Technical University of Denmark
Databases
Background
Unicode encodings
Unicode
References
How to encode unicode UTF32(BE,LE), BOM UTF16(BE,LE), BOM UTF8
Byte Order Mark: U+FEFF to distinguish Big-Endian, Little-Endiax
Marcin Szewczyk: msz@imm.dtu.dk SQL + Unicode
university-logo Technical University of Denmark
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- the python c api
- using database engines and unicode
- anna henningsen addaleax she her js character encodings
- understanding json schema
- appendix text processing with unicode
- rauth documentation
- python and unicode
- interfacing c c and python with swig
- overcoming frustration correctly using unicode in python
- character sets and unicode in firebird
Related searches
- stock search engines and charts
- recipes using boxed macaroni and cheese
- jasper engines remanufactured engines prices
- used chevy engines and transmissions
- ford crate engines and transmissions packages
- ford engines and transmissions packages
- derivatives using function notation and a table
- bundt cake using cake mix and pudding
- using baking soda and honey
- advantages of using database system
- probability calculator using standard deviation and mean
- using modifier 25 and 57