Unicode

unicode

#unicode

Table of Contents

About

1

Chapter 1: Getting started with unicode

2

Remarks

2

Versions

2

Examples

3

Installation or Setup

3

Chapter 2: Characters can consist of multiple code points

4

Remarks

4

Examples

4

Diacritics

4

combined forms

4

Zalgo Text

4

Emoji and flags

5

Chapter 3: English text is not ASCII only

6

Remarks

6

Examples

6

Diacritics

6

Emoji

6

Punctuation

6

Special symbols

7

Chapter 4: UTF-8 as an encoding way of Unicode

8

Remarks

8

Examples

8

How to convert a byte array of UTF-8 data to a Unicode string in Python

9

How to change the default encoding of the server to UTF-8

9

Save an Excel file in UTF-8

9

Credits

11

About

You can share this PDF with anyone you feel could benefit from it, downloaded the latest version from: unicode

It is an unofficial and free unicode ebook created for educational purposes. All the content is extracted from Stack Overflow Documentation, which is written by many hardworking individuals at Stack Overflow. It is neither affiliated with Stack Overflow nor official unicode.

The content is released under Creative Commons BY-SA, and the list of contributors to each chapter are provided in the credits section at the end of this book. Images may be copyright of their respective owners unless otherwise specified. All trademarks and registered trademarks are the property of their respective company owners.

Use the content presented in this book at your own risk; it is not guaranteed to be correct nor accurate, please send your feedback and corrections to info@



1

Chapter 1: Getting started with unicode

Remarks

The Unicode Standard is an international standardized character set. It attempts to assign characters and symbols from every writing system a unique number. With every major new version, additional characters are added to the Standard to achieve this goal. In providing a unified character set for all writing systems, text information can be exchanged in a Unicode format independent of any given platform.

The Unicode Standard also contains property data on the characters, and defines algorithms on how to properly manipulate characters. For example, these algorithms provide the correct method to search and display Unicode text.

Versions

Version Release Date 2.0.0 1996-07-01 3.0.0 1999-09-01 3.1.0 2001-03-01 3.2.0 2002-03-01 4.0.0 2003-04-01 4.0.1 2004-03-01 4.1.0 2005-03-31 5.0.0 2006-07-14 5.1.0 2008-04-04 5.2.0 2009-10-01 6.0.0 2010-10-11 6.1.0 2012-01-31 6.2.0 2012-09-26 6.3.0 2013-09-30 7.0.0 2014-06-16



2

Version Release Date 8.0.0 2015-06-17 9.0.0 2016-06-21

Examples

Installation or Setup

Detailed instructions on getting unicode set up or installed. Read Getting started with unicode online:



3

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download