STATS 507 Data Analysis in Python
STATS 507
Data Analysis in Python
Lecture 13: Text Encoding and Regular Expressions
Some slides adapted from C. Budak
Structured data
Encoding: how do bits correspond to symbols?
Interpretation/meaning: e.g., characters grouped into words
Delimited files: words grouped into sentences, documents
Structured content: metadata, tags, etc
Collections: databases, directories, archives (.zip, .gz, .tar, etc)
Increasing structure
Storage: bits on some storage medium (e.g., hard-drive)
Structured data
Today
Encoding: how do bits correspond to symbols?
Interpretation/meaning: e.g., characters grouped into words
Delimited files: words grouped into sentences, documents
Structured content: metadata, tags, etc
Collections: databases, directories, archives (.zip, .gz, .tar, etc)
Increasing structure
Storage: bits on some storage medium (e.g., hard-drive)
Structured data
Today
Encoding: how do bits correspond to symbols?
Interpretation/meaning: e.g., characters grouped into words
Delimited files: words grouped into sentences, documents
Structured content: metadata, tags, etc
Collections: databases, directories, archives (.zip, .gz, .tar, etc)
Lectures 13 and 14
Increasing structure
Storage: bits on some storage medium (e.g., hard-drive)
Text data is ubiquitous
Examples:
Biostatistics (DNA/RNA/protein sequences)
Databases (e.g., census data, product inventory)
Log files (program names, IP addresses, user IDs, etc)
Medical records (case histories, doctors¡¯ notes, medication lists)
Social media (Facebook, twitter, etc)
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- file handling
- 1 td 2 manipuler des expressions régulières avec python
- programming principles in python csci 503
- chapter 12 — string encoding an i
- pattern matching and text manipulation bram kuijper
- part 5 the python language
- types in python
- stats 507 data analysis in python
- programming principles in python csci 503 490
- python and unicode
Related searches
- data analysis in research methodology
- data analysis in research pdf
- data analysis in qualitative research pdf
- data analysis in qualitative research
- data analysis in quantitative research
- data analysis in research examples
- data analysis in research
- example of data analysis what is data analysis in research
- data analysis in research definition
- what is data analysis in research
- types of data analysis in research
- data analysis in excel