CS102 Spring2020 - Stanford University
Unstructured Data
CS102 Spring 2020
Unstructured Data
CS102
Data Tools and Techniques
? Basic Data Manipulation and Analysis
Performing well-defined computations or asking well-defined questions ("queries")
? Data Mining
Over
Looking for patterns in data "unstructured" data
? Machine Learning
(text, images, video)
Using data to make inferences or predictions
? Data Visualization
Graphical depiction of data
? Data Collection and Preparation
Unstructured Data
CS102
Analyzing Text
? Much of the world's data is in the form of free-text
? Different sizes of text elements
? Small ? tweets ? Medium ? emails, product reviews ? Large ? documents ? Very large ? books
? Different sizes of text corpuses
? Ultimate text corpus: the web
Unstructured Data
CS102
Types of Text Processing
1. Search "Text analytics", "Text mining"
? For specific words, phrases, patterns ? Find common patterns, phrase completions
2. Understand "Natural language processing/understanding"
? Parts of speech, parse trees ? Entity recognition ? people, places, organizations ? Disambiguation - "ford", "jaguar" ? Sentiment analysis - happy, sad, angry, ...
Unstructured Data
CS102
Applications of Text Analytics / NLP
? Search engines ? Spam classification ? News feed management ? Document summarization ? Language translation ? Speech-to-text ? Many, many more ...
Unstructured Data
CS102
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- understanding benefits and limitations of unstructured data collection
- from unstructured data to intelligence ipsos
- cs102 spring2020 stanford university
- unstructured data management with oracle database 12c
- text analytics unlocking the value of unstructured data
- unstructured data—growth and challenges
- unstructured data an overview of the data of big data researchgate
- separating historical data look to the future deloitte
- bringing more discipline to the management of unstructured data
- idc futurescape worldwide data integration and analytics oracle
Related searches
- stanford university philosophy department
- stanford university plato
- stanford university encyclopedia of philosophy
- stanford university philosophy encyclopedia
- stanford university philosophy
- stanford university ein number
- stanford university master computer science
- stanford university graduate programs
- stanford university computer science ms
- stanford university phd programs
- stanford university phd in education
- stanford university online doctoral programs