Prototyping Data Intensive Apps: TrendingTopics
[Pages:34]Prototyping Data Intensive Apps:
Pete Skomoroch Research Scientist at LinkedIn Consultant at Data Wrangling
@peteskomoroch
09/29/09
1
Talk Outline
? TrendingTopics Overview ? Wikipedia Page View Dataset ? Hadoop on Amazon EC2 ? Loading Data on EC2: Amazon EBS & S3 ? Daily Timelines with Hadoop Streaming ? Hive Data Warehouse Layer ? Trend Computation with Hive ? Hooking It All Together ? Front End & Visualizations
Data Intensive Web Apps
? Batch data mining or prediction with Hadoop ? Iterate quickly with high level languages & tools
? Pig, Hive, Clojure, Cascading, Python, Ruby
? EC2: Get running with limited initial capital ? Use external data and APIs in novel ways ? Recent real world example: FlightCaster
3
4
Daily Pageview Timeline Charts
5
Detects Rising Trends with Hadoop
6
TrendingTopics is Open Source
? Built as a side project at Data Wrangling ? Core code completed over 2 weeks in June ? Code on Github ? Data released on Amazon Public Datasets
7
Technology Stack
8
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- top 8 ecommerce marketplace trends in 2018
- professional weather center images
- the kuoni worldwide trends report 2018 amazon web services
- prototyping data intensive apps trendingtopics
- save this manual for future reference
- the state of ecommerce order fulfillment shipping
- prepare for the future of shopping amazon s3
- 6 new trends impacting festival and consumer events
Related searches
- data analysis quantitative data importance
- example of data analysis what is data analysis in research
- data scientist vs data analyst
- data science vs data analysis
- cardiac intensive care unit nurse
- intensive outpatient program los angeles
- harvard intensive review internal medicine
- key data elements data quality
- key data elements data governance
- data analytics vs data science
- structured data vs unstructured data examples
- data collection and data analysis