Python and Web Data Extraction: Introduction

Python and Web Data Extraction: Introduction

Alvin Zuyin Zheng

zheng@temple.edu

Outline

? Overview ? Steps in Web Scraping

? Fetching a Webpage ? Download the webpage ? Extracting information from the webpage ? Storing information in a file

? Tutorial 2: Extracting Textual Data from 10-K

Web scraping typically consist of

Step 1. Fetching a webpage

Step 2. Downloading the

webpage (Optional)

Step 4. Storing information in a file

Step 3. Extracting information from

the webpage

Example: 10-K

URL:

Example: Table with Links

URL:



................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download