Python and Web Data Extraction: Introduction

Python and Web Data Extraction:

Introduction

Alvin Zuyin Zheng

zheng@temple.edu



Outline

? Overview

? Steps in Web Scraping

¨C Fetching a Webpage

¨C Download the webpage

¨C Extracting information from the webpage

¨C Storing information in a file

? Tutorial 2: Extracting Textual Data from 10-K

Web scraping typically consist of

Step 1. Fetching a

webpage

Step 2.

Downloading the

webpage

(Optional)

Step 4. Storing

information in a file

Step 3. Extracting

information from

the webpage

Example: 10-K

URL:



Example: Table with Links

URL:





................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download