Introduction

The Dataframe objects in Pandas allow easy manipulation of data in tabular format, which is how the data from the previous project (Doan 2019) was organized. The Dataframes are used once to filter information from the Parquet files into CSV files and then used again to read the CSV files to perform natural language processing on the data. ................
................