1 Introduction to Apache Spark - Brigham Young University

Natalia Myronova 4 Approaches to create DataFrame • You can Create a DataFrameusing toDF()and createDataFrame() methods, both these function takes different signatures in order to create DataFramefrom existing RDD, list, and DataFrame. • You can also create DataFramefrom data sources like TXT, CSV, JSON, ORV, Avro, Parquet, XML formats by reading from HDFS, S3, DBFS, Azure … ................
................