Introduction to Big Data .edu

Spark Developed at UC Berkely, Spark is considered the next generation of distributed programming. It is useful for performing ad-hoc analysis of HDFS data, and includes support for a variety of libraries such as data frames (in memory tables), data streaming, machine learning, and graphs, making this platform well suited for a variety of tasks. ................
................