Scala and the JVM for Big Data: Lessons from Spark

Scala and the JVM for Big Data: Lessons from Spark

talks dean.wampler@ @deanwampler

1

?Dean Wampler 2014-2019, All Rights Reserved

Spark

2

A Distributed Computing Engine

on the JVM

3

Node Partition 1

Cluster Node RDD Partition 1

Node Partition 1

Resilient Distributed Datasets

4

Productivity?

Very concise, elegant, functional APIs.

?Scala, Java

?Python, R

?... and SQL!

5

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download