AN INTRODUCTION TO SPARK AND TO ITS …

[Pages:30]AN INTRODUCTION TO SPARK AND TO ITS PROGRAMMINGMODEL

2

Introduction toApache Spark

? Fast, expressive cluster computing system compatible with Apache Hadoop

? It is much faster and much easier than Hadoop MapReduce to use due its rich APIs

? Large community

? Goes far beyond batch applications to support a variety of workloads:

? including interactive queries, streaming, machine learning, and graph processing

3

Introduction toApache Spark

? General-purpose cluster in-memory computing system ? Provides high-level APIs in Java, Scala, python

4

5

Uses Cases

6

Uses Cases

7

Real Time Data Architecture for analyzing tweets - Twitter Sentiment Analysis

8

Spark Ecosystem

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download