Dask - FOSDEM
Dask
extending Python data tools for parallel and distributed computing
Joris Van den Bossche - FOSDEM 2017
1 / 29
Python's scientific/data tools ecosystem
## Thanks to Jake VanderPlas for the figure
2 / 29
3 / 29
3 / 29
Provides high-performance, easy-to-use data structures and tools Widely used for doing practical data analysis in Python Suited for tabular data (e.g. column data, spread-sheets, databases)
import pandas as pd df = pd.read_csv("myfile.csv") subset = df[df['value'] > 0] subset.groupby('key').mean()
4 / 29
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- scale independent data analysis with database backed
- gpus for data science rapids
- lecture 4 dask github pages
- 126 proc of the 14th python in science conf scipy
- scaling rapids with dask nvidia
- scalable machine learning with dask
- dask processing and analytics for large datasets
- magpie python at speed and scale using cloud backends
- dask fosdem
- gpu accelerated data analytics in python