Summary - Europa

The aggregate function frees us from the constraint of having the return be the same type as the RDD which we are working on. With aggregate, like fold, we supply an initial zero value of the type we want to return. ... Python create pair RDD using the first word as the key. input.map(lambda x: (x.split(" ")[0], x)) ................
................