Spark - IPT

• Operation on Pyspark DataFrame run parallel on different nodes in cluster but, in case of pandas it is not possible. • Operations in PySpark DataFrame are lazy in nature but, in case of pandas we get the result as soon as we apply any operation • Pandas API support more operations than PySpark DataFrame. Still pandas API is more ................
................