Massimiliano Figini
Only My Notes

Only My Notes

Follow
Follow
homeAbout Me
Tag

spark

#spark

More content

Read more stories on Hashnode


Articles with this tag

Intro to Spark SQL in Python

Nov 21, 20211 min read

Spark SQL is a component of Apache Spark that works with tabular data. # Load data from file df = spark.read.csv("trains.csv", header=True) # Create...

Intro to Spark SQL in Python

PySpark Fundamentals

Oct 17, 20214 min read

Spark = tool for doing parallel computation with large datasets. Spark lets you spread data and computations over clusters with multiple nodes.pyspark...

PySpark Fundamentals