Follow

Follow

Tag

data structures

#data-structures

Read more stories on Hashnode

Articles with this tag

PySpark Fundamentals

Oct 17, 20214 min read

Spark = tool for doing parallel computation with large datasets. Spark lets you spread data and computations over clusters with multiple nodes.pyspark...

R: dplyr

May 21, 20161 min read

Package to interact easily with tables install.packages('dplyr') library(dplyr) tab <- tbl_df(originaltable) Five basic functions: select(),...