Topics on Spark, Pyspark and Databricks
Table of contents
- Spark
- Spark Architecture
- Narrow_Vs_Wide_Transformation
- Spark DB-Tables-Metastore-Catalogs
- persist and cache
- Broadcast Variables
- Data Skew in Spark
- dropna and fillna
- Removing Duplicates - PySpark
- Partition And Bucket
- RDD-Dataframe-Dataset
- Scala Cheatsheet
- Spark Interview Questions
- Shuffle in Spark
- Install-PySpark-Windows
- PySpark Concepts I
- Q&A
- Spark-Hive-Delta Connection