- Introduction
- How is this course structured
- Introduction to our development environment
- Introduction to our dataset & dataframes
- Latest Config Code
- Environment configuration code (latest code in downloadable file)
- Ingesting & Cleaning Data
- Answering our scenario questions
- Bringing data into dataframes
- Inspecting A Dataframe
- Handling Null & Duplicate Values
- Selecting & Filtering Data
- Applying Multiple Filters
- Running SQL on Dataframes
- Adding Calculated Columns
- Group By And Aggregation
- Writing Dataframe To Files
- Challenge Overview
- Challenge Solution
- Thanks for joining me to learn PySpark!