This course is for data scientists (or aspiring data scientists) who want to get PRACTICAL training in PySpark (Python for Apache Spark) using REAL WORLD datasets and APPLICABLE coding knowledge that you ll use everyday as a data scientist! By enrolling in this course, you ll gain access to over 100 lectures, hundreds of example problems and quizzes and over 100,000 lines of code!
I m going to provide the essentials for what you need to know to be an expert in Pyspark by the end of this course, that I ve designed based on my EXTENSIVE experience consulting as a data scientist for clients like the IRS, the US Department of Labor and United States Veterans Affairs.
I ve structured the lectures and coding exercises for real world application, so you can understand how PySpark is actually used on the job. We are also going to dive into my custom functions that I wrote MYSELF to get you up and running in the MLlib API fast and make getting started building machine learning models a breeze! We will also touch on MLflow which will help us manage and track our model training and evaluation process in a custom user interface that will make you even more competitive on the job market!
Courses : 1
Specification: PySpark Essentials for Data Scientists (Big Data + Python)
13 reviews for PySpark Essentials for Data Scientists (Big Data + Python)