Taught by a 4 person team including 2 Stanford–educated, ex–Googlers and 2 ex–Flipkart Lead Analysts. This team has decades of practical experience in working with Java and with billions of rows of data.
Get your data to fly using Spark for analytics, machine learning and data science
Let s parse that.
What’s Spark? If you are an analyst or a data scientist, you’re used to having multiple systems for working with data. SQL, Python, R, Java, etc. With Spark, you have a single engine where you can explore and play with large amounts of data, run machine learning algorithms and then use the same system to productionize your code.
Analytics: Using Spark and Python you can analyze and explore your data in an interactive environment with fast feedback. The course will show how to leverage the power of RDDs and Dataframes to manipulate data with ease.
Machine Learning and Data Science : Spark’s core functionality and built–in libraries make it easy to implement complex algorithms like Recommendations with very few lines of code. We’ll cover a variety of datasets and algorithms including PageRank, MapReduce and Graph datasets.
What’s Covered:
Lot’s of cool stuff ..
.. and of course all the Spark basic and advanced features:
Instructor Details
Courses : 23
Specification: From 0 to 1 : Spark for Data Science with Python
|
9 reviews for From 0 to 1 : Spark for Data Science with Python
Add a review Cancel reply
This site uses Akismet to reduce spam. Learn how your comment data is processed.
Price | $12.99 |
---|---|
Provider | |
Duration | 8.5 hours |
Year | 2018 |
Level | All |
Language | English |
Certificate | Yes |
Quizzes | No |
$94.99 $12.99
Jaydip Parmar –
Very clear and deep understanding for spark
Tejus S Rao –
exactly what I needed, the knowledge transfer fits like a glove!
Joy B –
Precise And Effective
Ithapu Pavan Kumar –
very nice explanation.
Tiago Fernandes –
the classes are great. The examples are detailed and well explained. 5 stars, no doubt.
Sreekanth Payyavula –
The articulation needed to understand a complex topic is good. The tutor goes great lengths to explain the concepts and the examples have been taken which are interesting and relevant.
Subramonia Pillai –
keep up the good work.
Amit Gupta –
One of the best courses I have come across on Spark and in general around bigdata and Hadoop. Starts with giving the theory and basics before building upon the practical usage.
David Henderson –
It is simply outdated and abandoned in 2018, teaching RDDs rather than dataframes using Python 2.7 rather than a variant of 3. I have no doubt that at the time of release it was a good course and my rating reflects its relevance to the topic of pyspark today in 2020, not the work of the authors.