Latest Courses
ISTQB Artificial Intelligence Tester Sample ExamsCheck course
JAVA Programming Online Practice ExamCheck course
Programming for Kids and Beginners: Learn to Code in PythonCheck course
Practice Exams | Codeigniter 4 developer certificationCheck course
WordPress Practice Tests & Interview Questions (Basic/Adv)Check course
Git &Github Practice Tests & Interview Questions (Basic/Adv)Check course
Machine Learning and Deep Learning for Interviews & ResearchCheck course
Laravel | Build Pizza E-commerce WebsiteCheck course
101 - F5 CERTIFICATION EXAMCheck course
Master Python by Practicing 100 QuestionCheck course
ISTQB Artificial Intelligence Tester Sample ExamsCheck course
JAVA Programming Online Practice ExamCheck course
Programming for Kids and Beginners: Learn to Code in PythonCheck course
Practice Exams | Codeigniter 4 developer certificationCheck course
WordPress Practice Tests & Interview Questions (Basic/Adv)Check course
Distributed Machine Learning with Apache Spark

Distributed Machine Learning with Apache Spark

FREE

Add your review
Add to wishlistAdded to wishlistRemoved from wishlist 0
Add to compare
8.2/10 (Our Score)
Product is rated as #219 in category Machine Learning

Machine learning aims to extract knowledge from data, relying on fundamental concepts in computer science, statistics, probability and optimization. Learning algorithms enable a wide range of applications, from everyday tasks such as product recommendations and spam filtering to bleeding edge applications like self–driving cars and personalized medicine. In the age of ’big data’, with datasets rapidly growing in size and complexity and cloud computing becoming more pervasive, machine learning techniques are fast becoming a core component of large–scale data processing pipelines. This statistics and data analysis course introduces the underlying statistical and algorithmic principles required to develop scalable real–world machine learning pipelines. We present an integrated view of data processing by highlighting the various components of these pipelines, including exploratory data analysis, feature extraction, supervised learning, and model evaluation. You will gain hands–on experience applying these principles using Spark, a cluster computing system well–suited for large–scale machine learning tasks, and its packages spark.ml and spark.mllib. You will implement distributed algorithms for fundamental statistical models (linear regression, logistic regression, principal component analysis) while tackling key problems from domains such as online advertising and cognitive neuroscience.

Instructor Details

Ameet Talwalkar is an assistant professor of Computer Science at UCLA and a technical advisor for Databricks. His research addresses scalability and ease-of-use issues in the field of statistical machine learning, with applications in computational genomics. He led the initial development of the MLlib project in Apache Spark and is a co-author of the graduate-level textbook 'Foundations of Machine Learning' (2012, MIT Press). Prior to UCLA, he was an NSF post-doctoral fellow in the AMPLab at UC Berkeley. He obtained a B.S. from Yale University and a Ph.D. from the Courant Institute at NYU.

Specification: Distributed Machine Learning with Apache Spark

Duration

30 hours

Year

2021

Level

Intermediate

Certificate

Yes

Quizzes

No

1 review for Distributed Machine Learning with Apache Spark

4.0 out of 5
0
1
0
0
0
Write a review
Show all Most Helpful Highest Rating Lowest Rating
  1. Martijn Onderwater

    I really enjoyed taking this course! Initially, I was a bit annoyed with the various registrations that need to be filled out prior to the course (at edX, at Piazza, databricks, and one of the notebooks). But after this, everything was smooth sailing. The teachers explain the concepts well and they speak clearly. During the course, we touched various subjects that I am interested in: machine learning, spark, map reduce, python, and MLib. The lab exercices were at the right level for me (I have a solid background in math and software development) and took me about six hours each. I only got stuck at points where I had not read the instructions properly, and the active forum helped me through that. All in all I can recommend others to take this course!

    /Martijn

    and of course: many thanks to the people at Berkeley for providing the class

    Helpful(0) Unhelpful(0)You have already voted this

    Add a review

    Your email address will not be published. Required fields are marked *

    This site uses Akismet to reduce spam. Learn how your comment data is processed.

    Distributed Machine Learning with Apache Spark
    Distributed Machine Learning with Apache Spark

    Price tracking

    Java Code Geeks
    Logo
    Register New Account
    Compare items
    • Total (0)
    Compare