Latest Courses
Java A-Z : Learn java within 8 hoursCheck course
Learn React JS and Web API by creating a Full Stack Web AppCheck course
Angular 2 Master Class with Alejandro RangelCheck course
The Comple JavaScript From Beginner To AdvancedCheck course
The Complete Android Oreo and Nougat App TutorialsCheck course
C Programming Skills Test With ExplanationCheck course
Modern .NET Ecosystem and .NET CoreCheck course
Python Programming Bible: Hands-On Python 3 with 10 ProjectsCheck course
Introduction to Cloud Computing on Amazon AWS for BeginnersCheck course
Learn basics of Redux in React Native in 2 hours!Check course
Java A-Z : Learn java within 8 hoursCheck course
Learn React JS and Web API by creating a Full Stack Web AppCheck course
Angular 2 Master Class with Alejandro RangelCheck course
The Comple JavaScript From Beginner To AdvancedCheck course
The Complete Android Oreo and Nougat App TutorialsCheck course
Scalable Machine Learning on Big Data using Apache Spark

Scalable Machine Learning on Big Data using Apache Spark

FREE

Add your review
Add to wishlistAdded to wishlistRemoved from wishlist 0
Add to compare
8.4/10 (Our Score)
Product is rated as #27 in category Big Data

This course will empower you with the skills to scale data science and machine learning (ML) tasks on Big Data sets using Apache Spark. Most real world machine learning work involves very large data sets that go beyond the CPU, memory and storage limitations of a single computer. Apache Spark is an open source framework that leverages cluster computing and distributed storage to process extremely large data sets in an efficient and cost effective manner. Therefore an applied knowledge of working with Apache Spark is a great asset and potential differentiator for a Machine Learning engineer. After completing this course, you will be able to: – gain a practical understanding of Apache Spark, and apply it to solve machine learning problems involving both small and big data – understand how parallel code is written, capable of running on thousands of CPUs. – make use of large scale compute clusters to apply machine learning algorithms on Petabytes of data using Apache SparkML Pipelines. – eliminate out–of–memory errors generated by traditional machine learning frameworks when data doesn’t fit in a computer’s main memory – test thousands of different ML models in parallel to find the best performing one – a technique used by many …

Instructor Details

Romeo Kienzler holds a M. Sc. (ETH) in Information Systems, Bioinformatics & Applied Statistics (Swiss Federal Institute of Technology). He has nearly two decades of experience in Software Enineering, Database Administration and Information Integration. Since 2012 he works as a Data Scientist for IBM. He published several works in the field with international publishers and on conferences. His current research focus is on massive parallel data processing architectures. Romeo also contributes to various open source projects.

Specification: Scalable Machine Learning on Big Data using Apache Spark

Duration

9 hours

Year

2019

Level

Intermediate

Certificate

Yes

Quizzes

Yes

73 reviews for Scalable Machine Learning on Big Data using Apache Spark

3.3 out of 5
19
16
16
8
14
Write a review
Show all Most Helpful Highest Rating Lowest Rating
  1. Avatar

    Philippe D

    The course is interesting but I think a few things could be improved. For instance, some code examples from the videos are outdated because of a newer spark version. The video was edited to mention that the github repo was updated but I was unable to find the updated code. One (maybe more?) of the videos was done in a car; It makes the whole thing feel unprofessional even though the teacher’s skills far exceed the requirements for teaching this course. As others have mentioned, the teacher’s accent can be a bit difficult to understand at times but to me, this does not affect the quality of the course. The teacher always seems interested and is smiling most of the time which might seem unimportant but it still sets a positive mood for the lectures which is great. All in all, the course is interesting and it provides a good introduction to Machine Learning using Apache Spark.

    Helpful(3) Unhelpful(0)You have already voted this
  2. Avatar

    Yasser E H

    Really interesting content Unclear coding explanations Limitations with the free access in IBM Watson Studio

    Helpful(3) Unhelpful(0)You have already voted this
  3. Avatar

    Suresh C

    There should be more details about Apache spark and some examples

    Helpful(3) Unhelpful(0)You have already voted this
  4. Avatar

    Farrukh N A

    Course can be improved by focusing more on ML algorithms…. Explanation of GBT and Random Forest was not provided. But they were used.

    Helpful(1) Unhelpful(0)You have already voted this
  5. Avatar

    Ruslan I M V

    Apache spark is great and powerful but the lectures are not clear and long.

    Helpful(6) Unhelpful(0)You have already voted this
  6. Avatar

    Ujjwal G

    For a intorductory course it is very good. Do not expect anything too advanced.

    Helpful(2) Unhelpful(0)You have already voted this
  7. Avatar

    Lewis m

    So far the questions and quizes seem unrelated to machine learning. The videos are poorly set out, with breif explanations and the whole thing seems rushed.

    Helpful(7) Unhelpful(0)You have already voted this
  8. Avatar

    Gherbi H

    A very good course and will recommend it for anyone who has Apache Spark experience and wants to get an introduction to ML lib and machine learning in Apache Spark, the assignment submissions need some work but other than that a very good introductive course.

    Helpful(1) Unhelpful(0)You have already voted this
  9. Avatar

    Nicolas F

    Trully disappointing!! Waste of time and money. Really poor video material and exercises. I definetelly dont recommend this course!!

    Helpful(3) Unhelpful(0)You have already voted this
  10. Avatar

    Mohamad S

    Video is outdated. The course material is not structured properly. Lack of explanation on the code. The code reference is confusing.

    Helpful(3) Unhelpful(0)You have already voted this
  11. Avatar

    Jered W

    Good instructor. We should probably have more spark coding assignments.

    Helpful(0) Unhelpful(0)You have already voted this
  12. Avatar

    charles l

    Really really REALLY enjoyed this course! The instructor does a masterful job of going from simple examples and building up complexity in a very logical and thorough way.

    Helpful(0) Unhelpful(0)You have already voted this
  13. Avatar

    Justin M

    The Videos/tutorials need to be updated for the new pyspark. Some are still using python 2 which reaches end of life in a month!

    Helpful(1) Unhelpful(0)You have already voted this
  14. Avatar

    Mainak B

    Thanks a lot for helping me. I would suggest that the data storage in IBM cloud should be described in detail.

    Helpful(0) Unhelpful(0)You have already voted this
  15. Avatar

    Lucas I S

    Like the format of this course, which seems more laid back. Having said that, some of the assignments had some confusing portion, but need to acknowledge this is an intermediate course and not a beginner one. I also missed the part of the explanation that Apache Spark has its own tools vs. using Python’s SciKit

    Helpful(0) Unhelpful(0)You have already voted this
  16. Avatar

    Gao S

    Instructor accent is strong

    Helpful(0) Unhelpful(0)You have already voted this
  17. Avatar

    Ahmet

    There should be more practice notebooks and questions

    Helpful(0) Unhelpful(0)You have already voted this
  18. Avatar

    Narendra b O

    .

    Helpful(0) Unhelpful(0)You have already voted this
  19. Avatar

    Jesus M G G

    Some videos seem outdated, and one of them doesn’t have all subtitles. The exercises sometimes uses some models or functions not covered in the videos I had some issues connecting to the Spark Kernel (it was working before and then stop working. It fixed it self after a few days)

    Helpful(0) Unhelpful(0)You have already voted this
  20. Avatar

    Kishore K P

    Good Course who looks for practicals, not for theory classes.

    Helpful(0) Unhelpful(0)You have already voted this
  21. Avatar

    Moez B

    Good course. Beginner level, it starts slow and gets better in weeks 3 and 4. Instructor is very helpful.

    Helpful(0) Unhelpful(0)You have already voted this
  22. Avatar

    kexin

    A lot of errors in lecture.

    Helpful(0) Unhelpful(0)You have already voted this
  23. Avatar

    Miele W

    Again a nice course that introduce you on Apache Spark Usage. Just a little suggestion, if you could insert a little tweak on how pass from spark to pandas and vice versa. Enjoy 🙂

    Helpful(0) Unhelpful(0)You have already voted this
  24. Avatar

    Christopher G

    Very good and concise introduction to the topic. Thank you very much!

    Helpful(0) Unhelpful(0)You have already voted this
  25. Avatar

    Pietro D

    The course is based on a previous version of IBM Watson platform that makes too many slides outdated. Too much time is dedicated to the definition and computation of basic statistical moments. The same information about Apache Spark is published on the project’s website.

    Helpful(0) Unhelpful(0)You have already voted this
  26. Avatar

    Denis U

    Very primitive tasks

    Helpful(1) Unhelpful(0)You have already voted this
  27. Avatar

    ycey

    Need to be more organized course items

    Helpful(0) Unhelpful(0)You have already voted this
  28. Avatar

    PARITOSH P

    Good course

    Helpful(0) Unhelpful(0)You have already voted this
  29. Avatar

    Yassine E

    Awesome 🙂

    Helpful(0) Unhelpful(0)You have already voted this
  30. Avatar

    Theodore G

    This deeply need a much more detailed course on Apache Spark. You need far more than this course to actually get into PySpark.

    Helpful(0) Unhelpful(0)You have already voted this
  31. Avatar

    Petr J

    Obsolete presentations with low quality of explanations. I would not recommend this course to anyone.

    Helpful(1) Unhelpful(0)You have already voted this
  32. Avatar

    Jochen G

    Cool course with a slow paced start and then interesting examples to work with Apache Spark ML.

    Helpful(0) Unhelpful(0)You have already voted this
  33. Avatar

    Ratnakar M

    Content was ok , IBM has better course production than this , sorry to say , i m very grateful for the effort tutor took . Thanks

    Helpful(0) Unhelpful(0)You have already voted this
  34. Avatar

    Utkarsh B

    There should be some more exercises for practice.

    Helpful(0) Unhelpful(0)You have already voted this
  35. Avatar

    Billy

    focus too much on practical skills than the balance of concepts and implementations confusing to follow

    Helpful(0) Unhelpful(0)You have already voted this
  36. Avatar

    Vladimir G

    Good day whoever reads this! First of all I’d like to say thank you for the course. This topic is pretty interesting for me and I move through this course with interest. IF you would update videos according python 3.6++ as in notebooks it would be much easier to learn and get into things. Also final assignment seems a too easy. Also quality of sound and videos varies from week to week, and sometimes even from lesson to lesson during one week. Good day and best luck!

    Helpful(0) Unhelpful(0)You have already voted this
  37. Avatar

    Eric

    The instructor breezed through course preparation and gave lectures from his car. Very easy course that doesn’t challenge you or hold you to any standard of learning. One would be better off learning from YouTube and Apache Spark guides. Don’t pay for this course!

    Helpful(0) Unhelpful(0)You have already voted this
  38. Avatar

    Samuel K

    This course has a lot of room for improvement (not to say plainly it’s a waste of time). The video lectures are useless. They consist on the instructor coding some lines to show basic commands in Apache Spark. An introductory course on Apache Spark would be much more useful than this one. The basic stuff on regression and classification methods is really poor as well. The Quizzes and practice exercises only teach some basic Spark functionalities, which could be the only somewhat useful elements of the course. Avoid this unit if you can, I just took it because I enrolled in the AI Professional certificate on Data Engineering.

    Helpful(0) Unhelpful(0)You have already voted this
  39. Avatar

    Stefan W

    Course was nice and avoided peer graded assignments (which I appreciate) but code was written in Python 2 which led to un maintained code.

    Helpful(0) Unhelpful(0)You have already voted this
  40. Avatar

    Aleksei K

    Hard to listen video without subtitles. It be better to show how create a notebook in the watson on the first lecture.

    Helpful(0) Unhelpful(0)You have already voted this
  41. Avatar

    Batyr A

    Demotivating course, the worst I studied on Coursera. If not “AI Engineering” path I would have dropped it

    Helpful(1) Unhelpful(0)You have already voted this
  42. Avatar

    Darnesha C

    I really enjoyed this course! it made learning fun!

    Helpful(0) Unhelpful(0)You have already voted this
  43. Avatar

    Esteban H E

    Not clear at all. A lot of things are not explained, or explained in a confusing way. I learn more by researching what things meant than from lectures

    Helpful(0) Unhelpful(0)You have already voted this
  44. Avatar

    Wei J ( T

    I should not comment the way how this lecture has been carried out… HOWEVER, it does bring actual tooling skills and makes it interesting to use those instruments for real life situation. Highly recommended. Make your model maybe can be your crystal ball.

    Helpful(0) Unhelpful(0)You have already voted this
  45. Avatar

    Shrinivas S

    Great Hands On activity, good pace so you are not lost and complete the learning objectives.

    Helpful(0) Unhelpful(0)You have already voted this
  46. Avatar

    Valerio R

    Please less math calculus in the quitzzes

    Helpful(0) Unhelpful(0)You have already voted this
  47. Avatar

    Paruhang K

    Explanations could be a little more detailed. Felt like I was missing chunks of information while watching videos.

    Helpful(0) Unhelpful(0)You have already voted this
  48. Avatar

    Michael T E

    There was not enough learning about how to use ApacheSpark, it was more of a show what it can do.

    Helpful(0) Unhelpful(0)You have already voted this
  49. Avatar

    Jason A

    more hands on would be nice, rather than having so much of the code pre written

    Helpful(0) Unhelpful(0)You have already voted this
  50. Avatar

    Branly F L

    This course needs more spark towards the student. Thanks.

    Helpful(0) Unhelpful(0)You have already voted this
  51. Avatar

    Brandon S C

    I found this course incredibly beneficial. Moving forward, I would like to see a bit more explanation of concepts and few extra workable examples.

    Helpful(0) Unhelpful(0)You have already voted this
  52. Avatar

    Daxkumar J

    This course gives you a basic idea behind the pyspark. If you are a beginner so this course for you.

    Helpful(0) Unhelpful(0)You have already voted this
  53. Avatar

    shashank s

    for the last assignment we should have got the opportunity to code in the notebook instead of just running it and reporting results.

    Helpful(0) Unhelpful(0)You have already voted this
  54. Avatar

    Kovacs R

    Low quality videos, not relevant quiz questions. I can set up the same course in one day. Waste of time.

    Helpful(0) Unhelpful(0)You have already voted this
  55. Avatar

    lorenzo a

    It really feels like the course was prepared in a bit of a rushed fashion.

    Helpful(0) Unhelpful(0)You have already voted this
  56. Avatar

    leonardo d

    There are some issues with the normalization of the distribution moments. Everything else is good material to learn how to use apache spark for the first time.

    Helpful(0) Unhelpful(0)You have already voted this
  57. Avatar

    Oussama B

    I am really disapointed by this chapter!!!! I learnd nothing, week 2 evaluation is ridiculous I did it with Excel….

    Helpful(0) Unhelpful(0)You have already voted this
  58. Avatar

    JULIAN S

    After completing this course you will be able to use Apache Spark to build ML models (e.g., Linear Regression, Gaussian Mixture Model, etc.).

    Helpful(0) Unhelpful(0)You have already voted this
  59. Avatar

    Scott P

    The course material was clear but we are never really given any challenging practice exercises to do. The “project” at the end was litterely just running prewritten code it would have been better if we got to write the code on our own.

    Helpful(0) Unhelpful(0)You have already voted this
  60. Avatar

    Raul S C

    muy malo por eso me di de baja de todos los cursos ni si quiera estan las traducciones

    Helpful(0) Unhelpful(0)You have already voted this
  61. Avatar

    Fahad T A

    It was challenging and very informative

    Helpful(0) Unhelpful(0)You have already voted this
  62. Avatar

    Mriam A

    Useless course, waste of time, but i had to take it for the certificate.

    Helpful(0) Unhelpful(0)You have already voted this
  63. Avatar

    Michel G E H

    Amazing course! Thank you!

    Helpful(0) Unhelpful(0)You have already voted this
  64. Avatar

    Nils N

    Maybe I do not have knowledge about Python, but a lot of things were not understandable for me. In addition, parts of the course are still shown in an older, out of fashion version of Watson. The shown code is not working in todays version

    Helpful(0) Unhelpful(0)You have already voted this
  65. Avatar

    Ait A O M

    Overly, It is a well structured and oriented course, especially the practice part. However, the lectures could have been improved and made clearer. Thank you for all your efforts.

    Helpful(0) Unhelpful(0)You have already voted this
  66. Avatar

    Ever A B V

    excellent course

    Helpful(0) Unhelpful(0)You have already voted this
  67. Avatar

    ANGEL G C

    Excellent course! All the explanations are quite clear, a lot of good quality information provided from amazing teacher. Additionally, response times for any question is very fast.

    Helpful(0) Unhelpful(0)You have already voted this
  68. Avatar

    Mohammad S H

    i like very much the Machine Learning, but the course was focusing to cover the whole functions,methods,logarithms… but i was preferred to focus on few concepts and do more practicing on to understand more the course and to make it more beneifal in our job carrier.

    Helpful(0) Unhelpful(0)You have already voted this
  69. Avatar

    Giuseppe C

    very bad teacher

    Helpful(0) Unhelpful(0)You have already voted this
  70. Avatar

    Itamar A T

    I found difficult to understand the concepts, for sure I must have to review the class. Thanks for the dedication in helping us. Itamar

    Helpful(0) Unhelpful(0)You have already voted this
  71. Avatar

    Timur U

    The course Scalable Machine Learning on Big Data using Apache Spark has too complicated instructions for Labs. This course should be more properly organized like another part of IBM Professional Certificate, Intermediate Level Machine Learning with Python. I don’t understand why this course was included to this certificate. I’m not motivated to continue this course.

    Helpful(0) Unhelpful(0)You have already voted this
  72. Avatar

    Bo T

    Some of the content could have been presented more clearly and recorded in the same manner consistently, few items seemed to repeat also while other are not covered well. The code walk thought should have been better explained and with less errors/clarifications which are later explained through video quiz or overlays.

    Helpful(0) Unhelpful(0)You have already voted this
  73. Avatar

    Sadabrata K

    Good content and the programming assignments really help in applying the concepts.

    Helpful(0) Unhelpful(0)You have already voted this

    Add a review

    Your email address will not be published. Required fields are marked *

    This site uses Akismet to reduce spam. Learn how your comment data is processed.

    Scalable Machine Learning on Big Data using Apache Spark
    Scalable Machine Learning on Big Data using Apache Spark

    Price tracking

    Register New Account
    Reset Password
    Compare items
    • Total (0)
    Compare