Latest Courses
ISTQB Artificial Intelligence Tester Sample ExamsCheck course
JAVA Programming Online Practice ExamCheck course
Programming for Kids and Beginners: Learn to Code in PythonCheck course
Practice Exams | Codeigniter 4 developer certificationCheck course
WordPress Practice Tests & Interview Questions (Basic/Adv)Check course
Git &Github Practice Tests & Interview Questions (Basic/Adv)Check course
Machine Learning and Deep Learning for Interviews & ResearchCheck course
Laravel | Build Pizza E-commerce WebsiteCheck course
101 - F5 CERTIFICATION EXAMCheck course
Master Python by Practicing 100 QuestionCheck course
ISTQB Artificial Intelligence Tester Sample ExamsCheck course
JAVA Programming Online Practice ExamCheck course
Programming for Kids and Beginners: Learn to Code in PythonCheck course
Practice Exams | Codeigniter 4 developer certificationCheck course
WordPress Practice Tests & Interview Questions (Basic/Adv)Check course
Fundamentals of Scalable Data Science

Fundamentals of Scalable Data Science

FREE

(55 customer reviews)
Add to wishlistAdded to wishlistRemoved from wishlist 0
Add to compare
8.5/10 (Our Score)
Product is rated as #137 in category Data Science

Apache Spark is the de–facto standard for large scale data processing. This is the first course of a series of courses towards the IBM Advanced Data Science Specialization. We strongly believe that is is crucial for success to start learning a scalable data science platform since memory and CPU constraints are to most limiting factors when it comes to building advanced machine learning models. In this course we teach you the fundamentals of Apache Spark using python and pyspark. We’ll introduce Apache Spark in the first two weeks and learn how to apply it to compute basic exploratory and data pre–processing tasks in the last two weeks. Through this exercise you’ll also be introduced to the most fundamental statistical measures and data visualization technologies. This gives you enough knowledge to take over the role of a data engineer in any modern environment. But it gives you also the basis for advancing your career towards data science. Please have a look at the full specialization curriculum: https://www.coursera.org/specializations/advanced–data–science–ibm If you choose to take this course and earn the Coursera course certificate, you will also earn an IBM digital badge. To find out more about IBM digital badges follow the link ibm.biz/badging. After completing this …

Instructor Details

Romeo Kienzler holds a M. Sc. (ETH) in Information Systems, Bioinformatics & Applied Statistics (Swiss Federal Institute of Technology). He has nearly two decades of experience in Software Enineering, Database Administration and Information Integration. Since 2012 he works as a Data Scientist for IBM. He published several works in the field with international publishers and on conferences. His current research focus is on massive parallel data processing architectures. Romeo also contributes to various open source projects.

Specification: Fundamentals of Scalable Data Science

Duration

12 hours

Year

2017

Level

Beginner

Certificate

Yes

Quizzes

Yes

55 reviews for Fundamentals of Scalable Data Science

4.3 out of 5
30
12
12
0
1
Write a review
Show all Most Helpful Highest Rating Lowest Rating
  1. Suyash

    There are a lot of glitch with the assignments, hope it gets fixed soon

    Helpful(0) Unhelpful(0)You have already voted this
  2. Tamer M

    Most of the video’s subtitles need to be synced, it was hard to fully understand the Indian accent without subtitles.

    Helpful(0) Unhelpful(0)You have already voted this
  3. Dhinson G D

    I love the course content. Simple but very informative and provides good practical exercises.

    Helpful(0) Unhelpful(0)You have already voted this
  4. PV R K

    excellent experience

    Helpful(0) Unhelpful(0)You have already voted this
  5. Eleni K

    I was really looking forward to this specialization but from the very first course I am really disappointed. The videos refer to various not updated information and then suddenly we are expected to do an assignment that was not at all explained in the course. I am not saying it is difficult, or not achievable but to be honest until now (week 2) it feels mostly like a waste of time.. Really sorry for this review.

    Helpful(0) Unhelpful(0)You have already voted this
  6. Charles–Antoine d T

    very good

    Helpful(0) Unhelpful(0)You have already voted this
  7. Charles Antoine d T

    very good

    Helpful(0) Unhelpful(0)You have already voted this
  8. Octavio A T N

    This is a very good Data Science course. It helped me a lot to think in realistic application of Data Analysis. Impressive !!!

    Helpful(0) Unhelpful(0)You have already voted this
  9. Saman S

    that’s wonderful

    Helpful(0) Unhelpful(0)You have already voted this
  10. Prithvi S

    Great course, I really liked Romeo’s explanation and learned a lot.

    Helpful(0) Unhelpful(0)You have already voted this
  11. Joshua A

    Gave excellent starting point for the course

    Helpful(0) Unhelpful(0)You have already voted this
  12. Madan K

    Excellent course

    Helpful(0) Unhelpful(0)You have already voted this
  13. Ivan J M

    There are a lot of not updated sections, sometimes it confuses me because in some videos he talks about how we will use Node RED but then we don’t use it.

    Helpful(0) Unhelpful(0)You have already voted this
  14. Oritseweyinmi H A

    Strong introduction into parallel computing and big data processing. Romeo’s expertise on the subject matter, combined with his love for teaching was on show during this course. He did a great job explaining the theoretical aspects, and slowly but surely introducing us into the practical aspects as well, through the programming exercises. All in all, this has proved to be a high quality introduction into this space and I’m excited to take the next step, learn more and apply the fundamentals I have picked up here.

    Helpful(0) Unhelpful(0)You have already voted this
  15. Tony H

    I felt that, for a course labelled as ‘Advanced’, there were too many trivial questions in the quizzes and too much hand holding in the programming assignments. That being said I did enjoy the course and learned quite a lot and look forward to the next one in the specialisation.

    Helpful(0) Unhelpful(0)You have already voted this
  16. Ankur S

    One of the Best Courses available here on Coursera.

    Helpful(0) Unhelpful(0)You have already voted this
  17. Gouri K

    Good overall,instructor was very good,but I feel more examples could be used especially when explaining multidimensional vector space and such basics of graphs

    Helpful(0) Unhelpful(0)You have already voted this
  18. praveen k

    First time I got the change to work on cloud data (big data). Thanks to IBM

    Helpful(0) Unhelpful(0)You have already voted this
  19. Jorge S A M B

    Conteudo muito bom, atual e otima didatica!

    Helpful(0) Unhelpful(0)You have already voted this
  20. Muyanja S Z

    This has been an interesting and intellectually nourishing course

    Helpful(0) Unhelpful(0)You have already voted this
  21. Muntakimur R

    Very Informative course, thanks IBM for this course

    Helpful(0) Unhelpful(0)You have already voted this
  22. BAUDRY S

    The functions we need to complete looks quite messy, it’a little bit overwhelming especially for people who start with spark.

    Helpful(0) Unhelpful(0)You have already voted this
  23. Bikash R

    PCA part was fun!

    Helpful(0) Unhelpful(0)You have already voted this
  24. Tom V

    Nice course covering the basics. Not very difficult though.

    Helpful(0) Unhelpful(0)You have already voted this
  25. Ahmad R J

    I liked the course because it introduced me to new topics but it did not really go further as expected from an advanced specialization. Maybe when I finished other courses, I find out that it well prepared me for the rest. However, please provide more sample datasets, similar questions, and generally more practice.

    Helpful(0) Unhelpful(0)You have already voted this
  26. Gautham N

    It’s a very good course

    Helpful(0) Unhelpful(0)You have already voted this
  27. Pierre–Matthieu P

    I’ve gained plenty of interesting information and valuable hands on experience. I had to work for it a little more than I should have, however. The lecturer has a strong accent, speaks very fast and the subtitles are mostly useless as they are wrong more often than not. If you take this course, be prepared to take plenty of notes and watch the videos several times.

    Helpful(0) Unhelpful(0)You have already voted this
  28. Pierre Matthieu P

    I’ve gained plenty of interesting information and valuable hands on experience. I had to work for it a little more than I should have, however. The lecturer has a strong accent, speaks very fast and the subtitles are mostly useless as they are wrong more often than not. If you take this course, be prepared to take plenty of notes and watch the videos several times.

    Helpful(0) Unhelpful(0)You have already voted this
  29. Ankit M

    good

    Helpful(0) Unhelpful(0)You have already voted this
  30. Lucas M

    Seria otimo se atualizassem o conteudo do video para reproduzir a versao atual do sistema e do Python, porem em teoria o conteudo nao deixou a desejar.

    Helpful(0) Unhelpful(0)You have already voted this
  31. Francesco d C

    the assignments could have left more freedom to the student.

    Helpful(0) Unhelpful(0)You have already voted this
  32. Leire A

    Low level

    Helpful(0) Unhelpful(0)You have already voted this
  33. Marcos P L

    As an introductory course on data science and manipulation of large data sets, the course proved to be quite comprehensive and technically capable of leading the student to an understanding of all content.

    Helpful(0) Unhelpful(0)You have already voted this
  34. Tee H L

    I really like this learning method from IBM especially the instant quiz just to make sure I understand the important points.

    Helpful(0) Unhelpful(0)You have already voted this
  35. daniel b

    This class make me confident in using apache spark for data projects that I may need. I really enjoyed how simple and effective it was. Very practical, easy to follow, high level course. Can not wait until the next course. You should probably have some experience with data frames and lambda expressions before coming into this class.

    Helpful(0) Unhelpful(0)You have already voted this
  36. Giovani F M

    Great course to learn basic knowledge in spark!

    Helpful(0) Unhelpful(0)You have already voted this
  37. Carlos F C d S e S

    This course changed my life!

    Helpful(0) Unhelpful(0)You have already voted this
  38. Feng L

    too simple not advanced

    Helpful(0) Unhelpful(0)You have already voted this
  39. Phuoc H L

    Love the content, simple and clear are the best.

    Helpful(0) Unhelpful(0)You have already voted this
  40. Mike H

    Not well structured in my opinion. Difficulty of content not well balanced. Outdated presentations and content…

    Helpful(0) Unhelpful(0)You have already voted this
  41. Xuan H N

    More coding please. One doent learn much just by filling out couple words

    Helpful(0) Unhelpful(0)You have already voted this
  42. srinivasareddy c

    Simply course has a very different and amazing nuances of learning

    Helpful(0) Unhelpful(0)You have already voted this
  43. Jeffrey G D

    Some of the courses have out of date instructions, or the methods recommended are deprecated.

    Helpful(0) Unhelpful(0)You have already voted this
  44. Adamya

    A very nice introduction to Apache Spark and it’s environment. As a bonus, it’s also a very nice refresher to your basic statistics!!! Great course!

    Helpful(0) Unhelpful(0)You have already voted this
  45. George H

    Analytically very simple, and fails to explain much of the syntax needed for the assignments.

    Helpful(0) Unhelpful(0)You have already voted this
  46. Arseniy T

    I want to put things into perspective: I recently completed a one year data science course at Flatiron School which covered all aspects of data science: Python, SQL, data mining, statistics, probability, linear regression, classification, decision trees, deep neural networks and everything in between. You name it, I’ve studied it. If you want to learn data science don’t take this course. Few videos about central limit theorem + several graphs in matplotlib wouldn’t leave you confident enough about how to actually do analysis. Also, assignments for this course were mostly about how to extract data with SQL, pretty easy if you know the basics. The entire course took me less than a day to complete and I’m still confused about how actually spark works under the hood. Some people complain about old videos and the thick accent of the teacher. For me it wasn’t the problem, the code was running smoothly and I understood everything the teacher said. My suggestion would be to give a more detailed explanation of the cloud/parallel computing, how it’s structured, how to set up servers, etc.

    Helpful(1) Unhelpful(0)You have already voted this
  47. Nicole Z K

    Outdated content, with corrections as annotations in the videos. Not very engaging and has just a little of spark content.

    Helpful(1) Unhelpful(0)You have already voted this
  48. Jeremie B

    Nice introduction, not too difficult without being so easy that you learn nothing. Sometimes outdated contents, but I always find solutions quickly to make everything work. In fact it is better to have realistic examples and to use up to date technologies, even it is of course harder to maintain. Therefore my remark is not a complaint. Actually Mr Kienzler does a good job to keep things working and the learners informed.

    Helpful(1) Unhelpful(0)You have already voted this
  49. Ulugbek D

    General into into how to deal with large data using Apache Spark

    Helpful(0) Unhelpful(0)You have already voted this
  50. Mohamed A T

    The course was great, the material and the assignments. IBM Watson platform was easy to use. But I can’t see how this course is included in the “advanced” data science specialization. Honestly I was expecting a more advanced course. But we’ll see with the next ones.

    Helpful(0) Unhelpful(0)You have already voted this
  51. Nabin R P

    well explained with relevant examples

    Helpful(0) Unhelpful(0)You have already voted this
  52. PRABAKARAN C

    Great course

    Helpful(0) Unhelpful(0)You have already voted this
  53. Edi W

    Nicely arranged course. However, both assignment on week 4 should be rechecked to make sure that it could run as exercise to student. Also, please make sure that the video is up to date and less error.

    Helpful(0) Unhelpful(0)You have already voted this
  54. Kaiwalya

    The course content is amazing but the instructor’s accent is very difficult to understand and in some videos subtitles in English weren’t available.

    Helpful(0) Unhelpful(0)You have already voted this
  55. Robert H

    Nice subjects notebooks could be more in dept

    Helpful(0) Unhelpful(0)You have already voted this

    Add a review

    Your email address will not be published. Required fields are marked *

    This site uses Akismet to reduce spam. Learn how your comment data is processed.

    Price tracking

    Java Code Geeks
    Logo
    Register New Account
    Compare items
    • Total (0)
    Compare