Latest Courses
COMPLETE PERL Programming 2020Check course
Advanced Automation framework with Appium C# - 2020 UpdatedCheck course
Build 10 JavaScript Projects in less than 6 Hours .Check course
Learning Analytics Through RCheck course
Build three custom, functional websites in Django 3.0Check course
JavaScript for beginners For StudentsCheck course
Selenium Mastery: Apply What You Learn Here Today By RicherUCheck course
Django Tutorial: Build Your First App Fast & Free!Check course
Getting Started with LESS - Beginner Crash CourseCheck course
C++ Working with Files - fstream I/O libraryCheck course
COMPLETE PERL Programming 2020Check course
Advanced Automation framework with Appium C# - 2020 UpdatedCheck course
Build 10 JavaScript Projects in less than 6 Hours .Check course
Learning Analytics Through RCheck course
Build three custom, functional websites in Django 3.0Check course
Big Data Analysis with Scala and Spark

Big Data Analysis with Scala and Spark

FREE

Add your review
Add to wishlistAdded to wishlistRemoved from wishlist 0
Add to compare
8.9/10 (Our Score)
Product is rated as #7 in category Big Data

Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in–memory distributed collections framework written in Scala. In this course, we’ll see how the data parallel paradigm can be extended to the distributed case, using Spark throughout. We’ll cover Spark’s programming model in detail, being careful to understand how and when it differs from familiar programming models, like shared–memory parallel collections or sequential Scala collections. Through hands–on examples in Spark and Scala, we’ll learn when important issues related to distribution like latency and network communication should be considered and how they can be addressed effectively for improved performance. Learning Outcomes. By the end of this course you will be able to: – read data from persistent storage and load it into Apache Spark, – manipulate data with Spark and Scala, – express algorithms for data analysis in a functional style, – recognize how to avoid shuffles and recomputation in Spark, Recommended background: You should have at least one year programming experience. Proficiency with Java or C# is ideal, …

Instructor Details

Heather Miller is an assistant professor in Carnegie Mellon University's School of Computer Science. Previously, she was a research scientist at EPFL, and the co-founder and executive director of the Scala Center.

Specification: Big Data Analysis with Scala and Spark

Duration 15 hours
Year 2017
Certificate Yes
Quizzes Yes

48 reviews for Big Data Analysis with Scala and Spark

4.6 out of 5
34
10
3
0
1
Write a review
Show all Most Helpful Highest Rating Lowest Rating
  1. Avatar

    Light0617

    wonderful!!!

    Helpful(0) Unhelpful(0)You have already voted this
  2. Avatar

    Rodion G

    Course is good to have some practice in spark and scala. However it seems to be long forsaken by staff and some issues with assignments require doing archaeology in the forums. Also it is quite unpleasant to see that while specialization emphasizes functional programming, some auxiliary code in assignments is written in the worst manner of imperative programming… Why use scala then?

    Helpful(1) Unhelpful(0)You have already voted this
  3. Avatar

    BOUDRAHEM

    Everything was excellent. This was one of the best courses I have attended so far

    Helpful(0) Unhelpful(0)You have already voted this
  4. Avatar

    Fabio A R

    Excellent course, very handful for whoever is starting his studies in spark.

    Helpful(0) Unhelpful(0)You have already voted this
  5. Avatar

    Ronald C M

    The best way to get to know Spark Fundamentals and Spark Core libraries

    Helpful(0) Unhelpful(0)You have already voted this
  6. Avatar

    AOC

    Very engaging and tutor showed excellent delivery. thanks to Coursera.

    Helpful(0) Unhelpful(0)You have already voted this
  7. Avatar

    Joel V

    The course presented the concepts in a clear and exemplified way. Tough, it was a bit too verbose. The exercises were not too hard and thorough enough to cover the concepts presented in class. I wished there were more exercises as Spark’s API is huge and sometimes a bit tedious to use thus being hard to quickly get confident. I hope that the capstone project will help for that.

    Helpful(0) Unhelpful(0)You have already voted this
  8. Avatar

    Kyle J

    Pretty good, but one of the assignments was poorly set up. Some of the provided code was broken and it was very hard to debug.

    Helpful(0) Unhelpful(0)You have already voted this
  9. Avatar

    Valter F

    I love the indepth aproach at the RDDs. I’d say DataFrames and DataSets required a bit more examples and testing material though.

    Helpful(0) Unhelpful(0)You have already voted this
  10. Avatar

    CLAUDIO A

    Excellent explanations by Heather Miller. She really knows how to explain a topic, and also makes the lectures a lot of fun to listen !

    Helpful(0) Unhelpful(0)You have already voted this
  11. Avatar

    Daniele M

    Great Introduction to spark. Programming assignments helped me to improve my skills. Thank you very much.

    Helpful(0) Unhelpful(0)You have already voted this
  12. Avatar

    Beibit

    Very nice course. Good level of explanations, good examples and pace.

    Helpful(0) Unhelpful(0)You have already voted this
  13. Avatar

    Carsten I

    Excellent class, very well stuctured, clear and informative.

    Helpful(0) Unhelpful(0)You have already voted this
  14. Avatar

    Kristoffer V

    Sometimes the labs included techniques not fully covered by the lectures.

    Helpful(0) Unhelpful(0)You have already voted this
  15. Avatar

    Robin B

    Very good introduction to RDDs and DataFrames/Dataset along with valuable insight into performance considerations. I’d done some prior work with Hadoop/Pig in the past and more recently with Spark (mainly DataFrames/GraphFrames) this was really useful to round out my understanding of RDDs and optimisation. The assignment guidance in the code comments could be more complete to save having to refer back to the site (and maybe reference specific video lectures with the hints). Though it’s good that the assignment exercises aren’t tutorial grade, as that makes the experience more transferable to real projects.

    Helpful(0) Unhelpful(0)You have already voted this
  16. Avatar

    Marija N

    Absolutely fantastic!

    Helpful(0) Unhelpful(0)You have already voted this
  17. Avatar

    Alexey A

    The most interesting course in whole specialization.

    Helpful(0) Unhelpful(0)You have already voted this
  18. Avatar

    Rajesh B

    Very nice explanation, trainer has good knowledge, course materials are good, video quality is too good.

    Helpful(0) Unhelpful(0)You have already voted this
  19. Avatar

    Yury C

    This course was by far the best of 22 courses I’ve done on Coursera. Prof Miller has this rare ability of presenting material in concise and interesting way and yet going into nitty gritty aspects when needed (in another course on Spark technology, such intricacies weren’t covered). Thanks a lot for this course!

    Helpful(0) Unhelpful(0)You have already voted this
  20. Avatar

    Mohamed A T

    the theory is very clear and well explained. the practical assignments are a little bit ambiguous but they are overall very good and challenging. highly recommended!

    Helpful(0) Unhelpful(0)You have already voted this
  21. Avatar

    Bulent B

    Amazing technology, explained wonderfully. Note: familiarity with scala (take Martin’s course in coursera) would make your experience even better.

    Helpful(0) Unhelpful(0)You have already voted this
  22. Avatar

    Mykyta P

    The video lectures are good but code assignments are worse, seems like they were written by students instead of professor or something. Sometimes code doesn’t follow Scala and FP conventions. And the output of the grader doesn’t really provide any helpful information besides the name of the faulty function. But overall it’s a good course and I think the newcomers without any previous experience with Spark will learn a lot.

    Helpful(0) Unhelpful(0)You have already voted this
  23. Avatar

    Mohammad T

    such a beautiful course design for a bigData devlopers

    Helpful(0) Unhelpful(0)You have already voted this
  24. Avatar

    Sanjeev R

    Excellent

    Helpful(0) Unhelpful(0)You have already voted this
  25. Avatar

    Manish M D

    Excellent course.

    Helpful(0) Unhelpful(0)You have already voted this
  26. Avatar

    Santiago A

    Awesome course and awesome teacher! Nevertheless, to grasp the most of this course, you should do the previous 3 courses of the “Functional Programming in Scala” specialization.

    Helpful(0) Unhelpful(0)You have already voted this
  27. Avatar

    Joaquin D R

    Incredible tutorial!!!!!!!!!! I love it

    Helpful(0) Unhelpful(0)You have already voted this
  28. Avatar

    Khaled J A

    this is an outstanding course to learn BIg data with scala and spark

    Helpful(0) Unhelpful(0)You have already voted this
  29. Avatar

    Sergio R P

    The assignments are very confusing and unexplained. They do not take long to reply to the forum.

    Helpful(0) Unhelpful(0)You have already voted this
  30. Avatar

    Zeb S

    I worked with PySpark professionally, and this helped add some depth to my knowledge of Spark as well as give me a chance to translate those skills to Scala.

    Helpful(0) Unhelpful(0)You have already voted this
  31. Avatar

    YEHOUENOU

    This course allows me to learn so many things about data analysys and Big data modeling.

    Helpful(0) Unhelpful(0)You have already voted this
  32. Avatar

    Kyle A

    Very real examples for the exercises and a great way to learn Spark!

    Helpful(0) Unhelpful(0)You have already voted this
  33. Avatar

    Yuri R

    Liked the course. Wasn’t familiar with spark before

    Helpful(0) Unhelpful(0)You have already voted this
  34. Avatar

    TAZI E G

    really good material, well explained with many examples. maybe more information or precisions should be added to the assignments but good material and explanations

    Helpful(0) Unhelpful(0)You have already voted this
  35. Avatar

    Zhenhua w

    The lecture is well organized and excellent

    Helpful(0) Unhelpful(0)You have already voted this
  36. Avatar

    Harshad H

    Best Course for Big Data Learning in the World

    Helpful(0) Unhelpful(0)You have already voted this
  37. Avatar

    Nikhil R

    Helpful for anyoe who wants to start with basics of Spark

    Helpful(0) Unhelpful(0)You have already voted this
  38. Avatar

    Bill P

    Excellent overview of Spark, including exercises that solidify what you learn during the lectures. The development environment setup tutorials were also very helpful, as I had not yet worked with sbt.

    Helpful(0) Unhelpful(0)You have already voted this
  39. Avatar

    LEBRAT O

    Very well explained, a very well teacher

    Helpful(0) Unhelpful(0)You have already voted this
  40. Avatar

    Jeni R

    It felt like the course material skipped over a great deal of syntax and how to. It was useful for concepts; but I found that I had to dig a great deal to be able to complete the assignments and that there is a lot of folklore in stackOverflow that potentially send you in a wrong direction.

    Helpful(0) Unhelpful(0)You have already voted this
  41. Avatar

    Aaron

    A good course to know about how spark works but not how to use it, I like this!

    Helpful(0) Unhelpful(0)You have already voted this
  42. Avatar

    Jose F O

    It is a great course however the exercises make you waste time trying to figure out how the grader works. You really need to read the instructions word by word then go to the discussions to figure out from others questions the pitfalls.

    Helpful(0) Unhelpful(0)You have already voted this
  43. Avatar

    aknin k

    Bien mais j’aurais aime avoir plus d’exercices pour s’entrainer sur les dataframes / datasets

    Helpful(0) Unhelpful(0)You have already voted this
  44. Avatar

    ANAND B

    Very Good Course For College students who completed and wants to start professionally.

    Helpful(0) Unhelpful(0)You have already voted this
  45. Avatar

    Andrejs A

    Specially the last lectures where useful. but there is quite some gap between the lectures and practical task.

    Helpful(0) Unhelpful(0)You have already voted this
  46. Avatar

    Patrik

    The videos had focus issues every now and again. It was still possible to see everything shown

    Helpful(0) Unhelpful(0)You have already voted this
  47. Avatar

    Charmy G

    Loved it all!! Thanks for the good information on Spark.

    Helpful(0) Unhelpful(0)You have already voted this
  48. Avatar

    Yogen R

    Excellent course materials and concepts explanation by Heather!

    Helpful(0) Unhelpful(0)You have already voted this

    Add a review

    Your email address will not be published. Required fields are marked *

    This site uses Akismet to reduce spam. Learn how your comment data is processed.

    Price tracking

    Register New Account
    Reset Password
    Compare items
    • Total (0)
    Compare