Big Data Analysis with Scala and Spark
FREE
Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in–memory distributed collections framework written in Scala. In this course, we’ll see how the data parallel paradigm can be extended to the distributed case, using Spark throughout. We’ll cover Spark’s programming model in detail, being careful to understand how and when it differs from familiar programming models, like shared–memory parallel collections or sequential Scala collections. Through hands–on examples in Spark and Scala, we’ll learn when important issues related to distribution like latency and network communication should be considered and how they can be addressed effectively for improved performance. Learning Outcomes. By the end of this course you will be able to: – read data from persistent storage and load it into Apache Spark, – manipulate data with Spark and Scala, – express algorithms for data analysis in a functional style, – recognize how to avoid shuffles and recomputation in Spark, Recommended background: You should have at least one year programming experience. Proficiency with Java or C# is ideal, …
Instructor Details
Courses : 1
Specification: Big Data Analysis with Scala and Spark
|
48 reviews for Big Data Analysis with Scala and Spark
Add a review Cancel reply
This site uses Akismet to reduce spam. Learn how your comment data is processed.
Price | Free |
---|---|
Provider | |
Duration | 15 hours |
Year | 2017 |
Language | English |
Certificate | Yes |
Quizzes | Yes |
FREE
Light0617 –
wonderful!!!
Rodion G –
Course is good to have some practice in spark and scala. However it seems to be long forsaken by staff and some issues with assignments require doing archaeology in the forums. Also it is quite unpleasant to see that while specialization emphasizes functional programming, some auxiliary code in assignments is written in the worst manner of imperative programming… Why use scala then?
BOUDRAHEM –
Everything was excellent. This was one of the best courses I have attended so far
Fabio A R –
Excellent course, very handful for whoever is starting his studies in spark.
Ronald C M –
The best way to get to know Spark Fundamentals and Spark Core libraries
AOC –
Very engaging and tutor showed excellent delivery. thanks to Coursera.
Joel V –
The course presented the concepts in a clear and exemplified way. Tough, it was a bit too verbose. The exercises were not too hard and thorough enough to cover the concepts presented in class. I wished there were more exercises as Spark’s API is huge and sometimes a bit tedious to use thus being hard to quickly get confident. I hope that the capstone project will help for that.
Kyle J –
Pretty good, but one of the assignments was poorly set up. Some of the provided code was broken and it was very hard to debug.
Valter F –
I love the indepth aproach at the RDDs. I’d say DataFrames and DataSets required a bit more examples and testing material though.
CLAUDIO A –
Excellent explanations by Heather Miller. She really knows how to explain a topic, and also makes the lectures a lot of fun to listen !
Daniele M –
Great Introduction to spark. Programming assignments helped me to improve my skills. Thank you very much.
Beibit –
Very nice course. Good level of explanations, good examples and pace.
Carsten I –
Excellent class, very well stuctured, clear and informative.
Kristoffer V –
Sometimes the labs included techniques not fully covered by the lectures.
Robin B –
Very good introduction to RDDs and DataFrames/Dataset along with valuable insight into performance considerations. I’d done some prior work with Hadoop/Pig in the past and more recently with Spark (mainly DataFrames/GraphFrames) this was really useful to round out my understanding of RDDs and optimisation. The assignment guidance in the code comments could be more complete to save having to refer back to the site (and maybe reference specific video lectures with the hints). Though it’s good that the assignment exercises aren’t tutorial grade, as that makes the experience more transferable to real projects.
Marija N –
Absolutely fantastic!
Alexey A –
The most interesting course in whole specialization.
Rajesh B –
Very nice explanation, trainer has good knowledge, course materials are good, video quality is too good.
Yury C –
This course was by far the best of 22 courses I’ve done on Coursera. Prof Miller has this rare ability of presenting material in concise and interesting way and yet going into nitty gritty aspects when needed (in another course on Spark technology, such intricacies weren’t covered). Thanks a lot for this course!
Mohamed A T –
the theory is very clear and well explained. the practical assignments are a little bit ambiguous but they are overall very good and challenging. highly recommended!
Bulent B –
Amazing technology, explained wonderfully. Note: familiarity with scala (take Martin’s course in coursera) would make your experience even better.
Mykyta P –
The video lectures are good but code assignments are worse, seems like they were written by students instead of professor or something. Sometimes code doesn’t follow Scala and FP conventions. And the output of the grader doesn’t really provide any helpful information besides the name of the faulty function. But overall it’s a good course and I think the newcomers without any previous experience with Spark will learn a lot.
Mohammad T –
such a beautiful course design for a bigData devlopers
Sanjeev R –
Excellent
Manish M D –
Excellent course.
Santiago A –
Awesome course and awesome teacher! Nevertheless, to grasp the most of this course, you should do the previous 3 courses of the “Functional Programming in Scala” specialization.
Joaquin D R –
Incredible tutorial!!!!!!!!!! I love it
Khaled J A –
this is an outstanding course to learn BIg data with scala and spark
Sergio R P –
The assignments are very confusing and unexplained. They do not take long to reply to the forum.
Zeb S –
I worked with PySpark professionally, and this helped add some depth to my knowledge of Spark as well as give me a chance to translate those skills to Scala.
YEHOUENOU –
This course allows me to learn so many things about data analysys and Big data modeling.
Kyle A –
Very real examples for the exercises and a great way to learn Spark!
Yuri R –
Liked the course. Wasn’t familiar with spark before
TAZI E G –
really good material, well explained with many examples. maybe more information or precisions should be added to the assignments but good material and explanations
Zhenhua w –
The lecture is well organized and excellent
Harshad H –
Best Course for Big Data Learning in the World
Nikhil R –
Helpful for anyoe who wants to start with basics of Spark
Bill P –
Excellent overview of Spark, including exercises that solidify what you learn during the lectures. The development environment setup tutorials were also very helpful, as I had not yet worked with sbt.
LEBRAT O –
Very well explained, a very well teacher
Jeni R –
It felt like the course material skipped over a great deal of syntax and how to. It was useful for concepts; but I found that I had to dig a great deal to be able to complete the assignments and that there is a lot of folklore in stackOverflow that potentially send you in a wrong direction.
Aaron –
A good course to know about how spark works but not how to use it, I like this!
Jose F O –
It is a great course however the exercises make you waste time trying to figure out how the grader works. You really need to read the instructions word by word then go to the discussions to figure out from others questions the pitfalls.
aknin k –
Bien mais j’aurais aime avoir plus d’exercices pour s’entrainer sur les dataframes / datasets
ANAND B –
Very Good Course For College students who completed and wants to start professionally.
Andrejs A –
Specially the last lectures where useful. but there is quite some gap between the lectures and practical task.
Patrik –
The videos had focus issues every now and again. It was still possible to see everything shown
Charmy G –
Loved it all!! Thanks for the good information on Spark.
Yogen R –
Excellent course materials and concepts explanation by Heather!