Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game “Catch the Pink Flamingo”. During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark’s MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine–generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership. UC San Diego is an academic powerhouse and economic …
Instructor Details
Courses : 6
Specification: Big Data – Capstone Project
|
50 reviews for Big Data – Capstone Project
Add a review Cancel reply
This site uses Akismet to reduce spam. Learn how your comment data is processed.
FREE
ZIKIF H L –
Excellent total review of all courses
Assaduzzaman N –
This is a very good course in most aspects.
Ricardo L C T –
the chat part (graphs) was hard to finish. the bar is very high for this capstone. anyway very good course.
Ankur K –
Loved the final Capstone Project.
Jose A B A –
It was a very useful project for my learning.
Samuel S –
Some of the instructions for the tools are not clear enough. It’s better to have some more suggested further readings like statistics with KNIME, operating Spark, etc. For the cases, it is very straight forward and with beautiful data; likely far away from real application.
Samuel C –
The project is really helpful to sum up the whole process of the 5 previous courses, but there is a bit problem with the week 4 assignment.
Kevin M –
Excellent course for introduction to Big Data. This course covers some of current software that is used in big data processing such as Splunk, Knime, Spark and Neo4j. The course curriculum is designed to help you learn the materials and complete the certification successfully. Great job Professors Ilkay Altintas, Amarnath Gupta, Mai Nguyen. Thank You. Kevin Murali
Helder V –
In general the course is very good. There is a lack of support, especially when some exercises are proposed and then the final solution to the problem is not available, which requires a lot of search for solutions on the web. Even so, at the end of the course it is possible to get a good knowledge about the complete big data process.
davide g –
Thank you
Nicolas G –
Excellent course, 100% recommend to people want to learn big data fundamentals.
Christoph R –
This is all you can expect from an online course, imho: all the stuff you learned in the courses belonging to the Big Data specialization coming together in hands on exercises, that not only test your knowledge but also are fun to do and give you a real taste of a Data Scientists work. In the beginning, it’s a bit overwhelming, how many new technologies and programming languages you are introduced to and have to exercise, but it is definitely worth it. And the course material is well designed, that you do not get lost. You get a lot of help in the instructions, that even beginners should be able to succeed. But there is also enough room to explore and try out the matter on your own. Like they said: happy learning! Thanks alot Ilkay, Amarnath, Mai, the guy from the hands on instruction videos and all the others who contributed, affiliates.. Eglence! 🙂
Gabriel T –
Very engaging course. Well designed and delivered. I also liked the breadth and depth of the course. Liked it and continue use the material as reference
Nameer A –
A b s u l o t e l y r e c o m m e n d e d
Sabawoon S –
Very simple but a very informative specialization. This is an excellent introduction, you will need to study some topics like machine learning further to get a better understanding. If this was to be extended to a 10 course Specialization I would love to see more on model selection, model interpretation, regression and regularized regression, error analysis and inferential statistics, model assembling and free text analysis.
donald m –
What a challenge, I came into this course as a London Black Cab Taxi Driver, I thought the knowledge was hard but this capstone was a challenge more intense than the Knowledge of London!!!
Anup K M –
very good
Rambabu A –
This has been excellent Learning experience.Instructor and fellow members shared their valuable information during the course of the Learning and Capstone Project phase.
Venkata B P T –
Big Thank you….
Edorta –
Really disappointed with the Virtual Machine related exercises throughout the whole course, not only the capstone, where literally I have spent more time trying to figure out how to make it work and do not get any error message than actually learning big data. Very very frustrating.For sure not going to recommend this course to any friend.
Juan J R M –
We can actually take a look in some other methods, when we finalize the course, you can send some bibliography or something else
Pijush K G –
Good course.
Mohamed H –
PERFECT
Santiago A G –
Good and very practical challenge
Caio S –
Congratulations for the course preparation and thank you all. It was a very exciting investiment I have performed and I have also learned how to think differently on problem analysis after taking this course.
Alexandre V –
Great course that uses a project to apply what you learned in the others courses. I enjoyed it a lot. The only point is the technical issues in the installation, compatibility of the cloudera version and softwares to be updated. It is necessary to give more support to the technical issues detected, else as a learner, you spend a lot of your time solving them. Anyway, I definitely recommend this course if you want to keep learning a lot, It’s worth the effort!
Vincent R –
The Big Data Capstone Project is a great course that is very challenging and requires a significant amount of work hours for a big data first timer. The great experience could be enhanced by fixing some bugs in software and typos in guidance document.
P. H –
Interesting final project to complete this course, didn’t get 5 stars because: Clarity could be improved in the instructions and solutions of the assignment However this is counter balanced by the amount of plagiarism (if you provide more detailed solutions, some people unfortunately will just copy/paste them). So I am still wondering how the final results in week 4 were obtained….
Jose A R N –
My name is Jose Antonio from Brazil. I am looking for a new Data Scientist career (https://www.linkedin.com/in/joseantonio11). I did this course to complete my CV in Big Data and better understand the technology. The course was excellent and the classes well taught by teachers. Thank you for the support, course quality and great classes. Regards. Jose Antonio.
Payal –
This is very helpful project where i have applied all learning through ouot journey of this course.Though it was time consuming but worth to invest time, which benefits to upskill my knowledge
Krati S –
Thank you Coursera and instructors for creating this course. The structure is very good. Looking forward for completing other specializations too. Thank you!!
Jamiil T A –
waoh.. it’s incredible.. .. I strongly recommend this Capstone Project. Be sure to put on frank effort. T H A K Y O U S O M U C H
Rafael F d N –
Very nice project! Uses a lot of the knowledge acquired in a funny way!!
NARESH D –
All the sessions were very informative and provided the required knowledge from basics.
Tejprakash U –
Course is well designed and structured.
Raunak S –
nice final course to close out the whole specialization program.
Samir V M –
This is great platform to enhance your skills with periodic learning even from busy schedule and make yourself in pace with new IT.
Miguel T –
The course is really good, but some exercises are difficult to be done without technical support.
Shimon A –
Working with Splunk is impossible. Taking this course, my intention was not to learn SPLUNK!!! My intention was to perform an intensive, deep and meaningful EDA. However, I’ve spent 2 days (!!!) for learning…Splunk which is a complex tool of extremely poor usability. This is why I prefer to quit the course and the project (which I really wanted to participate in.
To P H –
Not Bad
Mahamat N A M –
it’s really useful to practice what you’ve learned in the previous course.
ISLAM K –
its very good course , here its aggregating all knowledge and information learned in previous courses
Vadim P –
I only realized how good this specialization was when I took a course from another university.
Luis M M R –
very good
Fernando M –
Excellent opportunity to practice what I learned during the Specialization.
Swapnil D –
Excellent project
Manik S –
Unnecessarily prevented me from completing the specialization when I had time to make me pay extra. I had to wait 2 months for the capstone project to start, enough time to make me forget how I dealt with the excessive number of problems in this outdated course. Trivial assignments. Low quality lectures. Wrong, conflicting and obsolete instructions. Discussion forums are barely moderated and are filled with spam.
Steven Z –
I learned a lot about applying the big data knowledge gained in the previous courses. Thank you!
Emilio M –
very good course
Chandrakanth B –
It was wonderful experience while doing this project as hands on