CS190.1x: Scalable Machine Learning
University of California, Berkeley via edX
-
420
-
- Write review
Overview
This course introduces the underlying statistical and algorithmic principles required to develop scalable real-world machine learning pipelines. We present an integrated view of data processing by highlighting the various components of these pipelines, including exploratory data analysis, feature extraction, supervised learning, and model evaluation. You will gain hands-on experience applying these principles using Apache Spark, a cluster computing system well-suited for large-scale machine learning tasks. You will implement scalable algorithms for fundamental statistical models (linear regression, logistic regression, matrix factorization, principal component analysis) while tackling key problems from domains such as online advertising and cognitive neuroscience.
This self-assessment document provides a short quiz, as well as online resources that review the relevant background material.
Taught by
Ameet Talwalkar
Charts
- #3 in Subjects / Machine Learning / Supervised Learning
Tags
Related Courses
-
Machine Learning
Georgia Institute of Technology, Brown University
4.3 -
Machine Learning
Stanford University
4.7 -
Machine Learning
Columbia University
4.0 -
Scalable Machine Learning on Big Data using Apache Spark
IBM
-
Machine Learning Fundamentals
University of California, San Diego
4.0 -
Principles of Machine Learning
Microsoft
5.0
Reviews
4.5 rating, based on 31 reviews
-
Gregory J Hamel ( Life Is Study) completed this course and found the course difficulty to be medium.
Scalable Machine Learning is a 5-week distributed machine learning course offered by UC Berkeley through the edX platform. It is a follow up to another UC Berkely course: Introduction to Big Data with Apache Spark. Although the first course is not a... -
Martin Strandbygaard completed this course, spending 4 hours a week on it and found the course difficulty to be medium.
Overall a good course, that is worthwhile spending the time on, if you want to get a basic introduction to solving machine learning problems using Apache Spark. As with the precursor, CS100.1x, the lecture videos and quizzes are pretty light on actual... -
Anonymous is taking this course right now.
The machine learning algorithms are explained in reasonably granular level, and easy to follow. The labs are the highlight. I learnt a lot from doing. Thanks for putting this course together. -
Gaurabh completed this course, spending 5 hours a week on it and found the course difficulty to be medium.
Very well explained machine learning using Spark from scratch. Therefore a good introductory course. Not too many details covered, probably due to time limitation. Hope they make a sequel. -
Maurits Doorn is taking this course right now.
-
Igor Subbotin is taking this course right now.
-
Tabish Sada is taking this course right now.
-
Vlad Podgurschi completed this course, spending 6 hours a week on it and found the course difficulty to be medium.
-
Anonymous completed this course.
-
Lace Lofranco is taking this course right now.
-
C M Chan completed this course.
-
Prakhar Srivastav completed this course.
-
Dmitry Nikulin completed this course.
-
Liang Lu is taking this course right now.
-
Sergiy Matusevych completed this course.
-
V M completed this course.
-
Shuang Wu completed this course.
-
Peter Mosoni completed this course.
-
Rogier Werschkull completed this course.
-
Sauro Grandi completed this course.