Classbaze

Disclosure: when you buy through links on our site, we may earn an affiliate commission.

Machine Learning: Clustering & Retrieval

Case Studies: Finding Similar DocumentsA reader is interested in a specific news article and you want to find similar articles to recommend. What is the rig...
4.6
4.6/5
(2,223 reviews)
84,605 students
Created by

8.9

Classbaze Grade®

N/A

Freshness

8.4

Popularity

9.0

Material

Machine Learning: Clustering & Retrieval
Platform: Coursera
Video: 7h 32m
Language: English

Best Data Analysis classes:

Classbaze Rating

Classbaze Grade®

8.9 / 10

CourseMarks Score® helps students to find the best classes. We aggregate 18 factors, including freshness, student feedback and content diversity.

Freshness

Course content can become outdated quite quickly. After analysing 71,530 courses, we found that the highest rated courses are updated every year. If a course has not been updated for more than 2 years, you should carefully evaluate the course before enrolling.

Popularity

8.4 / 10
We analyzed factors such as the rating (4.6/5) and the ratio between the number of reviews and the number of students, which is a great signal of student commitment.

New courses are hard to evaluate because there are no or just a few student ratings, but Student Feedback Score helps you find great courses even with fewer reviews.

Material

9.0 / 10
Video Score: 8.7 / 10
The course includes 7h 32m video content. Courses with more videos usually have a higher average rating. We have found that the sweet spot is 16 hours of video, which is long enough to teach a topic comprehensively, but not overwhelming. Courses over 16 hours of video gets the maximum score.
The average video length is 4 hours 49 minutes of 559 Data Analysis courses on Coursera.
Detail Score: 8.3 / 10

The top online course contains a detailed description of the course, what you will learn and also a detailed description about the instructor.

Extra Content Score: 9.8 / 10

Tests, exercises, articles and other resources help students to better understand and deepen their understanding of the topic.

This course contains:

18 articles.
0 resource.
0 exercise.
15 tests or quizzes.

In this page

About the course

Case Studies: Finding Similar Documents

A reader is interested in a specific news article and you want to find similar articles to recommend. What is the right notion of similarity? Moreover, what if there are millions of other documents? Each time you want to a retrieve a new document, do you need to search through all other documents? How do you group similar documents together? How do you discover new, emerging topics that the documents cover?

In this third case study, finding similar documents, you will examine similarity-based algorithms for retrieval. In this course, you will also examine structured representations for describing the documents in the corpus, including clustering and mixed membership models, such as latent Dirichlet allocation (LDA). You will implement expectation maximization (EM) to learn the document clusterings, and see how to scale the methods using MapReduce.

Learning Outcomes: By the end of this course, you will be able to:
-Create a document retrieval system using k-nearest neighbors.
-Identify various similarity metrics for text data.
-Reduce computations in k-nearest neighbor search by using KD-trees.
-Produce approximate nearest neighbors using locality sensitive hashing.
-Compare and contrast supervised and unsupervised learning tasks.
-Cluster documents by topic using k-means.
-Describe how to parallelize k-means using MapReduce.
-Examine probabilistic clustering approaches using mixtures models.
-Fit a mixture of Gaussian model using expectation maximization (EM).
-Perform mixed membership modeling using latent Dirichlet allocation (LDA).
-Describe the steps of a Gibbs sampler and how to use its output to draw inferences.
-Compare and contrast initialization techniques for non-convex optimization objectives.
-Implement these techniques in Python.

What can you learn from this course?

What you need to start the course?

The course creator has not defined the requirements for this course.

Who is this course is made for?

The course creator hasn’t defined the level of this course.

Are there coupons or discounts for Machine Learning: Clustering & Retrieval ? What is the current price?

Access to most course materials is FREE in audit mode on Coursera. If you wish to earn a certificate and access graded assignments, you must purchase the certificate experience during or after your audit.

If the course does not offer the audit option, you can still take a free 7-day trial.
The average price is $8.1 of 559 Data Analysis courses. So this course is 100% cheaper than the average Data Analysis course on Coursera.

Will I be refunded if I'm not satisfied with the Machine Learning: Clustering & Retrieval course?

Coursera offers a 7-day free trial for subscribers.

Are there any financial aid for this course?

YES, you can get a scholarship or Financial Aid for Coursera courses. The first step is to fill out an application about your educational background, career goals, and financial circumstances. Learn more about financial aid on Coursera.

Who will teach this course? Can I trust Emily Fox?

Emily Fox has created 6 courses that got 69 reviews which are generally positive. Emily Fox has taught 413,241 students and received a 4.8 average review out of 69 reviews. Depending on the information available, we think that Emily Fox is an instructor that you can trust.
Statistics
University of Washington
Emily Fox is an assistant professor and the Amazon Professor of Machine Learning in the Statistics Department at the University of Washington. She was formerly at the Wharton Statistics Department at the University of Pennsylvania. Emily is a recipient of the Sloan Research Fellowship, a US Office of Naval Research Young Investigator award, and a National Science Foundation CAREER award. Her research interests are in large-scale Bayesian dynamic modeling and computations.

8.9

Classbaze Grade®

N/A

Freshness

8.4

Popularity

9.0

Material

Platform: Coursera
Video: 7h 32m
Language: English

Classbaze recommendations for you