Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Online Course

Text Retrieval and Search Engines

University of Illinois at Urbana-Champaign via Coursera

359
  • Provider Coursera
  • Cost Free Online Course (Audit)
  • Session In progress
  • Language English
  • Certificate Paid Certificate Available
  • Effort 4-6 hours a week
  • Duration 6 weeks long
  • Learn more about MOOCs

Taken this course? Share your experience with other students. Write review

Overview

Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. Text data are unique in that they are usually generated directly by humans rather than a computer system or sensors, and are thus especially valuable for discovering knowledge about people’s opinions and preferences, in addition to many other kinds of knowledge that we encode in text.

This course will cover search engine technologies, which play an important role in any data mining applications involving text data for two reasons. First, while the raw data may be large for any particular problem, it is often a relatively small subset of the data that are relevant, and a search engine is an essential tool for quickly discovering a small subset of relevant text data in a large text collection. Second, search engines are needed to help analysts interpret any patterns discovered in the data by allowing them to examine the relevant original text data to make sense of any discovered pattern. You will learn the basic concepts, principles, and the major techniques in text retrieval, which is the underlying science of search engines.

Syllabus

Orientation
-You will become familiar with the course, your classmates, and our learning environment. The orientation will also help you obtain the technical skills required for the course.

Week 1
-During this week's lessons, you will learn of natural language processing techniques, which are the foundation for all kinds of text-processing applications, the concept of a retrieval model, and the basic idea of the vector space model.

Week 2
-In this week's lessons, you will learn how the vector space model works in detail, the major heuristics used in designing a retrieval function for ranking documents with respect to a query, and how to implement an information retrieval system (i.e., a search engine), including how to build an inverted index and how to score documents quickly for a query.

Week 3
-In this week's lessons, you will learn how to evaluate an information retrieval system (a search engine), including the basic measures for evaluating a set of retrieved results and the major measures for evaluating a ranked list, including the average precision (AP) and the normalized discounted cumulative gain (nDCG), and practical issues in evaluation, including statistical significance testing and pooling.

Week 4
-In this week's lessons, you will learn probabilistic retrieval models and statistical language models, particularly the detail of the query likelihood retrieval function with two specific smoothing methods, and how the query likelihood retrieval function is connected with the retrieval heuristics used in the vector space model.

Week 5
-In this week's lessons, you will learn feedback techniques in information retrieval, including the Rocchio feedback method for the vector space model, and a mixture model for feedback with language models. You will also learn how web search engines work, including web crawling, web indexing, and how links between web pages can be leveraged to score web pages.

Week 6
-In this week's lessons, you will learn how machine learning can be used to combine multiple scoring factors to optimize ranking of documents in web search (i.e., learning to rank), and learn techniques used in recommender systems (also called filtering systems), including content-based recommendation/filtering and collaborative filtering. You will also have a chance to review the entire course.

Taught by

ChengXiang Zhai

Tags

Help Center

Most commonly asked questions about Coursera

Reviews for Coursera's Text Retrieval and Search Engines Based on 14 reviews

  • 5 stars 14%
  • 4 stars 43%
  • 3 stars 14%
  • 2 stars 14%
  • 1 stars 14%

Did you take this course? Share your experience with other students.

Write a review
  • 1
Mike R
Mike completed this course.
Was this review helpful to you? Yes
Gregory S
by Gregory completed this course and found the course difficulty to be medium.
Text Retrieval and Search Engines is the second course in Coursera's new data mining specialization offered by the University of Illinois at Urbana-Champaign. The course covers a variety of topics in text data mining and natural language processing including text retrieval, query ranking and evaluation methods, methods and the basics of recommender systems. Grading is based entirely on 4 weekly quizzes comprised of 10 multiple choice questions. You only get 1 attempt on the quizzes.

The weekly content in Text Retrieval and Search Engines consists of around 10 video lectures that r…
6 people found
this review helpful
Was this review helpful to you? Yes
Marianne C
by Marianne completed this course.
I've taken a number of courses on Coursera and have thoroughly enjoyed some of them, but it's clear that the quality varies. I was very disappointed in this course. Having applied to the University of Illinois' Master of Computer Science - Data Science, I thought it'd be a good idea to take some of their Coursera courses to get a sense of the quality of their education. I probably should have taken their classes first and then applied, saving me the trouble. If this is the type of instruction I can expect in the Masters program, I think I'll save myself the $19k in tuition.

The pr…
Was this review helpful to you? Yes
Anonymous
Anonymous is taking this course right now.
I was initially excited for this course as it seemed a good dive into unstructured text data. But now I'd say: *skip this course*. I think the instructor is okay and presents the material in a sufficient enough manner to get a decent grasp of it.

The reason I'd say skip this course is that the exercises are pretty bad. The class is only graded on quizzes and the optional programming assignments use an obscure text mining/analysis tool called MeTA which is time consuming to setup unless you're experienced in navigating the mess that open source C++ libraries are. Once you've s…
Was this review helpful to you? Yes
Anonymous
Anonymous completed this course.
Great class with a nice mix of theoretical and practical lessons. There was a competition at the end of the course which pushed us to come up with new ideas.
1 person found
this review helpful
Was this review helpful to you? Yes
Anonymous
Anonymous completed this course.
Precise and clear explanation about the concepts .This course completes focuses on text retrieval concepts with strong strong intro on what is text retrieval , what are the challenges faced and further gives an insight on various models and improvement in this field .Therefore, this course is mostly only for people more interested in an area in information retrieval.

Was this review helpful to you? Yes
Anonymous
Anonymous completed this course.
pretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty goodpretty good
Was this review helpful to you? Yes
Lien B
Lien completed this course, spending 2 hours a week on it and found the course difficulty to be medium.
The course is not very organised and even though they share a lot of information, it's not really very useful for someone who wants to get his/her hands dirty and really learn NLP/Text retrieval.

(+ Instructor is sometimes very hard to understand)
Was this review helpful to you? Yes
Kristina Š
by Kristina completed this course and found the course difficulty to be medium.
I'm encouraging more programming assignments dealing with NLP, and a bit smaller focus on C++ and more R/Py support. It was a fun experience, and I hope that the theoretical approach will slowly turn into a combination of theory and practice.

Was this review helpful to you? Yes
Anonymous
Anonymous completed this course.
It's not complete, but a good start point for who want to learn more about information retrieval. Great course. I recommend.
Was this review helpful to you? Yes
Colin K
by Colin completed this course.
0 person found
this review helpful
Was this review helpful to you? Yes
Basil R
Basil completed this course.
Was this review helpful to you? Yes
Deepak J
by Deepak is taking this course right now and found the course difficulty to be medium.
2 people found
this review helpful
Was this review helpful to you? Yes
Rafael P
Rafael completed this course.
0 person found
this review helpful
Was this review helpful to you? Yes
  • 1

Class Central

Get personalized course recommendations, track subjects and courses with reminders, and more.

Sign up for free

Never stop learning Never Stop Learning!

Get personalized course recommendations, track subjects and courses with reminders, and more.

Sign up for free