University of Oxford Logo University of OxfordDepartment of Computer Science - Home
Linked in
Linked in
Follow us on twitter
Twitter
On Facebook
Facebook
Instagram
Instagram

Information Retrieval:  2013-2014

Information

Overview

Modern internet search engines form the primary interface for most users interacting with the World Wide Web. The dramatic increase in the amount of data that is available on the Web, in recent years, means that automatic methods of Information Retrieval (IR) have acquired greater significance. For the purpose of this course, IR will mainly mean the study of the indexing, processing, storage and querying of textual data. The aim of the course is to provide an introduction to the core principles and techniques used in IR, and to demonstrate how statistical models of language can be used to solve document indexing and retrieval problems. In addition, we will look at the issues involved in indexing the entire web and the creative solutions to this problem currently deployed by large scale online search providers.

Learning outcomes

On completion of the course students will be expected to:

Prerequisites

Prior knowledge of elementary linear algebra would be helpful but is not required for this course. The practical side of this course has a relatively in-depth programming component. Students will build a vector space based information retrieval system from scratch using a programming language of their choice. Students should be familiar with object oriented programming, simple data structures such as hash maps, and text processing.

Synopsis

 

Syllabus

Boolean Model and Vector Space Model, evaluation in information retrieval, text representation and processing, relevance feedback and query expansion, index construction and compression, language models and smoothing techniques, document clustering, text classification, dimensionality reduction and semantic similarity, IR on the Web (Page Rank, HITS), web usage mining, other IR topics and challenges.

Reading list

Primary Texts

Secondary Texts