University of Oxford Logo University of OxfordDepartment of Computer Science - Home

Information Retrieval:  2011-2012

Information

Lecturer

Degrees

Schedule C1Honour School of Computer Science

Schedule C1Honour School of Mathematics and Computer Science

Schedule CMSc in Computer Science

Term

Overview

Modern internet search engines form the primary interface for most users interacting with the World Wide Web. The dramatic increase in the amount of data that is available on the Web, in recent years, means that automatic methods of Information Retrieval (IR) have acquired greater significance. For the purpose of this course IR is the study of the indexing, processing, storage and querying of textual data. The aim of the course is to provide an introduction to the core principles and techniques used in IR, and to demonstrate how statistical models of language can be used to solve the document indexing and retrieval problems. In addition, we will look at the issues involved in indexing the entire web and the creative solutions to this problem currently deployed by large scale online search providers.

Learning outcomes

On completion of the course students will be expected to:

Prerequisites

Prior knowledge of elementary linear algebra would be helpful but is not required for this course. The practical side of this course has a relatively in depth programming component. Students will build a vector space based information retrieval system from scratch using a programming language of their choice. Students should be familiar with object oriented programming, simple data structures such as hash maps, and text processing.

Synopsis

Reading list

Primary Texts

Secondary Texts