University of Oxford Logo University of OxfordDepartment of Computer Science - Home
Linked in
Linked in
Follow us on twitter
Twitter
On Facebook
Facebook
Instagram
Instagram

Car Intelligence: Domain Analysis and Intelligent Services

Supervisor

Suitable for

Abstract

In the DIADEM project, we aim to extract data at scale from specific domains such as real-estate or used cars. This project will do a first analysis of the used car domain and formalise both the domain concepts and their phenomenology on current web sites. It will also investigate and prototypically design value-added services in the car domain based on extracted structured data. The work will be done in the context of the large ERC project DIADEM: Domain-centric Intelligent Automated Data Extraction Methodology whose goal is to automate web data extraction in specific application domains such as real estate, restaurants, and so on. Ultimately, we want to construct a system that is able to automatically navigate Web pages within a given application domain and extract relevant data from that pages. The output should be a highly structured XML file obeying a certain pre-defined schema.

This project is co-supervised by Dr Tim Furche and Dr Christian Schallhart.