University of Oxford Logo University of OxfordDepartment of Computer Science - Home
Linked in
Linked in
Follow us on twitter
Twitter
On Facebook
Facebook
Instagram
Instagram

Visual Annotation of Web Objects using OXPath

Supervisor

Suitable for

Abstract

Background: The work will be done in the context of the large ERC project DIADEM: Domain-centric Intelligent Automated Data Extraction Methodology whose goal is to automate web data extraction in specific application domains such as real estate, restaurants, and so on.

Principal goal of the MSc or Honour School project

This proposal involves designing and implementing a web browser plugin for manually (and visually) annotating objects on a web page, with concepts of a given ontology.

Such annotations will be expressed by linking objects instances to OXPath expressions, which can be semi-automatically produced. The plugin offers many  features to modify these expressions. Also, the plugin will be designed as front-end of a system to query web pages for objects using query languages such as SPARQL.

Skills Needed: This project requires good analytic and software engineering skills, and involves  programming languages and web technologies such as Java_script, XPath, CSS.

Supervision: The project is co-supervised by Dr. Giorgio Orsi.