Bootstrapping document annotation with Schema.org
The recently started Schema.org initiative of the major search engine providers aims at fostering semantic annotations across the Web. You can read about it here. Semi-automatic annotation of natural language documents is a long-standing problem area. The goal of this project would be to apply state-of-the-art annotation techniques to a large corpus based on the Schema.org semantic model.
Some familiarity with topics from Computational Linguistics and Knowledge, Representation and Reasoning.