Title |
Extracting Information for Automatic Indexing of Multimedia Material |
Authors |
Horacio Saggion (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK) Hamish Cunningham (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK) Diana Maynard (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK) Kalina Bontcheva (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK) Oana Hamza (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK) Christian Ursu (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK) Yorick Wilks (Department of Computer Science, University of Sheffield, Regent Court 211 Portobello Street S1 4DP - Sheffield - England, UK) |
Session |
MMO3: Collection & Indexing Of Multimodal LR |
Abstract |
This paper discusses our work on information extraction (IE) from multi-lingual, multi-media, multi-genre Language Resources, in a domain where there are many different event types. This work is being carried out in the context of MUMIS, an EU-funded project that aims at the development of basic technology for the creation of a composite index from multiple and multi-lingual sources. Our approach to IE relies on a finite state machinery provided by GATE, a General Architecture for Text Engineering, pipelined with full syntactic analysis and discourse interpretation implemented in Prolog. |
Keywords |
Automatic indexing, Multimedia material |
Full Paper |