LREC 2000 2nd International Conference on Language Resources & Evaluation
 

Previous Paper   Next Paper

Title Acquisition of Linguistic Patterns for Knowledge-based Information Extraction
Authors Harabagiu Sanda M. (Department of Computer Science and Engineering, Southern Methodist University, Dallas, TX 75275-0122, U.S.A., sanda@renoir.seas.smu.edu)
Maiorano Steven J. (Department of Computer Science and Engineering, Southern Methodist University, Dallas, TX 75275-0122, U.S.A., steve@renoir.seas.smu.edu)
Keywords  
Session Session WO11 - Mono-Multilingual Lexicon Acquisition and Building
Full Paper 347.ps, 347.pdf
Abstract In this paper we present a new method of automatic acquisition of linguistic patterns for Information Extraction, as implemented in the CICERO system. Our approach combines lexico-semantic information available from the WordNet database with collocating data extracted from training corpora. Due to the open-domain nature of the WordNet information and the immediate availability of large collections of texts, our method can be easily ported to open-domain Information Extraction.