Title |
LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation |
Authors |
Romaric Besançon, Gaël de Chalendar, Olivier Ferret, Faiza Gara, Olivier Mesnard, Meriama Laïb and Nasredine Semmar |
Abstract |
The increasing amount of available textual information makes necessary the use of Natural Language Processing (NLP) tools. These tools have to be used on large collections of documents in different languages. But NLP is a complex task that relies on many processes and resources. As a consequence, NLP tools must be both configurable and efficient: specific software architectures must be designed for this purpose. We present in this paper the LIMA multilingual analysis platform, developed at CEA LIST. This configurable platform has been designed to develop NLP based industrial applications while keeping enough flexibility to integrate various processes and resources. This design makes LIMA a linguistic analyzer that can handle languages as different as French, English, German, Arabic or Chinese. Beyond its architecture principles and its capabilities as a linguistic analyzer, LIMA also offers a set of tools dedicated to the test and the evaluation of linguistic modules and to the production and the management of new linguistic resources. |
Topics |
Tools, systems, applications, LR Infrastructures and Architectures, Multilinguality |
Full paper |
LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation |
Slides |
- |
Bibtex |
@InProceedings{BESANON10.537,
author = {Romaric Besançon and Gaël de Chalendar and Olivier Ferret and Faiza Gara and Olivier Mesnard and Meriama Laïb and Nasredine Semmar}, title = {LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |