Title |
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language. |
Authors |
Alessio Bosca, Luca Dini, Milen Kouylekov and Marco Trevisan |
Abstract |
In order to handle the increasing amount of textual information today available on the web and exploit the knowledge latent in this mass of unstructured data, a wide variety of linguistic knowledge and resources (Language Identification, Morphological Analysis, Entity Extraction, etc.). is crucial. In the last decade LRaas (Language Resource as a Service) emerged as a novel paradigm for publishing and sharing these heterogeneous software resources over the Web. In this paper we present an overview of Linguagrid, a recent initiative that implements an open network of linguistic and semantic Web Services for the Italian language, as well as a new approach for enabling customizable corpus-based linguistic services on Linguagrid LRaaS infrastructure. A corpus ingestion service in fact allows users to upload corpora of documents and to generate classification/clustering models tailored to their needs by means of standard machine learning techniques applied to the textual contents and metadata from the corpora. The models so generated can then be accessed through proper Web Services and exploited to process and classify new textual contents. |
Topics |
LR Infrastructures and Architectures, Web Services, Tools, systems, applications |
Full paper |
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language. |
Bibtex |
@InProceedings{BOSCA12.867,
author = {Alessio Bosca and Luca Dini and Milen Kouylekov and Marco Trevisan}, title = {Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |