Title |
Slovene Terminology Web Portal and the TBX-Compatible Simplified DTD/schema |
Authors |
Simon Krek, Vojko Gorjanc and pela Arhar |
Abstract |
The paper describes the project whose main purpose is the creation of the Slovene terminology web portal, funded by the Slovene Research Agency and the Amebis software company. It focuses on the DTD/schema used for the unification of different terminology resources in different input formats into one database available on the web. Two projects involving unification DTD/schemas were taken as the model for the resulting DTD/schema: the CONCEDE project and the TMF project. The final DTD/schema was tested on twenty different specialized dictionaries, both monolingual and bilingual, in various formats either without any existing markup or with complex XML structure. The result of the project will be an on-line terminology resource for Slovenian which will also include didactic material on terminology and free tools for uploading domain-specific text collections to be processed with NLP software, including a term extractor. |
Language |
Single language |
Topics |
LR national/international projects, organizational/policy issues, LR web services, LR Infrastructures and Architectures |
Full paper |
Slovene Terminology Web Portal and the TBX-Compatible Simplified DTD/schema |
Slides |
- |
Bibtex |
@InProceedings{KREK08.553,
author = {Simon Krek, Vojko Gorjanc and pela Arhar},
title = {Slovene Terminology Web Portal and the TBX-Compatible Simplified DTD/schema},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |