Title |
Word Sketches for Turkish |
Authors |
Bharat Ram Ambati, Siva Reddy and Adam Kilgarriff |
Abstract |
Word sketches are one-page, automatic, corpus-based summaries of a word's grammatical and collocational behaviour. In this paper we present word sketches for Turkish. Until now, word sketches have been generated using a purpose-built finite-state grammars. Here, we use an existing dependency parser. We describe the process of collecting a 42 million word corpus, parsing it, and generating word sketches from it. We evaluate the word sketches in comparison with word sketches from a language independent sketch grammar on an external evaluation task called topic coherence, using Turkish WordNet to derive an evaluation set of coherent topics. |
Topics |
Tools, systems, applications, Lexicon, lexical database, Web Services |
Full paper |
Word Sketches for Turkish |
Bibtex |
@InProceedings{AMBATI12.585,
author = {Bharat Ram Ambati and Siva Reddy and Adam Kilgarriff}, title = {Word Sketches for Turkish}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |