Title |
sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks |
Authors |
Darja Fišer, Aleš Tavčar and Tomaž Erjavec |
Abstract |
The paper presents sloWCrowd, a simple tool developed to facilitate crowdsourcing lexicographic tasks, such as error correction in automatically generated wordnets and semantic annotation of corpora. The tool is open-source, language-independent and can be adapted to a broad range of crowdsourcing tasks. Since volunteers who participate in our crowdsourcing tasks are not trained lexicographers, the tool has been designed to obtain multiple answers to the same question and compute the majority vote, making sure individual unreliable answers are discarded. We also make sure unreliable volunteers, who systematically provide unreliable answers, are not taken into account. This is achieved by measuring their accuracy against a gold standard, the questions from which are posed to the annotators on a regular basis in between the real question. We tested the tool in an extensive crowdsourcing task, i.e. error correction of the Slovene wordnet, the results of which are encouraging, motivating us to use the tool in other annotation tasks in the future as well. |
Topics |
Lexicon, Lexical Database, Word Sense Disambiguation |
Full paper |
sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks |
Bibtex |
@InProceedings{FIER14.1106,
author = {Darja Fišer and Aleš Tavčar and Tomaž Erjavec}, title = {sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |