Summary of the paper

Title Publishing DGS corpus data: Different Formats for Different Needs
Authors Elena Jahn, Reiner Konrad, Gabriele Langer, Sven Wagner and Thomas Hanke
Abstract In 2010-2012, the DGS-Korpus project collected a large corpus of German Sign Language (DGS). Now, a substantial subset of the data is published, namely the Public DGS Corpus. We describe the considerations and decisions taken regarding what part of the data is to be made public, the necessary quality assurance measures to the data preparation as well as the formats of the published data. The corpus is published in three different ways in order to fulfil the needs of a variety of different users. First of all, the data is made available to the language community whose members allowed us to share their recorded language. In addition, we hope that a large number of non-scientific users with various backgrounds will find the data useful. Last but not least, we aim to make the data attractive for users with a scientific background and provide the possibility to conduct studies based on it, irrespective of whether they are familiar with DGS or not.
Full paper Publishing DGS corpus data: Different Formats for Different Needs
Bibtex @InProceedings{JAHN18.18018,
  author = {Elena Jahn ,Reiner Konrad ,Gabriele Langer ,Sven Wagner and Thomas Hanke},
  title = {Publishing DGS corpus data: Different Formats for Different Needs},
  booktitle = {Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)},
  year = {2018},
  month = {may},
  date = {7-12},
  location = {Miyazaki, Japan},
  editor = {Mayumi Bono and Eleni Efthimiou and Stavroula-Evita Fotinea and Thomas Hanke and Julie Hochgesang and Jette Kristoffersen and Johanna Mesch and Yutaka Osugi},
  publisher = {European Language Resources Association (ELRA)},
  address = {Paris, France},
  isbn = {979-10-95546-01-6},
  language = {english}
  }
Powered by ELDA © 2018 ELDA/ELRA