Title |
German Today: a really extensive Corpus of Spoken Standard German |
Authors |
Caren Brinckmann, Stefan Kleiner, Ralf Knöbl and Nina Berend |
Abstract |
The research project German Today aims to determine the amount of regional variation in (near-)standard German spoken by young and older educated adults and to identify and locate regional features. To this end, we compile an areally extensive corpus of read and spontaneous German speech. Secondary school students and 50-to-60-year-old locals are recorded in 160 cities throughout the German speaking area of Europe. All participants read a number of short texts and a word list, name pictures, translate words and sentences from English, answer questions in a sociobiographic interview, and take part in a map task experiment. The resulting corpus comprises over 1,000 hours of speech, which is transcribed orthographically. Automatically derived broad phonetic transcriptions, selective manual narrow phonetic transcriptions, and variationalist annotations are added. Focussing on phonetic variation we aim to show to what extent national or regional standards exist in spoken German. Furthermore, the linguistic variation due to different contextual styles (read vs. spontaneous speech) shall be analysed. Finally, the corpus enables us to investigate whether linguistic change has occurred in spoken (near-)standard German. |
Language |
Single language |
Topics |
Corpus (creation, annotation, etc.), Speech resource/database, Phonetic Databases, Phonology |
Full paper |
German Today: a really extensive Corpus of Spoken Standard German |
Slides |
- |
Bibtex |
@InProceedings{BRINCKMANN08.806,
author = {Caren Brinckmann, Stefan Kleiner, Ralf Knöbl and Nina Berend},
title = {German Today: a really extensive Corpus of Spoken Standard German},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |