SUMMARY : Session O34-WE Morphology & Tagging

 

Title Part-of-Speech Tagging of Transcribed Speech
Authors M. Mieskes, M. Strube
Abstract We used four Part-of-Speech taggers, which are available for research purposes and were originally trained on text to tag a corpus of transcribed multiparty spoken dialogues. The assigned tags were then manually corrected. The correction was first used to evaluate the four taggers, then to retrain them. Despite limited resources in time, money and annotators we reached results comparable to those reported for the taggers on text. Based on our experience we present guidelines to produce reliably POS tagged corpora of new domains.
Keywords ICSI Meeting Recorder Project, multiparty dialogues
Full paper Part-of-Speech Tagging of Transcribed Speech