LREC 2000 2nd International Conference on Language Resources & Evaluation | ||||||
Title | NaniTrans: a Speech Labelling Tool |
Authors | Portabella David (Universitat Politècnica de Catalunya, Jordi Girona 1-3 08034 Barcelona, SPAIN, http://gps-tsc.upc.es/veu) Febrer Albert (Universitat Politècnica de Catalunya, Jordi Girona 1-3 08034 Barcelona, SPAIN, http://gps-tsc.upc.es/veu) Moreno Asunción (Universitat Politècnica de Catalunya, Jordi Girona 1-3 08034 Barcelona, SPAIN, http://gps-tsc.upc.es/veu, asuncion@tsc.upc.es) |
Keywords | Annotation Tools, Labelling |
Session | Session SP4 - Tools for Evaluation and Processing of Spoken Language Resources |
Full Paper | 345.ps, 345.pdf |
Abstract | This paper deals with a description of NaniTrans, a tool for segmentation and labeling of speech. The tool is programmed to work on the MATLAB application interface, in any of the supported platforms (Unix, Windows, Macintosh). The tool has been designed to annotate large speech databases, which can be also partially preprocessed (but require manual supervision). It supports the definition of an environment of annotation: set of annotation levels (orthographic, phonetic, etc.), display mode (how to show information), graphic representation (waveform, spectrogram), keyboard short-cuts, etc. This configuration is then used on a speech database. A safe file locking system allows many annotators to work concurrently on the same speech database. The tool is very friendly and easy to use by non experienced annotators, and it is designed to optimize speed using both keyboard and mouse. New options or speech processing tools can be easily added by using any MATLAB or user defined function. |