Paper |
Paper Title |
Authors |
1 |
The Cost258 Signal Generation Test Array |
Gérard Bailly, Eduardo R. Banga, Alex Monaghan, Erhard Rank |
2 |
Collocations as Word Co-ocurrence Restriction Data - An Application to Japanese Word Processor - |
Kosho Shudo, Masahito Takahashi, Yasuo Koyama, Kenji Yoshimura |
5 |
Enhancing the TDT Tracking Evaluation |
Amit Bagga |
7 |
GREEK ToBI: A System for the Annotation of Greek Speech Corpora |
Amalia Arvaniti, Mary Baltazani |
8 |
English Senseval: Report and Results |
Adam Kilgarriff, Joseph Rosenzweig |
10 |
SALA: SpeechDat across Latin America. Results of the First Phase |
Asunción Moreno, Robrecht Comeyne, Keith Haslam, Henk van den Heuvel, Harald Höge, Sabine Horbach, Giorgio Micca |
11 |
Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging |
Dan Tufiş |
12 |
TransSearch: A Free Translation Memory on the World Wide Web |
Elliott Macklovitch, Michel Simard, Philippe Langlais |
13 |
Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language |
Bolette Sandford Pedersen, Sanni Nimb |
14 |
A Comparison of Summarization Methods Based on Task-based Evaluation |
Mochizuki Hajime, Okumura Manabu |
15 |
A Word Sense Disambiguation Method Using Bilingual Corpus |
Zheng Jie, Mao Yuhang |
16 |
Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering |
Stavroula-Evita Fotinea, Ioannis Dologlou, Stylianos Bakamidis, Gregory Stainhaouer, George Carayannis |
17 |
Terms Specification and Extraction within a Linguistic-based Intranet Service |
Sandro Pedrazzini, Elisabeth Maier, Dierk König |
18 |
Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level |
Eva Hajičová, Petr Sgall |
19 |
Coreference in Annotating a Large Corpus |
Eva Hajičová, Jarmila Panenová, Petr Sgall |
20 |
Designing a Tool for Exploiting Bilingual Comparable Corpora |
Peter Bennison, Lynne Bowker |
22 |
Creating and Using Domain-specific Ontologies for Terminological Applications |
Diana Maynard, Sophia Ananiadou |
26 |
The TREC-8 Question Answering Track |
Ellen M. Voorhees, Dawn M. Tice |
27 |
IREX: IR & IE Evaluation Project in Japanese |
Satoshi Sekine, Hitoshi Isahara |
28 |
Towards A Universal Tool For NLP Resource Acquisition |
Svetlana Sheremetyeva, Sergei Nirenburg |
29 |
The Multi-layer Language Knowledge Base of Chinese NLP |
Hu Junfeng, Yu Shiwen |
31 |
With WORLDTREK Family, Create, Update and Browse your Terminological World |
Yasmina Abbas, Marie-Luce Picard |
32 |
Etude et Evaluation de la Di-Syllabe comme Unité Acoustique pour le Système de Synthèse Arabe PARADIS |
N. Chenfour, A. Benabbou, A. Mouradi |
33 |
Dialogue Annotation for Language Systems Evaluation |
Marcela Charfuelán, José Relaño Gil, M. Carmen Rogríguez Gancedo, Daniel Tapias Merino, Luis Hernández Gómez |
34 |
Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures |
Philippe Langlais, Sébastien Sauvé, George Foster, Elliott Macklovitch, Guy Lapalme |
35 |
Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs |
Gerardo Sierra, John McNaught |
36 |
Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm |
Jean-Yves Antoine, Jacques Siroux, Jean Caelen, Jeanne Villaneau, Jérôme Goulian, Mohamed Ahafhaf |
37 |
MHATLex: Lexical Resources for Modelling the French Pronunciation |
Guy Pérennou, Martine De Calmès |
38 |
Dialogue and Prompting Strategies Evaluation in the DEMON System |
Carine-Alexia Lavelle, Martine De Calmès, Guy Pérennou |
39 |
SLR Validation: Present State of Affairs and Prospects |
Henk van den Heuvel, Lou Boves, Khalid Choukri, Simo Goddijn, Eric Sanders |
41 |
EULER: an Open, Generic, Multilingual and Multi-platform Text-to-Speech System |
Thierry Dutoit, Michel Bagein, Fabrice Malfrère, Vincent Pagel, Alain Ruelle, Nawfal Tounsi, Dominique Wynsberghe |
43 |
On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems |
Marc Swerts, Emiel Krahmer |
44 |
A Word-level Morphosyntactic Analyzer for Basque |
I. Aduriz, E. Agirre, I. Aldezabal, X. Arregi, J. M. Arriola, X. Artola, K. Gojenola, A. Maritxalar, K. Sarasola, M. Urkia |
45 |
The EUDICO Project, Multi Media Annotation over the Internet |
Albert Russel, Hennie Brugman, Daan Broeder, Peter Wittenburg |
47 |
Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon |
Anna Braasch, Sussi Olsen |
48 |
Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek |
Stavroula-Evita Fotinea, Athanassios Protopapas, Dimitris Dimitriadis, George Carayannis |
52 |
Creation of Spoken Hebrew Databases |
Tami Rannon, Ofra Golani, Anat Goren, Sherrie Shammass, Ami Moyal |
53 |
PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists |
Damjan Vlaj, Janez Kaiser, Ralph Wilhelm, Ute Ziegenhain |
55 |
Use of Greek and Latin Forms for Term Detection |
Rosa Estopà, Jordi Vivaldi, M. Teresa Cabré |
56 |
Methods and Metrics for the Evaluation of Dictation Systems: a Case Study |
Maria Canelli, Daniele Grasso, Margaret King |
58 |
Cairo: An Alignment Visualization Tool |
Noah A. Smith, Michael E. Jahr |
59 |
An XML-based Representation Format for Syntactically Annotated Corpora |
Andreas Mengel, Wolfgang Lezius |
60 |
An Experiment of Lexical-Semantic Tagging of an Italian Corpus |
Ornella Corazzari, Nicoletta Calzolari, Antonio Zampolli |
61 |
SIMPLE: A General Framework for the Development of Multilingual Lexicons |
Nuria Bel, Federica Busa, Nicoletta Calzolari, Elisabetta Gola, Alessandro Lenci, Monica Monachini, Antoine Ogonowski, Ivonne Peters, Wim Peters, Nilda Ruimy, Marta Villegas, Antonio Zampolli |
62 |
Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX |
Zygmunt Vetulani |
63 |
SPEECON - Speech Data for Consumer Devices |
Rainer Siemund, Harald Höge, Siegfried Kunzmann, Krzysztof Marasek |
66 |
A Treebank of Spanish and its Application to Parsing |
Antonio Moreno, Ralph Grishman, Susana López, Fernando Sánchez, Satoshi Sekine |
67 |
End-to-End Evaluation of Machine Interpretation Systems: A Graphical Evaluation Tool |
Susanne J. Jekat, Lorenzo Tessiore |
68 |
A Proposal for the Integration of NLP Tools using SGML-Tagged Documents |
X. Artola, A. Díaz de Ilarraza, N. Ezeiza, K. Gojenola, A. Maritxalar, A. Soroa |
69 |
A Bilingual Electronic Dictionary for Frame Semantics |
Thierry Fontenelle |
70 |
The Evaluation of Systems for Cross-language Information Retrieval |
Martin Braschler, Donna Harman, Michael Hess, Michael Kluck, Carol Peters, Peter Schäuble |
71 |
Spoken Portuguese: Geographic and Social Varieties |
José Bettencourt Gonçalves, Rita Veloso |
72 |
Portuguese Corpora at CLUL |
Maria Fernanda Bacelar do Nascimento, Luisa Pereira, João Saramago |
74 |
Reusing the Mikrokosmos Ontology for Concept-based Multilingual Terminology Databases |
Antonio Moreno, Chantal Pérez |
75 |
Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation |
Kimura Kazuhiro, Hirakawa Hideki |
76 |
Will Very Large Corpora Play For Semantic Disambiguation The Role That Massive Computing Power Is Playing For Other AI-Hard Problems? |
Alessandro Cucchiarelli, Enrico Faggioli, Paola Velardi |
77 |
Guidelines for Japanese Speech Synthesizer Evaluation |
Shuichi Itahashi |
78 |
Constructing a Tagged E-J Parallel Corpus for Assisting Japanese Software Engineers in Writing English Abstracts |
Masumi Narita |
79 |
Extraction of Unknown Words Using the Probability of Accepting the Kanji Character Sequence as One Word |
Hiroyuki Shinnou, Masanori Ikeya |
80 |
Automatic Speech Segmentation in High Noise Condition |
Rosen Ivanov |
81 |
Open Ended Computerized Overview of Controlled Languages |
Elisa Gavieiro-Villatte, Laurent Spaggiari |
82 |
Shallow Parsing and Functional Structure in Italian Corpora |
Rodolfo Delmonte |
84 |
Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon |
Dimitrios Kokkinakis, Maria Toporowska Gronostaj, Karin Warmenius |
85 |
Providing Internet Access to Portuguese Corpora: the AC/DC Project |
Diana Santos, Eckhard Bick |
86 |
Turkish Electronic Living Lexicon (TELL): A Lexical Database |
Sharon Inkelas, Aylin Küntay, C. Orhan Orgun, Ronald Sprouse |
87 |
Orthographic Transcription of the Spoken Dutch Corpus |
Wim Goedertier, Simo Goddijn, Jean-Pierre Martens |
90 |
Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers |
Giulia Bernardis, Hervé Bourlard, Martin Rajman, Jean-Cédric Chappelier |
91 |
An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser |
Guillermo Rojo, Maria Concepción Álvarez, Pilar Alvariño, Adelaida Gil, María Paula Santalla, Susana Sotelo |
92 |
Target Suites for Evaluating the Coverage of Text Generators |
John A. Bateman, Anthony F. Hartley |
93 |
LT TTT - A Flexible Tokenisation Tool |
Claire Grover, Colin Matheson, Andrei Mikheev, Marc Moens |
94 |
Perception and Analysis of a Reiterant Speech Paradigm: a Functional Diagnostic of Synthetic Prosody |
Albert Rilliard, Véronique Aubergé |
95 |
Development and Evaluation of an Italian Broadcast News Corpus |
Marcello Federico, Dimitri Giordani, Paolo Coletti |
96 |
Multilingual Linguistic Resources: From Monolingual Lexicons to Bilingual Interrelated Lexicons |
Marta Villegas, Nuria Bel, Alessandro Lenci, Nicoletta Calzolari, Nilda Ruimy, Antonio Zampolli, Teresa Sadurní, Joan Soler |
98 |
Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation |
Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli, Claudia Soria |
99 |
Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation |
Paolo Allegrini, Simonetta Montemagni, Vito Pirrelli |
100 |
Coreference Annotation: Whither? |
Rodger Kibble, Kees van Deemter |
101 |
Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control |
R. López-Cózar, A.J. Rubio, J.E. Díaz Verdejo, A. De la Torre |
104 |
MDWOZ: A Wizard of Oz Environment for Dialog Systems Development |
Cosmin Munteanu, Marian Boldea |
105 |
A Web-based Text Corpora Development System |
Dan Bohuş, Marian Boldea |
106 |
Term-based Identification of Sentences for Text Summarisation |
Byron Georgantopoulos, Stelios Piperidis |
107 |
Morphemic Analysis and Morphological Tagging of Latvian Corpus |
Kristīne Levāne, Andrejs Spektors |
109 |
Textual Information Retrieval Systems Test: The Point of View of an Organizer and Corpuses Provider |
Patrick Kremer, Laurent Schmitt |
110 |
The Spoken Dutch Corpus. Overview and First Evaluation |
Nelleke Oostdijk |
111 |
A Strategy for the Syntactic Parsing of Corpora: from Constraint Grammar Output to Unification-based Processing |
Toni Badia, Àngels Egea |
112 |
Producing LRs in Parallel with Lexicographic Description: the DCC project |
Joan Soler i Bou |
113 |
A Novelty-based Evaluation Method for Information Retrieval |
Atsushi Fujii, Tetsuya Ishikawa |
115 |
Towards More Comprehensive Evaluation in Anaphora Resolution |
Ruslan Mitkov |
116 |
Galaxy-II as an Architecture for Spoken Dialogue Evaluation |
Joseph Polifroni, Stephanie Seneff |
119 |
Building the Croatian-English Parallel Corpus |
Marko Tadić |
122 |
Lexical and Translation Equivalence in Parallel Corpora |
Tamás Váradi |
125 |
Towards a Standard for Meta-descriptions of Language Resources |
D. Broeder, H. Brugman, A. Russel, R. Skiba, P. Wittenburg |
128 |
Object-oriented Access to the Estonian Phonetic Database |
Einar Meister, Arvo Eek, Toomas Altosaar, Martti Vainio |
129 |
ItalWordNet: a Large Semantic Database for Italian |
Adriana Roventini, Antonietta Alonge, Nicoletta Calzolari, Bernardo Magnini, Francesca Bertagna |
130 |
FAST - Towards a Semi-automatic Annotation of Corpora |
Cătălina Barbu |
131 |
Coreference Resolution Evaluation Based on Descriptive Specificity |
François Trouilleux, Eric Gaussier, Gabriel G. Bès, Annie Zaenen |
132 |
A Text->Meaning->Text Dictionary and Process |
Dominique Dutoit |
133 |
A French Phonetic Lexicon with Variants for Speech and Language Processing |
Philippe Boula de Mareüil, Christophe d'Alessandro, François Yvon, Véronique Aubergé, Jacqueline Vaissière, Angélique Amelot |
134 |
Annotating Communication Problems Using the MATE Workbench |
Laila Dybkjær, Morten Baun Møller, Niels Ole Bernsen, Michael Grosse, Martin Olsen, Amanda Schiffrin |
135 |
A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components |
Niels Ole Bernsen, Laila Dybkjær |
136 |
Evaluating Translation Quality as Input to Product Development |
Niamh Bohan, Elisabeth Breidt, Martin Volk |
137 |
Evaluation of Word Alignment Systems |
Lars Ahrenberg, Magnus Merkel, Anna Sågvall Hein, Jörg Tiedemann |
138 |
How To Evaluate and Compare Tagsets? A Proposal |
Hervé Déjean |
139 |
Determining the Tolerance of Text-handling Tasks for MT Output |
John White, Jennifer Doyon, Susan Talbott |
140 |
A Parallel Corpus of Italian/German Legal Texts |
Johann Gamper |
141 |
Integrating Seed Names and ngrams for a Named Entity List and Classifier |
Sabine Buchholz, Antal van den Bosch |
142 |
Automatically Expansion of Thesaurus Entries with a Different Thesaurus |
Hideki Kashioka, Satosi Shirai |
145 |
Learning Verb Subcategorization from Corpora: Counting Frame Subsets |
Daniel Zeman, Anoop Sarkar |
146 |
Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets |
Sašo Džeroski, Tomaž Erjavec, Jakub Zavrel |
147 |
Cross-lingual Interpolation of Speech Recognition Models |
Giorgio Micca, Alessandra Frasca, Maria Gabriella Di Benedetto |
148 |
Lexicalised Systematic Polysemy in WordNet |
Wim Peters, Ivonne Peters |
151 |
Experiences of Language Engineering Algorithm Reuse |
Björn Gambäck, Fredrik Olsson |
153 |
Derivation in the Czech National Corpus |
Jana Klímová, Jan Kocek |
155 |
Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers |
Jakub Zavrel, Walter Daelemans |
156 |
The Context (not only) for Humans |
Barbora Hladká |
158 |
Something Borrowed, Something Blue: Rule-based Combination of POS Taggers |
Lars Borin |
159 |
Screffva: A Lexicographer's Workbench |
Jon Mills |
161 |
A Step toward Semantic Indexing of an Encyclopedic Corpus |
Philippe Alcouffe, Nicolas Gacon, Claude Roux, Frédérique Segond |
162 |
Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project |
Thomas Brey, Gerhard Hanrieder, Paul Heisterkamp, Ludwig Hitzenberger, Peter Regel-Brietzmann |
163 |
Evaluating Summaries for Multiple Documents in an Interactive Environment |
Gees C. Stein, Tomek Strzalkowski, G. Bowden Wise, Amit Bagga |
164 |
Grammarless Bracketing in an Aligned Bilingual Corpus |
Jorge Kinoshita |
165 |
A Semi-automatic System for Conceptual Annotation, its Application to Resource Construction and Evaluation |
W.J. Black, J. McNaught, G.P. Zarri, A. Persidis, A. Brasher, L. Gilardoni, E. Bertino, G. Semeraro, P. Leo |
166 |
The MATE Workbench Annotation Tool, a Technical Description |
Amy Isard, David McKelvie, Andreas Mengel, Morten Baun Møller |
167 |
Recruitment Techniques for Minority Language Speech Databases: Some Observations |
Rhys James Jones, John S. Mason, Louise Helliker, Mark Pawlewski |
168 |
Multilingual Topic Detection and Tracking: Successful Research Enabled by Corpora and Evaluation |
Charles L. Wayne |
169 |
PoS Disambiguation and Partial Parsing Bidirectional Interaction |
Montserrat Marimon Felipe, Jordi Porta Zamorano |
170 |
Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis |
Hamish Cunnigham, Kalina Bontcheva, Valentin Tablan, Yorick Wilks |
172 |
XCES: An XML-based Encoding Standard for Linguistic Corpora |
Nancy Ide, Patrice Bonhomme, Laurent Romary |
173 |
Named Entity Recognition in Greek Texts |
Iason Demiros, Sotiris Boutsis, Voula Giouli, Maria Liakata, Harris Papageorgiou, Stelios Piperidis |
174 |
A Robust Parser for Unrestricted Greek Text |
Sotiris Boutsis, Prokopis Prokopidis, Voula Giouli, Stelios Piperidis |
175 |
A Computational Platform for Development of Morphologic and Phonetic Lexica |
Matej Rojc, Zdravko Kačič |
176 |
An Open Architecture for the Construction and Administration of Corpora |
Constantin Orăsan, Ramesh Krishnamurthy |
177 |
Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System |
Matej Rojc, Zdravko Kačič |
179 |
CLinkA A Coreferential Links Annotator |
Constantin Orăsan |
180 |
What's in a Thesaurus? |
Adam Kilgarriff, Colin Yallop |
181 |
A Unified POS Tagging Architecture and its Application to Greek |
Harris Papageorgiou, Prokopis Prokopidis, Voula Giouli, Stelios Piperidis |
182 |
Resources for Lexicalized Tree Adjoining Grammars and XML Encoding: TagML |
Patrice Bonhomme, Patrice Lopez |
183 |
Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers |
Andreas Witt, Harald Lüngen, Dafydd Gibbon |
184 |
ATLAS: A Flexible and Extensible Architecture for Linguistic Annotation |
Steven Bird, David Day, John Garofolo, John Henderson, Christophe Laprun, Mark Liberman |
185 |
Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches |
Pavel Skrelin, Tatiana Sherstinova |
186 |
Some Technical Aspects about Aligning Near Languages |
Lluís de Yzaguirre, Marta Ribas, Jordi Vivaldi, M. Teresa Cabré |
187 |
Corpus Resources and Minority Language Engineering |
Tony McEnery, Paul Baker, Lou Burnard |
189 |
CDB - A Database of Lexical Collocations |
Brigitte Krenn |
191 |
Evaluation for Darpa Communicator Spoken Dialogue Systems |
Marilyn Walker, Lynette Hirschman, John Aberdeen |
192 |
Transcribing with Annotation Graphs |
Edouard Geoffrois, Claude Barras, Steven Bird, Zhibiao Wu |
193 |
Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results |
Massimo Poesio |
194 |
Towards a Query Language for Annotation Graphs |
Steven Bird, Peter Buneman, Wang-Chiew Tan |
196 |
The American National Corpus: A Standardized Resource for American English |
Catherine Macleod, Nancy Ide, Ralph Grishman |
197 |
Semantic Tagging for the Penn Treebank |
Martha Palmer, Hoa Trang Dang, Joseph Rosenzweig |
199 |
Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions |
Kiril Ribarov |
200 |
The (Un)Deterministic Nature of Morphological Context |
Kiril Ribarov |
201 |
A Framework for Cross-Document Annotation |
David Day, Alan Goldschen, John Henderson |
202 |
Extraction of Concepts and Multilingual Information Schemes from French and English Economics Documents |
Peggy Cadel, Hélène Ledouble |
203 |
How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done |
Eric J. Breck, John D. Burger, Lisa Ferro, Lynette Hirschman, David House, Marc Light, Inderjeet Mani |
205 |
What are Transcription Errors and Why are They made? |
Daniela Oppermann, Susanne Burger, Karl Weilhammer |
206 |
On the Usage of Kappa to Evaluate Agreement on Coding Tasks |
Barbara Di Eugenio |
208 |
Automatic Extraction of English-Chinese Term Lexicons from Noisy Bilingual Corpora |
Sun Le, Jin Youbing, Du Lin, Sun Yufang |
209 |
Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium |
Christopher Cieri, Mark Liberman |
210 |
Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts |
Christopher Cieri, David Graff, Mark Liberman, Nii Martey, Stephanie Strassel |
211 |
Using Machine Learning Methods to Improve Quality of Tagged Corpora and Learning Models |
Yuji Matsumoto, Tatsuo Yamashita |
212 |
Quality Control in Large Annotation Projects Involving Multiple Judges: The Case of the TDT Corpora |
Stephanie Strassel, David Graff, Nii Martey, Christopher Cieri |
213 |
Learning Preference of Dependency between Japanese Subordinate Clauses and its Evaluation in Parsing |
Takehito Utsuro |
214 |
Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era |
Lin-Shan Lee, Lee-Feng Chien |
215 |
Lessons Learned from a Task-based Evaluation of Speech-to-Speech Machine Translation |
Lori Levin, Boris Bartlog, Ariadna Font Llitjos, Donna Gates, Alon Lavie, Dorcas Wallace, Taro Watanabe, Monika Woszczyna |
216 |
Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus |
Frank Van Eynde, Jakub Zavrel, Walter Daelemans |
217 |
The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora |
Karl Weilhammer, Daniela Oppermann, Susanne Burger |
218 |
Automatic Assignment of Grammatical Relations |
Leonardo Lesmo, Vincenzo Lombardo |
219 |
Integrating Subject Field Codes into WordNet |
Bernardo Magnini, Gabriela Cavaglià |
220 |
Building a Treebank for Italian: a Data-driven Annotation Schema |
Cristina Bosco, Vincenzo Lombardo, Daniela Vassallo, Leonardo Lesmo |
221 |
Typographical and Orthographical Spelling Error Correction |
Kyongho Min, William H. Wilson, Yoo-Jin Moon |
223 |
Application of WordNet ILR in Czech Word-formation |
Jana Klímová, Karel Pala |
224 |
POSCAT: A Morpheme-based Speech Corpus Annotation Tool |
Byeongchang Kim, Jin-seok Lee, Jeongwon Cha, Geunbae Lee |
226 |
A Flexible Infrastructure for Large Monolingual Corpora |
Uwe Quasthoff, Christian Wolff |
227 |
Automatic Transliteration and Back-transliteration by Decision Tree Learning |
Byung-Ju Kang, Key-Sun Choi |
228 |
Shallow Discourse Genre Annotation in CallHome Spanish |
Klaus Ries, Lori Levin, Liza Valle, Alon Lavie, Alex Waibel |
230 |
Building a Treebank for French |
Anne Abeillé, Lionel Clément, Alexandra Kinyon |
233 |
Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task |
Paola Merlo, Suzanne Stevenson |
234 |
Layout Annotation in a Corpus of Patient Information Leaflets |
Nadjet Bouayad-Agha |
235 |
A New Methodology for Speech Corpora Definition from Internet Documents |
D. Vaufreydaz, C. Bergamini, J.F. Serignat, L. Besacier, M. Akbar |
236 |
Coping with Lexical Gaps when Building Aligned Multilingual Wordnets |
Luisa Bentivogli, Emanuele Pianta, Fabio Pianesi |
237 |
Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus |
Young-Soog Chae, Key-Sun Choi |
239 |
Introduction of KIBS (Korean Information Base System) Project |
Young-Soog Chae, Key-Sun Choi |
241 |
Resources for Multilingual Text Generation in Three Slavic Languages |
John Bateman, Elke Teich, Geert-Jan Kruijff, Ivanna Kruijff-Korbayová, Serge Sharoff, Hana Skoumalová |
243 |
A Multi-view Hyperlexicon Resource for Speech and Language System Development |
Dafydd Gibbon, Thorsten Trippel |
244 |
Enabling Resource Sharing in Language Generation: an Abstract Reference Architecture |
Lynne Cahill, Christy Doran, Roger Evans, Rodger Kibble, Chris Mellish, D. Paiva, Mike Reape, Donia Scott, Neil Tipper |
246 |
Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language |
Zdravko Kačič, Bogomir Horvat, Aleksandra Zögling |
247 |
ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora |
Christophe Jouis, ARC A3 |
248 |
A Parallel English-Japanese Query Collection for the Evaluation of On-Line Help Systems |
Richard F. E. Sutcliffe, Sadao Kurohashi |
249 |
Principled Hidden Tagset Design for Tiered Tagging of Hungarian |
Dan Tufiş, Péter Dienes, Csaba Oravecz, Tamás Váradi |
250 |
Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine |
Felisa Verdejo, Julio Gonzalo, Anselmo Peñas, Fernando López, David Fernández |
251 |
An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages |
Dafydd Gibbon, Ana Paula Quirino Simões, Martin Matthiesen |
252 |
Sublanguage Dependent Evaluation: Toward Predicting NLP performances |
Gabriel Illouz |
253 |
The Universal XML Organizer: UXO |
Jan-Torsten Milde, Markus Reinsch |
254 |
TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation |
Helka Folch, Serge Heiden, Benoît Habert, Serge Fleury, Gabriel Illouz, Pierre Lafon, Julien Nioche, Sophie Prévost |
256 |
An Approach to Lexical Development for Inflectional Languages |
Davide Turcato, Janine Toole, Stavroula Tsiplakou, Trude Heift, Paul McFetridge |
257 |
Some Language Resources and Tools for Computational Processing of Portuguese at INESC |
Luzia Wittmann, Ricardo Daniel Ribeiro, Tânia Pêgo, Fernando Batista |
258 |
Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation |
Takehito Utsuro, Manabu Sassano |
259 |
Evaluation of a Generic Lexical Semantic Resource in Information Extraction |
Joyce Yue Chai |
260 |
The Establishment of Motorola's Human Language Data Resource Center: Addressing the Criticality of Language Resources in the Industrial Setting |
Jim Talley |
261 |
IPA Japanese Dictation Free Software Project |
Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kasuya Takeda, Atsushi Yamada, Akinori Itou, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee |
262 |
Spontaneous Speech Corpus of Japanese |
Kikuo Maekawa, Hanae Koiso, Sadaoki Furui, Hitoshi Isahara |
263 |
Annotating Resources for Information Extraction |
Sean Boisen, Michael R. Crystal, Richard Schwartz, Rebecca Stone, Ralph Weischedel |
267 |
The New Edition of the Natural Language Software Registry (an Initiative of ACL hosted at DFKI) |
Thierry Declerck, Alexander Werner Jachmann, Hans Uszkoreit |
269 |
Design Methodology for Bilingual Pronunciation Dictionary |
Jong-mi Kim |
271 |
LEXIPLOIGISSI: An Educational Platform for the Teaching of Terminology in Greece |
Constandina Economou, Spyros Raptis, Gregory Stainhaouer |
272 |
An HPSG-Annotated Test Suite for Polish |
Malgorzata Marciniak, Agnieszka Mykowiecka, Anna Kupść, Adam Przepiórkowski |
274 |
The COST 249 SpeechDat Multilingual Reference Recogniser |
Finn Tore Johansen, Narada Warakagoda, Børge Lindberg, Gunnar Lehtinen, Zdravko Kačič, Andreh Žgank, Kjell Elenius, Gampiero Salvi |
275 |
Terminology Encoding in View of Multifunctional NLP Resources |
Marianna Katsoyannou, Eleni Efthimiou |
276 |
Terminology in Korea: KORTERM |
Key-Sun Choi, Young-Soog Chae |
277 |
Morphological Tagging to Resolve Morphological Ambiguities |
Gaëlle Birocheau |
278 |
An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research |
Sonja Nießen, Franz Josef Och, Gregor Leusch, Hermann Ney |
279 |
GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources |
Fiammetta Namer, Georgette Dal |
281 |
Le Programme Compalex (COMPAraison LEXicale) |
Josué Ndamba, Jean Silence Bayamboussa |
282 |
Many Uses, Many Annotations for Large Speech Corpora: Switchboard and TDT as Case Studies |
David Graff, Steven Bird |
283 |
Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future |
Gerhard Budin, Alan K. Melby |
285 |
Using a Formal Approach to Evaluate Grammars |
Bilel Gargouri, Mohamed Jmaiel, Abdelmajid Ben Hamadou |
286 |
Design Issues in Text-Independent Speaker Recognition Evaluation |
Alvin Martin, Mark Przybocki |
287 |
Developing Guidelines and Ensuring Consistency for Chinese Text Annotation |
Fei Xia, Martha Palmer, Nianwen Xue, Mary Ellen Okurowski, John Kovarik, Fu-Dong Chiou, Shizhe Huang, Tony Kroch, Mitch Marcus |
288 |
Corpora of Slovene Spoken Language for Multi-lingual Applications |
Jerneja Gros, France Mihelič, Simon Dobrišek, Tomaž Erjavec, Mario Žganec |
289 |
GRUHD: A Greek database of Unconstrained Handwriting |
E. Kavallieratou, N. Liolios, E. Koutsogeorgos, N. Fakotakis, G. Kokkinakis |
292 |
Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS |
France Mihelič, Jerneja Gros, Elmar Nöth, Volker Warnke |
294 |
NL-Translex: Machine Translation for Dutch |
Catia Cucchiarini, Johan Van Hoorde, Elizabeth D'Halleweyn |
295 |
Rarity of Words in a Language and in a Corpus |
Jaroslava Hlaváčová |
297 |
Language Resources Development at the Spanish Royal Academy |
Ángel Martín Municio, Guillermo Rojo, Fernando Sánchez León, Octavio Pinillos |
298 |
Reusability as Easy Adaptability: A Substantial Advance in NL Technology |
Irina Prodanof, Amedeo Cappelli, Lorenzo Moretti |
299 |
Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking |
Andrew Bredenkamp, Berthold Crysmann, Mirela Petrea |
300 |
The Bank of Swedish |
Martin Gellerstam, Yvonne Cederholm, Torgny Rasmark |
301 |
Automatic Style Categorisation of Corpora in the Greek Language |
George Tambouratzis, Stella Markantonatou, Nikolaos Hairetakis, George Carayannis |
302 |
Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts |
Aristomenis Thanopoulos, Nikos Fakotakis, George Kokkinakis |
303 |
Predictive Performance of Dialog Systems |
H. Bonneau-Maynard, L. Devillers, S. Rosset |
306 |
Automatic Generation of Dictionary Definitions from a Computational Lexicon |
Penny Labropoulou, Elena Mantzari, Harris Papageorgiou, Maria Gavrilidou |
307 |
Regional Pronunciation Variants for Automatic Segmentation |
Nicole Beringer, Marcia Neff |
310 |
SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties |
Mario Refice, Michelina Savino, Marco Altieri, Roberto Altieri |
312 |
Automotive Speech-Recognition - Success Conditions Beyond Recognition Rates |
Klaus Bengler |
313 |
The ISLE Corpus of Non-Native Spoken English |
Wolfgang Menzel, Eric Atwell, Patrizia Bonaventura, Daniel Herron, Peter Howarth, Rachel Morton, Clive Souter |
314 |
A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora |
Kallirroi Georgila, Nikos Fakotakis, George Kokkinakis |
315 |
The PAROLE Program |
Georges Vignaux |
316 |
For a Repository of NLP Tools |
Stéphane Chaudiron, Khalid Choukri, Audrey Mance, Valérie Mapelli |
317 |
Survey of Language Engineering Needs: a Language Resources Perspective |
Jeffrey Allen, Khalid Choukri |
319 |
Interarbora and Thistle - Delivering Linguistic Structure by the Internet |
Jo Calder |
320 |
Automatically Augmenting Terminological Lexicons from Untagged Text |
George Demetriou, Robert Gaizauskas |
321 |
Annotating Events and Temporal Information in Newswire Texts |
Andrea Setzer, Robert Gaizauskas |
327 |
Chinese-English Semantic Resource Construction |
Bonnie J. Dorr, Gina-Anne Levow, Dekang Lin, Scott Thomas |
328 |
Production of NLP-oriented Bilingual Language Resources from Human-oriented dictionaries |
Vera Fluhr-Semenova, Christian Fluhr, Stéphanie Brisson |
329 |
Developing a Multilingual Telephone Based Information System in African Languages |
J.C. Roux, E.C. Botha, J.A. du Preez |
330 |
Tuning Lexicons to New Operational Scenarios |
Roberto Basili, Maria Teresa Pazienza, Michele Vindigni, Fabio Massimo Zanzotto |
331 |
SpeechDat-Car Fixed Platform |
José A.R. Fonollosa, Asunción Moreno |
333 |
Inter-annotator Agreement for a German Newspaper Corpus |
Thorsten Brants |
334 |
Interactive Corpus Annotation |
Thorsten Brants, Oliver Plaehn |
335 |
The Concede Model for Lexical Databases |
Tomaž Erjavec, Roger Evans, Nancy Ide, Adam Kilgarriff |
336 |
Design and Implementation of the Online ILSP Greek Corpus |
Nick Hatzigeorgiu, Maria Gavrilidou, Stelios Piperidis, George Carayannis, Anastasia Papakostopoulou, Athanassia Spiliotopoulou, Anna Vacalopoulou, Penny Labropoulou, Elena Mantzari, Harris Papageorgiou, Iason Demiros |
337 |
A Software Toolkit for Sharing and Accessing Corpora Over the Internet |
Saturnino Luz |
338 |
Tools for the Generation of Morphological Entries in Dictionaries |
Ülle Viks |
340 |
Improving Lexical Databases with Collocational Information: Data from Portuguese |
Paula Guerreiro |
341 |
Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model |
Kiyoaki Shirai, Hozumi Tanaka, Takenobu Tokunaga |
342 |
Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications |
Marilyn Mason |
345 |
NaniTrans: a Speech Labelling Tool |
David Portabella, Albert Febrer, Asunción Moreno |
347 |
Acquisition of Linguistic Patterns for Knowledge-based Information Extraction |
Sanda M. Harabagiu, Steven J. Maiorano |
348 |
A Platform for Dutch in Human Language Technologies |
Elisabeth D'Halleweyn, Erwin Dewallef, Jeannine Beeken |
349 |
Developing and Testing General Models of Spoken Dialogue System Peformance |
Marilyn Walker, Candace Kamm, Julie Boland |
350 |
Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation |
Claude de Loupy, Marc El-Bèze |
351 |
Modern Greek Corpus Taxonomy |
George Mikros, George Carayannis |
353 |
Language Resources as by-Product of Evaluation: The MULTITAG Example |
Patrick Paroubek |
355 |
Evaluation of Computational Linguistic Techniques for Identifying Significant Topics for Browsing Applications |
Judith L. Klavans, Nina Wacholder, David K. Evans |
356 |
Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition |
Satoshi Nakamura, Kazuo Hiyane, Futoshi Asano, Takanobu Nishiura, Takeshi Yamada |
357 |
Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling |
George Demetriou, Eric Atwell, Clive Souter |
358 |
Annotation of a Multichannel Noisy Speech Corpus |
L. Cristoforetti, M. Matassoni, M. Omologo, P. Svaizer, E. Zovato |
360 |
ARISTA Generative Lexicon for Compound Greek Medical Terms |
John Kontos, Ioanna Malagardi, Spyros Fountoukis |
362 |
A Self-Expanding Corpus Based on Newspapers on the Web |
Knut Hofland |
363 |
A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts |
Janne Bondi Johannessen, Anders Nøklestad, Kristin Hagen |
364 |
COCOSDA - a Progress Report |
Nick Campbell |
366 |
The Treatment of Adjectives in SIMPLE: Theoretical Observations |
Ivonne Peters, Wim Peters |
367 |
Cardinal, Nominal or Ordinal Similarity Measures in Comparative Evaluation of Information Retrieval Process |
Christine Michel |
368 |
Evaluating Multi-party Multi-modal Systems |
Laurie E. Damianos, Jill Drury, Tari Fanderclai, Lynette Hirschman, Jeff Kurtz, Beatrice Oshika |
369 |
Extension and Use of GermaNet, a Lexical-Semantic Database |
Claudia Kunze |
370 |
Russian Monitor Corpora: Composition, Linguistic Encoding and Internet Publication |
Serge A.Yablonsky |
371 |
An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG |
Ann Copestake, Dan Flickinger |
372 |
Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus |
Sun Maosong, Sun Honglin, Huang Changning, Zhang Pu, Xing Hongbing, Zhou Qiang |
373 |
SPEECHDAT-CAR. A Large Speech Database for Automotive Environments |
Asunción Moreno, Børge Lindberg, Christoph Draxler, Gaël Richard, Khalid Choukri, Stephan Euler, Jeffrey Allen |
374 |
Addizionario: an Interactive Hypermedia Tool for Language Learning |
Giovanna Turrini, Laura Cignoni, Alessandro Paccosi |
377 |
Recent Developments within the European Language Resources Association (ELRA) |
Khalid Choukri, Audrey Mance, Valérie Mapelli |