| Paper |
Paper Title |
Authors |
| 1 |
The Cost258 Signal Generation Test Array |
Gerard Bailly, Eduardo R. Banga, Alex Monaghan, Erhard Rank |
| 2 |
Collocations as Word Co-ocurrence Restriction Data - An Application to Japanese Word Processor - |
Kosho Shudo, Masahito Takahashi, Yasuo Koyama, Kenji Yoshimura |
| 5 |
Enhancing the TDT Tracking Evaluation |
Amit Bagga |
| 7 |
GREEK ToBI: A System for the Annotation of Greek Speech Corpora |
Amalia Arvaniti, Mary Baltazani |
| 8 |
English Senseval: Report and Results |
Adam Kilgarriff, Joseph Rosenzweig |
| 10 |
SALA: SpeechDat across Latin America. Results of the First Phase |
Asuncion Moreno, Robrecht Comeyne, Keith Haslam, Henk van den Heuvel, Harald Hoge, Sabine Horbach, Giorgio Micca |
| 11 |
Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging |
Dan Tufis |
| 12 |
TransSearch: A Free Translation Memory on the World Wide Web |
Elliott Macklovitch, Michel Simard, Philippe Langlais |
| 13 |
Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language |
Bolette Sandford Pedersen, Sanni Nimb |
| 14 |
A Comparison of Summarization Methods Based on Task-based Evaluation |
Mochizuki Hajime, Okumura Manabu |
| 15 |
A Word Sense Disambiguation Method Using Bilingual Corpus |
Zheng Jie, Mao Yuhang |
| 16 |
Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering |
Stavroula-Evita Fotinea, Ioannis Dologlou, Stylianos Bakamidis, Gregory Stainhaouer, George Carayannis |
| 17 |
Terms Specification and Extraction within a Linguistic-based Intranet Service |
Sandro Pedrazzini, Elisabeth Maier, Dierk Konig |
| 18 |
Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level |
Eva Hajicova, Petr Sgall |
| 19 |
Coreference in Annotating a Large Corpus |
Eva Hajicova, Jarmila Panenova, Petr Sgall |
| 20 |
Designing a Tool for Exploiting Bilingual Comparable Corpora |
Peter Bennison, Lynne Bowker |
| 22 |
Creating and Using Domain-specific Ontologies for Terminological Applications |
Diana Maynard, Sophia Ananiadou |
| 26 |
The TREC-8 Question Answering Track |
Ellen M. Voorhees, Dawn M. Tice |
| 27 |
IREX: IR & IE Evaluation Project in Japanese |
Satoshi Sekine, Hitoshi Isahara |
| 28 |
Towards A Universal Tool For NLP Resource Acquisition |
Svetlana Sheremetyeva, Sergei Nirenburg |
| 29 |
The Multi-layer Language Knowledge Base of Chinese NLP |
Hu Junfeng, Yu Shiwen |
| 31 |
With WORLDTREK Family, Create, Update and Browse your Terminological World |
Yasmina Abbas, Marie-Luce Picard |
| 32 |
Etude et Evaluation de la Di-Syllabe comme Unite Acoustique pour le Systeme de Synthese Arabe PARADIS |
N. Chenfour, A. Benabbou, A. Mouradi |
| 33 |
Dialogue Annotation for Language Systems Evaluation |
Marcela Charfuelan, Jose Relano Gil, M. Carmen Rogriguez Gancedo, Daniel Tapias Merino, Luis Hernandez Gomez |
| 34 |
Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures |
Philippe Langlais, Sebastien Sauve, George Foster, Elliott Macklovitch, Guy Lapalme |
| 35 |
Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs |
Gerardo Sierra, John McNaught |
| 36 |
Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm |
Jean-Yves Antoine, Jacques Siroux, Jean Caelen, Jeanne Villaneau, Jerome Goulian, Mohamed Ahafhaf |
| 37 |
MHATLex: Lexical Resources for Modelling the French Pronunciation |
Guy Perennou, Martine De Calmes |
| 38 |
Dialogue and Prompting Strategies Evaluation in the DEMON System |
Carine-Alexia Lavelle, Martine De Calmes, Guy Perennou |
| 39 |
SLR Validation: Present State of Affairs and Prospects |
Henk van den Heuvel, Lou Boves, Khalid Choukri, Simo Goddijn, Eric Sanders |
| 41 |
EULER: an Open, Generic, Multilingual and Multi-platform Text-to-Speech System |
Thierry Dutoit, Michel Bagein, Fabrice Malfrere, Vincent Pagel, Alain Ruelle, Nawfal Tounsi, Dominique Wynsberghe |
| 43 |
On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems |
Marc Swerts, Emiel Krahmer |
| 44 |
A Word-level Morphosyntactic Analyzer for Basque |
I. Aduriz, E. Agirre, I. Aldezabal, X. Arregi, J. M. Arriola, X. Artola, K. Gojenola, A. Maritxalar, K. Sarasola, M. Urkia |
| 45 |
The EUDICO Project, Multi Media Annotation over the Internet |
Albert Russel, Hennie Brugman, Daan Broeder, Peter Wittenburg |
| 47 |
Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon |
Anna Braasch, Sussi Olsen |
| 48 |
Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek |
Stavroula-Evita Fotinea, Athanassios Protopapas, Dimitris Dimitriadis, George Carayannis |
| 52 |
Creation of Spoken Hebrew Databases |
Tami Rannon, Ofra Golani, Anat Goren, Sherrie Shammass, Ami Moyal |
| 53 |
PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists |
Damjan Vlaj, Janez Kaiser, Ralph Wilhelm, Ute Ziegenhain |
| 55 |
Use of Greek and Latin Forms for Term Detection |
Rosa Estopa, Jordi Vivaldi, M. Teresa Cabre |
| 56 |
Methods and Metrics for the Evaluation of Dictation Systems: a Case Study |
Maria Canelli, Daniele Grasso, Margaret King |
| 58 |
Cairo: An Alignment Visualization Tool |
Noah A. Smith, Michael E. Jahr |
| 59 |
An XML-based Representation Format for Syntactically Annotated Corpora |
Andreas Mengel, Wolfgang Lezius |
| 60 |
An Experiment of Lexical-Semantic Tagging of an Italian Corpus |
Ornella Corazzari, Nicoletta Calzolari, Antonio Zampolli |
| 61 |
SIMPLE: A General Framework for the Development of Multilingual Lexicons |
Nuria Bel, Federica Busa, Nicoletta Calzolari, Elisabetta Gola, Alessandro Lenci, Monica Monachini, Antoine Ogonowski, Ivonne Peters, Wim Peters, Nilda Ruimy, Marta Villegas, Antonio Zampolli |
| 62 |
Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX |
Zygmunt Vetulani |
| 63 |
SPEECON - Speech Data for Consumer Devices |
Rainer Siemund, Harald Hoge, Siegfried Kunzmann, Krzysztof Marasek |
| 66 |
A Treebank of Spanish and its Application to Parsing |
Antonio Moreno, Ralph Grishman, Susana Lopez, Fernando Sanchez, Satoshi Sekine |
| 67 |
End-to-End Evaluation of Machine Interpretation Systems: A Graphical Evaluation Tool |
Susanne J. Jekat, Lorenzo Tessiore |
| 68 |
A Proposal for the Integration of NLP Tools using SGML-Tagged Documents |
X. Artola, A. Diaz de Ilarraza, N. Ezeiza, K. Gojenola, A. Maritxalar, A. Soroa |
| 69 |
A Bilingual Electronic Dictionary for Frame Semantics |
Thierry Fontenelle |
| 70 |
The Evaluation of Systems for Cross-language Information Retrieval |
Martin Braschler, Donna Harman, Michael Hess, Michael Kluck, Carol Peters, Peter Schauble |
| 71 |
Spoken Portuguese: Geographic and Social Varieties |
Jose Bettencourt Goncalves, Rita Veloso |
| 72 |
Portuguese Corpora at CLUL |
Maria Fernanda Bacelar do Nascimento, Luisa Pereira, Joao Saramago |
| 74 |
Reusing the Mikrokosmos Ontology for Concept-based Multilingual Terminology Databases |
Antonio Moreno, Chantal Perez |
| 75 |
Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation |
Kimura Kazuhiro, Hirakawa Hideki |
| 76 |
Will Very Large Corpora Play For Semantic Disambiguation The Role That Massive Computing Power Is Playing For Other AI-Hard Problems? |
Alessandro Cucchiarelli, Enrico Faggioli, Paola Velardi |
| 77 |
Guidelines for Japanese Speech Synthesizer Evaluation |
Shuichi Itahashi |
| 78 |
Constructing a Tagged E-J Parallel Corpus for Assisting Japanese Software Engineers in Writing English Abstracts |
Masumi Narita |
| 79 |
Extraction of Unknown Words Using the Probability of Accepting the Kanji Character Sequence as One Word |
Hiroyuki Shinnou, Masanori Ikeya |
| 80 |
Automatic Speech Segmentation in High Noise Condition |
Rosen Ivanov |
| 81 |
Open Ended Computerized Overview of Controlled Languages |
Elisa Gavieiro-Villatte, Laurent Spaggiari |
| 82 |
Shallow Parsing and Functional Structure in Italian Corpora |
Rodolfo Delmonte |
| 84 |
Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon |
Dimitrios Kokkinakis, Maria Toporowska Gronostaj, Karin Warmenius |
| 85 |
Providing Internet Access to Portuguese Corpora: the AC/DC Project |
Diana Santos, Eckhard Bick |
| 86 |
Turkish Electronic Living Lexicon (TELL): A Lexical Database |
Sharon Inkelas, Aylin Kuntay, C. Orhan Orgun, Ronald Sprouse |
| 87 |
Orthographic Transcription of the Spoken Dutch Corpus |
Wim Goedertier, Simo Goddijn, Jean-Pierre Martens |
| 90 |
Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers |
Giulia Bernardis, Herve Bourlard, Martin Rajman, Jean-Cedric Chappelier |
| 91 |
An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser |
Guillermo Rojo, Maria Concepcion Alvarez, Pilar Alvarino, Adelaida Gil, Maria Paula Santalla, Susana Sotelo |
| 92 |
Target Suites for Evaluating the Coverage of Text Generators |
John A. Bateman, Anthony F. Hartley |
| 93 |
LT TTT - A Flexible Tokenisation Tool |
Claire Grover, Colin Matheson, Andrei Mikheev, Marc Moens |
| 94 |
Perception and Analysis of a Reiterant Speech Paradigm: a Functional Diagnostic of Synthetic Prosody |
Albert Rilliard, Veronique Auberge |
| 95 |
Development and Evaluation of an Italian Broadcast News Corpus |
Marcello Federico, Dimitri Giordani, Paolo Coletti |
| 96 |
Multilingual Linguistic Resources: From Monolingual Lexicons to Bilingual Interrelated Lexicons |
Marta Villegas, Nuria Bel, Alessandro Lenci, Nicoletta Calzolari, Nilda Ruimy, Antonio Zampolli, Teresa Sadurni, Joan Soler |
| 98 |
Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation |
Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli, Claudia Soria |
| 99 |
Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation |
Paolo Allegrini, Simonetta Montemagni, Vito Pirrelli |
| 100 |
Coreference Annotation: Whither? |
Rodger Kibble, Kees van Deemter |
| 101 |
Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control |
R. Lopez-Cozar, A.J. Rubio, J.E. Diaz Verdejo, A. De la Torre |
| 104 |
MDWOZ: A Wizard of Oz Environment for Dialog Systems Development |
Cosmin Munteanu, Marian Boldea |
| 105 |
A Web-based Text Corpora Development System |
Dan Bohus, Marian Boldea |
| 106 |
Term-based Identification of Sentences for Text Summarisation |
Byron Georgantopoulos, Stelios Piperidis |
| 107 |
Morphemic Analysis and Morphological Tagging of Latvian Corpus |
Kristine Levane, Andrejs Spektors |
| 109 |
Textual Information Retrieval Systems Test: The Point of View of an Organizer and Corpuses Provider |
Patrick Kremer, Laurent Schmitt |
| 110 |
The Spoken Dutch Corpus. Overview and First Evaluation |
Nelleke Oostdijk |
| 111 |
A Strategy for the Syntactic Parsing of Corpora: from Constraint Grammar Output to Unification-based Processing |
Toni Badia, Angels Egea |
| 112 |
Producing LRs in Parallel with Lexicographic Description: the DCC project |
Joan Soler i Bou |
| 113 |
A Novelty-based Evaluation Method for Information Retrieval |
Atsushi Fujii, Tetsuya Ishikawa |
| 115 |
Towards More Comprehensive Evaluation in Anaphora Resolution |
Ruslan Mitkov |
| 116 |
Galaxy-II as an Architecture for Spoken Dialogue Evaluation |
Joseph Polifroni, Stephanie Seneff |
| 119 |
Building the Croatian-English Parallel Corpus |
Marko Tadic |
| 122 |
Lexical and Translation Equivalence in Parallel Corpora |
Tamas Varadi |
| 125 |
Towards a Standard for Meta-descriptions of Language Resources |
D. Broeder, H. Brugman, A. Russel, R. Skiba, P. Wittenburg |
| 128 |
Object-oriented Access to the Estonian Phonetic Database |
Einar Meister, Arvo Eek, Toomas Altosaar, Martti Vainio |
| 129 |
ItalWordNet: a Large Semantic Database for Italian |
Adriana Roventini, Antonietta Alonge, Nicoletta Calzolari, Bernardo Magnini, Francesca Bertagna |
| 130 |
FAST - Towards a Semi-automatic Annotation of Corpora |
Catalina Barbu |
| 131 |
Coreference Resolution Evaluation Based on Descriptive Specificity |
Francois Trouilleux, Eric Gaussier, Gabriel G. Bes, Annie Zaenen |
| 132 |
A Text->Meaning->Text Dictionary and Process |
Dominique Dutoit |
| 133 |
A French Phonetic Lexicon with Variants for Speech and Language Processing |
Philippe Boula de Mareuil, Christophe d'Alessandro, Francois Yvon, Veronique Auberge, Jacqueline Vaissiere, Angelique Amelot |
| 134 |
Annotating Communication Problems Using the MATE Workbench |
Laila Dybkj?r, Morten Baun Moller, Niels Ole Bernsen, Michael Grosse, Martin Olsen, Amanda Schiffrin |
| 135 |
A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components |
Niels Ole Bernsen, Laila Dybkj?r |
| 136 |
Evaluating Translation Quality as Input to Product Development |
Niamh Bohan, Elisabeth Breidt, Martin Volk |
| 137 |
Evaluation of Word Alignment Systems |
Lars Ahrenberg, Magnus Merkel, Anna Sagvall Hein, Jorg Tiedemann |
| 138 |
How To Evaluate and Compare Tagsets? A Proposal |
Herve Dejean |
| 139 |
Determining the Tolerance of Text-handling Tasks for MT Output |
John White, Jennifer Doyon, Susan Talbott |
| 140 |
A Parallel Corpus of Italian/German Legal Texts |
Johann Gamper |
| 141 |
Integrating Seed Names and ngrams for a Named Entity List and Classifier |
Sabine Buchholz, Antal van den Bosch |
| 142 |
Automatically Expansion of Thesaurus Entries with a Different Thesaurus |
Hideki Kashioka, Satosi Shirai |
| 145 |
Learning Verb Subcategorization from Corpora: Counting Frame Subsets |
Daniel Zeman, Anoop Sarkar |
| 146 |
Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets |
Saso Dzeroski, Tomaz Erjavec, Jakub Zavrel |
| 147 |
Cross-lingual Interpolation of Speech Recognition Models |
Giorgio Micca, Alessandra Frasca, Maria Gabriella Di Benedetto |
| 148 |
Lexicalised Systematic Polysemy in WordNet |
Wim Peters, Ivonne Peters |
| 151 |
Experiences of Language Engineering Algorithm Reuse |
Bjorn Gamback, Fredrik Olsson |
| 153 |
Derivation in the Czech National Corpus |
Jana Klimova, Jan Kocek |
| 155 |
Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers |
Jakub Zavrel, Walter Daelemans |
| 156 |
The Context (not only) for Humans |
Barbora Hladka |
| 158 |
Something Borrowed, Something Blue: Rule-based Combination of POS Taggers |
Lars Borin |
| 159 |
Screffva: A Lexicographer's Workbench |
Jon Mills |
| 161 |
A Step toward Semantic Indexing of an Encyclopedic Corpus |
Philippe Alcouffe, Nicolas Gacon, Claude Roux, Frederique Segond |
| 162 |
Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project |
Thomas Brey, Gerhard Hanrieder, Paul Heisterkamp, Ludwig Hitzenberger, Peter Regel-Brietzmann |
| 163 |
Evaluating Summaries for Multiple Documents in an Interactive Environment |
Gees C. Stein, Tomek Strzalkowski, G. Bowden Wise, Amit Bagga |
| 164 |
Grammarless Bracketing in an Aligned Bilingual Corpus |
Jorge Kinoshita |
| 165 |
A Semi-automatic System for Conceptual Annotation, its Application to Resource Construction and Evaluation |
W.J. Black, J. McNaught, G.P. Zarri, A. Persidis, A. Brasher, L. Gilardoni, E. Bertino, G. Semeraro, P. Leo |
| 166 |
The MATE Workbench Annotation Tool, a Technical Description |
Amy Isard, David McKelvie, Andreas Mengel, Morten Baun Moller |
| 167 |
Recruitment Techniques for Minority Language Speech Databases: Some Observations |
Rhys James Jones, John S. Mason, Louise Helliker, Mark Pawlewski |
| 168 |
Multilingual Topic Detection and Tracking: Successful Research Enabled by Corpora and Evaluation |
Charles L. Wayne |
| 169 |
PoS Disambiguation and Partial Parsing Bidirectional Interaction |
Montserrat Marimon Felipe, Jordi Porta Zamorano |
| 170 |
Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis |
Hamish Cunnigham, Kalina Bontcheva, Valentin Tablan, Yorick Wilks |
| 172 |
XCES: An XML-based Encoding Standard for Linguistic Corpora |
Nancy Ide, Patrice Bonhomme, Laurent Romary |
| 173 |
Named Entity Recognition in Greek Texts |
Iason Demiros, Sotiris Boutsis, Voula Giouli, Maria Liakata, Harris Papageorgiou, Stelios Piperidis |
| 174 |
A Robust Parser for Unrestricted Greek Text |
Sotiris Boutsis, Prokopis Prokopidis, Voula Giouli, Stelios Piperidis |
| 175 |
A Computational Platform for Development of Morphologic and Phonetic Lexica |
Matej Rojc, Zdravko Kacic |
| 176 |
An Open Architecture for the Construction and Administration of Corpora |
Constantin Orasan, Ramesh Krishnamurthy |
| 177 |
Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System |
Matej Rojc, Zdravko Kacic |
| 179 |
CLinkA A Coreferential Links Annotator |
Constantin Orasan |
| 180 |
What's in a Thesaurus? |
Adam Kilgarriff, Colin Yallop |
| 181 |
A Unified POS Tagging Architecture and its Application to Greek |
Harris Papageorgiou, Prokopis Prokopidis, Voula Giouli, Stelios Piperidis |
| 182 |
Resources for Lexicalized Tree Adjoining Grammars and XML Encoding: TagML |
Patrice Bonhomme, Patrice Lopez |
| 183 |
Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers |
Andreas Witt, Harald Lungen, Dafydd Gibbon |
| 184 |
ATLAS: A Flexible and Extensible Architecture for Linguistic Annotation |
Steven Bird, David Day, John Garofolo, John Henderson, Christophe Laprun, Mark Liberman |
| 185 |
Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches |
Pavel Skrelin, Tatiana Sherstinova |
| 186 |
Some Technical Aspects about Aligning Near Languages |
Lluis de Yzaguirre, Marta Ribas, Jordi Vivaldi, M. Teresa Cabre |
| 187 |
Corpus Resources and Minority Language Engineering |
Tony McEnery, Paul Baker, Lou Burnard |
| 189 |
CDB - A Database of Lexical Collocations |
Brigitte Krenn |
| 191 |
Evaluation for Darpa Communicator Spoken Dialogue Systems |
Marilyn Walker, Lynette Hirschman, John Aberdeen |
| 192 |
Transcribing with Annotation Graphs |
Edouard Geoffrois, Claude Barras, Steven Bird, Zhibiao Wu |
| 193 |
Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results |
Massimo Poesio |
| 194 |
Towards a Query Language for Annotation Graphs |
Steven Bird, Peter Buneman, Wang-Chiew Tan |
| 196 |
The American National Corpus: A Standardized Resource for American English |
Catherine Macleod, Nancy Ide, Ralph Grishman |
| 197 |
Semantic Tagging for the Penn Treebank |
Martha Palmer, Hoa Trang Dang, Joseph Rosenzweig |
| 199 |
Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions |
Kiril Ribarov |
| 200 |
The (Un)Deterministic Nature of Morphological Context |
Kiril Ribarov |
| 201 |
A Framework for Cross-Document Annotation |
David Day, Alan Goldschen, John Henderson |
| 202 |
Extraction of Concepts and Multilingual Information Schemes from French and English Economics Documents |
Peggy Cadel, Helene Ledouble |
| 203 |
How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done |
Eric J. Breck, John D. Burger, Lisa Ferro, Lynette Hirschman, David House, Marc Light, Inderjeet Mani |
| 205 |
What are Transcription Errors and Why are They made? |
Daniela Oppermann, Susanne Burger, Karl Weilhammer |
| 206 |
On the Usage of Kappa to Evaluate Agreement on Coding Tasks |
Barbara Di Eugenio |
| 208 |
Automatic Extraction of English-Chinese Term Lexicons from Noisy Bilingual Corpora |
Sun Le, Jin Youbing, Du Lin, Sun Yufang |
| 209 |
Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium |
Christopher Cieri, Mark Liberman |
| 210 |
Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts |
Christopher Cieri, David Graff, Mark Liberman, Nii Martey, Stephanie Strassel |
| 211 |
Using Machine Learning Methods to Improve Quality of Tagged Corpora and Learning Models |
Yuji Matsumoto, Tatsuo Yamashita |
| 212 |
Quality Control in Large Annotation Projects Involving Multiple Judges: The Case of the TDT Corpora |
Stephanie Strassel, David Graff, Nii Martey, Christopher Cieri |
| 213 |
Learning Preference of Dependency between Japanese Subordinate Clauses and its Evaluation in Parsing |
Takehito Utsuro |
| 214 |
Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era |
Lin-Shan Lee, Lee-Feng Chien |
| 215 |
Lessons Learned from a Task-based Evaluation of Speech-to-Speech Machine Translation |
Lori Levin, Boris Bartlog, Ariadna Font Llitjos, Donna Gates, Alon Lavie, Dorcas Wallace, Taro Watanabe, Monika Woszczyna |
| 216 |
Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus |
Frank Van Eynde, Jakub Zavrel, Walter Daelemans |
| 217 |
The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora |
Karl Weilhammer, Daniela Oppermann, Susanne Burger |
| 218 |
Automatic Assignment of Grammatical Relations |
Leonardo Lesmo, Vincenzo Lombardo |
| 219 |
Integrating Subject Field Codes into WordNet |
Bernardo Magnini, Gabriela Cavaglia |
| 220 |
Building a Treebank for Italian: a Data-driven Annotation Schema |
Cristina Bosco, Vincenzo Lombardo, Daniela Vassallo, Leonardo Lesmo |
| 221 |
Typographical and Orthographical Spelling Error Correction |
Kyongho Min, William H. Wilson, Yoo-Jin Moon |
| 223 |
Application of WordNet ILR in Czech Word-formation |
Jana Klimova, Karel Pala |
| 224 |
POSCAT: A Morpheme-based Speech Corpus Annotation Tool |
Byeongchang Kim, Jin-seok Lee, Jeongwon Cha, Geunbae Lee |
| 226 |
A Flexible Infrastructure for Large Monolingual Corpora |
Uwe Quasthoff, Christian Wolff |
| 227 |
Automatic Transliteration and Back-transliteration by Decision Tree Learning |
Byung-Ju Kang, Key-Sun Choi |
| 228 |
Shallow Discourse Genre Annotation in CallHome Spanish |
Klaus Ries, Lori Levin, Liza Valle, Alon Lavie, Alex Waibel |
| 230 |
Building a Treebank for French |
Anne Abeille, Lionel Clement, Alexandra Kinyon |
| 233 |
Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task |
Paola Merlo, Suzanne Stevenson |
| 234 |
Layout Annotation in a Corpus of Patient Information Leaflets |
Nadjet Bouayad-Agha |
| 235 |
A New Methodology for Speech Corpora Definition from Internet Documents |
D. Vaufreydaz, C. Bergamini, J.F. Serignat, L. Besacier, M. Akbar |
| 236 |
Coping with Lexical Gaps when Building Aligned Multilingual Wordnets |
Luisa Bentivogli, Emanuele Pianta, Fabio Pianesi |
| 237 |
Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus |
Young-Soog Chae, Key-Sun Choi |
| 239 |
Introduction of KIBS (Korean Information Base System) Project |
Young-Soog Chae, Key-Sun Choi |
| 241 |
Resources for Multilingual Text Generation in Three Slavic Languages |
John Bateman, Elke Teich, Geert-Jan Kruijff, Ivanna Kruijff-Korbayova, Serge Sharoff, Hana Skoumalova |
| 243 |
A Multi-view Hyperlexicon Resource for Speech and Language System Development |
Dafydd Gibbon, Thorsten Trippel |
| 244 |
Enabling Resource Sharing in Language Generation: an Abstract Reference Architecture |
Lynne Cahill, Christy Doran, Roger Evans, Rodger Kibble, Chris Mellish, D. Paiva, Mike Reape, Donia Scott, Neil Tipper |
| 246 |
Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language |
Zdravko Kacic, Bogomir Horvat, Aleksandra Zogling |
| 247 |
ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora |
Christophe Jouis, ARC A3 |
| 248 |
A Parallel English-Japanese Query Collection for the Evaluation of On-Line Help Systems |
Richard F. E. Sutcliffe, Sadao Kurohashi |
| 249 |
Principled Hidden Tagset Design for Tiered Tagging of Hungarian |
Dan Tufis, Peter Dienes, Csaba Oravecz, Tamas Varadi |
| 250 |
Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine |
Felisa Verdejo, Julio Gonzalo, Anselmo Penas, Fernando Lopez, David Fernandez |
| 251 |
An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages |
Dafydd Gibbon, Ana Paula Quirino Simoes, Martin Matthiesen |
| 252 |
Sublanguage Dependent Evaluation: Toward Predicting NLP performances |
Gabriel Illouz |
| 253 |
The Universal XML Organizer: UXO |
Jan-Torsten Milde, Markus Reinsch |
| 254 |
TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation |
Helka Folch, Serge Heiden, Benoit Habert, Serge Fleury, Gabriel Illouz, Pierre Lafon, Julien Nioche, Sophie Prevost |
| 256 |
An Approach to Lexical Development for Inflectional Languages |
Davide Turcato, Janine Toole, Stavroula Tsiplakou, Trude Heift, Paul McFetridge |
| 257 |
Some Language Resources and Tools for Computational Processing of Portuguese at INESC |
Luzia Wittmann, Ricardo Daniel Ribeiro, Tania Pego, Fernando Batista |
| 258 |
Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation |
Takehito Utsuro, Manabu Sassano |
| 259 |
Evaluation of a Generic Lexical Semantic Resource in Information Extraction |
Joyce Yue Chai |
| 260 |
The Establishment of Motorola's Human Language Data Resource Center: Addressing the Criticality of Language Resources in the Industrial Setting |
Jim Talley |
| 261 |
IPA Japanese Dictation Free Software Project |
Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kasuya Takeda, Atsushi Yamada, Akinori Itou, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee |
| 262 |
Spontaneous Speech Corpus of Japanese |
Kikuo Maekawa, Hanae Koiso, Sadaoki Furui, Hitoshi Isahara |
| 263 |
Annotating Resources for Information Extraction |
Sean Boisen, Michael R. Crystal, Richard Schwartz, Rebecca Stone, Ralph Weischedel |
| 267 |
The New Edition of the Natural Language Software Registry (an Initiative of ACL hosted at DFKI) |
Thierry Declerck, Alexander Werner Jachmann, Hans Uszkoreit |
| 269 |
Design Methodology for Bilingual Pronunciation Dictionary |
Jong-mi Kim |
| 271 |
LEXIPLOIGISSI: An Educational Platform for the Teaching of Terminology in Greece |
Constandina Economou, Spyros Raptis, Gregory Stainhaouer |
| 272 |
An HPSG-Annotated Test Suite for Polish |
Malgorzata Marciniak, Agnieszka Mykowiecka, Anna Kupsc, Adam Przepiorkowski |
| 274 |
The COST 249 SpeechDat Multilingual Reference Recogniser |
Finn Tore Johansen, Narada Warakagoda, Borge Lindberg, Gunnar Lehtinen, Zdravko Kacic, Andreh Zgank, Kjell Elenius, Gampiero Salvi |
| 275 |
Terminology Encoding in View of Multifunctional NLP Resources |
Marianna Katsoyannou, Eleni Efthimiou |
| 276 |
Terminology in Korea: KORTERM |
Key-Sun Choi, Young-Soog Chae |
| 277 |
Morphological Tagging to Resolve Morphological Ambiguities |
Gaelle Birocheau |
| 278 |
An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research |
Sonja Nie?en, Franz Josef Och, Gregor Leusch, Hermann Ney |
| 279 |
GeDeriF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources |
Fiammetta Namer, Georgette Dal |
| 281 |
Le Programme Compalex (COMPAraison LEXicale) |
Josue Ndamba, Jean Silence Bayamboussa |
| 282 |
Many Uses, Many Annotations for Large Speech Corpora: Switchboard and TDT as Case Studies |
David Graff, Steven Bird |
| 283 |
Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future |
Gerhard Budin, Alan K. Melby |
| 285 |
Using a Formal Approach to Evaluate Grammars |
Bilel Gargouri, Mohamed Jmaiel, Abdelmajid Ben Hamadou |
| 286 |
Design Issues in Text-Independent Speaker Recognition Evaluation |
Alvin Martin, Mark Przybocki |
| 287 |
Developing Guidelines and Ensuring Consistency for Chinese Text Annotation |
Fei Xia, Martha Palmer, Nianwen Xue, Mary Ellen Okurowski, John Kovarik, Fu-Dong Chiou, Shizhe Huang, Tony Kroch, Mitch Marcus |
| 288 |
Corpora of Slovene Spoken Language for Multi-lingual Applications |
Jerneja Gros, France Mihelic, Simon Dobrisek, Tomaz Erjavec, Mario Zganec |
| 289 |
GRUHD: A Greek database of Unconstrained Handwriting |
E. Kavallieratou, N. Liolios, E. Koutsogeorgos, N. Fakotakis, G. Kokkinakis |
| 292 |
Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS |
France Mihelic, Jerneja Gros, Elmar Noth, Volker Warnke |
| 294 |
NL-Translex: Machine Translation for Dutch |
Catia Cucchiarini, Johan Van Hoorde, Elizabeth D'Halleweyn |
| 295 |
Rarity of Words in a Language and in a Corpus |
Jaroslava Hlavacova |
| 297 |
Language Resources Development at the Spanish Royal Academy |
Angel Martin Municio, Guillermo Rojo, Fernando Sanchez Leon, Octavio Pinillos |
| 298 |
Reusability as Easy Adaptability: A Substantial Advance in NL Technology |
Irina Prodanof, Amedeo Cappelli, Lorenzo Moretti |
| 299 |
Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking |
Andrew Bredenkamp, Berthold Crysmann, Mirela Petrea |
| 300 |
The Bank of Swedish |
Martin Gellerstam, Yvonne Cederholm, Torgny Rasmark |
| 301 |
Automatic Style Categorisation of Corpora in the Greek Language |
George Tambouratzis, Stella Markantonatou, Nikolaos Hairetakis, George Carayannis |
| 302 |
Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts |
Aristomenis Thanopoulos, Nikos Fakotakis, George Kokkinakis |
| 303 |
Predictive Performance of Dialog Systems |
H. Bonneau-Maynard, L. Devillers, S. Rosset |
| 306 |
Automatic Generation of Dictionary Definitions from a Computational Lexicon |
Penny Labropoulou, Elena Mantzari, Harris Papageorgiou, Maria Gavrilidou |
| 307 |
Regional Pronunciation Variants for Automatic Segmentation |
Nicole Beringer, Marcia Neff |
| 310 |
SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties |
Mario Refice, Michelina Savino, Marco Altieri, Roberto Altieri |
| 312 |
Automotive Speech-Recognition - Success Conditions Beyond Recognition Rates |
Klaus Bengler |
| 313 |
The ISLE Corpus of Non-Native Spoken English |
Wolfgang Menzel, Eric Atwell, Patrizia Bonaventura, Daniel Herron, Peter Howarth, Rachel Morton, Clive Souter |
| 314 |
A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora |
Kallirroi Georgila, Nikos Fakotakis, George Kokkinakis |
| 315 |
The PAROLE Program |
Georges Vignaux |
| 316 |
For a Repository of NLP Tools |
Stephane Chaudiron, Khalid Choukri, Audrey Mance, Valerie Mapelli |
| 317 |
Survey of Language Engineering Needs: a Language Resources Perspective |
Jeffrey Allen, Khalid Choukri |
| 319 |
Interarbora and Thistle - Delivering Linguistic Structure by the Internet |
Jo Calder |
| 320 |
Automatically Augmenting Terminological Lexicons from Untagged Text |
George Demetriou, Robert Gaizauskas |
| 321 |
Annotating Events and Temporal Information in Newswire Texts |
Andrea Setzer, Robert Gaizauskas |
| 327 |
Chinese-English Semantic Resource Construction |
Bonnie J. Dorr, Gina-Anne Levow, Dekang Lin, Scott Thomas |
| 328 |
Production of NLP-oriented Bilingual Language Resources from Human-oriented dictionaries |
Vera Fluhr-Semenova, Christian Fluhr, Stephanie Brisson |
| 329 |
Developing a Multilingual Telephone Based Information System in African Languages |
J.C. Roux, E.C. Botha, J.A. du Preez |
| 330 |
Tuning Lexicons to New Operational Scenarios |
Roberto Basili, Maria Teresa Pazienza, Michele Vindigni, Fabio Massimo Zanzotto |
| 331 |
SpeechDat-Car Fixed Platform |
Jose A.R. Fonollosa, Asuncion Moreno |
| 333 |
Inter-annotator Agreement for a German Newspaper Corpus |
Thorsten Brants |
| 334 |
Interactive Corpus Annotation |
Thorsten Brants, Oliver Plaehn |
| 335 |
The Concede Model for Lexical Databases |
Tomaz Erjavec, Roger Evans, Nancy Ide, Adam Kilgarriff |
| 336 |
Design and Implementation of the Online ILSP Greek Corpus |
Nick Hatzigeorgiu, Maria Gavrilidou, Stelios Piperidis, George Carayannis, Anastasia Papakostopoulou, Athanassia Spiliotopoulou, Anna Vacalopoulou, Penny Labropoulou, Elena Mantzari, Harris Papageorgiou, Iason Demiros |
| 337 |
A Software Toolkit for Sharing and Accessing Corpora Over the Internet |
Saturnino Luz |
| 338 |
Tools for the Generation of Morphological Entries in Dictionaries |
Ulle Viks |
| 340 |
Improving Lexical Databases with Collocational Information: Data from Portuguese |
Paula Guerreiro |
| 341 |
Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model |
Kiyoaki Shirai, Hozumi Tanaka, Takenobu Tokunaga |
| 342 |
Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications |
Marilyn Mason |
| 345 |
NaniTrans: a Speech Labelling Tool |
David Portabella, Albert Febrer, Asuncion Moreno |
| 347 |
Acquisition of Linguistic Patterns for Knowledge-based Information Extraction |
Sanda M. Harabagiu, Steven J. Maiorano |
| 348 |
A Platform for Dutch in Human Language Technologies |
Elisabeth D'Halleweyn, Erwin Dewallef, Jeannine Beeken |
| 349 |
Developing and Testing General Models of Spoken Dialogue System Peformance |
Marilyn Walker, Candace Kamm, Julie Boland |
| 350 |
Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation |
Claude de Loupy, Marc El-Beze |
| 351 |
Modern Greek Corpus Taxonomy |
George Mikros, George Carayannis |
| 353 |
Language Resources as by-Product of Evaluation: The MULTITAG Example |
Patrick Paroubek |
| 355 |
Evaluation of Computational Linguistic Techniques for Identifying Significant Topics for Browsing Applications |
Judith L. Klavans, Nina Wacholder, David K. Evans |
| 356 |
Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition |
Satoshi Nakamura, Kazuo Hiyane, Futoshi Asano, Takanobu Nishiura, Takeshi Yamada |
| 357 |
Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling |
George Demetriou, Eric Atwell, Clive Souter |
| 358 |
Annotation of a Multichannel Noisy Speech Corpus |
L. Cristoforetti, M. Matassoni, M. Omologo, P. Svaizer, E. Zovato |
| 360 |
ARISTA Generative Lexicon for Compound Greek Medical Terms |
John Kontos, Ioanna Malagardi, Spyros Fountoukis |
| 362 |
A Self-Expanding Corpus Based on Newspapers on the Web |
Knut Hofland |
| 363 |
A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts |
Janne Bondi Johannessen, Anders Noklestad, Kristin Hagen |
| 364 |
COCOSDA - a Progress Report |
Nick Campbell |
| 366 |
The Treatment of Adjectives in SIMPLE: Theoretical Observations |
Ivonne Peters, Wim Peters |
| 367 |
Cardinal, Nominal or Ordinal Similarity Measures in Comparative Evaluation of Information Retrieval Process |
Christine Michel |
| 368 |
Evaluating Multi-party Multi-modal Systems |
Laurie E. Damianos, Jill Drury, Tari Fanderclai, Lynette Hirschman, Jeff Kurtz, Beatrice Oshika |
| 369 |
Extension and Use of GermaNet, a Lexical-Semantic Database |
Claudia Kunze |
| 370 |
Russian Monitor Corpora: Composition, Linguistic Encoding and Internet Publication |
Serge A.Yablonsky |
| 371 |
An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG |
Ann Copestake, Dan Flickinger |
| 372 |
Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus |
Sun Maosong, Sun Honglin, Huang Changning, Zhang Pu, Xing Hongbing, Zhou Qiang |
| 373 |
SPEECHDAT-CAR. A Large Speech Database for Automotive Environments |
Asuncion Moreno, Borge Lindberg, Christoph Draxler, Gael Richard, Khalid Choukri, Stephan Euler, Jeffrey Allen |
| 374 |
Addizionario: an Interactive Hypermedia Tool for Language Learning |
Giovanna Turrini, Laura Cignoni, Alessandro Paccosi |
| 377 |
Recent Developments within the European Language Resources Association (ELRA) |
Khalid Choukri, Audrey Mance, Valerie Mapelli |