Keyword |
Paper Title |
Paper |
Accessibility | Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future | 283 |
A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
Acoustic Adaptation | SPEECON - Speech Data for Consumer Devices | 63 |
Acquisition Tools | SpeechDat-Car Fixed Platform | 331 |
Adjuncts | Automatic Assignment of Grammatical Relations | 218 |
African Languages | Developing a Multilingual Telephone Based Information System in African Languages | 329 |
Agglutinative Languages | A Word-level Morphosyntactic Analyzer for Basque | 44 |
Alignment | Building the Croatian-English Parallel Corpus | 119 |
Grammarless Bracketing in an Aligned Bilingual Corpus | 164 |
The Bank of Swedish | 300 |
Analogy-Based Aquisition | Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation | 99 |
Analysis-Modification-Synthesis systems | The Cost258 Signal Generation Test Array | 1 |
Analytic Tools | Introduction of KIBS (Korean Information Base System) Project | 239 |
Analytical Functions | Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
Anaphora | Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
Anaphora Resolution | CLinkA A Coreferential Links Annotator | 179 |
Annotated Corpora | On the Usage of Kappa to Evaluate Agreement on Coding Tasks | 206 |
Annotated Corpus | Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus | 372 |
Annotated Dialogue Corpora | Dialogue Annotation for Language Systems Evaluation | 33 |
Annotation | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
The EUDICO Project, Multi Media Annotation over the Internet | 45 |
An XML-based Representation Format for Syntactically Annotated Corpora | 59 |
The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
Annotating Communication Problems Using the MATE Workbench | 134 |
The MATE Workbench Annotation Tool, a Technical Description | 166 |
Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
Annotating Resources for Information Extraction | 263 |
Annotation Errors | What are Transcription Errors and Why are They made? | 205 |
Annotation frameworks | Dialogue Annotation for Language Systems Evaluation | 33 |
Annotation Graphs | Transcribing with Annotation Graphs | 192 |
Annotation Guidelines | Developing Guidelines and Ensuring Consistency for Chinese Text Annotation | 287 |
Annotation Schema | Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
Annotation tool | FAST - Towards a Semi-automatic Annotation of Corpora | 130 |
A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
Annotation Tools | Dialogue Annotation for Language Systems Evaluation | 33 |
Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
Interactive Corpus Annotation | 334 |
NaniTrans: a Speech Labelling Tool | 345 |
Annotation of a Multichannel Noisy Speech Corpus | 358 |
Applications | Portuguese Corpora at CLUL | 72 |
Architecture | Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
Architectures | Enabling Resource Sharing in Language Generation: an Abstract Reference Architecture | 244 |
Argument Structure | Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task | 233 |
ASR | MHATLex: Lexical Resources for Modelling the French Pronunciation | 37 |
Regional Pronunciation Variants for Automatic Segmentation | 307 |
Assesment Techniques | COCOSDA - a Progress Report | 364 |
Assessment and Evaluation | For a Repository of NLP Tools | 316 |
Automated Style Categorisation | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
Automatic annotation | Target Suites for Evaluating the Coverage of Text Generators | 92 |
Automatic Evaluation | Evaluation of Word Alignment Systems | 137 |
How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done | 203 |
Automatic Generation of Constructed Words | GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources | 279 |
Automatic Lexical Acquisition | Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task | 233 |
Automatic Term Extraction | Term-based Identification of Sentences for Text Summarisation | 106 |
Automatic Text Summarization | A Comparison of Summarization Methods Based on Task-based Evaluation | 14 |
Automatically Category Code Assignment | Automatically Expansion of Thesaurus Entries with a Different Thesaurus | 142 |
Batch Download | A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
Benchmark | Dialogue Annotation for Language Systems Evaluation | 33 |
Best Practice | A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
Bilingual Comparable Corpora | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
Bilingual Corpora | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
Bilingual Corpora Processing | Automatic Extraction of English-Chinese Term Lexicons from Noisy Bilingual Corpora | 208 |
Bilingual Corpus | Grammarless Bracketing in an Aligned Bilingual Corpus | 164 |
Bilingual Dictionary | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
Bilingual Lexicon Extraction | Evaluation of Word Alignment Systems | 137 |
Bilingual Pronunciation Dictionary | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
Binational Policies | A Platform for Dutch in Human Language Technologies | 348 |
Bi-textual Database | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
BNC2 | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
Bootstrapping Methods | Automatically Augmenting Terminological Lexicons from Untagged Text | 320 |
Bracketed Corpus (Treebank) | Developing Guidelines and Ensuring Consistency for Chinese Text Annotation | 287 |
CALL | Constructing a Tagged E-J Parallel Corpus for Assisting Japanese Software Engineers in Writing English Abstracts | 78 |
Call-Center Applications | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
CAPI | SpeechDat-Car Fixed Platform | 331 |
Car environment | SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
Case Frame | Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
Cenario | Evaluating Multi-party Multi-modal Systems | 368 |
CES | A Parallel Corpus of Italian/German Legal Texts | 140 |
Chinese Information Processing | Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus | 372 |
Chinese Language Processing | Developing Guidelines and Ensuring Consistency for Chinese Text Annotation | 287 |
Chinese-English Lexicons | Chinese-English Semantic Resource Construction | 327 |
Chunks | PoS Disambiguation and Partial Parsing Bidirectional Interaction | 169 |
Classified Listing of NLP Software | The New Edition of the Natural Language Software Registry (an Initiative of ACL hosted at DFKI) | 267 |
Client-Server | An Open Architecture for the Construction and Administration of Corpora | 176 |
Client-Server Model | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
Clustering | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
Evaluating Summaries for Multiple Documents in an Interactive Environment | 163 |
Coding Schemes | Annotating Communication Problems Using the MATE Workbench | 134 |
Collaborative System | Evaluating Multi-party Multi-modal Systems | 368 |
Collocation | Collocations as Word Co-ocurrence Restriction Data - An Application to Japanese Word Processor - | 2 |
Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon | 47 |
Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
Collocations | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
A Flexible Infrastructure for Large Monolingual Corpora | 226 |
Combination of Automatic and Manual Annotation | Interactive Corpus Annotation | 334 |
Combined Classifiers | Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
Combining Systems | Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers | 155 |
Complements | Automatic Assignment of Grammatical Relations | 218 |
Completeness | A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
Compound Nouns | ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
Compounding | Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
Computational Lexicography | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
Computational Lexicon | CDB - A Database of Lexical Collocations | 189 |
Computational Lexicons | SIMPLE: A General Framework for the Development of Multilingual Lexicons | 61 |
An Approach to Lexical Development for Inflectional Languages | 256 |
Computational Psycholinguistics | The Treatment of Adjectives in SIMPLE: Theoretical Observations | 366 |
Computer-Assisted Translation Tools | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
Computer-assisted Writing Tool | Constructing a Tagged E-J Parallel Corpus for Assisting Japanese Software Engineers in Writing English Abstracts | 78 |
Concept Abstraction | Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation | 75 |
Concept Accuracy | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
Concept Description | Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation | 75 |
Concept Hierarchy | Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation | 75 |
Conceptual Representation | Extraction of Concepts and Multilingual Information Schemes from French and English Economics Documents | 202 |
Constraint Grammar | Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
Construction of Linguistic Resources | Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation | 98 |
Consumer Electronics | SPEECON - Speech Data for Consumer Devices | 63 |
Context | The Context (not only) for Humans | 156 |
The (Un)Deterministic Nature of Morphological Context | 200 |
Context-Based Method | The (Un)Deterministic Nature of Morphological Context | 200 |
Continuous Speech | Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
Continuous Speech Recognition over the Telephone | Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language | 246 |
Controlled Languages | Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking | 299 |
Copyright | An Open Architecture for the Construction and Administration of Corpora | 176 |
Coreference | Coreference in Annotating a Large Corpus | 19 |
Coreference Annotation: Whither? | 100 |
Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
Coreference Annotation | CLinkA A Coreferential Links Annotator | 179 |
Coreferential Chain | CLinkA A Coreferential Links Annotator | 179 |
Corpora | Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
Evaluating Translation Quality as Input to Product Development | 136 |
The MATE Workbench Annotation Tool, a Technical Description | 166 |
An Open Architecture for the Construction and Administration of Corpora | 176 |
SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
Corpora Workbench | The Bank of Swedish | 300 |
Corporate Usage of HLR | The Establishment of Motorola's Human Language Data Resource Center: Addressing the Criticality of Language Resources in the Industrial Setting | 260 |
Corpus | Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level | 18 |
Coreference in Annotating a Large Corpus | 19 |
FAST - Towards a Semi-automatic Annotation of Corpora | 130 |
Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
The American National Corpus: A Standardized Resource for American English | 196 |
Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
Application of WordNet ILR in Czech Word-formation | 223 |
Introduction of KIBS (Korean Information Base System) Project | 239 |
ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
Language Resources Development at the Spanish Royal Academy | 297 |
Design and Implementation of the Online ILSP Greek Corpus | 336 |
A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
Corpus Administration | An Open Architecture for the Construction and Administration of Corpora | 176 |
Corpus Analysis | Modern Greek Corpus Taxonomy | 351 |
Corpus Annotation | FAST - Towards a Semi-automatic Annotation of Corpora | 130 |
Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
Building a Treebank for French | 230 |
Principled Hidden Tagset Design for Tiered Tagging of Hungarian | 249 |
Inter-annotator Agreement for a German Newspaper Corpus | 333 |
Interactive Corpus Annotation | 334 |
Corpus Architecture | The American National Corpus: A Standardized Resource for American English | 196 |
Corpus Based Linguistics | TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
Corpus Building | An Open Architecture for the Construction and Administration of Corpora | 176 |
Corpus Design | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
Corpus Encoding | A Parallel Corpus of Italian/German Legal Texts | 140 |
XCES: An XML-based Encoding Standard for Linguistic Corpora | 172 |
Corpus Linguistics | Building the Croatian-English Parallel Corpus | 119 |
Building a Treebank for French | 230 |
Corpus Preparation | LT TTT - A Flexible Tokenisation Tool | 93 |
Corpus Processing | Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts | 302 |
A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
Corpus-Based Techniques | An Approach to Lexical Development for Inflectional Languages | 256 |
Correctness | A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
co-Training | Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
Creativity | Addizionario: an Interactive Hypermedia Tool for Language Learning | 374 |
Croatian | Building the Croatian-English Parallel Corpus | 119 |
Cross-language Information Retrieval | The Evaluation of Systems for Cross-language Information Retrieval | 70 |
Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine | 250 |
Cross-Lingual | Cross-lingual Interpolation of Speech Recognition Models | 147 |
Czech National Corpus | Derivation in the Czech National Corpus | 153 |
Danish verbs | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
Data Architectures | XCES: An XML-based Encoding Standard for Linguistic Corpora | 172 |
Data Centers | Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
Data Collection | SPEECON - Speech Data for Consumer Devices | 63 |
MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
Data Collection and Distribution | Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
Data interchange | Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future | 283 |
Data Set | IREX: IR & IE Evaluation Project in Japanese | 27 |
Data Visualization | Evaluating Multi-party Multi-modal Systems | 368 |
Database | Design and Implementation of the Online ILSP Greek Corpus | 336 |
Database Access | Object-oriented Access to the Estonian Phonetic Database | 128 |
The Universal XML Organizer: UXO | 253 |
Databases | Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
Data-Collection | What are Transcription Errors and Why are They made? | 205 |
DCR Evaluation | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
Decision List Learning | Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
Definitions | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
Deletions | Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level | 18 |
Dependency | Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level | 18 |
Coreference in Annotating a Large Corpus | 19 |
Dependency Format | Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
Dependency Syntax | Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
Dependency Tree | Automatic Assignment of Grammatical Relations | 218 |
Derivation | Derivation in the Czech National Corpus | 153 |
Application of WordNet ILR in Czech Word-formation | 223 |
Derivational Morphology | GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources | 279 |
Descriptive Specificity | Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
Design Methodology | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
Development Environment | MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
Diacritic Characters Restoration | A Web-based Text Corpora Development System | 105 |
Dialog Interaction Model | MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
Dialog System | Predictive Performance of Dialog Systems | 303 |
Dialogue | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
Dialogue Corpora | The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora | 217 |
Dictation Free Software | IPA Japanese Dictation Free Software Project | 261 |
Dictation Systems | Methods and Metrics for the Evaluation of Dictation Systems: a Case Study | 56 |
Dictionaries | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
Dictionary | The Concede Model for Lexical Databases | 335 |
Dictionary Definitions | Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
Dictionary Formats | Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
Diphone Inventory | Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
Discourse | Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
Discourse Annotation | Annotating Events and Temporal Information in Newswire Texts | 321 |
Discriminant Function Analysis | Modern Greek Corpus Taxonomy | 351 |
Distributed Corpora | A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
Distributed Processing | A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
Distribution | Recent Developments within the European Language Resources Association (ELRA) | 377 |
Document Routing | An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser | 91 |
Domains | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Dry sound source | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
DSSSL | Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
Dutch | Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus | 216 |
NL-Translex: Machine Translation for Dutch | 294 |
Dutch (spoken) | The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
Dynamic Corpora | Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era | 214 |
EAGLES | Methods and Metrics for the Evaluation of Dictation Systems: a Case Study | 56 |
E-J Parallel Corpus | Constructing a Tagged E-J Parallel Corpus for Assisting Japanese Software Engineers in Writing English Abstracts | 78 |
ELDA | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Recent Developments within the European Language Resources Association (ELRA) | 377 |
Electronic Dictionaries | Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
Coping with Lexical Gaps when Building Aligned Multilingual Wordnets | 236 |
ELRA | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Recent Developments within the European Language Resources Association (ELRA) | 377 |
Empirical Methods | Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
Encyclopedia | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
End-User Environments | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
English | Building the Croatian-English Parallel Corpus | 119 |
Morphological Tagging to Resolve Morphological Ambiguities | 277 |
English and Korean | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
English Pronunciation | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
Environment sound recognition | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
Error-Spotting | On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
Evaluation | Enhancing the TDT Tracking Evaluation | 5 |
English Senseval: Report and Results | 8 |
Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
IREX: IR & IE Evaluation Project in Japanese | 27 |
Methods and Metrics for the Evaluation of Dictation Systems: a Case Study | 56 |
Guidelines for Japanese Speech Synthesizer Evaluation | 77 |
Target Suites for Evaluating the Coverage of Text Generators | 92 |
Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
Galaxy-II as an Architecture for Spoken Dialogue Evaluation | 116 |
Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
Evaluating Translation Quality as Input to Product Development | 136 |
Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets | 146 |
Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
Evaluating Summaries for Multiple Documents in an Interactive Environment | 163 |
CLinkA A Coreferential Links Annotator | 179 |
Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
The (Un)Deterministic Nature of Morphological Context | 200 |
ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
Evaluation of a Generic Lexical Semantic Resource in Information Extraction | 259 |
An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research | 278 |
Design Issues in Text-Independent Speaker Recognition Evaluation | 286 |
Predictive Performance of Dialog Systems | 303 |
Inter-annotator Agreement for a German Newspaper Corpus | 333 |
A Platform for Dutch in Human Language Technologies | 348 |
Cardinal, Nominal or Ordinal Similarity Measures in Comparative Evaluation of Information Retrieval Process | 367 |
Evaluation (predictive) | Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
Evaluation Criteria | Using a Formal Approach to Evaluate Grammars | 285 |
Evaluation in Written Language Processing | NL-Translex: Machine Translation for Dutch | 294 |
Evaluation Methodology | Evaluating Multi-party Multi-modal Systems | 368 |
Evaluation Metrics | Evaluation of Word Alignment Systems | 137 |
Evaluation of Language Resources and Tools | Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine | 250 |
Evaluation Protocols | Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures | 34 |
Event Recognition | Tuning Lexicons to New Operational Scenarios | 330 |
Events | Annotating Events and Temporal Information in Newswire Texts | 321 |
Exercise(s) | Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
Expanding Entry | Automatically Expansion of Thesaurus Entries with a Different Thesaurus | 142 |
Extraction | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
ExTrECC | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
Feature Structures | A Proposal for the Integration of NLP Tools using SGML-Tagged Documents | 68 |
Finite State Technologies | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
Finite State Transducers | A Robust Parser for Unrestricted Greek Text | 174 |
Finite-state tools | Terms Specification and Extraction within a Linguistic-based Intranet Service | 17 |
Formal Approach | Using a Formal Approach to Evaluate Grammars | 285 |
Formal Descriptions | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
Formalism | Cardinal, Nominal or Ordinal Similarity Measures in Comparative Evaluation of Information Retrieval Process | 367 |
Format Independent | The EUDICO Project, Multi Media Annotation over the Internet | 45 |
Formation of Company (Motorola) Organization | The Establishment of Motorola's Human Language Data Resource Center: Addressing the Criticality of Language Resources in the Industrial Setting | 260 |
Frames | Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
Framework | Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
French Market | For a Repository of NLP Tools | 316 |
French Phonetic Lexicon | A French Phonetic Lexicon with Variants for Speech and Language Processing | 133 |
Frequency | Rarity of Words in a Language and in a Corpus | 295 |
Functional Annotation | Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation | 99 |
Generation | Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
Resources for Multilingual Text Generation in Three Slavic Languages | 241 |
Generative Lexicon | ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
Genericity | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
Genre | Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
German Dialectal Regions | Regional Pronunciation Variants for Automatic Segmentation | 307 |
Gold Standard | Evaluation of Word Alignment Systems | 137 |
Grammar Acquisition | A Treebank of Spanish and its Application to Parsing | 66 |
Grammar Checking | Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking | 299 |
Grammars | Using a Formal Approach to Evaluate Grammars | 285 |
Language Resources Development at the Spanish Royal Academy | 297 |
An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG | 371 |
Grammatical coverage | Target Suites for Evaluating the Coverage of Text Generators | 92 |
Grammatical Data | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
Grammatical Dictionary | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
Grammatical Relations | Automatic Assignment of Grammatical Relations | 218 |
Grammatical Rules | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
Grammatical Tagging | A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
Grapheme-Phoneme Conversion | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
Grapheme-to-Phoneme Conversion | A Computational Platform for Development of Morphologic and Phonetic Lexica | 175 |
Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System | 177 |
Graphic User Interface | PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists | 53 |
Graphical | A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
Greek | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
Named Entity Recognition in Greek Texts | 173 |
A Robust Parser for Unrestricted Greek Text | 174 |
A Unified POS Tagging Architecture and its Application to Greek | 181 |
Greek Characters | GRUHD: A Greek database of Unconstrained Handwriting | 289 |
Greek Language | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
Design and Implementation of the Online ILSP Greek Corpus | 336 |
GSM signals | SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
Haitian Creole | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
Hands-free speech recognition | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
Hebrew | Creation of Spoken Hebrew Databases | 52 |
HPSG | An HPSG-Annotated Test Suite for Polish | 272 |
HTML-to-Text Conversion | A Web-based Text Corpora Development System | 105 |
Human Assessors | The TREC-8 Question Answering Track | 26 |
Human Intervention | Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model | 341 |
Human-Computer Interaction | Evaluating Multi-party Multi-modal Systems | 368 |
Hyperlexicon | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
Idiom | Collocations as Word Co-ocurrence Restriction Data - An Application to Japanese Word Processor - | 2 |
Implicit Relations | ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
Impulse response | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
In-Car Speech Data | Annotation of a Multichannel Noisy Speech Corpus | 358 |
Induction | Tuning Lexicons to New Operational Scenarios | 330 |
Industrial Human Language Resource Center | The Establishment of Motorola's Human Language Data Resource Center: Addressing the Criticality of Language Resources in the Industrial Setting | 260 |
Inflectional Languages | An Approach to Lexical Development for Inflectional Languages | 256 |
Information Base | Introduction of KIBS (Korean Information Base System) Project | 239 |
Information Dissemination | Recent Developments within the European Language Resources Association (ELRA) | 377 |
Information Extraction | IREX: IR & IE Evaluation Project in Japanese | 27 |
LT TTT - A Flexible Tokenisation Tool | 93 |
Coreference Annotation: Whither? | 100 |
Named Entity Recognition in Greek Texts | 173 |
A Flexible Infrastructure for Large Monolingual Corpora | 226 |
Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
Evaluation of a Generic Lexical Semantic Resource in Information Extraction | 259 |
Annotating Resources for Information Extraction | 263 |
Automatically Augmenting Terminological Lexicons from Untagged Text | 320 |
Annotating Events and Temporal Information in Newswire Texts | 321 |
Information Retrieval | A Comparison of Summarization Methods Based on Task-based Evaluation | 14 |
IREX: IR & IE Evaluation Project in Japanese | 27 |
Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser | 91 |
Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
Information Society | SPEECON - Speech Data for Consumer Devices | 63 |
In-situs Evaluation | Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures | 34 |
Integration of NLP Tools | A Proposal for the Integration of NLP Tools using SGML-Tagged Documents | 68 |
Interactive Machine Translation | Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures | 34 |
Inter-Annotator Agreement | Semantic Tagging for the Penn Treebank | 197 |
Inter-annotator Agreement for a German Newspaper Corpus | 333 |
Intercoder Reliability | On the Usage of Kappa to Evaluate Agreement on Coding Tasks | 206 |
Inter-Judge Agreement | Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task | 233 |
Internet | The EUDICO Project, Multi Media Annotation over the Internet | 45 |
Towards a Standard for Meta-descriptions of Language Resources | 125 |
Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era | 214 |
Design and Implementation of the Online ILSP Greek Corpus | 336 |
Intonaion Labeling | Spontaneous Speech Corpus of Japanese | 262 |
Intonation | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
ISDN | SpeechDat-Car Fixed Platform | 331 |
ISO | Methods and Metrics for the Evaluation of Dictation Systems: a Case Study | 56 |
Italian | Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
Iterative Learning | Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model | 341 |
Japanese | IREX: IR & IE Evaluation Project in Japanese | 27 |
Spontaneous Speech Corpus of Japanese | 262 |
Japanese Dictation | IPA Japanese Dictation Free Software Project | 261 |
Japanese Named Entity Recognition | Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
Japanese Text Analysis | Guidelines for Japanese Speech Synthesizer Evaluation | 77 |
Java | Cairo: An Alignment Visualization Tool | 58 |
Annotation of a Multichannel Noisy Speech Corpus | 358 |
Kana-to-Kanji Conversion | Collocations as Word Co-ocurrence Restriction Data - An Application to Japanese Word Processor - | 2 |
Knowledge | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
Terminology in Korea: KORTERM | 276 |
Knowledge Acquisition | The Treatment of Adjectives in SIMPLE: Theoretical Observations | 366 |
Knowledge Base | Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
Knowledge Elicitation | Towards A Universal Tool For NLP Resource Acquisition | 28 |
Knowledge Extraction | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
Knowledge-Rich NLP | Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
Korean Pronunciation | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
Labeling | Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
Labelling | NaniTrans: a Speech Labelling Tool | 345 |
Language Bank | The Bank of Swedish | 300 |
Language Corpus | Rarity of Words in a Language and in a Corpus | 295 |
Language Engineering | Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
Terminology in Korea: KORTERM | 276 |
Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Developing a Multilingual Telephone Based Information System in African Languages | 329 |
Language Knowledge | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
Language Learning | Addizionario: an Interactive Hypermedia Tool for Language Learning | 374 |
Language Modelling | Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling | 357 |
Language Processing Resource Reusability | Experiences of Language Engineering Algorithm Reuse | 151 |
Language Requirements | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Language Resource | Towards A Universal Tool For NLP Resource Acquisition | 28 |
CDB - A Database of Lexical Collocations | 189 |
Language Resource Creation | Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
Language Resources | PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists | 53 |
Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language | 246 |
Language Resources Infrastructure | Software Infrastructure for Language Resources: a Taxonomy of Previous Work and a Requirements Analysis | 170 |
Language Teaching | Spoken Portuguese: Geographic and Social Varieties | 71 |
Language Variation | Modern Greek Corpus Taxonomy | 351 |
Language-Independent | A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
Languages Resources | Recent Developments within the European Language Resources Association (ELRA) | 377 |
Large-Scale Corpus | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
Latin America | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
LE Platforms | Experiences of Language Engineering Algorithm Reuse | 151 |
Legal Issues | A Platform for Dutch in Human Language Technologies | 348 |
Recent Developments within the European Language Resources Association (ELRA) | 377 |
Lemma and Part-of-Speech Based Aligment | Some Technical Aspects about Aligning Near Languages | 186 |
Lexica | A Computational Platform for Development of Morphologic and Phonetic Lexica | 175 |
Lexical Acquisition | Chinese-English Semantic Resource Construction | 327 |
Tuning Lexicons to New Operational Scenarios | 330 |
Lexical Database | The Bank of Swedish | 300 |
The Concede Model for Lexical Databases | 335 |
Lexical Databases | Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
Lexical Description | Producing LRs in Parallel with Lexicographic Description: the DCC project | 112 |
Lexical Function | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
Lexical Gaps | Coping with Lexical Gaps when Building Aligned Multilingual Wordnets | 236 |
Lexical Resources | Creating and Using Domain-specific Ontologies for Terminological Applications | 22 |
MHATLex: Lexical Resources for Modelling the French Pronunciation | 37 |
ItalWordNet: a Large Semantic Database for Italian | 129 |
GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources | 279 |
Lexical Semantics | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
SIMPLE: A General Framework for the Development of Multilingual Lexicons | 61 |
Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts | 302 |
Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling | 357 |
The Treatment of Adjectives in SIMPLE: Theoretical Observations | 366 |
Lexical Tuning | Tuning Lexicons to New Operational Scenarios | 330 |
Lexical-Semantic Database | A Bilingual Electronic Dictionary for Frame Semantics | 69 |
Lexicography | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
Producing LRs in Parallel with Lexicographic Description: the DCC project | 112 |
Language Resources Development at the Spanish Royal Academy | 297 |
Lexicon | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
Producing LRs in Parallel with Lexicographic Description: the DCC project | 112 |
Language Resources Development at the Spanish Royal Academy | 297 |
Tuning Lexicons to New Operational Scenarios | 330 |
Lexicon Design | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
Lexicon Implementation | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
Lexicon Specification | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
Lexicon-Cum-Corpus | CDB - A Database of Lexical Collocations | 189 |
Linguistic Annotation | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
Linguistic Data Acquisition | The Universal XML Organizer: UXO | 253 |
Linguistic Database | CDB - A Database of Lexical Collocations | 189 |
Linguistic Resources | Towards a Standard for Meta-descriptions of Language Resources | 125 |
Linguistic Software | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
Lingware Development | Using a Formal Approach to Evaluate Grammars | 285 |
Listening and Understanding | Spoken Portuguese: Geographic and Social Varieties | 71 |
Live Lexicon | Live Lexicons and Dynamic Corpora Adapted to the Network Resources for Chinese Spoken Language Processing Applications in an Internet Era | 214 |
Local grammars | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
Log files | Dialogue Annotation for Language Systems Evaluation | 33 |
Low Bit Rate Speech Compression | Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering | 16 |
LR Domains | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
LR Tools | An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG | 371 |
LVCSR | IPA Japanese Dictation Free Software Project | 261 |
LVCSR Workbench | IPA Japanese Dictation Free Software Project | 261 |
Machine Learning | Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers | 155 |
Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
Machine Readable Dictionaries | Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling | 357 |
Machine Translation | A Word Sense Disambiguation Method Using Bilingual Corpus | 15 |
Evaluating Translation Quality as Input to Product Development | 136 |
Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research | 278 |
NL-Translex: Machine Translation for Dutch | 294 |
Chinese-English Semantic Resource Construction | 327 |
Maintenance | A Platform for Dutch in Human Language Technologies | 348 |
MARTIF | A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
Masking-and-Matching Technique | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
Maturity | For a Repository of NLP Tools | 316 |
MAUS | Regional Pronunciation Variants for Automatic Segmentation | 307 |
Medical Terms | ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
Membership | Recent Developments within the European Language Resources Association (ELRA) | 377 |
Meta-Data | Towards a Standard for Meta-descriptions of Language Resources | 125 |
Method | ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
Metrics | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
Evaluating Multi-party Multi-modal Systems | 368 |
Microphone array | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
Minimally Supervised Approach | Minimally Supervised Japanese Named Entity Recognition: Resources and Evaluation | 258 |
Minority Languages | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
Modular Programming | An Open Architecture for the Construction and Administration of Corpora | 176 |
Monolingual Corpora | A Flexible Infrastructure for Large Monolingual Corpora | 226 |
Monolingual LR | An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG | 371 |
Morpheme-Based Annotation | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
Morphological ambiguities | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
Morphological Analysis | Language Resources Development at the Spanish Royal Academy | 297 |
Morphological Processing | Automatic Style Categorisation of Corpora in the Greek Language | 301 |
Morphological tagging | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
Morphology | Terms Specification and Extraction within a Linguistic-based Intranet Service | 17 |
A Word-level Morphosyntactic Analyzer for Basque | 44 |
A Computational Platform for Development of Morphologic and Phonetic Lexica | 175 |
Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
An Approach to Lexical Development for Inflectional Languages | 256 |
Morphophonology | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
Morpho-Syntactic Annotation | A Web-based Text Corpora Development System | 105 |
Morphosyntax | A Word-level Morphosyntactic Analyzer for Basque | 44 |
Morpohological Annotation vs Tagging | The Context (not only) for Humans | 156 |
MRD | Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
MUC-7 | CLinkA A Coreferential Links Annotator | 179 |
Multi-Channel Databases | Annotation of a Multichannel Noisy Speech Corpus | 358 |
Multi-Criteria Diagnosis | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
Multidimensional Statistics | TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
Multi-Document Summarization | Evaluating Summaries for Multiple Documents in an Interactive Environment | 163 |
Multilingual | SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
Multi-Lingual | Cross-lingual Interpolation of Speech Recognition Models | 147 |
Multilingual Alignment | Lexical and Translation Equivalence in Parallel Corpora | 122 |
Multilingual Corpora | Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
Multilingual Information Schemes | Extraction of Concepts and Multilingual Information Schemes from French and English Economics Documents | 202 |
Multilingual LR | Resources for Multilingual Text Generation in Three Slavic Languages | 241 |
Multilingual Terminological Resources | Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future | 283 |
Multilingual Test Collections | The Evaluation of Systems for Cross-language Information Retrieval | 70 |
Multilingual Tool | CLinkA A Coreferential Links Annotator | 179 |
Multilingual Wordnets | Coping with Lexical Gaps when Building Aligned Multilingual Wordnets | 236 |
Multi-Linguality | The COST 249 SpeechDat Multilingual Reference Recogniser | 274 |
Multimedia | Addizionario: an Interactive Hypermedia Tool for Language Learning | 374 |
Multi-Media | The EUDICO Project, Multi Media Annotation over the Internet | 45 |
Multi-Media and Multi-Modal Language Resources | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Named Entity Extraction | Annotating Resources for Information Extraction | 263 |
Named Entity Recognition | LT TTT - A Flexible Tokenisation Tool | 93 |
Named Entity Recognition in Greek Texts | 173 |
Named Entity Tagging | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
Natural Language Generation | Target Suites for Evaluating the Coverage of Text Generators | 92 |
Enabling Resource Sharing in Language Generation: an Abstract Reference Architecture | 244 |
Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
Natural Language Processing | A Word Sense Disambiguation Method Using Bilingual Corpus | 15 |
Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Natural Language Processing and Information Retrieval | Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine | 250 |
Natural Language Processing Systems | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
Neo-Classical Compounds | Use of Greek and Latin Forms for Term Detection | 55 |
Neural Networks | Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
Newspapers | A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
NIST | Design Issues in Text-Independent Speaker Recognition Evaluation | 286 |
NLG | Target Suites for Evaluating the Coverage of Text Generators | 92 |
NLP | Creating and Using Domain-specific Ontologies for Terminological Applications | 22 |
Morphological Tagging to Resolve Morphological Ambiguities | 277 |
Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts | 302 |
NLP Applications | For a Repository of NLP Tools | 316 |
NLP Offer | For a Repository of NLP Tools | 316 |
NLP Tools | Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
Language Resources Development at the Spanish Royal Academy | 297 |
NLP-Lexicon | Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon | 47 |
Non-Linear | The (Un)Deterministic Nature of Morphological Context | 200 |
Non-Native Speech | The ISLE Corpus of Non-Native Spoken English | 313 |
Non-Uniform Units | Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System | 177 |
Objective evaluation | The Cost258 Signal Generation Test Array | 1 |
Object-Oriented Approach | Object-oriented Access to the Estonian Phonetic Database | 128 |
OCR | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
OCR Database | GRUHD: A Greek database of Unconstrained Handwriting | 289 |
On-Line Evaluation | On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
Ontologies | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
Ontology | Creating and Using Domain-specific Ontologies for Terminological Applications | 22 |
Oral Corpora | Portuguese Corpora at CLUL | 72 |
Oral Databases | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
Organisational Issues | A Platform for Dutch in Human Language Technologies | 348 |
Orthographic Labeling | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
Orthographic Transcription | Orthographic Transcription of the Spoken Dutch Corpus | 87 |
Orthography Conversion | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
Paradigmatic and Semantic Properties of Suffixes | Derivation in the Czech National Corpus | 153 |
Parallel Corpora | Lexical and Translation Equivalence in Parallel Corpora | 122 |
Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
Parallel Corpus | Building the Croatian-English Parallel Corpus | 119 |
A Parallel Corpus of Italian/German Legal Texts | 140 |
Parallel Multilingual Evaluation Materials | A Parallel English-Japanese Query Collection for the Evaluation of On-Line Help Systems | 248 |
Parallel Text | Evaluation of Word Alignment Systems | 137 |
Parallell Corpora | The Bank of Swedish | 300 |
Parametric | A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
PAROLE | Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon | 47 |
Parsing | A Treebank of Spanish and its Application to Parsing | 66 |
Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser | 91 |
Building a Treebank for French | 230 |
Parsing Evaluation | Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation | 98 |
Partial Parsing | PoS Disambiguation and Partial Parsing Bidirectional Interaction | 169 |
A Robust Parser for Unrestricted Greek Text | 174 |
Partnership | Recent Developments within the European Language Resources Association (ELRA) | 377 |
Part-of-Speech Tagging | A Web-based Text Corpora Development System | 105 |
The Bank of Swedish | 300 |
Perception | On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
Perceptual Evaluation | Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering | 16 |
Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
Performance Variations | Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
Performances Measures | Predictive Performance of Dialog Systems | 303 |
Phonetic Database | Object-oriented Access to the Estonian Phonetic Database | 128 |
Phonetic Knowledge | Object-oriented Access to the Estonian Phonetic Database | 128 |
Phonetics | Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
Phrasal verbs | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
Polish | An HPSG-Annotated Test Suite for Polish | 272 |
Polish Morphology | Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
Polysemy | What's in a Thesaurus? | 180 |
Portuguese Corpus | Spoken Portuguese: Geographic and Social Varieties | 71 |
Portuguese Resources | Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
Portuguese Varieties | Spoken Portuguese: Geographic and Social Varieties | 71 |
Portuguese Corpora at CLUL | 72 |
PoS Disambiguation | PoS Disambiguation and Partial Parsing Bidirectional Interaction | 169 |
POS Tagging | Something Borrowed, Something Blue: Rule-based Combination of POS Taggers | 158 |
A Unified POS Tagging Architecture and its Application to Greek | 181 |
Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus | 216 |
Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
POS1 | Morphological Tagging to Resolve Morphological Ambiguities | 277 |
Predicate-Argument Structure | Semantic Tagging for the Penn Treebank | 197 |
Priorities | A Platform for Dutch in Human Language Technologies | 348 |
Privileged co-Occurrence | Improving Lexical Databases with Collocational Information: Data from Portuguese | 340 |
Probabilistic Method | Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation | 350 |
Processing | A Strategy for the Syntactic Parsing of Corpora: from Constraint Grammar Output to Unification-based Processing | 111 |
Productivity of Suffixes | Derivation in the Czech National Corpus | 153 |
Project Introduction | Introduction of KIBS (Korean Information Base System) Project | 239 |
Prompting Strategy | Dialogue and Prompting Strategies Evaluation in the DEMON System | 38 |
Pronunciation | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
Pronunciation Dictionary | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
Pronunciation Lexica Editor | PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists | 53 |
Pronunciation Lexicon | PLEDIT - A New Efficient Tool for Management of Multilingual Pronunciation Lexica and Batchlists | 53 |
Pronunciation Model | MHATLex: Lexical Resources for Modelling the French Pronunciation | 37 |
Pronunciation Training | The ISLE Corpus of Non-Native Spoken English | 313 |
Pronunciation Variants | A French Phonetic Lexicon with Variants for Speech and Language Processing | 133 |
Prosodic Boundaries | Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
Prosody | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
Quality Control | Developing Guidelines and Ensuring Consistency for Chinese Text Annotation | 287 |
Query Language | The MATE Workbench Annotation Tool, a Technical Description | 166 |
Question Answering | The TREC-8 Question Answering Track | 26 |
How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done | 203 |
Question-Answering | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
Questionnaires | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Rarity | Rarity of Words in a Language and in a Corpus | 295 |
Rating Scales | Evaluating Translation Quality as Input to Product Development | 136 |
Reading Comprehension | How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done | 203 |
Real environments | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
Recruitment | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
Reduced Frequency | Rarity of Words in a Language and in a Corpus | 295 |
Reference Systems | The COST 249 SpeechDat Multilingual Reference Recogniser | 274 |
Referential Expressions | Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results | 193 |
Regional Pronunciation Rules | Regional Pronunciation Variants for Automatic Segmentation | 307 |
Relational Model and Database for Lexical Collocations | CDB - A Database of Lexical Collocations | 189 |
Research Center | Terminology in Korea: KORTERM | 276 |
Resource Discovery | Towards a Standard for Meta-descriptions of Language Resources | 125 |
Resources | SIMPLE: A General Framework for the Development of Multilingual Lexicons | 61 |
Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX | 62 |
Reusability as Easy Adaptability: A Substantial Advance in NL Technology | 298 |
Reusability | Enabling Resource Sharing in Language Generation: an Abstract Reference Architecture | 244 |
Reusability as Easy Adaptability: A Substantial Advance in NL Technology | 298 |
Reuse | Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
Reuse of Resources | Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers | 155 |
Reusing LR | Resources for Multilingual Text Generation in Three Slavic Languages | 241 |
Rexical Semantic Networks | ItalWordNet: a Large Semantic Database for Italian | 129 |
Robust Parsing | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
Rule-Based | Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
Rule-Based Morphology | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
Rule-Based Stemming | GéDériF: Automatic Generation and Analysis of Morphologically Constructed Lexical Resources | 279 |
SALT | Accessibility of Multilingual Terminological Resources - Current Problems and Prospects for the Future | 283 |
Satellited-Framed Language | Semantic Encoding of Danish Verbs in SIMPLE - Adapting a Verb Framed Model to a Satellite-framed Language | 13 |
Scheme-Scripting | The Universal XML Organizer: UXO | 253 |
Scientific and Cultural Research | Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
Segmentation | Annotation of a Multichannel Noisy Speech Corpus | 358 |
Segmentation Tools | SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
Semantic Classification | Controlled Bootstrapping of Lexico-semantic Classes as a Bridge between Paradigmatic and Syntagmatic Knowledge: Methodology and Evaluation | 99 |
Semantic Lexicon | Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
Semantic Lexicons | Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
Semantic Network | A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
Semantic Resource Construction | Chinese-English Semantic Resource Construction | 327 |
Semantic Similarity | What's in a Thesaurus? | 180 |
Semantic Tagging | An Experiment of Lexical-Semantic Tagging of an Italian Corpus | 60 |
Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
Semantics | Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
Semetic Language SR | Creation of Spoken Hebrew Databases | 52 |
Sense Discrimination | Lexical and Translation Equivalence in Parallel Corpora | 122 |
Sense Distinctions | Semantic Tagging for the Penn Treebank | 197 |
Sense Tagging | An Experiment of Lexical-Semantic Tagging of an Italian Corpus | 60 |
SENSEVAL | English Senseval: Report and Results | 8 |
Sentence Aligment | Some Technical Aspects about Aligning Near Languages | 186 |
Sentence Alignment | Automatic Extraction of English-Chinese Term Lexicons from Noisy Bilingual Corpora | 208 |
Sentence Extraction | Term-based Identification of Sentences for Text Summarisation | 106 |
Server | The Cost258 Signal Generation Test Array | 1 |
SGML | A Proposal for the Integration of NLP Tools using SGML-Tagged Documents | 68 |
A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
Shallow Parsing | Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
Shallow Processing | Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking | 299 |
Shalow Parsing | Building a Treebank for French | 230 |
Signal-Level Annotation | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
Similarity Measure | Cardinal, Nominal or Ordinal Similarity Measures in Comparative Evaluation of Information Retrieval Process | 367 |
SIMPLE | Annotating, Disambiguating & Automatically Extending the Coverage of the Swedish SIMPLE Lexicon | 84 |
SIMPLE Computational Lexicon | Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
Size-Limited Resources | Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation | 350 |
SLDSs Evaluation Procedures | Dialogue Annotation for Language Systems Evaluation | 33 |
Slovene Langauge | Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets | 146 |
Software Architectur | TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
SORIDA | Design Methodology for Bilingual Pronunciation Dictionary | 269 |
Sound Scene | Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition | 356 |
Spanish | A Treebank of Spanish and its Application to Parsing | 66 |
Language Resources Development at the Spanish Royal Academy | 297 |
Spanish and Portuguese | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
Speaker Recognition | Design Issues in Text-Independent Speaker Recognition Evaluation | 286 |
Speech | The Cost258 Signal Generation Test Array | 1 |
Guidelines for Japanese Speech Synthesizer Evaluation | 77 |
Models of Russian Text/Speech Interactive Databases for Supporting of Scientific, Practical and Cultural Researches | 185 |
Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
Speech Act | Shallow Discourse Genre Annotation in CallHome Spanish | 228 |
Speech Corpora | Orthographic Transcription of the Spoken Dutch Corpus | 87 |
Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
A Graphical Parametric Language-Independent Tool for the Annotation of Speech Corpora | 314 |
Speech Corpus | Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
Speech Corpus Annotation | The ISLE Corpus of Non-Native Spoken English | 313 |
Speech Corpus Annotation Tool | POSCAT: A Morpheme-based Speech Corpus Annotation Tool | 224 |
Speech Corpus Design | The ISLE Corpus of Non-Native Spoken English | 313 |
Speech Data Annotation | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
Speech Database | Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
Speech Databases | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language | 246 |
Developing a Multilingual Telephone Based Information System in African Languages | 329 |
SpeechDat-Car Fixed Platform | 331 |
COCOSDA - a Progress Report | 364 |
Speech Dictation Task | Issues in Design and Collection of Large Telephone Speech Corpus for Slovenian Language | 246 |
Speech Generation | Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
Speech Input/Output | COCOSDA - a Progress Report | 364 |
Speech Lexica | Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
Speech Processing | Object-oriented Access to the Estonian Phonetic Database | 128 |
Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Speech Recognition | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
Creation of Spoken Hebrew Databases | 52 |
SPEECON - Speech Data for Consumer Devices | 63 |
Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
Cross-lingual Interpolation of Speech Recognition Models | 147 |
Spontaneous Speech Corpus of Japanese | 262 |
The ISLE Corpus of Non-Native Spoken English | 313 |
Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling | 357 |
SPEECHDAT-CAR. A Large Speech Database for Automotive Environments | 373 |
Speech Synthesis | Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
Speech Transcription Tool | Transcribing with Annotation Graphs | 192 |
Speech Understanding | Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
SpeechDat | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
SpeechDat-Car | SpeechDat-Car Fixed Platform | 331 |
Speech-to-Speech Machine Translation | Lessons Learned from a Task-based Evaluation of Speech-to-Speech Machine Translation | 215 |
Spoken Commands | Corpora of Slovene Spoken Language for Multi-lingual Applications | 288 |
Spoken Corpora | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
Spoken Corpus | Language Resources Development at the Spanish Royal Academy | 297 |
Spoken Database | Creation of Spoken Hebrew Databases | 52 |
Spoken Dialog Systems | MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
Spoken Dialogue Architectures | Galaxy-II as an Architecture for Spoken Dialogue Evaluation | 116 |
Spoken Dialogue Dystems | Dialogue and Prompting Strategies Evaluation in the DEMON System | 38 |
Galaxy-II as an Architecture for Spoken Dialogue Evaluation | 116 |
Spoken Dialogue Systems | On the Use of Prosody for On-line Evaluation of Spoken Dialogue Systems | 43 |
Evaluation of a Dialogue System Based on a Generic Model that Combines Robust Speech Understanding and Mixed-initiative Control | 101 |
A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components | 135 |
Spoken Dutch | Orthographic Transcription of the Spoken Dutch Corpus | 87 |
Spoken Language Corpora | The Spoken Dutch Corpus. Overview and First Evaluation | 110 |
Spoken Language Dialogue Systems | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
Spoken Language Resources | Orthographic Transcription of the Spoken Dutch Corpus | 87 |
Annotating Communication Problems Using the MATE Workbench | 134 |
Spoken Language Understanding | Obtaining Predictive Results with an Objective Evaluation of Spoken Dialogue Systems: Experiments with the DCR Assessment Paradigm | 36 |
Spoken Language Varieties | SegWin: a Tool for Segmenting, Annotating, and Controlling the Creation of a Database of Spoken Italian Varieties | 310 |
Spoken Portuguese | Spoken Portuguese: Geographic and Social Varieties | 71 |
Spontaneous Speech | What are Transcription Errors and Why are They made? | 205 |
The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora | 217 |
Spontaneous Speech Corpus of Japanese | 262 |
Standardisation | Annotating Communication Problems Using the MATE Workbench | 134 |
COCOSDA - a Progress Report | 364 |
Standardization | Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
Towards a Standard for Meta-descriptions of Language Resources | 125 |
Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
Standards | The American National Corpus: A Standardized Resource for American English | 196 |
Standards and Tools | Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium | 209 |
Statistical Language Model | Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model | 341 |
Statistical Linguistics | Modern Greek Corpus Taxonomy | 351 |
Statistical Machine Translation | Evaluation of TRANSTYPE, a Computer-aided Translation Typing System: A Comparison of a Theoretical- and a User-oriented Evaluation Procedures | 34 |
Cairo: An Alignment Visualization Tool | 58 |
Statistical NLP | Term-based Identification of Sentences for Text Summarisation | 106 |
Statistics | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Stress | Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
Structure Driven Processing | Interarbora and Thistle - Delivering Linguistic Structure by the Internet | 319 |
Stylesheet | The MATE Workbench Annotation Tool, a Technical Description | 166 |
Stylistic Analysis | Modern Greek Corpus Taxonomy | 351 |
Subband Approach | Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering | 16 |
Subcategorization | Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
Automatic Assignment of Grammatical Relations | 218 |
Subject Field Codes | Integrating Subject Field Codes into WordNet | 219 |
Sublanguages | Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
ARISTA Generative Lexicon for Compound Greek Medical Terms | 360 |
Success Rate | Issues in the Evaluation of Spoken Dialogue Systems - Experience from the ACCeSS Project | 162 |
Surface realization | Target Suites for Evaluating the Coverage of Text Generators | 92 |
Surveys | For a Repository of NLP Tools | 316 |
Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Recent Developments within the European Language Resources Association (ELRA) | 377 |
SVD | Perceptual Evaluation of a New Subband Low Bit Rate Speech Compression System based on Waveform Vector Quantization and SVD Postfiltering | 16 |
Syntactic Analysis | A Strategy for the Syntactic Parsing of Corpora: from Constraint Grammar Output to Unification-based Processing | 111 |
Syntactic and Semantic Constraints | Extraction of Concepts and Multilingual Information Schemes from French and English Economics Documents | 202 |
Syntactic Annotation of Corpora | Where Opposites Meet. A Syntactic Meta-scheme for Corpus Annotation and Parsing Evaluation | 98 |
Syntactic Normalization | An Architecture for Document Routing in Spanish: Two Language Components, Pre-processor and Parser | 91 |
Syntactic Patterns | Producing LRs in Parallel with Lexicographic Description: the DCC project | 112 |
Syntax | Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level | 18 |
Coreference in Annotating a Large Corpus | 19 |
Towards A Universal Tool For NLP Resource Acquisition | 28 |
An XML-based Representation Format for Syntactically Annotated Corpora | 59 |
A Treebank of Spanish and its Application to Parsing | 66 |
Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
Syntax-Semantics Linking | SIMPLE: A General Framework for the Development of Multilingual Lexicons | 61 |
Synthesizer | Guidelines for Japanese Speech Synthesizer Evaluation | 77 |
System Evaluation Methodologies | The Evaluation of Systems for Cross-language Information Retrieval | 70 |
Tag Set for Chinese | Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus | 372 |
Tagged Corpus | Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
Tagger Evaluation | Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus | 216 |
Tagging | Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets | 146 |
Bootstrapping a Tagged Corpus through Combination of Existing Heterogeneous Taggers | 155 |
Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
Building a Treebank for French | 230 |
TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
Tagset | Rule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions | 199 |
Tagset Design | Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
Part of Speech Tagging and Lemmatisation for the Spoken Dutch Corpus | 216 |
Principled Hidden Tagset Design for Tiered Tagging of Hungarian | 249 |
Tagset Reduction | Principled Hidden Tagset Design for Tiered Tagging of Hungarian | 249 |
Target suites | Target Suites for Evaluating the Coverage of Text Generators | 92 |
Task-Based Evaluation | A Comparison of Summarization Methods Based on Task-based Evaluation | 14 |
Lessons Learned from a Task-based Evaluation of Speech-to-Speech Machine Translation | 215 |
TDT | Enhancing the TDT Tracking Evaluation | 5 |
TEI | The Concede Model for Lexical Databases | 335 |
TEI-Conformant Feature Structures | A Proposal for the Integration of NLP Tools using SGML-Tagged Documents | 68 |
Telephone Speech | SALA: SpeechDat across Latin America. Results of the First Phase | 10 |
Telephony Applications | Creation of Spoken Hebrew Databases | 52 |
Telephony Speech Databases | The COST 249 SpeechDat Multilingual Reference Recogniser | 274 |
Temporal Information | Annotating Events and Temporal Information in Newswire Texts | 321 |
Term Detection | Use of Greek and Latin Forms for Term Detection | 55 |
Term Extraction | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
Automatic Extraction of English-Chinese Term Lexicons from Noisy Bilingual Corpora | 208 |
Terminological Resources | Term-based Identification of Sentences for Text Summarisation | 106 |
Terminology | Terms Specification and Extraction within a Linguistic-based Intranet Service | 17 |
Creating and Using Domain-specific Ontologies for Terminological Applications | 22 |
Extraction of Semantic Clusters for Terminological Information Retrieval from MRDs | 35 |
Use of Greek and Latin Forms for Term Detection | 55 |
Terminology in Korea: KORTERM | 276 |
Terminology Extraction | ARC A3: A Method for Evaluating Term Extracting Tools and/or Semantic Relations between Terms from Corpora | 247 |
Terminology Lexicons | Automatically Augmenting Terminological Lexicons from Untagged Text | 320 |
Terminology Resource | Terminology in Korea: KORTERM | 276 |
Test Suite | An HPSG-Annotated Test Suite for Polish | 272 |
Test suites | Target Suites for Evaluating the Coverage of Text Generators | 92 |
Text | Resources for Multilingual Text Generation in Three Slavic Languages | 241 |
Text Analysis | Terms Specification and Extraction within a Linguistic-based Intranet Service | 17 |
Text Categorization | Modern Greek Corpus Taxonomy | 351 |
Text Classification | Integrating Subject Field Codes into WordNet | 219 |
Text Corpora | A Web-based Text Corpora Development System | 105 |
Text Processing | A Computational Platform for Development of Morphologic and Phonetic Lexica | 175 |
Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System | 177 |
Text Summarisation | Term-based Identification of Sentences for Text Summarisation | 106 |
Text Technology | Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
Text Typology | TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation | 254 |
Text Understanding | Coreference Resolution Evaluation Based on Descriptive Specificity | 131 |
Text-Handling tasks | Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
Text-independence | Design Issues in Text-Independent Speaker Recognition Evaluation | 286 |
Text-To-Speech | Perceptual Evaluation of Text-to-Speech Implementation of Enclitic Stress in Greek | 48 |
Textual Typology | Sublanguage Dependent Evaluation: Toward Predicting NLP performances | 252 |
Thematic Roles | Chinese-English Semantic Resource Construction | 327 |
Thesaurus | Automatically Expansion of Thesaurus Entries with a Different Thesaurus | 142 |
What's in a Thesaurus? | 180 |
Tiered Tagging | Using a Large Set of EAGLES-compliant Morpho-syntactic Descriptors as a Tagset for Probabilistic Tagging | 11 |
Principled Hidden Tagset Design for Tiered Tagging of Hungarian | 249 |
ToBI | GREEK ToBI: A System for the Annotation of Greek Speech Corpora | 7 |
Tokenisation | LT TTT - A Flexible Tokenisation Tool | 93 |
Tool Directory | For a Repository of NLP Tools | 316 |
Tool Typology | For a Repository of NLP Tools | 316 |
Tools | Portuguese Corpora at CLUL | 72 |
Annotating Communication Problems Using the MATE Workbench | 134 |
Reusability as Easy Adaptability: A Substantial Advance in NL Technology | 298 |
Tools for Grammar Evaluation | An HPSG-Annotated Test Suite for Polish | 272 |
Topic Detection | Evaluating Summaries for Multiple Documents in an Interactive Environment | 163 |
Topic Detection and Tracking | Large, Multilingual, Broadcast News Corpora for Cooperative Research in Topic Detection and Tracking: The TDT-2 and TDT-3 Corpus Efforts | 210 |
Topic Tracking | Enhancing the TDT Tracking Evaluation | 5 |
Topic(s) of Interest | Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
Traditional Dictionaries | Tools for the Generation of Morphological Entries in Dictionaries | 338 |
Trained Systems | Annotating Resources for Information Extraction | 263 |
Training Data | Semantic Tagging for the Penn Treebank | 197 |
Transcription | What are Transcription Errors and Why are They made? | 205 |
Transcription Tool | The Universal XML Organizer: UXO | 253 |
Transformation Based Learning | A Unified POS Tagging Architecture and its Application to Greek | 181 |
Translation Equivalents | Designing a Tool for Exploiting Bilingual Comparable Corpora | 20 |
Translation Memory | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
Translation Quality | Evaluating Translation Quality as Input to Product Development | 136 |
Translation Resource | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
Translation Studies | A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
Translational English | A Software Toolkit for Sharing and Accessing Corpora Over the Internet | 337 |
Tree Bank | Introduction of KIBS (Korean Information Base System) Project | 239 |
Tree-Annotated Coprpus | Semi-automatic Construction of a Tree-annotated Corpus Using an Iterative Learning Statistical Language Model | 341 |
Treebank | A Treebank of Spanish and its Application to Parsing | 66 |
Building a Treebank for Italian: a Data-driven Annotation Schema | 220 |
Building a Treebank for French | 230 |
Turn-Taking | The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora | 217 |
Unconstrained Speech Recognition | Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers | 90 |
Unconstrained Writing | GRUHD: A Greek database of Unconstrained Handwriting | 289 |
Up-Translation | The Concede Model for Lexical Databases | 335 |
Usability | Dialogue and Prompting Strategies Evaluation in the DEMON System | 38 |
Evaluating Multi-party Multi-modal Systems | 368 |
Usage | For a Repository of NLP Tools | 316 |
User Friendly Search System | A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
User Interface | CLinkA A Coreferential Links Annotator | 179 |
User Needs | Survey of Language Engineering Needs: a Language Resources Perspective | 317 |
Users | Determining the Tolerance of Text-handling Tasks for MT Output | 139 |
Valency | Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
Validation | The TREC-8 Question Answering Track | 26 |
Recent Developments within the European Language Resources Association (ELRA) | 377 |
Variants | Regional Pronunciation Variants for Automatic Segmentation | 307 |
Variation | The Influence of Scenario Constraints on the Spontaneity of Speech. A Comparison of Dialogue Corpora | 217 |
Verb | Learning Verb Subcategorization from Corpora: Counting Frame Subsets | 145 |
Design and Construction of Knowledge base for Verb using MRD and Tagged Corpus | 237 |
Verb Classification | Establishing the Upper Bound and Inter-judge Agreement of a Verb Classification Task | 233 |
Vernacular Languages | Issues from Corpus Analysis that have influenced the On-going Development of Various Haitian Creole Text- and Speech-based NLP Systems and Applications | 342 |
Visualization | Cairo: An Alignment Visualization Tool | 58 |
Interarbora and Thistle - Delivering Linguistic Structure by the Internet | 319 |
Web | A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
Web Application | Design and Implementation of the Online ILSP Greek Corpus | 336 |
Web Interface | A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts | 363 |
Web Interfaces | Providing Internet Access to Portuguese Corpora: the AC/DC Project | 85 |
Web Search | A Flexible Infrastructure for Large Monolingual Corpora | 226 |
Web-Based Concordance | A Self-Expanding Corpus Based on Newspapers on the Web | 362 |
Welsh | Recruitment Techniques for Minority Language Speech Databases: Some Observations | 167 |
Wizard of Oz Simulation | MDWOZ: A Wizard of Oz Environment for Dialog Systems Development | 104 |
Word Accent | Labeling of Prosodic Events in Slovenian Speech Database GOPOLIS | 292 |
Word Alignment | Cairo: An Alignment Visualization Tool | 58 |
Evaluation of Word Alignment Systems | 137 |
Word Clustering | Automatic Extraction of Semantic Similarity of Words from Raw Technical Texts | 302 |
Word Combinations | Towards a Strategy for a Representation of Collocations - Extending the Danish PAROLE-lexicon | 47 |
Word Formation | Application of WordNet ILR in Czech Word-formation | 223 |
Word Formation Rules | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
Word Segmentation | The Multi-layer Language Knowledge Base of Chinese NLP | 29 |
Word Segmentation and Part-of-Speech Tagging | Hua Yu: A Word-segmented and Part-Of-Speech Tagged Chinese Corpus | 372 |
Word Sense Disambiguation | English Senseval: Report and Results | 8 |
Abstraction of the EDR Concept Classification and its Effectiveness in Word Sense Disambiguation | 75 |
Tuning Lexicons to New Operational Scenarios | 330 |
Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation | 350 |
Word Sense Tagging | A Word Sense Disambiguation Method Using Bilingual Corpus | 15 |
WordNet | Integrating Subject Field Codes into WordNet | 219 |
Application of WordNet ILR in Czech Word-formation | 223 |
Evaluation of a Generic Lexical Semantic Resource in Information Extraction | 259 |
Word-Sense Disambiguation | An Experiment of Lexical-Semantic Tagging of an Italian Corpus | 60 |
World Wide Web | TransSearch: A Free Translation Memory on the World Wide Web | 12 |
Written Corpora | Portuguese Corpora at CLUL | 72 |
Written Language Resources | NL-Translex: Machine Translation for Dutch | 294 |
XCES | Building the Croatian-English Parallel Corpus | 119 |
xfst | An Optimised FS Pronunciation Resource Generator for Highly Inflecting Languages | 251 |
XML | An XML-based Representation Format for Syntactically Annotated Corpora | 59 |
Building the Croatian-English Parallel Corpus | 119 |
A Step toward Semantic Indexing of an Encyclopedic Corpus | 161 |
The MATE Workbench Annotation Tool, a Technical Description | 166 |
XCES: An XML-based Encoding Standard for Linguistic Corpora | 172 |
A Unified POS Tagging Architecture and its Application to Greek | 181 |
Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers | 183 |
A Multi-view Hyperlexicon Resource for Speech and Language System Development | 243 |
The Concede Model for Lexical Databases | 335 |
XML Mark-Up | LT TTT - A Flexible Tokenisation Tool | 93 |
XML Representation | Automatic Generation of Dictionary Definitions from a Computational Lexicon | 306 |
XML-Editor | The Universal XML Organizer: UXO | 253 |