Keyword |
title |
A |
|
Abbreviation |
Clustering acronyms in biomedical text for disambiguation
|
Accent restoration |
A Grapheme-Based Approach for Accent Restoration in Gikuyu
|
Access by meaning |
Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements
|
Acquisition |
Building a network of topical relations from a corpus
Next Generation Language Resources using Grid
|
Acronym |
Clustering acronyms in biomedical text for disambiguation
|
Acronym recognition |
Towards a terminological resource for biomedical text mining
|
Acronyms and definitions |
Recognizing Acronyms and their Definitions in Swedish Medical Texts
|
Act-topic annotation |
Act-Topic Patterns for Automatically Checking Dialogue Models
|
Actantial structure |
A Methodology for Developing Multilingual Resources for Terminology
|
Adaptation |
Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
|
Adequacy |
CESTA: First Conclusions of the Technolangue MT Evaluation Campaign
X-Score: Automatic Evaluation of Machine Translation Grammaticality
|
Adjudication |
A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit
An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
|
Adverbs |
LEXADV - a multilingual semantic Lexicon for Adverbs
|
Agglutinative |
Morphological annotation of Korean with Directly Maintainable Resources
|
Agreement |
Annotating Emotions in Meetings
|
Alignment |
Aligning Multilingual Thesauri
Czech-English Word Alignment
|
Alternation |
Valency Lexicon of Czech Verbs: Alternation-Based Model
|
Anagram-key spelling correction |
Corpus-Induced Corpus Clean-up
|
Analysis |
A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
|
Anaphora |
Coreference Resolution with and without Linguistic Knowledge
Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
|
And DUC. |
Automated Summarization Evaluation with Basic Elements.
|
Annotated corpora |
HAREM: An Advanced NER Evaluation Contest for Portuguese
|
Annotation |
A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation
An Annotated Corpus Management Tool: ChaKi
An Introduction to NLP-based Textual Anonymisation
Annotating Emotions in Meetings
Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Dependency-structure Annotation to Corpus of Spontaneous Japanese
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
H. C. Andersen Conversation Corpus
KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
Lexicon Development for Varieties of Spoken Colloquial Arabic
Linguistic Resources for Speech Parsing
Morphological annotation of Korean with Directly Maintainable Resources
NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
NPs for Events: Experiments in Coreference Annotation
The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
The MULINCO corpus and corpus platform
UAM Text Tools - a flexible NLP architecture
Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus Annotation of Temporal Relations with Tango
|
Annotation evaluation |
Fear-type emotions of the SAFE Corpus: annotation issues
|
Annotation guidelines |
An Annotated Corpus of Typical Durations of Events
NPs for Events: Experiments in Coreference Annotation Annotating COMPARA, a Grammar-aware Parallel Corpus
|
Annotation mining |
A BLARK extension for temporal annotation mining
Discourse functions of duration in Mandarin: resource design and implementation
|
Annotation of emotion |
Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
|
Annotation scheme |
Annotating Bridging Anaphors in Italian: in Search of Reliability
NPs for Events: Experiments in Coreference Annotation
|
Annotation tool |
Dependency-structure Annotation to Corpus of Spontaneous Japanese
|
Anonymisation |
An Introduction to NLP-based Textual Anonymisation
|
Answer selection |
Exploiting Multiple Semantic Resources for Answer Selection
|
Answer Time-Bank |
An Answer Bank for Temporal Inference
|
Appraisal |
Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
|
Arabic |
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Developing and Using a Pilot Dialectal Arabic Treebank
Lexicon Development for Varieties of Spoken Colloquial Arabic
Parallel Syntactic Annotation of Multiple Languages
What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic
|
Argument structure |
Romanian Valence Dictionary in XML Format
Valency Lexicon of Czech Verbs: Alternation-Based Model
|
Artificial intelligence |
MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
|
Asia |
Oriental COCOSDA: Past, Present and Future
|
Augmentation of lexicons |
Moving to dynamic computational lexicons with LeXFlow
|
Augmentative and Alternative Communication |
Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
|
Automatic alignment |
Building a Swedish-Turkish Parallel Corpus
|
Automatic generation of web portals |
Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
|
Automatic lexical acquisition |
A Large Subcategorization Lexicon for Natural Language Processing Applications
|
Automatic MT evaluation |
CESTA: First Conclusions of the Technolangue MT Evaluation Campaign
X-Score: Automatic Evaluation of Machine Translation Grammaticality
|
Automatic phonetic transcription |
Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
|
Automatic segmentation |
Creation and analysis of a Polish speech database for use in unit selection synthesis
|
Automatic Speech Recognition (ASR) |
Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints
Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign
|
Automatic Text Summarization |
Towards Holistic Summarization -- Selecting Summaries, Not Sentences
Computer-aided summarisation -- what the user really wants
|
Average reduced frequency |
New Approach to Frequency Dictionaries - Czech Example
|
B |
|
Back translation |
Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
|
Basic Elements |
Automated Summarization Evaluation with Basic Elements. Summarizing Answers for Complicated Questions
|
Bilingual |
Data-driven Amharic-English Bilingual Lexicon Acquisition Bilingual speech corpus in two phonetically similar languages Bilingual Machine-Aided Indexing
|
Bilingual corpora |
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
|
Bilingual lexicon acquisition |
Automatic Detection of Orthographic Cues for Cognate Recognition
|
Bio-NLP |
Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
|
Bioinformatics |
Linguistic and Biological Annotations of Biological Interaction Events
|
Biomedical corpora |
The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
|
Biomedical literature |
Clustering acronyms in biomedical text for disambiguation
|
Biomedicine |
Towards a terminological resource for biomedical text mining
|
Bitext |
Building a Swedish-Turkish Parallel Corpus
|
BLARK |
A BLARK extension for temporal annotation mining
|
BLAST |
A BLARK extension for temporal annotation mining
|
Blind Evaluation |
Blind Evaluation for Thai Search Engines
|
Bracket scores |
SParseval: Evaluation Metrics for Parsing Speech
|
Bridging |
Annotating Bridging Anaphors in Italian: in Search of Reliability
|
Broadcast news |
Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
SINOD - Slovenian non-native speech database |
C |
|
Case frame |
A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
Case Frame Compilation from the Web using High-Performance Computing |
Catalan |
Bilingual speech corpus in two phonetically similar languages
|
Categorial ambiguity |
Annotating COMPARA, a Grammar-aware Parallel Corpus
|
Centering |
PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
|
CESTA evaluation |
Creating a Large-Scale Arabic to French Statistical MachineTranslation System
|
CGN |
JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
|
Character semantics |
Feature-based Encoding and Querying Language Resources with Character Semantics
|
Chat language |
Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
|
Chemical compounds |
Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
|
Children |
JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
|
Chinese collocation |
The Design and Construction of A Chinese Collocation Bank
|
Chinese Segmentation |
Towards Unified Chinese Segmentation Algorithm
|
Chinese Sketch Engine |
Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
|
Chinese thesaurus |
Toward a Pan-Chinese Thesaurus
|
Chunking |
Rule-Based Chunking and Reusability
|
Class-based language models |
Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech
|
Classification |
Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
|
CLEF |
The Multilingual Question Answering Track at CLEF
|
CLIR |
Extraction of Cross Language Term Correspondences
|
Clustering |
Clustering acronyms in biomedical text for disambiguation
Finding representative sets of dialect words for geographical regions
|
Co-occurence |
BACO - A large database of text and co-occurrences
Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words |
COCOSDA |
Oriental COCOSDA: Past, Present and Future
|
Cognate detection |
Automatic Detection of Orthographic Cues for Cognate Recognition
|
Cognitive Theory of Metaphor |
The Italian Metaphor Database
|
Collocation bank |
The Design and Construction of A Chinese Collocation Bank
|
Collocation extraction |
Extraction tools for collocations and their morphosyntactic specificities
Semi-automatic Building of Swedish Collocation Lexicon
|
Collocations |
COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Extraction tools for collocations and their morphosyntactic specificities
|
Comparability and enrichment |
Exploring opportunities for Comparability and Enrichment by Linking lexical databases
|
Compounds |
Query Expansion on Compounds
|
Computational Lexicography |
Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
|
Computational dialectology |
Dialectal resources on-line: the ALT-Web experience
|
Computational lexicon |
The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
A Large Subcategorization Lexicon for Natural Language Processing Applications
Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons
|
Computational morphology |
Morphological Tools for Six Small Uralic Languages
|
Computational Semantics |
Towards Holistic Summarization -- Selecting Summaries, Not Sentences
|
Computational Terminology |
Extraction of Cross Language Term Correspondences
|
Computer-aided summarization |
Computer-aided summarisation -- what the user really wants
|
Computer-assisted language learning |
Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
|
Concept Set |
Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
|
Conceptual Interlingua |
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
|
Conceptual markers |
Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
|
Conceptual Metaphor |
The Italian Metaphor Database
|
Conceptual relations |
Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
|
Conceptual Vectors |
Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
|
Confidence measure |
Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
|
Confusables |
Corpus-Induced Corpus Clean-up
|
Conjunction Disambiguation |
Named Entity Extraction with Conjunction Disambiguation
|
Constraint Grammar |
Turning a Dependency Treebank into a PSG-style Constituent Treebank
|
Construction of lexical resources |
Dialectal resources on-line: the ALT-Web experience
|
Content selection |
Learning Database Content for Spoken Dialogue System Design
|
Context-based evaluation |
A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
|
Context-independent spelling correction |
A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
Spelling Error Patterns in Spanish for Word Processing Applications
|
Contributing factor |
A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
|
Controlled language |
Predicting MT Quality as a Function of the Source Language
|
Conversational speech |
Linguistic Resources for Speech Parsing
|
Conversational telephone speech |
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
|
Conversion |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
|
Cooperative answers |
Language Challenges for Data Fusion in Question-Answering
|
CoreLex |
Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
|
Coreference |
Coreference Resolution with and without Linguistic Knowledge
Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
Towards Natural Interactive Question Answering
|
Corpora |
A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
Development of the First LRs for Macedonian: Current Projects
EQueR: the French Evaluation campaign of Question-Answering Systems
I-CAB: the Italian Content Annotation Bank
Oriental COCOSDA: Past, Present and Future
SALTO - A Versatile Multi-Level Annotation Tool
Structure, Annotation and Tools in the Basque ZT Corpus
The Multilingual Question Answering Track at CLEF
|
Corpus |
An Annotated Corpus Management Tool: ChaKi
An Annotated Corpus of Typical Durations of Events
Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
Case Frame Compilation from the Web using High-Performance Computing
Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
Discourse functions of duration in Mandarin: resource design and implementation
Evaluation of multimodal components within CHIL: The evaluation packages and results
Extraction tools for collocations and their morphosyntactic specificities
Gathering a corpus of multimodal computer-mediated meetings
Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
The MULINCO corpus and corpus platform
Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
|
Corpus acquisition and preprocessing |
Creating a Large-Scale Arabic to French Statistical MachineTranslation System
|
Corpus analysis tools |
FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
|
Corpus annotation |
A methodology for the joint development of the Basque WordNet and Semcor
Annotating COMPARA, a Grammar-aware Parallel Corpus
Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
Corpus Annotation as a Test of a Linguistic Theory The SALSA Corpus: a German Corpus Resource for Lexical Semantics
|
Corpus annotation tool |
A new approach to syntactic annotation
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development |
Corpus Based Synthesis |
Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
|
Corpus construction |
KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
|
Corpus design |
User requirements analysis for the design of a reference corpus of written Dutch
|
Corpus development |
A Uniform Interface to Large-Scale Linguistic Resources
Linguistic Resources for Speech Parsing
|
Corpus processing |
UAM Text Tools - a flexible NLP architecture
|
Corpus query interface |
A Uniform Interface to Large-Scale Linguistic Resources
|
Corpus search |
A Corpus Search System Utilizing Lexical Dependency Structure
Annotating COMPARA, a Grammar-aware Parallel Corpus
|
Corpus size |
Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
|
Corpus Spoken Dutch |
JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
|
Corpus-cleanup |
Corpus-Induced Corpus Clean-up
|
Corpus-induced |
Corpus-Induced Corpus Clean-up
|
Correference |
Transferring Coreference Chains through Word Alignment
|
Correlation |
Automated Summarization Evaluation with Basic Elements.
|
Cross-language information retrieval |
The Impact of Evaluation on Multilingual Information Retrieval System Development
|
Cross-level queries |
Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).
|
Cross-lingual and trans-lingual techniques |
The pragmatic combination of different crosslingual resources
|
Cross-lingual information retrieval |
A Deep Linguistic Analysis for Cross-language Information Retrieval
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
|
Crosslingual |
Bilingual Machine-Aided Indexing
|
Czech |
Towards a Slovene Dependency Treebank
|
D |
|
Danish |
LEXADV - a multilingual semantic Lexicon for Adverbs
Query Expansion on Compounds
|
Data centers |
More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities Integrated Linguistic Resources for Language Exploitation Technologies
|
Data collection |
Data for question answering: The case of why
|
Data sparsity |
A Closer Look at Skip-gram Modelling
|
Data-driven methods |
MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
|
Database |
On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
Spanish Synthesis Corpora
|
Database creation |
Creation and analysis of a Polish speech database for use in unit selection synthesis
|
Database Design |
Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
|
Date integration |
Language Challenges for Data Fusion in Question-Answering
|
Decision points |
An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
|
Decoder |
MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
|
Deep lexical acquisition |
Automated Deep Lexical Acquisition for Robust Open Texts Processing
|
Deep parsing |
A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation
Preprocessing and Tokenisation Standards in DELPH-IN Tools Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
|
Dependency grammar |
Towards a Slovene Dependency Treebank Turning a Dependency Treebank into a PSG-style Constituent Treebank
|
Dependency parsing |
MaltParser: A Data-Driven Parser-Generator for Dependency Parsing Dependency-Based Phrase Alignment
|
Dependency relations |
Corpus Annotation as a Test of a Linguistic Theory
|
Dependency scores |
SParseval: Evaluation Metrics for Parsing Speech
|
Dependency structure |
A Corpus Search System Utilizing Lexical Dependency Structure
An Annotated Corpus Management Tool: ChaKi
Dependency-structure Annotation to Corpus of Spontaneous Japanese
|
Dependency tree |
From Natural Language to Databases via Ontologies
|
Diacritic placement |
A Grapheme-Based Approach for Accent Restoration in Gikuyu
|
Diagrams |
Semantic Descriptors: The Case of Reflexive Verbs
|
Dialect |
A Unified Structure for Dutch Dialect Dictionary Data
Finding representative sets of dialect words for geographical regions
|
Dialectal Arabic |
Developing and Using a Pilot Dialectal Arabic Treebank
Lexicon Development for Varieties of Spoken Colloquial Arabic
|
Dialectal databases |
Dialectal resources on-line: the ALT-Web experience
|
Dialectal Treebank |
Developing and Using a Pilot Dialectal Arabic Treebank
|
Dialectometry |
Finding representative sets of dialect words for geographical regions
|
Dialog |
The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
|
Dialogue |
Towards Natural Interactive Question Answering
|
Dialogue annotation |
TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
|
Dialogue model |
Act-Topic Patterns for Automatically Checking Dialogue Models
|
Dialogue system user speech |
Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
|
Dialogue systems |
Learning Database Content for Spoken Dialogue System Design A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
|
Dictionary |
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
A Unified Structure for Dutch Dialect Dictionary Data
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
|
Dictionary creation |
Developing and Using a Pilot Dialectal Arabic Treebank
|
Dictionary interfaces |
Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements
|
Dictionary model |
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
|
Digital libraries |
Natural Language Processing: A Terminological and Statistical Approach
|
Dimensions |
Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
|
Disambiguation |
On Automatic Assignment of Verb Valency Frames in Czech
|
Discourse phenomena |
Discourse functions of duration in Mandarin: resource design and implementation
|
Disfluencies |
Linguistic Resources for Speech Parsing
|
Disparate languages |
Data-driven Amharic-English Bilingual Lexicon Acquisition
|
Distillation |
Integrated Linguistic Resources for Language Exploitation Technologies
|
Distributed databases |
The Information Commons Gazetteer
|
Distributed index |
The Information Commons Gazetteer
|
Doctor-patient dialogue |
Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
|
Document categorization |
Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods
|
DOLCE |
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
|
Domain ontology |
A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames Using Core Ontology for Domain Lexicon Structuring
|
Domain specific knowledge |
Structuring a Domain Vocabulary in a General Knowledge Environment
|
Domain-specific corpora |
Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model
|
Dutch |
User requirements analysis for the design of a reference corpus of written Dutch A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
|
Dynamic lexicons |
Moving to dynamic computational lexicons with LeXFlow
|
E |
|
ECESS |
ECESS Inter-Module Interface Specification for Speech Synthesis
|
EContent users |
Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
|
Editing |
Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
|
Elderly |
JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
|
Electronic dictionaries |
Development of the First LRs for Macedonian: Current Projects
|
Ellipsis |
Towards Natural Interactive Question Answering
|
EMMA |
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
|
Emotion |
Annotating Emotions in Meetings
Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
|
Emotion recognition |
Improving Automatic Emotion Recognition from Speech via Gender Differentiaion
|
Emotional database |
Fear-type emotions of the SAFE Corpus: annotation issues
|
Emotional valence |
Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
|
EngValLex |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
|
English |
Data-driven Amharic-English Bilingual Lexicon Acquisition
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
General and Task-Specific Corpus Resources for Polish Adult Learners of English
Parallel Syntactic Annotation of Multiple Languages
The English-Slovene ACQUIS corpus
|
English inclusions |
Integrating Language Knowledge Resources to Extend the English Inclusion
|
EQUER |
Question Answering Evaluation Survey
|
Error mining |
Automated Deep Lexical Acquisition for Robust Open Texts Processing
|
EU legislation |
The English-Slovene ACQUIS corpus
|
Eurovoc |
Aligning Multilingual Thesauri
|
Evaluation |
A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
A joint prosody evaluation of French text-to-speech synthesis systems
Act-Topic Patterns for Automatically Checking Dialogue Models
Annotating Bridging Anaphors in Italian: in Search of Reliability
Automated Summarization Evaluation with Basic Elements.
Automatic Evaluation and Composition of NLP Pipelines with Web Services
Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
Czech-English Word Alignment
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
EQueR: the French Evaluation campaign of Question-Answering Systems
Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign
Evaluation of multilingual text alignment systems: the ARCADE II project
Evaluation of multimodal components within CHIL: The evaluation packages and results
HAREM: An Advanced NER Evaluation Contest for Portuguese
Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
Predicting MT Quality as a Function of the Source Language
Question Answering Evaluation Survey
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics
Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons
Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community
TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
The Multilingual Question Answering Track at CLEF
|
Evaluation contest |
HAREM: An Advanced NER Evaluation Contest for Portuguese
|
Evaluation programs |
Integrated Linguistic Resources for Language Exploitation Technologies
|
Evaluation resources |
HAREM: An Advanced NER Evaluation Contest for Portuguese
|
Evaluation tools |
A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
|
Event annotation |
Adding multi-layer semantics to the Greek Dependency Treebank
|
Event coreference |
NPs for Events: Experiments in Coreference Annotation
|
Event durations |
An Annotated Corpus of Typical Durations of Events
|
Event taxonomy |
Adding multi-layer semantics to the Greek Dependency Treebank
|
Extraction from Spoken Text. In Proceeding of LREC 2004. Paris |
Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
|
Extractive Summarization |
Multilingual Multidocument Summarization Tools and Evaluation
|
F |
|
Fear detection |
Fear-type emotions of the SAFE Corpus: annotation issues
|
Feature extraction |
Improving Automatic Emotion Recognition from Speech via Gender Differentiaion
|
Feature structures |
Feature-based Encoding and Querying Language Resources with Character Semantics
|
Features |
Linguistic features modeling based on Partial New Cache
|
Features-Cache model |
Linguistic features modeling based on Partial New Cache
|
FGD |
Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
|
Figurative Language |
Proper Names and Linguistic Dynamics
|
Filed Recording |
SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
|
Filtering data |
Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
|
Finite-state transducers |
The importance of precise tokenizing for deep grammars
|
Fluency |
CESTA: First Conclusions of the Technolangue MT Evaluation Campaign
X-Score: Automatic Evaluation of Machine Translation Grammaticality
|
Focus |
Annotating Information Structure in a Corpus of Spoken Danish
|
Forensic |
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
|
Frame |
On Automatic Assignment of Verb Valency Frames in Czech
|
FrameNet |
Construction of a FrameNet Labeler for Swedish Text
The SALSA Corpus: a German Corpus Resource for Lexical Semantics
|
Framework |
GAIA: Common Framework for the Development of Speech Translation Technologies
|
France: ELRA |
Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
|
Free software |
FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
|
French |
Integrating Language Knowledge Resources to Extend the English Inclusion
Parallel Syntactic Annotation of Multiple Languages
The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
|
French Question Answering Campaign |
Question Answering Evaluation Survey
|
Frequency |
FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text
New Approach to Frequency Dictionaries - Czech Example
|
Functional Generative Description |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
|
Fusion |
The Information Commons Gazetteer
|
G |
|
GALE |
Integrated Linguistic Resources for Language Exploitation Technologies
|
Gazetteer |
The Information Commons Gazetteer
|
Gender detection from speech |
Improving Automatic Emotion Recognition from Speech via Gender Differentiaion
|
General Lexical Resources |
The Italian Metaphor Database
|
Generation |
A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
|
German |
Integrating Language Knowledge Resources to Extend the English Inclusion
The SALSA Corpus: a German Corpus Resource for Lexical Semantics
The importance of precise tokenizing for deep grammars
|
German parsers |
Improving coverage and parsing quality of a large-scale LFG for German
|
Gold standard |
An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
|
Grammar |
A Lexicalized Tree-Adjoining Grammar for Vietnamese
|
Granularity |
Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
|
Grapheme to phoneme conversion |
Creation and analysis of a Polish speech database for use in unit selection synthesis
|
Grapheme-based approach |
A Grapheme-Based Approach for Accent Restoration in Gikuyu
|
Greedy algorithm |
Tools and resources for speech synthesis arising from a Welsh TTS project
|
Grid |
Next Generation Language Resources using Grid
|
H |
|
Hand-crafted grammars |
Improving coverage and parsing quality of a large-scale LFG for German
The importance of precise tokenizing for deep grammars
|
Head gesture |
Tangible Objects for the Acquisition of Multimodal Interaction Patterns
|
Hebrew |
A Computational Lexicon of Contemporary Hebrew
|
Hesitation |
Discourse functions of duration in Mandarin: resource design and implementation
|
Heterogeneous resources |
Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
|
High-performance computing |
Case Frame Compilation from the Web using High-Performance Computing
|
Hindi |
Parallel Syntactic Annotation of Multiple Languages
|
Historical corpus |
Building a historical corpus for Classical Portuguese: some technological aspects
|
Human associations |
Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
|
Human error |
A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
|
Human-computer communication |
Creation and analysis of a Polish speech database for use in unit selection synthesis
|
Human-machine interaction |
JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
|
I |
|
ICSI Meeting Recorder Project |
Part-of-Speech Tagging of Transcribed Speech
|
Idioms |
Extraction tools for collocations and their morphosyntactic specificities
|
Image processing |
Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
|
Inconsistencies |
The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
|
Information extraction |
COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Constructing a Named Entity Ontology from Web Corpora
I-CAB: the Italian Content Annotation Bank
Named Entity Extraction with Conjunction Disambiguation
Linguistic and Biological Annotations of Biological Interaction Events
Ontology-based Information Extraction with SOBA
|
Information commons |
The Information Commons Gazetteer
|
Information retrieval |
Query Expansion on Compounds
Development of Linguistic Ontology on Natural Sciences and Technology
|
Information structure |
Annotating Information Structure in a Corpus of Spoken Danish
Corpus Annotation as a Test of a Linguistic Theory
|
Information theory |
Evaluation of Stop Word Lists in Chinese Language
|
Infrastructure |
Automatic Evaluation and Composition of NLP Pipelines with Web Services
A Unified Structure for Dutch Dialect Dictionary Data |
Integration |
GAIA: Common Framework for the Development of Speech Translation Technologies
|
Intelligibility |
Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
|
Inter-annotator agreement |
An Annotated Corpus of Typical Durations of Events
|
Interaction |
Towards Natural Interactive Question Answering
|
Interface |
Computer-aided summarisation -- what the user really wants
ECESS Inter-Module Interface Specification for Speech Synthesis
|
Interlanguages |
“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
|
Internationalization |
Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
|
Internet |
A Uniform Interface to Large-Scale Linguistic Resources
|
Internet search |
Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community
|
ISO-DIS-24610-1 |
Feature-based Encoding and Querying Language Resources with Character Semantics
|
J |
|
Japanese |
Parallel Syntactic Annotation of Multiple Languages
|
K |
|
Knowledge Engineering |
Ontology Driven K-Portal Construction and K-Service Provision
|
Knowledge extraction |
Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
|
Knowledge Modeling |
A Framework to Integrate Ubiquitous Knowledge Modeling
|
Knowledge Portal |
Ontology Driven K-Portal Construction and K-Service Provision
|
Knowledge representation |
The LOIS Project
|
Knowledge Services Provision |
Ontology Driven K-Portal Construction and K-Service Provision
|
Knowledge sharing |
A new approach to syntactic annotation
|
Korean |
Morphological annotation of Korean with Directly Maintainable Resources
Parallel Syntactic Annotation of Multiple Languages
|
L |
|
L2S |
Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints
|
Labelling |
Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
|
Language Engineering |
Ontology Driven K-Portal Construction and K-Service Provision
|
Language identification |
Reconsidering Language Identification for Written Language Resources
Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
|
Language interaction |
Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
|
Language modeling |
Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech
|
Language modelling |
A Closer Look at Skip-gram Modelling
|
Language Models |
Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
|
Language resource interoperability |
Moving to dynamic computational lexicons with LeXFlow
|
Language resource |
A new approach to syntactic annotation
Development of the First LRs for Macedonian: Current Projects
Integrating Language Knowledge Resources to Extend the English Inclusion
Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
LexikoNet - a lexical database based on type and role hierarchies
On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community
Syntactic Lexicon of Polish Predicative Nouns
TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
Corpus Portal for Search in Monolingual Corpora
Morphological annotation of Korean with Directly Maintainable Resources
BACO - A large database of text and co-occurrences
|
Language resources production models |
Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
|
Language service |
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
|
Language technology |
Query Expansion on Compounds
|
Language typology |
Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
|
LDC |
More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities
|
Learning by Emergence |
Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
|
Legal Ontologies |
A Development Tool For Multilingual Ontology-based Conceptual
|
Legitimate translation variation |
Using collocations from comparable corpora to find translation equivalents
|
Lemmatiser |
Open Source Corpus Analysis Tools for Malay
|
Lemmatization |
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
|
Lemmatizer |
Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
|
Less widely digitally available languages |
Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
|
Letter-to-Sound |
Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints
|
Letter-to-sound rules |
Tools and resources for speech synthesis arising from a Welsh TTS project
|
Levantine |
Developing and Using a Pilot Dialectal Arabic Treebank
|
Lexica |
A Unified Structure for Dutch Dialect Dictionary Data
|
Lexical acquisition |
The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
|
Lexical anaphora |
Annotating Bridging Anaphors in Italian: in Search of Reliability
|
Lexical association measures |
Semi-automatic Building of Swedish Collocation Lexicon COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
|
Lexical database |
Building a network of topical relations from a corpus
Natural Language Processing: A Terminological and Statistical Approach
COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Proper Names and Linguistic Dynamics
Using Core Ontology for Domain Lexicon Structuring
|
Lexical entailment |
Mixing WordNet, VerbNet and PropBank for studying verb relations
|
Lexical ontology |
LexikoNet - a lexical database based on type and role hierarchies
|
Lexical resource |
Merging two Ontology-based Lexical Resources
Structuring a Domain Vocabulary in a General Knowledge Environment
The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet |
Lexical semantics |
The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
The SALSA Corpus: a German Corpus Resource for Lexical Semantics Mixing WordNet, VerbNet and PropBank for studying verb relations Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
|
Lexical variation |
Finding representative sets of dialect words for geographical regions
|
Lexical workflow |
Moving to dynamic computational lexicons with LeXFlow
|
Lexical-semantic knowledge bases |
A methodology for the joint development of the Basque WordNet and Semcor
|
Lexicon |
A Computational Lexicon of Contemporary Hebrew
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
Building a network of topical relations from a corpus
Data-driven Amharic-English Bilingual Lexicon Acquisition
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Elaborating the parameterized Equivalence Class Method for Dutch
LEXADV - a multilingual semantic Lexicon for Adverbs
Lexicon Development for Varieties of Spoken Colloquial Arabic
Morphological annotation of Korean with Directly Maintainable Resources
Natural Language Processing: A Terminological and Statistical Approach
The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Valency Lexicon of Czech Verbs: Alternation-Based Model
|
Lexicon Architecture |
Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
|
Lexicon Construction |
Word Knowledge Acquisition for Computational Lexicon Construction
|
Lexicon extension |
Automated Deep Lexical Acquisition for Robust Open Texts Processing
|
Lexicon to Ontology Mapping |
Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
|
Lexicon-grammar |
Syntactic Lexicon of Polish Predicative Nouns
|
LFG |
Improving coverage and parsing quality of a large-scale LFG for German
The importance of precise tokenizing for deep grammars
|
Linguistic analysis |
Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
|
Linguistic annotations |
The English-Slovene ACQUIS corpus
|
Linguistic databases |
Tregex and Tsurgeon: tools for querying and manipulating tree data structures
|
Linguistic ontology |
Development of Linguistic Ontology on Natural Sciences and Technology
|
Linguistic resources |
Integrated Linguistic Resources for Language Exploitation Technologies
Linguistic Resources for Speech Parsing
|
Linkage discovery |
Dependency-Based Phrase Alignment
|
Linking |
Merging two Ontology-based Lexical Resources
|
Linking of lexical databases |
Exploring opportunities for Comparability and Enrichment by Linking lexical databases
|
Localization |
Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
|
Location-aware computing |
The Information Commons Gazetteer
|
Lombard speech database |
Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
|
LR Infrastructures and Architectures |
Techno-langue: The French National Initiative for Human Language Technologies (HLT)
|
LR national/international projects |
Techno-langue: The French National Initiative for Human Language Technologies (HLT)
|
LTAG |
A Lexicalized Tree-Adjoining Grammar for Vietnamese
|
M |
|
Machine |
Multilingual parallel treebanking: a lean and flexible approach
|
Machine learning |
A Grapheme-Based Approach for Accent Restoration in Gikuyu
An Introduction to NLP-based Textual Anonymisation
Learning Database Content for Spoken Dialogue System Design
On Automatic Assignment of Verb Valency Frames in Czech
Recognizing Acronyms and their Definitions in Swedish Medical Texts
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
|
Machine translatability |
Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
|
Machine translation |
Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Dependency-Based Phrase Alignment
Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation
Predicting MT Quality as a Function of the Source Language CESTA: First Conclusions of the Technolangue MT Evaluation Campaign
X-Score: Automatic Evaluation of Machine Translation Grammaticality Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign METIS-II: Machine Translation for Low Resource Languages
|
Machine translation evaluation |
A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
|
Machine-Aided Indexing |
Bilingual Machine-Aided Indexing
|
Machine-readable lexicons |
Towards machine-readable lexicons for South African Bantu languages
|
Malay |
Open Source Corpus Analysis Tools for Malay
|
Manual annotation |
Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
|
Manual word-alignment |
Annotation Guidelines for Czech-English Word Alignment
|
Mark-up tools |
Rule-Based Chunking and Reusability
|
Markup |
Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
|
MDE |
Linguistic Resources for Speech Parsing
|
Meaning |
Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
|
Meaning elicitation |
The role of lexical resources in matching classification schemas
|
Medical corpus |
Recognizing Acronyms and their Definitions in Swedish Medical Texts
|
Meeting |
Gathering a corpus of multimodal computer-mediated meetings
Annotating Emotions in Meetings
|
Meeting browsers |
TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
|
Meeting corpora |
Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
|
Meeting data |
Evaluation of multimodal components within CHIL: The evaluation packages and results
|
Memory based learning |
Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
|
Merging |
Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
|
Metamodel |
Methodological Aspects of Semantic Annotation
|
Metaphor Database |
The Italian Metaphor Database
|
Methodology |
A Preliminary Study for Building the Basque PropBank
|
Minority and endangered languages |
Morphological Tools for Six Small Uralic Languages
|
Mixed-lingual texts |
Integrating Language Knowledge Resources to Extend the English Inclusion
|
Mixer |
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
|
Mobile and multimodal interaction |
The pragmatic combination of different crosslingual resources
|
Monolingual corpora |
Corpus Portal for Search in Monolingual Corpora
|
Morpheme |
Morphological annotation of Korean with Directly Maintainable Resources
|
Morphological analysis |
A Deep Linguistic Analysis for Cross-language Information Retrieval
FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
New Approach to Frequency Dictionaries - Czech Example Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Morphological Tools for Six Small Uralic Languages
|
Morphological annotation |
Developing and Using a Pilot Dialectal Arabic Treebank
|
Morphological Lexicon |
Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
|
Morphology |
A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
Creating Tools for Morphological Analysis of Sumerian
Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Extraction tools for collocations and their morphosyntactic specificities
HAREM: An Advanced NER Evaluation Contest for Portuguese
Lexicon Development for Varieties of Spoken Colloquial Arabic
Morphological annotation of Korean with Directly Maintainable Resources
|
Morphosyntax |
Extraction tools for collocations and their morphosyntactic specificities
|
Motorbike |
Bikers Accessing the Web: The SmartWeb Motorbike Corpus
|
Movement detection |
Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
|
MPEG-7 |
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
|
Multi-Document Summarization |
Impact of Question Decomposition on the Quality of Answer Summaries
|
MultiNet |
Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
|
Multidocument |
Multilingual Multidocument Summarization Tools and Evaluation
|
Multilevel database |
Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).
|
Multilingual |
|
Multilingual annotation |
Parallel Syntactic Annotation of Multiple Languages
|
Multilingual Corpora |
Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
|
Multilingual dabatase |
A Methodology for Developing Multilingual Resources for Terminology
|
Multilingual information services |
The pragmatic combination of different crosslingual resources
|
Multilingual parallel corpora |
Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology Evaluation of multilingual text alignment systems: the ARCADE II project
|
Multilingual Question Answering |
Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering
The Multilingual Question Answering Track at CLEF |
Multilinguality |
Aligning Multilingual Thesauri
Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
|
Multimedia corpus |
The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
|
Multimedia Meeting Recordings |
Gathering a corpus of multimodal computer-mediated meetings
|
Multimodal |
Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
|
Multimodal behavior |
Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
|
Multimodal corpora |
H. C. Andersen Conversation Corpus
TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
|
Multimodal interfaces |
Evaluation of multimodal components within CHIL: The evaluation packages and results
TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
|
Multimodality |
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
|
Multiparty dialogues |
Part-of-Speech Tagging of Transcribed Speech
|
Multiword expressions |
Elaborating the parameterized Equivalence Class Method for Dutch
Using collocations from comparable corpora to find translation equivalents COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
|
Mutation rules |
Automatic Detection of Orthographic Cues for Cognate Recognition
|
N |
|
Named Entity Ontology |
Constructing a Named Entity Ontology from Web Corpora
|
Named Entity Recognition |
Named Entity Extraction with Conjunction Disambiguation
What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic
FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
HAREM: An Advanced NER Evaluation Contest for Portuguese
Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
|
Named entities |
A Deep Linguistic Analysis for Cross-language Information Retrieval
|
Named entity annotation |
A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit
|
Named-Entity Tagging |
Identifying Named Entities in Text Databases from the Natural History Domain
|
National Project |
An observatory on Spoken Italian linguistic resources and descriptive standards.
|
Natural language access to database |
From Natural Language to Databases via Ontologies
|
Natural language engineering |
Automatic Evaluation and Composition of NLP Pipelines with Web Services
|
Natural Language Interfaces to GIS Databases |
Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
|
Natural Language Processing (NLP) |
Automatic Evaluation and Composition of NLP Pipelines with Web Services
Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
Development of Linguistic Ontology on Natural Sciences and Technology
Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Turning a Dependency Treebank into a PSG-style Constituent Treebank
|
Naturalistic |
Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
|
NLTK |
PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
|
Non-machine-translatable parts |
Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
|
Non-native speech |
Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
SINOD - Slovenian non-native speech database
“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
|
Non-words |
Corpus-Induced Corpus Clean-up
|
Noncompositionality |
The SALSA Corpus: a German Corpus Resource for Lexical Semantics
|
Normalized edit distance |
Automatic Detection of Orthographic Cues for Cognate Recognition
|
Norwegian |
FonDat1: A Speech Synthesis Corpus for Norwegian
|
NP coreference |
NPs for Events: Experiments in Coreference Annotation
|
Nursing Terminologies |
Features of Terms in Actual Nursing Activities
|
O |
|
Ontology |
A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
Feature-based Encoding and Querying Language Resources with Character Semantics
From Natural Language to Databases via Ontologies
LEXADV - a multilingual semantic Lexicon for Adverbs
Ontology-based Information Extraction with SOBA
Structuring a Domain Vocabulary in a General Knowledge Environment A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Aligning Multilingual Thesauri
The LOIS Project
|
Ontology acquisition |
Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
|
Ontology Development from Text |
Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering
|
Ontology Engineering |
Ontology Driven K-Portal Construction and K-Service Provision
|
Ontology learning |
Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
|
Ontology mapping |
Merging two Ontology-based Lexical Resources
|
Ontology Modelling |
Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
|
Ontology modelling for trans/cross-lingual dialogues |
The pragmatic combination of different crosslingual resources
|
Open Source |
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
|
Open-domain Q&A evaluation |
Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
|
Open-domain free text translation |
The pragmatic combination of different crosslingual resources
|
Optimality Theory |
PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
|
Organizational/policy issues |
Techno-langue: The French National Initiative for Human Language Technologies (HLT)
|
Oriental |
Oriental COCOSDA: Past, Present and Future
|
Orthographic cues |
Automatic Detection of Orthographic Cues for Cognate Recognition
|
Orthography |
FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text
|
P |
|
Pan-Chinese lexical resource |
Toward a Pan-Chinese Thesaurus
|
Parallel corpora |
Annotating COMPARA, a Grammar-aware Parallel Corpus
Building a Swedish-Turkish Parallel Corpus
Construction of a FrameNet Labeler for Swedish Text
Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation
Multilingual parallel treebanking: a lean and flexible approach
The English-Slovene ACQUIS corpus
Transferring Coreference Chains through Word Alignment
|
Paraphrase |
Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
|
Parse scoring |
SParseval: Evaluation Metrics for Parsing Speech
|
Parser evaluation |
Improving coverage and parsing quality of a large-scale LFG for German
|
Parsers |
Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
|
Parsing |
A Lexicalized Tree-Adjoining Grammar for Vietnamese
Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Improving coverage and parsing quality of a large-scale LFG for German
MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
Multilingual parallel treebanking: a lean and flexible approach
The importance of precise tokenizing for deep grammars
Developing and Using a Pilot Dialectal Arabic Treebank |
Parsing Evaluation |
Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.
|
Part-of-speech tagging |
A Deep Linguistic Analysis for Cross-language Information Retrieval
|
Pattern generation |
A Corpus Search System Utilizing Lexical Dependency Structure
|
Pattern search |
UAM Text Tools - a flexible NLP architecture
|
Pattern transformation |
Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
|
PDT |
Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
|
Phonetic distance |
Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
|
Phonetic lexicons |
Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
|
Phonetic transcriptions |
Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
|
Phonetics |
Feature-based Encoding and Querying Language Resources with Character Semantics
|
Phonology |
FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text
|
Phrase alignment |
Dependency-Based Phrase Alignment
|
Phrase-based statistical machine translation |
Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries
|
Phrase-book translation |
The pragmatic combination of different crosslingual resources
|
Platforms |
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
|
PMI |
Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
|
POS tagging |
An Annotated Corpus Management Tool: ChaKi
Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
|
Polish |
General and Task-Specific Corpus Resources for Polish Adult Learners of English
|
Populated places |
The Information Commons Gazetteer
|
Portuguese |
Building a historical corpus for Classical Portuguese: some technological aspects
|
Portuguese African varieties |
The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
|
Prague-English Dependency Treebank |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
|
Predicate – argument structure |
Adding multi-layer semantics to the Greek Dependency Treebank
|
Predicate-argument |
Building a resource for studying translation shifts
|
Predicate-argument structure |
Shalmaneser - A Toolchain For Shallow Semantic Parsing
|
Predicative nouns |
Syntactic Lexicon of Polish Predicative Nouns
|
Prefix |
Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.
|
Prepositions |
The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
|
Preprocessing |
Preprocessing and Tokenisation Standards in DELPH-IN Tools
|
Problem-solving language |
The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
|
Profile Creation |
Language Resources for Background Gathering
|
Projectivity |
Corpus Annotation as a Test of a Linguistic Theory
|
Pronoun Resolution |
PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
|
Pronunciation lexicon |
SI-PRON: A Pronunciation Lexicon for Slovenian Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
|
Pronunciation of names |
“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
|
Pronunciation variation |
Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
|
Proofing tools |
A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
Spelling Error Patterns in Spanish for Word Processing Applications
|
PropBank |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
|
Property Grammars |
Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
|
Prosodic transplantation |
A joint prosody evaluation of French text-to-speech synthesis systems
|
Prosody |
A joint prosody evaluation of French text-to-speech synthesis systems
|
Prosody analysis |
Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
|
Protein names |
The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
|
Pseudonymisation |
An Introduction to NLP-based Textual Anonymisation
|
Python |
PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
|
Q |
|
QA |
Question Answering Evaluation Survey
|
Quality model |
ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics
The Evolution of an Evaluation Framework for a Text Mining System
|
Query expansion |
Query Expansion on Compounds
Using collocations from comparable corpora to find translation equivalents
|
Query-interpretation |
Summarizing Answers for Complicated Questions
|
Question |
Language Resources for Background Gathering
|
Question Answering |
An Answer Bank for Temporal Inference
Data for question answering: The case of why
Impact of Question Decomposition on the Quality of Answer Summaries
Question Answering Evaluation Survey
Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons
Towards Natural Interactive Question Answering
Exploiting Multiple Semantic Resources for Answer Selection
EQueR: the French Evaluation campaign of Question-Answering Systems
The Multilingual Question Answering Track at CLEF
Summarizing Answers for Complicated Questions
Component Evaluation in a Question Answering System
|
Question Decomposition |
Impact of Question Decomposition on the Quality of Answer Summaries
|
R |
|
Random Indexing |
Towards Holistic Summarization -- Selecting Summaries, Not Sentences
|
Reader Model |
Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
|
Recording methodology |
Low-cost Customized Speech Corpus Creation for Speech Technology Applications
|
Regional bias |
Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
|
Regular changes in valency structure |
Valency Lexicon of Czech Verbs: Alternation-Based Model
|
Relation Annotation |
Shallow Semantic Annotation of Bulgarian
|
Relational database |
Lexicon Development for Varieties of Spoken Colloquial Arabic
|
Relative ordering |
ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics
|
Resource building |
The LOIS Project
|
Resource merging |
Exploiting Multiple Semantic Resources for Answer Selection
|
Resource-scarce languages |
A Grapheme-Based Approach for Accent Restoration in Gikuyu
|
Resource |
The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
|
Results |
EQueR: the French Evaluation campaign of Question-Answering Systems
The Multilingual Question Answering Track at CLEF
|
Reusability |
Rule-Based Chunking and Reusability
|
Rhythm |
A BLARK extension for temporal annotation mining
|
Roadmap |
Reconsidering Language Identification for Written Language Resources
|
Robust parsing |
Automated Deep Lexical Acquisition for Robust Open Texts Processing
|
Role labelling |
A Preliminary Study for Building the Basque PropBank
|
Romanian syllables |
On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
|
Romanized Pinyin |
Towards Unified Chinese Segmentation Algorithm
|
Rule patterns |
Act-Topic Patterns for Automatically Checking Dialogue Models
|
Rule-based |
Rule-Based Chunking and Reusability
|
S |
|
Scandinavian |
LEXADV - a multilingual semantic Lexicon for Adverbs
|
Search |
Query Expansion on Compounds
|
Search Engines |
Towards Unified Chinese Segmentation Algorithm
|
Second language speech corpus |
General and Task-Specific Corpus Resources for Polish Adult Learners of English
|
Securing interpretability |
Feature-based Encoding and Querying Language Resources with Character Semantics
|
Selectional restrictions |
Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
LexikoNet - a lexical database based on type and role hierarchies
|
Semantic |
A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation
A Preliminary Study for Building the Basque PropBank
Shalmaneser - A Toolchain For Shallow Semantic Parsing Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
|
Semantic Analysis |
Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
|
Semantic Annotation |
SALTO - A Versatile Multi-Level Annotation Tool
Shallow Semantic Annotation of Bulgarian Discriminant-Based MRS Banking
FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
I-CAB: the Italian Content Annotation Bank
Methodological Aspects of Semantic Annotation
|
Semantic classification |
Semantic Descriptors: The Case of Reflexive Verbs
|
Semantic information |
Merging two Ontology-based Lexical Resources
|
Semantic lexicons |
Building Slovene WordNet
|
Semantic matching |
The role of lexical resources in matching classification schemas
|
Semantic network |
Structuring a Domain Vocabulary in a General Knowledge Environment
|
Semantic Parsing |
Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations Shalmaneser - A Toolchain For Shallow Semantic Parsing
|
Semantic pattern |
Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
|
Semantic relations |
Structuring a Domain Vocabulary in a General Knowledge Environment
|
Semantic relations detection |
SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations
|
Semantic representation |
Parallel Syntactic Annotation of Multiple Languages
Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
|
Semantic resources |
Exploiting Multiple Semantic Resources for Answer Selection
|
Semantic role labeling |
Adding multi-layer semantics to the Greek Dependency Treebank
Construction of a FrameNet Labeler for Swedish Text
|
Semantic roles |
The SALSA Corpus: a German Corpus Resource for Lexical Semantics
|
Semantic Similarity |
Towards Holistic Summarization -- Selecting Summaries, Not Sentences
|
Semantic tagging |
Semantic Tag Extraction from WordNet Glosses
|
Semantic verb classes |
Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
|
Semantic Web |
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Automatic Evaluation and Composition of NLP Pipelines with Web Services
NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
The role of lexical resources in matching classification schemas
|
Semantically related terms |
A Methodology for Developing Multilingual Resources for Terminology
|
Semantics |
A methodology for the joint development of the Basque WordNet and Semcor
Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Building a network of topical relations from a corpus
HAREM: An Advanced NER Evaluation Contest for Portuguese
Linguistic and Biological Annotations of Biological Interaction Events
The Look and Feel of a Confident Entailer
|
Semi-automatic annotation |
KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
|
Sense clustering |
Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
|
Sense mapping |
Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
|
Sensitivity |
An Introduction to NLP-based Textual Anonymisation
|
Sentence alignment |
Acquis Communautaire Sentence Alignment using Support Vector Machines
|
Sentence tokeniser |
Open Source Corpus Analysis Tools for Malay
|
Sentiment analysis |
Semantic Tag Extraction from WordNet Glosses
|
Serbian WordNet |
Building Slovene WordNet
|
Shallow analysis |
Query Expansion on Compounds
|
Shallow Parsing |
Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy The wraetlic NLP suite
|
Similar Event Search |
Language Resources for Background Gathering
|
Similarity |
Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
|
SIMPLE |
LEXADV - a multilingual semantic Lexicon for Adverbs
|
Situational Prompting |
Bikers Accessing the Web: The SmartWeb Motorbike Corpus
SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
|
Skip-gram |
A Closer Look at Skip-gram Modelling
|
SKOS |
A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs
|
Slovene WordNet |
Building Slovene WordNet
|
Slovenian |
SI-PRON: A Pronunciation Lexicon for Slovenian
SINOD - Slovenian non-native speech database
The English-Slovene ACQUIS corpus
Towards a Slovene Dependency Treebank
|
SMILES strings |
Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
|
Software |
GAIA: Common Framework for the Development of Speech Translation Technologies
UAM Text Tools - a flexible NLP architecture
|
Somali |
Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
|
Source code |
GAIA: Common Framework for the Development of Speech Translation Technologies
|
Source quality |
Predicting MT Quality as a Function of the Source Language
|
Spanish |
Parallel Syntactic Annotation of Multiple Languages
Spanish Synthesis Corpora
|
Spatial ontology |
Principles for annotating and reasoning with spatial information
|
Spatial semantics |
Principles for annotating and reasoning with spatial information
|
Speaker recognition |
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
|
Special Terms Dictionaries |
Features of Terms in Actual Nursing Activities
|
Specifications |
TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis ECESS Inter-Module Interface Specification for Speech Synthesis
|
Speech |
A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit
Gathering a corpus of multimodal computer-mediated meetings
Oriental COCOSDA: Past, Present and Future
|
Speech and pointing gesture |
H. C. Andersen Conversation Corpus
|
Speech Corpus |
Bikers Accessing the Web: The SmartWeb Motorbike Corpus
SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus Bilingual speech corpus in two phonetically similar languages FonDat1: A Speech Synthesis Corpus for Norwegian
|
Speech corpus design |
Low-cost Customized Speech Corpus Creation for Speech Technology Applications
|
Speech database |
Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
SINOD - Slovenian non-native speech database Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
|
Speech feedback |
Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
|
Speech parsing |
Linguistic Resources for Speech Parsing
|
Speech recognition |
Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech
GAIA: Common Framework for the Development of Speech Translation Technologies
Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
|
Speech synthesis |
Creation and analysis of a Polish speech database for use in unit selection synthesis
Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
ECESS Inter-Module Interface Specification for Speech Synthesis
FonDat1: A Speech Synthesis Corpus for Norwegian
GAIA: Common Framework for the Development of Speech Translation Technologies
TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
Tools and resources for speech synthesis arising from a Welsh TTS project
|
Speech timing |
A BLARK extension for temporal annotation mining
|
Speech translation |
GAIA: Common Framework for the Development of Speech Translation Technologies
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
|
Speech-to-Speech Translation (SST) |
Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign
|
Spell checking |
A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
Spelling Error Patterns in Spanish for Word Processing Applications
|
Spelling error detection |
A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
Spelling Error Patterns in Spanish for Word Processing Applications
|
Spelling error patterns |
A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
Spelling Error Patterns in Spanish for Word Processing Applications
|
Spoken Corpora |
Features of Terms in Actual Nursing Activities
|
Spoken dialogue systems |
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications A Framework to Integrate Ubiquitous Knowledge Modeling
|
Spoken Dutch Corpus |
Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
|
Spoken Italian corpus |
Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).
|
Spoken Italian Linguistic Resources |
An observatory on Spoken Italian linguistic resources and descriptive standards.
|
Spoken Language Translation (SLT) |
Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign
|
Spoken language |
Annotating Information Structure in a Corpus of Spoken Danish
|
Spoken language corpora |
“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
|
Spoken metadata |
Linguistic Resources for Speech Parsing
|
Spontaneous conversation |
Discourse functions of duration in Mandarin: resource design and implementation
|
Spontaneous speech corpus |
Dependency-structure Annotation to Corpus of Spontaneous Japanese
|
SSML |
ECESS Inter-Module Interface Specification for Speech Synthesis
|
Standard based archive |
Building a historical corpus for Classical Portuguese: some technological aspects
|
Standardisation |
A Unified Structure for Dutch Dialect Dictionary Data
|
Standardization |
A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
ECESS Inter-Module Interface Specification for Speech Synthesis
Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
|
Standards for lexicons |
Exploring opportunities for Comparability and Enrichment by Linking lexical databases
|
Standoff annotation |
Preprocessing and Tokenisation Standards in DELPH-IN Tools
|
Statistical Machine Translation |
Creating a Large-Scale Arabic to French Statistical MachineTranslation System
MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
Czech-English Word Alignment
|
Statistical language modeling |
Linguistic features modeling based on Partial New Cache
|
Statistical method Languages: Amharic |
Data-driven Amharic-English Bilingual Lexicon Acquisition
|
Statistical modeling |
Evaluation of Stop Word Lists in Chinese Language
|
Stochastic parsing |
Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
|
Stop word list |
Evaluation of Stop Word Lists in Chinese Language
|
Structural event detection |
Linguistic Resources for Speech Parsing
|
Stylistic alignment |
Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
|
Subcategorization |
A Large Subcategorization Lexicon for Natural Language Processing Applications
Automatic extraction of subcategorization frames for French
|
Sumerian |
Creating Tools for Morphological Analysis of Sumerian
|
Summarization |
Summarizing Answers for Complicated Questions
|
SUMO |
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
|
Support vector machines |
Acquis Communautaire Sentence Alignment using Support Vector Machines
|
Support verbs |
Syntactic Lexicon of Polish Predicative Nouns
|
Surface patterns |
Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
|
Surface syntactic analysis |
Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
|
Surveillance application |
Fear-type emotions of the SAFE Corpus: annotation issues
|
Syllabus Project |
A Development Tool For Multilingual Ontology-based Conceptual
|
Synchronous corpus |
Toward a Pan-Chinese Thesaurus
|
Syntacs |
Linguistic and Biological Annotations of Biological Interaction Events
|
Syntactic analysis |
A Deep Linguistic Analysis for Cross-language Information Retrieval
Data for question answering: The case of why
|
Syntactic annotation |
Developing and Using a Pilot Dialectal Arabic Treebank
Parallel Syntactic Annotation of Multiple Languages
|
Syntactic lexicon |
The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
|
Syntactic parsing |
Towards a Slovene Dependency Treebank
|
Syntactic structures |
A new approach to syntactic annotation
|
Syntax |
Multilingual parallel treebanking: a lean and flexible approach
The Look and Feel of a Confident Entailer
|
Synthesis |
Spanish Synthesis Corpora
|
System evaluation |
The Impact of Evaluation on Multilingual Information Retrieval System Development
|
System extension |
Integrating Language Knowledge Resources to Extend the English Inclusion
|
Systems |
EQueR: the French Evaluation campaign of Question-Answering Systems
|
T |
|
Tagging |
Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Morphological annotation of Korean with Directly Maintainable Resources
The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
|
Tagging errors detection and correction |
Tagset Mapping and Statistical Training Data Cleaning-up
|
Tagsets mapping |
Tagset Mapping and Statistical Training Data Cleaning-up
|
TC-STAR |
ECESS Inter-Module Interface Specification for Speech Synthesis
|
Teaching |
The MULINCO corpus and corpus platform
|
Techninal Term |
Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
|
Temporal Expression Identification and Normalisation |
If “it” were “then”, then when was “it”? Establishing the anaphoric role of “then”
|
Temporal Inference |
An Answer Bank for Temporal Inference
|
Temporal information |
An Annotated Corpus of Typical Durations of Events
|
Term associations |
Towards a terminological resource for biomedical text mining
|
Term extraction |
Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations
Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model
|
Term identification |
Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
|
Term variation |
Clustering acronyms in biomedical text for disambiguation
|
Terminological databases |
Towards a terminological resource for biomedical text mining
|
Terminology |
A Methodology for Developing Multilingual Resources for Terminology
Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Using Core Ontology for Domain Lexicon Structuring
|
Terminology acquisition |
Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model
|
Terminology Extraction |
Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering
|
Terminology recognition |
Towards a terminological resource for biomedical text mining
|
Test suite |
A Lexicalized Tree-Adjoining Grammar for Vietnamese
|
Text |
Gathering a corpus of multimodal computer-mediated meetings
|
Text alignment |
Evaluation of multilingual text alignment systems: the ARCADE II project
|
Text Analysis |
A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
|
Text Classification |
Bilingual Machine-Aided Indexing
|
Text Database |
BACO - A large database of text and co-occurrences
|
Text mining |
A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
Natural Language Processing: A Terminological and Statistical Approach
The Evolution of an Evaluation Framework for a Text Mining System
|
Text Prediction |
Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
|
Text-to-speech synthesis |
A joint prosody evaluation of French text-to-speech synthesis systems
|
Textual coreference |
Corpus Annotation as a Test of a Linguistic Theory
|
Textual entailment |
Mixing WordNet, VerbNet and PropBank for studying verb relations
The Look and Feel of a Confident Entailer Building a Large-Scale Repository of Textual Entailment Rules
|
Thai Search Engine |
Blind Evaluation for Thai Search Engines
|
The Chinese Gigaword Corpus |
Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
|
Thesauri |
Aligning Multilingual Thesauri
Development of Linguistic Ontology on Natural Sciences and Technology
|
Thesaurus browsing |
A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs
|
Third language acquisition |
Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
|
Time |
An Annotated Corpus of Typical Durations of Events
|
Tokenisation |
Preprocessing and Tokenisation Standards in DELPH-IN Tools
Tools and resources for speech synthesis arising from a Welsh TTS project
|
Tokenization |
The importance of precise tokenizing for deep grammars
|
Tool |
Act-Topic Patterns for Automatically Checking Dialogue Models
Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
|
Tool design |
An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
|
Toolkits |
The wraetlic NLP suite
|
Tools |
Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
SALTO - A Versatile Multi-Level Annotation Tool
The MULINCO corpus and corpus platform
|
Tools for annotation |
Structure, Annotation and Tools in the Basque ZT Corpus
|
Topic classification |
Next Generation Language Resources using Grid
|
Training corpora |
Tagset Mapping and Statistical Training Data Cleaning-up
|
Training in an unsupervised way |
Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
|
Transcription |
A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit
Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
Integrated Linguistic Resources for Language Exploitation Technologies
|
Transcription evaluation |
Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
|
Transducer |
Morphological annotation of Korean with Directly Maintainable Resources
|
Translation |
Integrated Linguistic Resources for Language Exploitation Technologies
|
Translation aid |
Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
|
Translation equivalents |
Using collocations from comparable corpora to find translation equivalents
|
Translation shifts |
Building a resource for studying translation shifts
|
Translation studies |
Building a resource for studying translation shifts
The MULINCO corpus and corpus platform
|
Translation training |
A Uniform Interface to Large-Scale Linguistic Resources
|
Tree Edit Distance |
Building a Large-Scale Repository of Textual Entailment Rules
|
Tree query |
Tregex and Tsurgeon: tools for querying and manipulating tree data structures
|
Tree search |
Tregex and Tsurgeon: tools for querying and manipulating tree data structures
|
Tree transformation |
Tregex and Tsurgeon: tools for querying and manipulating tree data structures
|
Treebank |
Linguistic Resources for Speech Parsing
Shallow Semantic Annotation of Bulgarian
|
Treebanking |
Discriminant-Based MRS Banking
Multilingual parallel treebanking: a lean and flexible approach
|
Treebanks |
Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
Towards a Slovene Dependency Treebank
Tregex and Tsurgeon: tools for querying and manipulating tree data structures
Turning a Dependency Treebank into a PSG-style Constituent Treebank
|
Typology of Multiword Expressions |
COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
|
Typos |
Corpus-Induced Corpus Clean-up
|
U |
|
UBIQUITOUS DOMAIN MODEL |
A Framework to Integrate Ubiquitous Knowledge Modeling
|
UMTS |
Bikers Accessing the Web: The SmartWeb Motorbike Corpus
SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
|
Under-resourced languages |
Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
|
Underspecification |
Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
|
Unification grammar |
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
|
Unified Lexicon |
Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
|
Unified Morphosyntactic Specifications |
Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
|
Unit selection |
Creation and analysis of a Polish speech database for use in unit selection synthesis
Tools and resources for speech synthesis arising from a Welsh TTS project
|
Universal databases |
The Information Commons Gazetteer
|
Unknown Word Interpretation |
A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
|
Unknown words |
Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.
|
Unsupervised |
Corpus-Induced Corpus Clean-up
|
Unsupervised classifier |
Integrating Language Knowledge Resources to Extend the English Inclusion
|
Uralic languages |
Morphological Tools for Six Small Uralic Languages
|
User centred evaluation |
The Evolution of an Evaluation Framework for a Text Mining System
|
User-feedback |
Computer-aided summarisation -- what the user really wants
|
V |
|
Vagueness |
Annotating COMPARA, a Grammar-aware Parallel Corpus
|
Valence dictionary |
Romanian Valence Dictionary in XML Format
|
Valency |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
|
Valency frames |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
|
Validation of language resources |
Czech-English Word Alignment
|
Vallex |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
|
Verb |
On Automatic Assignment of Verb Valency Frames in Czech
|
Verb classes |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
|
VerbNet |
Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
|
Verbal lexicon |
Automatic extraction of subcategorization frames for French
|
Verbal predicates |
Semantic Descriptors: The Case of Reflexive Verbs
|
Verbo-nominal collocations |
Syntactic Lexicon of Polish Predicative Nouns
|
Verbs |
From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
|
Vietnamese |
A Lexicalized Tree-Adjoining Grammar for Vietnamese
|
W |
|
Web Data |
Towards Unified Chinese Segmentation Algorithm
|
Web Mining |
Constructing a Named Entity Ontology from Web Corpora
Towards Unified Chinese Segmentation Algorithm
|
Web Services |
Automatic Evaluation and Composition of NLP Pipelines with Web Services
|
Web based language resouce access |
Building a historical corpus for Classical Portuguese: some technological aspects
|
Web-based tool |
Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering
|
Welsh |
Tools and resources for speech synthesis arising from a Welsh TTS project
|
Why-questions |
Data for question answering: The case of why
|
Wizard of Oz experiments |
Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
|
Word alignment |
Non-probabilistic alignment of rare German and English nominal expressions
Dependency-Based Phrase Alignment
Transferring Coreference Chains through Word Alignment
Annotation Guidelines for Czech-English Word Alignment
|
Word sense disambiguation |
Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation
Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
|
WordNet |
Building Slovene WordNet
The Italian Metaphor Database
Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Semantic Tag Extraction from WordNet Glosses
The role of lexical resources in matching classification schemas
|
Words |
Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
|
Written Dutch |
KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
|
Written language resource |
User requirements analysis for the design of a reference corpus of written Dutch
|
X |
|
XML |
A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
ECESS Inter-Module Interface Specification for Speech Synthesis
Preprocessing and Tokenisation Standards in DELPH-IN Tools
Romanian Valence Dictionary in XML Format
Rule-Based Chunking and Reusability
|