|
TOPICS: Browse articles of the conference sorted by topic
A - C - D - E - G - H - I - K - L - M - N - O - P - Q - S - T - U - V - W
C |
Cognitive methods |
Exploring the Relationship between Semantic Spaces and Semantic Relations
LIPS: A Tool for Predicting the Lexical Isolation Point of a Word
The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals
Cognitive Evaluation Approach for a Controlled Language Post-Editing Experiment
Evaluating Semantic Relations and Distances in the Associative Concept Dictionary using NIRS-imaging
Types of Nods. The Polysemy of a Social Signal
A Speech Corpus for Modeling Language Acquisition: CAREGIVER
How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese.
|
Controlled languages |
A Use Case for Controlled Languages as Interfaces to Semantic Web Applications
Resources for Controlled Languages for Alert Messages and Protocols in the European Perspective
Assessing the Impact of English Language Skills and Education Level on PubMed Searches by Dutch-speaking Users
Semi-Automated Extension of a Specialized Medical Lexicon for French
Cognitive Evaluation Approach for a Controlled Language Post-Editing Experiment
Identification of the Question Focus: Combining Syntactic Analysis and Ontology-based Lookup through the User Interaction
GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains
|
Corpus (creation, annotation, etc.) |
An Annotation Scheme and Gold Standard for Dutch-English Word Alignment
Developing an Expressive Speech Labeling Tool Incorporating the Temporal Characteristics of Emotion
Model Summaries for Location-related Images
A Persian Part-Of-Speech Tagger Based on Morphological Analysis
Challenges in Building a Multilingual Alpine Heritage Corpus
Annotating Attribution Relations: Towards an Italian Discourse Treebank
The English-Swedish-Turkish Parallel Treebank
The Alborada-I3A Corpus of Disordered Speech
A Large List of Confusion Sets for Spellchecking Assessed Against a Corpus of Real-word Errors
Constructing the CODA Corpus: A Parallel Corpus of Monologues and Expository Dialogues
Annotation Process Management Revisited
Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction
Ad-hoc Evaluations Along the Lifecycle of Industrial Spoken Dialogue Systems: Heading to Harmonisation?
Construction of Text Summarization Corpus for the Credibility of Information on the Web
The JOS Linguistically Tagged Corpus of Slovene
Corpus-based Semantics of Concession: Where do Expectations Come from?
Multimodal Russian Corpus (MURCO): First Steps
Lingua-Align: An Experimental Toolbox for Automatic Tree-to-Tree Alignment
Human Judgements on Causation in French Texts
Recent Developments in the National Corpus of Polish
Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank
The Grande Grammaire du Français Project
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information
A Typology of Near-Identity Relations for Coreference (NIDENT)
Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch
TRIOS-TimeBank Corpus: Extended TimeBank Corpus with Help of Deep Understanding of Text
A Morphologically-Analyzed CHILDES Corpus of Hebrew
Improvements in Parsing the Index Thomisticus Treebank. Revision, Combination and a Feature Model for Medieval Latin
FOLKER: An Annotation Tool for Efficient Transcription of Natural, Multi-party Interaction
An Associative Concept Dictionary for Verbs and its Application to Elliptical Word Estimation
Resource Creation for Training and Testing of Transliteration Systems for Indian Languages
Building a Lexicon of French Deverbal Nouns from a Semantically Annotated Corpus
Annotation of Discourse Relations for Conversational Spoken Dialogs
A Fact-aligned Corpus of Numerical Expressions
A Comprehensive Resource to Evaluate Complex Open Domain Question Answering
Alignment-based Profiling of Europarl Data in an English-Swedish Parallel Corpus
Comparing the Influence of Different Treebank Annotations on Dependency Parsing
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method
An Annotated Dataset for Extracting Definitions and Hypernyms from the Web
Data Collection and IPR in Multilingual Parallel Corpora. Dutch Parallel Corpus
Towards a Balanced Named Entity Corpus for Dutch
A Resource and Tool for Super-sense Tagging of Italian Texts
Building the Basque PropBank
Automatic Detection of Syllable Boundaries in Spontaneous Speech
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus
HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish
Acquisition and Annotation of Slovenian Lombard Speech Database
Paragraph Acquisition and Selection for List Question Using Amazons Mechanical Turk
Construction of a Chinese Opinion Treebank
Community-based Construction of Draft and Final Translation Corpus Through a Translation Hosting Site Minna no Hon'yaku (MNH)
Active Learning and Crowd-Sourcing for Machine Translation
How Certain are Clinical Assessments? Annotating Swedish Clinical Text for (Un)certainties, Speculations and Negations
CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation
The Sign Linguistics Corpora Network: Towards Standards for Signed Language Resources
The D-TUNA Corpus: A Dutch Dataset for the Evaluation of Referring Expression Generation Algorithms
ADN-Classifier:Automatically Assigning Denotation Types to Nominalizations
NameDat: A Database of English Proper Names Spoken by Native Norwegians
The Study of Writing Variants in an Under-resourced Language: Some Evidence from Mobile N-Deletion in Luxembourgish
The Design of Syntactic Annotation Levels in the National Corpus of Polish
A Database of Age and Gender Annotated Telephone Speech
DutchParl. The Parliamentary Documents in Dutch
Computer Assisted Semantic Annotation in the DutchSemCor Project
WebLicht: Web-based LRT Services in a Distributed eScience Infrastructure
The Nijmegen Corpus of Casual Spanish
A Fully Annotated Corpus of Russian Speech
FAU IISAH Corpus -- A German Speech Database Consisting of Human-Machine and Human-Human Interaction Acquired by Close-Talking and Far-Distance Microphones
Morphological Annotation of Quranic Arabic
Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank
The Web Library of Babel: evaluating genre collections
The AVLaughterCycle Database
Towards a Large Parallel Corpus of Cleft Constructions
Evaluating Repetitions, or how to Improve your Multilingual ASR System by doing Nothing
Example-Based Automatic Phonetic Transcription
MACAQ : A Multi Annotated Corpus to Study how we Adapt Answers to Various Questions
Evaluation of HMM-based Models for the Annotation of Unsegmented Dialogue Turns
Meta-Knowledge Annotation of Bio-Events
Enhancing Language Resources with Maps
Test Suite Design for Biomedical Ontology Concept Recognition Systems
Building a Textual Entailment Suite for the Evaluation of Automatic Content Scoring Technologies
Spontal-N: A Corpus of Interactional Spoken Norwegian
Bulgarian National Corpus Project
The Development of a Morphosyntactic Tagset for Afrikaans and its Use with Statistical Tagging
Emotion Cause Events: Corpus Construction and Analysis
The DAD Parallel Corpora and their Uses
The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals
The VeteranTapes: Research Corpus, Fragment Processing Tool, and Enhanced Publications for the e-Humanities
Descriptive Analysis of Negation Cues in Biomedical Texts
PDTB XML: the XMLization of the Penn Discourse TreeBank 2.0
Construction of a Benchmark Data Set for Cross-lingual Word Sense Disambiguation
The SignSpeak Project - Bridging the Gap Between Signers and Speakers
Automatic Term Recognition Based on the Statistical Differences of Relative Frequencies in Different Corpora
Corpus and Evaluation Measures for Automatic Plagiarism Detection
Spontal: A Swedish Spontaneous Dialogue Corpus of Audio, Video and Motion Capture
DiSCo - A German Evaluation Corpus for Challenging Problems in the Broadcast Domain
The Creagest Project: a Digitized and Annotated Corpus for French Sign Language (LSF) and Natural Gestural Languages
Design and Data Collection for the Accentological Corpus of the Russian Language
An Evolving eScience Environment for Research Data in Linguistics
CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development
Resources for Speech Synthesis of Viennese Varieties
The Database of Catalan Adjectives
Querying Diverse Treebanks in a Uniform Way
Arabic Part of Speech Tagging
Twitter as a Corpus for Sentiment Analysis and Opinion Mining
Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank
Two-level Annotation of Utterance-units in Japanese Dialogs: An Empirically Emerged Scheme
Statistical French Dependency Parsing: Treebank Conversion and First Results
Formal Description of Resources for Ontology-based Semantic Annotation
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems
Annotation of Morphological Meanings of Verbs Revisited
Predicting Morphological Types of Chinese Bi-Character Words by Machine Learning Approaches
Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content
Building a System for Emotions Detection from Speech to Control an Affective Avatar
The GeneReg Corpus for Gene Expression Regulation Events ― An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability
Word-based Partial Annotation for Efficient Corpus Construction
The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research
The Second Evaluation Campaign of PASSAGE on Parsing of French
Anaphoric Annotation of Wikipedia and Blogs in the Live Memories Corpus
WikiWoods: Syntacto-Semantic Annotation for English Wikipedia
Wizard of Oz Experiments for a Companion Dialogue System: Eliciting Companionable Conversation
A Database for the Exploration of Spanish Planning
Work on Spoken (Multimodal) Language Corpora in South Africa
Using NLP Methods for the Analysis of Rituals
Evaluating Complex Semantic Artifacts
Freds Reusable Evaluation Device: Providing Support for Quick and Reliable Linguistic Annotation
The Creation of a Large-Scale LFG-Based Gold Parsebank
Tag Dictionaries Accelerate Manual Annotation
BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora
Hungarian Dependency Treebank
POS Multi-tagging Based on Combined Models
Heterogeneous Sensor Database in Support of Human Behaviour Analysis in Unrestricted Environments: The Audio Part
A Game-based Approach to Transcribing Images of Text
The RODRIGO Database
Building Textual Entailment Specialized Data Sets: a Methodology for Isolating Linguistic Phenomena Relevant to Inference
The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic
An Integrated Digital Tool for Accessing Language Resources
CINEMO ― A French Spoken Language Resource for Complex Emotions: Facts and Baselines
Annotation Tool for Extended Textual Coreference and Bridging Anaphora
A Resource for Investigating the Impact of Anaphora and Coreference on Inference.
Mapping between Dependency Structures and Compositional Semantic Representations
Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure
A Dataset for Assessing Machine Translation Evaluation Metrics
Quality Indicators of LSP Texts ― Selection and Measurements Measuring the Terminological Usefulness of Documents for an LSP Corpus
Computational Linguistics for Mere Mortals - Powerful but Easy-to-use Linguistic Processing for Scientists in the Humanities
Heuristic Word Alignment with Parallel Phrases
The Demo / Kemo Corpus: A Principled Approach to the Study of Cross-cultural Differences in the Vocal Expression and Perception of Emotion
Fine-Grained Geographical Relation Extraction from Wikipedia
A Speech Corpus for Dyslexic Reading Training
Fine-grained Linguistic Evaluation of Question Answering Systems
A Corpus for Studying Full Answer Justification
Entity Mention Detection using a Combination of Redundancy-Driven Classifiers
NP Alignment in Bilingual Corpora
The GIVE-2 Corpus of Giving Instructions in Virtual Environments
WAPUSK20 - A Database for Robust Audiovisual Speech Recognition
Annotation and Representation of a Diachronic Corpus of Spanish
Saturnalia: A Latin-Catalan Parallel Corpus for Statistical MT
Djangology: A Light-weight Web-based Tool for Distributed Collaborative Text Annotation
Analysing Temporally Annotated Corpora with CAVaT
Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus
ISO-TimeML: An International Standard for Semantic Annotation
Generating FrameNets of Various Granularities: The FrameNet Transformer
Sign Language Corpora for Analysis, Processing and Evaluation
From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News
Towards an ISO Standard for Dialogue Act Annotation
Empty Categories in a Hindi Treebank
Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building
Consistent and Flexible Integration of Morphological Annotation in the Arabic Treebank
Building a Bilingual ValLex Using Treebank Token Alignment: First Observations
A Language Approach to Modeling Human Behaviors
Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation
Learning Recursive Segments for Discourse Parsing
Corpus Aligner (CorAl) Evaluation on English-Croatian Parallel Corpora
A Swedish Scientific Medical Corpus for Terminology Management and Linguistic Exploration
Information Retrieval of Word Form Variants in Spoken Language Corpora Using Generalized Edit Distance
PASSAGE Syntactic Representation: a Minimal Common Ground for Evaluation
Towards Optimal TTS Corpora
Metaphor Corpus Annotated for Source - Target Domain Mappings
Efficiently Extract Rrecurring Tree Fragments from Large Treebanks
Vergina: A Modern Greek Speech Database for Speech Synthesis
A Japanese Particle Corpus Built by Example-Based Annotation
Idioms in Context: The IDIX Corpus
The PlayMancer Database: A Multimodal Affect Database in Support of Research and Development Activities in Serious Game Environment
C-3: Coherence and Coreference Corpus
A Pilot Arabic CCGbank
Porting an Ancient Greek and Latin Treebank
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9
Corpora for the Conceptualisation and Zoning of Scientific Papers
FastKwic, an Intelligent Concordancer Using FASTR
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News
Annotation Time Stamps ― Temporal Metadata from the Linguistic Annotation Process
Annotating the Enron Email Corpus with Number Senses
Active Learning for Building a Corpus of Questions for Parsing
Speech Data Corpus for Verbal Intelligence Estimation
A Very Large Scale Mandarin Chinese Broadcast Corpus for GALE Project
Building a Generative Lexicon for Romanian
How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese.
Like Finding a Needle in a Haystack: Annotating the American National Corpus for Idiomatic Expressions
Determining Reliability of Subjective and Multi-label Emotion Annotation through Novel Fuzzy Agreement Measure
Enriching Word Alignment with Linguistic Tags
The Indiana ``Cooperative Remote Search Task" (CReST) Corpus
Principled Construction of Elicited Imitation Tests
A High Recall Error Identification Tool for Hindi Treebank Validation
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems
Towards Building Annotated Resources for Analyzing Opinions and Argumentation in News Editorials
Eslo: From Transcription to Speakers' Personal Information Annotation
The Dictionary of Italian Collocations: Design and Integration in an Online Learning Environment
Annotating Event Mentions in Text with Modality, Focus, and Source Information
English ― Oromo Machine Translation: An Experiment Using a Statistical Approach
A Semi-supervised Type-based Classification of Adjectives: Distinguishing Properties and Relations
MultiUN: A Multilingual Corpus from United Nation Documents
FrAG, a Hybrid Constraint Grammar Parser for French
Multilingual Corpus Development for Opinion Mining
A Tool for Feature-Structure Stand-Off-Annotation on Transcriptions of Spoken Discourse
Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation
Capturing Coercions in Texts: a First Annotation Exercise
Exploring Co-Reference Chains for Concept Annotation of Domain Texts
The Ariadne System: A Flexible and Extensible Framework for the Modeling and Storage of Experimental Data in the Humanities.
Corpora for Automatically Learning to Map Natural Language Questions into SQL Queries
A Survey of Idiomatic Preposition-Noun-Verb Triples on Token Level
Propbank Frameset Annotation Guidelines Using a Dedicated Editor, Cornerstone
Constructing an Annotated Story Corpus: Some Observations and Issues
Towards a Learning Approach for Abbreviation Detection and Resolution.
When CORDIAL Becomes Friendly: Endowing the CORDIAL Corpus with a Syntactic Annotation Layer
Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank
The Brandeis Annotation Tool
ANC2Go: A Web Application for Customized Corpus Creation
Collection of Usage Information for Language Resources from Academic Articles
ProPOSEC: A Prosody and PoS Annotated Spoken English Corpus
Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora
Data Issues in English-to-Hindi Machine Translation
Belgisch Staatsblad Corpus: Retrieving French-Dutch Sentences from Official Documents
Typical Cases of Annotators Disagreement in Discourse Annotations in Prague Dependency Treebank
A Recursive Annotation Scheme for Referential Information Status
A Large Coverage Verb Taxonomy for Arabic
The Kachna L1/L2 Picture Replication Corpus
Bank of Russian Constructions and Valencies
News Image Annotation on a Large Parallel Text-image Corpus
Annotating Event Anaphora: A Case Study
The POETICON Corpus: Capturing Language Use and Sensorimotor Experience in Everyday Interaction
The OTIM Formal Annotation Model: A Preliminary Step before Annotation Scheme
Fairy Tale Corpus Organization Using Latent Semantic Mapping and an Item-to-item Top-n Recommendation Algorithm
From XML to XML: The Why and How of Making the Biodiversity Literature Accessible to Researchers
Greybeard Longitudinal Speech Study
A Corpus Factory for Many Languages
Building High Quality Databases for Minority Languages such as Galician
Mixer 6
How Complex is Discourse Structure?
Enhanced Infrastructure for Creation and Collection of Translation Resources
Annotating Event Chains for Carbon Sequestration Literature
A Positional Tagset for Russian
BlogBuster: A Tool for Extracting Corpora from the Blogosphere
Building a Web Corpus of Czech
EAGLE: an Error-Annotated Corpus of Beginning Learner German
A Corpus for Evaluating Semantic Multilingual Web Retrieval Systems: The Sense Folder Corpus
Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank
Collecting Voices from the Cloud
Partial Dependency Parsing for Irish
Mining Naturally-occurring Corrections and Paraphrases from Wikipedias Revision History
Towards Semi-Automated Annotation for Prepositional Phrase Attachment
Lexical Resources and Ontological Classifications for the Recognition of Proper Names Sense Extension
Building a Bank of Semantically Encoded Narratives
Spatial Role Labeling: Task Definition and Annotation Scheme
Discovering Polarity for Ambiguous and Objective Adjectives through Adverbial Modification
Transcription Methods for Consistency, Volume and Efficiency
MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse
Romanian Zero Pronoun Distribution: A Comparative Study
ADESSE, a Database with Syntactic and Semantic Annotation of a Corpus of Spanish
The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks
Wikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population
The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI)
Towards the Annotation of Named Entities in the National Corpus of Polish
Cross-Corpus Textual Entailment for Sublanguage Analysis in Epidemic Intelligence
The CALBC Silver Standard Corpus for Biomedical Named Entities ― A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers
Concept Mentions within KDD-2009 Abstracts (kdd09cma1) Linked to a KDD Ontology (kddo1)
DICIT: Evaluation of a Distant-talking Speech Interface for Television
Experimental Deployment of a Grid Virtual Organization for Human Language Technologies
NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems
Propbank Instance Annotation Guidelines Using a Dedicated Editor, Jubilee
Inter-sentential Relations in Information Extraction Corpora
Sentiment Analysis in the News
Enriching a Treebank to Investigate Relative Clause Extraposition in German
A French Human Reference Corpus for Multi-Document Summarization and Sentence Compression
Ontology-based Interoperation of Linguistic Tools for an Improved Lemma Annotation in Spanish
The Problems of Language Identification within Hugely Multilingual Data Sets
Number or Nuance: Which Factors Restrict Reliable Word Sense Annotation?
A Linguistic Resource for Semantic Parsing of Motion Events
For Standardised Amazigh Linguistic Resources
Annotation of Human Gesture using 3D Skeleton Controls
A Software Toolkit for Viewing Annotated Multimodal Data Interactively over the Web
Named Entity Recognition in Questions: Towards a Golden Collection
The NOMCO Multimodal Nordic Resource - Goals and Characteristics
Design, Compilation, and Preliminary Analyses of Balanced Corpus of Contemporary Written Japanese
|
L |
Language Identification |
A Study of the Influence of Speech Type on Automatic Language Recognition Performance
Language Identification of Short Text Segments with N-gram Models
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems
Learning Language Identification Models: A Comparative Analysis of the Distinctive Features of Names and Common Words
Automatic Identification of Arabic Dialects
Belgisch Staatsblad Corpus: Retrieving French-Dutch Sentences from Official Documents
Determining the Origin and Structure of Person Names
Bigorna -- A Toolkit for Orthography Migration Challenges
|
Language modelling |
Video Retrieval in Sign Language Videos : How to Model and Compare Signs?
An Evaluation of Predicate Argument Clustering using Pseudo-Disambiguation
Czech Information Retrieval with Syntax-based Language Models
Towards Improving English-Latvian Translation: A System Comparison and a New Rescoring Feature
Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System
Inter-Annotator Agreement on a Linguistic Ontology for Spatial Language - A Case Study for GUM-Space
Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System
BAStat : New Statistical Resources at the Bavarian Archive for Speech Signals
Language Identification of Short Text Segments with N-gram Models
LIPS: A Tool for Predicting the Lexical Isolation Point of a Word
Descriptive Analysis of Negation Cues in Biomedical Texts
Efficient Spoken Dialogue Domain Representation and Interpretation
POS Multi-tagging Based on Combined Models
Sign Language Corpora for Analysis, Processing and Evaluation
A Language Approach to Modeling Human Behaviors
Open Soucre Graph Transducer Interpreter and Grammar Development Environment
The Semantic Atlas: an Interactive Model of Lexical Representation
A Morphological Lexicon for the Persian Language
The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French
A Multilayered Declarative Approach to Cope with Morphotactics and Allomorphy in Derivational Morphology
Using an Error-Annotated Learner Corpus to Develop an ESL/EFL Error Correction System
Efficient Minimal Perfect Hash Language Models
A Syntactic Lexicon for Arabic Verbs
|
Lexicon, lexical database |
A Derivational Rephrasing Experiment for Question Answering
A Freely Available Morphological Analyzer for Turkish
Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction
Learning to Mine Definitions from Slovene Structured and Unstructured Knowledge-Rich Resources
A Differential Semantics Approach to the Annotation of Synsets in WordNet
Lexical Semantic Resources in a Terminological Network
Is Sentiment a Property of Synsets? Evaluating Resources for Sentiment Classification using Machine Learning
A General Method for Creating a Bilingual Transliteration Dictionary
Annotation Scheme and Gold Standard for Dutch Subjective Adjectives
Detection of Peculiar Examples using LOF and One Class SVM
Online Japanese Unknown Morpheme Detection using Orthographic Variation
Cross-lingual Ontology Alignment using EuroWordNet and Wikipedia
Identifying Sources of Weakness in Syntactic Lexicon Extraction
Building a Lexicon of French Deverbal Nouns from a Semantically Annotated Corpus
Study of Word Sense Disambiguation System that uses Contextual Features - Approach of Combining Associative Concept Dictionary and Corpus -
Merging Specialist Taxonomies and Folk Taxonomies in Wordnets - A case Study of Plants, Animals and Foods in the Danish Wordnet
Event Models for Historical Perspectives: Determining Relations between High and Low Level Events in Text, Based on the Classification of Time, Location and Participants.
Meaning Representation: From Continuity to Discreteness
Learning Subjectivity Phrases missing from Resources through a Large Set of Semantic Tests
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus
English-Spanish Large Statistical Dictionary of Inflectional Forms
The MuLeXFoR Database: Representing Word-Formation Processes in a Multilingual Lexicographic Environment
Base Concepts in the African Languages Compared to Upper Ontologies and the WordNet Top Ontology
GernEdiT - The GermaNet Editing Tool
Pattern-Based Extraction of Negative Polarity Items from Dependency-Parsed Text
A Bilingual Dictionary Mexican Sign Language-Spanish/Spanish-Mexican Sign Language
Improving Proper Name Recognition by Adding Automatically Learned Pronunciation Variants to the Lexicon
Constructing and Using Broad-coverage Lexical Resource for Enhancing Morphological Analysis of Arabic
Bilingual Lexicon Induction: Effortless Evaluation of Word Alignment Tools and Production of Resources for Improbable Language Pairs
Exploring the Relationship between Semantic Spaces and Semantic Relations
Providing Multilingual, Multimodal Answers to Lexical Database Queries
Building an Italian FrameNet through Semi-automatic Corpus Analysis
NLP Resources for the Analysis of Patient/Therapist Interviews
A Person-Name Filter for Automatic Compilation of Bilingual Person-Name Lexicons
Mining the Web for the Induction of a Dialectical Arabic Lexicon
Interpreting SentiWordNet for Opinion Classification
Term and Collocation Extraction by Means of Complex Linguistic Web Services
Automatic Acquisition of Chinese Novel Noun Compounds
Constructing a Broad-coverage Lexicon for Text Mining in the Patent Domain
A Lexicon of French Quotation Verbs for Automatic Quotation Extraction
Predicting Morphological Types of Chinese Bi-Character Words by Machine Learning Approaches
Facilitating Non-expert Users of the KYOTO Platform: the TMEKO Editing Protocol for Synset to Ontology Mappings
Identification of Rare & Novel Senses Using Translations in a Parallel Corpus
Semi-Automated Extension of a Specialized Medical Lexicon for French
Extraction of German Multiword Expressions from Parsed Corpora Using Context Features
Error Correction for Arabic Dictionary Lookup
Morphological Analysis and Generation of Arabic Nouns: A Morphemic Functional Approach
A Modality Lexicon and its use in Automatic Tagging
The ConceptMapper Approach to Named Entity Recognition
Handling of Missing Values in Lexical Acquisition
Senso Comune
Combining Resources: Taxonomy Extraction from Multiple Dictionaries
FrameNet Translation Using Bilingual Dictionaries with Evaluation on the English-French Pair
SentiWS - A Publicly Available German-language Resource for Sentiment Analysis
A Tool for Linking Stems and Conceptual Fragments to Enhance word Access
Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure
Exploring Knowledge Bases for Similarity
Inferring Subcat Frames of Verbs in Urdu
MLIF : A Metamodel to Represent and Exchange Multilingual Textual Information
Generating FrameNets of Various Granularities: The FrameNet Transformer
The Role of Parallel Corpora in Bilingual Lexicography
Creation of Lexical Resources for a Characterisation of Multiword Expressions in Italian
Building a Bilingual ValLex Using Treebank Token Alignment: First Observations
Linking Korean Words with an Ontology
The Semantic Atlas: an Interactive Model of Lexical Representation
Information Retrieval of Word Form Variants in Spoken Language Corpora Using Generalized Edit Distance
Using High-Quality Resources in NLP: The Valency Dictionary of English as a Resource for Left-Associative Grammars
Comparing Computational Models of Selectional Preferences - Second-order Co-Occurrence vs. Latent Semantic Clusters
Aligning FrameNet and WordNet based on Semantic Neighborhoods
Al ―Khalil : The Arabic Linguistic Ontology Project
An LMF-based Web Service for Accessing WordNet-type Semantic Lexicons
Automatically Identifying Changes in the Semantic Orientation of Words
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration
Building a Generative Lexicon for Romanian
The Dictionary of Italian Collocations: Design and Integration in an Online Learning Environment
Q-WordNet: Extracting Polarity from WordNet Senses
How FrameSQL Shows the Japanese FrameNet Data
A Context Sensitive Variant Dictionary for Supporting Variant Selection
A Morphological Lexicon for the Persian Language
The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French
Integrating a Large Domain Ontology of Species into WordNet
Lexicon and Grammar in Bulgarian FrameNet
Language Resource Management System for Asian WordNet Collaboration and Its Web Service Application
Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation
A Multilayered Declarative Approach to Cope with Morphotactics and Allomorphy in Derivational Morphology
Extensive Evaluation of a FrameNet-WordNet mapping resource
Building High Quality Databases for Minority Languages such as Galician
An Automatically Built Named Entity Lexicon for Arabic
eXtended WordFrameNet
Testing Semantic Similarity Measures for Extracting Synonyms from a Corpus
The Impact of Grammar Enhancement on Semantic Resources Induction
GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains
Lexical Resources and Ontological Classifications for the Recognition of Proper Names Sense Extension
REBECA: Turning WordNet Databases into "Ontolexicons"
Data-Driven and Ontological Analysis of FrameNet for Natural Language Reasoning
Constructing of an Ontology-based Lexicon for Bulgarian
Grammar Extraction from Treebanks for Hindi and Telugu
Corpus Based Analysis for Multilingual Terminology Entry Compounding
ADESSE, a Database with Syntactic and Semantic Annotation of a Corpus of Spanish
Learning Morphology of Romance, Germanic and Slavic Languages with the Tool Linguistica
A Syntactic Lexicon for Arabic Verbs
The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI)
OAL: A NLP Architecture to Improve the Development of Linguistic Resources for NLP
Lexical Resources for Noun Compounds in Czech, English and Zulu
A Recursive Treatment of Collocations
EcoLexicon: An Environmental TKB
Processing and Extracting Data from Dicionário Aberto
Bootstrapping Language Neutral Term Extraction
Word Sense Annotation of Polysemous Words by Multiple Annotators
Expanding the Lexicon for a Resource-Poor Language Using a Morphological Analyzer and a Web Crawler
The More the Better? Assessing the Influence of Wikipedias Growth on Semantic Relatedness Measures
IndoWordNet
PolNet ― Polish WordNet: Data and Tools
Lexicon Design for Transcription of Spontaneous Voice Messages
|
LR Infrastructures and Architectures |
Annotation Process Management Revisited
FreeLing 2.1: Five Years of Open-source Language Processing Tools
Diabase: Towards a Diachronic BLARK in Support of Historical Studies
A Data Category Registry- and Component-based Metadata Framework
Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar
Community-based Construction of Draft and Final Translation Corpus Through a Translation Hosting Site Minna no Hon'yaku (MNH)
Sustainability of Linguistic Data and Analysis in the Context of a Collaborative eScience Environment
Virtual Language Observatory: The Portal to the Language Resources and Technology Universe
The Architecture of FunGramKB
U-Compare: An Integrated Language Resource Evaluation Platform Including a Comprehensive UIMA Resource Library
Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes
The VeteranTapes: Research Corpus, Fragment Processing Tool, and Enhanced Publications for the e-Humanities
An Evolving eScience Environment for Research Data in Linguistics
Term and Collocation Extraction by Means of Complex Linguistic Web Services
Preparing the field for an Open Resource Infrastructure: the role of the FLaReNet Network of Excellence
Querying Diverse Treebanks in a Uniform Way
The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research
A Case Study on Interoperability for Language Resources and Applications
The South African Human Language Technologies Audit
A Game-based Approach to Transcribing Images of Text
A Corpus Representation Format for Linguistic Web Services: The D-SPIN Text Corpus Format and its Relationship with ISO Standards
Annotation and Representation of a Diachronic Corpus of Spanish
LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation
ISO-TimeML: An International Standard for Semantic Annotation
Development and Use of an Evaluation Collection for Personalisation of Digital Newspapers
GIS Application Improvement with Multilingual Lexical and Terminological Resources
Language Technology Challenges of a Small Language (Catalan)
Aligning FrameNet and WordNet based on Semantic Neighborhoods
LAT Bridge: Bridging Tools for Annotation and Exploration of Rich Linguistic Data
Integration of Linguistic Markup into Semantic Models of Folk Narratives: The Fairy Tale Use Case
Building a Generative Lexicon for Romanian
Resource and Service Centres as the Backbone for a Sustainable Service Infrastructure
Building a Node of the Accessible Language Technology Infrastructure
The CLARIN-NL Project
The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French
LT World: Ontology and Reference Information Portal
Towards an Integrated Architecture for Composite Language Services and Multiple Linguistic Processing Components
Learning Based Java for Rapid Development of NLP Systems
A Corpus Factory for Many Languages
Enhanced Infrastructure for Creation and Collection of Translation Resources
A Flexible Representation of Heterogeneous Annotation Data
Building a Web Corpus of Czech
Language Service Management with the Language Grid
Corpus Based Analysis for Multilingual Terminology Entry Compounding
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation
AutoTagTCG : A Framework for Automatic Thai CG Tagging
OAL: A NLP Architecture to Improve the Development of Linguistic Resources for NLP
Experimental Deployment of a Grid Virtual Organization for Human Language Technologies
Ontology-based Interoperation of Linguistic Tools for an Improved Lemma Annotation in Spanish
IndoWordNet
A Road Map for Interoperable Language Resource Metadata
|
LR national/international projects, organizational/policy issues |
Resources for Controlled Languages for Alert Messages and Protocols in the European Perspective
Recent Developments in the National Corpus of Polish
The Grande Grammaire du Français Project
Merging Specialist Taxonomies and Folk Taxonomies in Wordnets - A case Study of Plants, Animals and Foods in the Danish Wordnet
Towards a Balanced Named Entity Corpus for Dutch
CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation
The Sign Linguistics Corpora Network: Towards Standards for Signed Language Resources
Human Language Technology and Communicative Disabilities: Requirements and Possibilities for the Future
Sustainability of Linguistic Data and Analysis in the Context of a Collaborative eScience Environment
WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network
Bulgarian National Corpus Project
The SignSpeak Project - Bridging the Gap Between Signers and Speakers
Preparing the field for an Open Resource Infrastructure: the role of the FLaReNet Network of Excellence
The LREC Map of Language Resources and Technologies
Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank
The South African Human Language Technologies Audit
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora
GIS Application Improvement with Multilingual Lexical and Terminological Resources
Linking Korean Words with an Ontology
Development of Linguistic Resources and Tools for Providing Multilingual Solutions in Indian Languages ― A Report on National Initiative
An Evaluation of Technologies for Knowledge Base Population
Building a Node of the Accessible Language Technology Infrastructure
The CLARIN-NL Project
Lexicon and Grammar in Bulgarian FrameNet
Flemish-Dutch HLT Policy: Evolving to New Forms of Collaboration
SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining
Cooperation for Arabic Language Resources and Tools ― The MEDAR Project
Language Service Management with the Language Grid
The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks
The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI)
For Standardised Amazigh Linguistic Resources
A Road Map for Interoperable Language Resource Metadata
Adapting to Trends in Language Resource Development: A Progress Report on LDC Activities
|
S |
Semantics |
The JOS Linguistically Tagged Corpus of Slovene
Corpus-based Semantics of Concession: Where do Expectations Come from?
Lexical Semantic Resources in a Terminological Network
Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank
Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch
TRIOS-TimeBank Corpus: Extended TimeBank Corpus with Help of Deep Understanding of Text
Cultural Aspects of Spatiotemporal Analysis in Multilingual Applications
An Associative Concept Dictionary for Verbs and its Application to Elliptical Word Estimation
Building a Lexicon of French Deverbal Nouns from a Semantically Annotated Corpus
An Annotated Dataset for Extracting Definitions and Hypernyms from the Web
Inducing Ontologies from Folksonomies using Natural Language Understanding
ADN-Classifier:Automatically Assigning Denotation Types to Nominalizations
Pattern-Based Extraction of Negative Polarity Items from Dependency-Parsed Text
The Architecture of FunGramKB
A Random Graph Walk based Approach to Computing Semantic Relatedness Using Knowledge from Wikipedia
Building an Italian FrameNet through Semi-automatic Corpus Analysis
Descriptive Analysis of Negation Cues in Biomedical Texts
Domain-related Annotation of Polish Spoken Dialogue Corpus LUNA.PL
A New Approach to Pseudoword Generation
Mining the Web for the Induction of a Dialectical Arabic Lexicon
Predictive Features for Detecting Indefinite Polar Sentences
Can Syntactic and Logical Graphs help Word Sense Disambiguation?
Syntactic Testsuites and Textual Entailment Recognition
WikiWoods: Syntacto-Semantic Annotation for English Wikipedia
Evaluating Complex Semantic Artifacts
Freds Reusable Evaluation Device: Providing Support for Quick and Reliable Linguistic Annotation
A Modality Lexicon and its use in Automatic Tagging
Feasibility of Automatically Bootstrapping a Persian WordNet
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora
Senso Comune
Building Textual Entailment Specialized Data Sets: a Methodology for Isolating Linguistic Phenomena Relevant to Inference
FrameNet Translation Using Bilingual Dictionaries with Evaluation on the English-French Pair
Automatic Annotation of Co-Occurrence Relations
Mapping between Dependency Structures and Compositional Semantic Representations
Semantic Feature Engineering for Enhancing Disambiguation Performance in Deep Linguistic Processing
Exploring Knowledge Bases for Similarity
Towards an ISO Standard for Dialogue Act Annotation
Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building
A Database of Narrative Schemas
Learning Recursive Segments for Discourse Parsing
Extracting Product Features and Sentiments from Chinese Customer Reviews
The Semantic Atlas: an Interactive Model of Lexical Representation
Metaphor Corpus Annotated for Source - Target Domain Mappings
Semi-Automatic Domain Ontology Creation from Text Resources
Comparing Computational Models of Selectional Preferences - Second-order Co-Occurrence vs. Latent Semantic Clusters
Aligning FrameNet and WordNet based on Semantic Neighborhoods
Automatically Identifying Changes in the Semantic Orientation of Words
Annotating Event Mentions in Text with Modality, Focus, and Source Information
Q-WordNet: Extracting Polarity from WordNet Senses
How FrameSQL Shows the Japanese FrameNet Data
Lexicon and Grammar in Bulgarian FrameNet
Capturing Coercions in Texts: a First Annotation Exercise
Propbank Frameset Annotation Guidelines Using a Dedicated Editor, Cornerstone
Evaluating Machine Translation Utility via Semantic Role Labels
Bank of Russian Constructions and Valencies
Annotating Event Anaphora: A Case Study
Automatic Discovery of Semantic Relations using MindNet
Fairy Tale Corpus Organization Using Latent Semantic Mapping and an Item-to-item Top-n Recommendation Algorithm
Testing Semantic Similarity Measures for Extracting Synonyms from a Corpus
The Impact of Grammar Enhancement on Semantic Resources Induction
Building a Bank of Semantically Encoded Narratives
Semantic Evaluation of Machine Translation
REBECA: Turning WordNet Databases into "Ontolexicons"
Discovering Polarity for Ambiguous and Objective Adjectives through Adverbial Modification
Constructing of an Ontology-based Lexicon for Bulgarian
Using Ontologies for Semi-automatic Linking VerbaLex with FrameNet
Lexical Resources for Noun Compounds in Czech, English and Zulu
A General Purpose FrameNet-based Shallow Semantic Parser
Propbank Instance Annotation Guidelines Using a Dedicated Editor, Jubilee
Number or Nuance: Which Factors Restrict Reliable Word Sense Annotation?
The More the Better? Assessing the Influence of Wikipedias Growth on Semantic Relatedness Measures
A Linguistic Resource for Semantic Parsing of Motion Events
|
Semantic Web |
A Use Case for Controlled Languages as Interfaces to Semantic Web Applications
Ontology-Based Categorization of Web Services with Machine Learning
A General Methodology for Equipping Ontologies with Time
Classifying Action Items for Semantic Email
Question Answering Biographic Information and Social Network Powered by the Semantic Web
Al ―Khalil : The Arabic Linguistic Ontology Project
Socially Driven Ontology Enrichment for eLearning
Towards a Standardized Linguistic Annotation of the Textual Content of Labels in Knowledge Representation Systems
Extraction, Merging, and Monitoring of Company Data from Heterogeneous Sources
Speech Grammars for Textual Entailment Patterns in Multimodal Question Answering
Flexible Ontology Population from Text: The OwlExporter
|
Sign Language Recognition/Generation |
Video Retrieval in Sign Language Videos : How to Model and Compare Signs?
ELAN as Flexible Annotation Framework for Sound and Image Processing Detectors
The Sign Linguistics Corpora Network: Towards Standards for Signed Language Resources
A Bilingual Dictionary Mexican Sign Language-Spanish/Spanish-Mexican Sign Language
The SignSpeak Project - Bridging the Gap Between Signers and Speakers
Heterogeneous Data Sources for Signed Language Analysis and Synthesis: The SignCom Project
Sign Language Corpora for Analysis, Processing and Evaluation
Automatic Identification of Arabic Dialects
|
Speech Recognition/Understanding |
Transcriber Driving Strategies for Transcription Aid System
Examining the Effects of Rephrasing User Input on Two Mobile Spoken Language Systems
ELAN as Flexible Annotation Framework for Sound and Image Processing Detectors
Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System
Bootstrapping Named Entity Extraction for the Creation of Mobile Services
Improving Proper Name Recognition by Adding Automatically Learned Pronunciation Variants to the Lexicon
WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network
Evaluating Repetitions, or how to Improve your Multilingual ASR System by doing Nothing
Domain-related Annotation of Polish Spoken Dialogue Corpus LUNA.PL
DiSCo - A German Evaluation Corpus for Challenging Problems in the Broadcast Domain
Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices
A Speech Corpus for Dyslexic Reading Training
WAPUSK20 - A Database for Robust Audiovisual Speech Recognition
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm
Practical Evaluation of Speech Recognizers for Virtual Human Dialogue Systems
Comparison of Spectral Properties of Read, Prepared and Casual Speech in French
Collecting Voices from the Cloud
Efficient Minimal Perfect Hash Language Models
An Analysis of the Performances of the CasEN Named Entities Recognition System in the Ester2 Evaluation Campaign
DICIT: Evaluation of a Distant-talking Speech Interface for Television
An Evaluation Framework for Natural Language Understanding in Spoken Dialogue Systems
Lexicon Design for Transcription of Spontaneous Voice Messages
|
Speech resource/database |
The Alborada-I3A Corpus of Disordered Speech
Multimodal Russian Corpus (MURCO): First Steps
Acquisition and Annotation of Slovenian Lombard Speech Database
NameDat: A Database of English Proper Names Spoken by Native Norwegians
Construction of Back-Channel Utterance Corpus for Responsive Spoken Dialogue System Development
A Database of Age and Gender Annotated Telephone Speech
The Nijmegen Corpus of Casual Spanish
FAU IISAH Corpus -- A German Speech Database Consisting of Human-Machine and Human-Human Interaction Acquired by Close-Talking and Far-Distance Microphones
BAStat : New Statistical Resources at the Bavarian Archive for Speech Signals
WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network
Enhancing Language Resources with Maps
The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals
Spontal: A Swedish Spontaneous Dialogue Corpus of Audio, Video and Motion Capture
DiSCo - A German Evaluation Corpus for Challenging Problems in the Broadcast Domain
Resources for Speech Synthesis of Viennese Varieties
Deep Linguistic Processing with GETARUNS for Spoken Dialogue Understanding
Word Boundaries in French: Evidence from Large Speech Corpora
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems
Language Modeling Approach for Retrieving Passages in Lecture Audio Data
AhoTransf: A Tool for Multiband Excitation Based Speech Analysis and Modification
CINEMO ― A French Spoken Language Resource for Complex Emotions: Facts and Baselines
Towards Investigating Effective Affective Dialogue Strategies
The Demo / Kemo Corpus: A Principled Approach to the Study of Cross-cultural Differences in the Vocal Expression and Perception of Emotion
Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices
A Speech Corpus for Dyslexic Reading Training
WAPUSK20 - A Database for Robust Audiovisual Speech Recognition
Partial Parsing of Spontaneous Spoken French
Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation
A Speech Corpus for Modeling Language Acquisition: CAREGIVER
Towards Optimal TTS Corpora
Vergina: A Modern Greek Speech Database for Speech Synthesis
The PlayMancer Database: A Multimodal Affect Database in Support of Research and Development Activities in Serious Game Environment
Speech Data Corpus for Verbal Intelligence Estimation
A Very Large Scale Mandarin Chinese Broadcast Corpus for GALE Project
Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems
Automatic Identification of Arabic Dialects
The Kachna L1/L2 Picture Replication Corpus
Greybeard Longitudinal Speech Study
Mixer 6
Collecting Voices from the Cloud
|
Speech Synthesis |
Incorporating Speech Synthesis in the Development of a Mobile Platform for e-learning.
A Fully Annotated Corpus of Russian Speech
Resources for Speech Synthesis of Viennese Varieties
TTS Evaluation Campaign with a Common Spanish Database
AhoTransf: A Tool for Multiband Excitation Based Speech Analysis and Modification
Towards Optimal TTS Corpora
Vergina: A Modern Greek Speech Database for Speech Synthesis
Multilingual Voice Creation Toolkit for the MARY TTS Platform
|
Standards for LRs |
Resources for Controlled Languages for Alert Messages and Protocols in the European Perspective
MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora
Recent Developments in the National Corpus of Polish
The Creagest Project: a Digitized and Annotated Corpus for French Sign Language (LSF) and Natural Gestural Languages
The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research
LAF/GrAF-grounded Representation of Dependency Structures
A Corpus Representation Format for Linguistic Web Services: The D-SPIN Text Corpus Format and its Relationship with ISO Standards
ISO-TimeML: An International Standard for Semantic Annotation
MLIF : A Metamodel to Represent and Exchange Multilingual Textual Information
An LMF-based Web Service for Accessing WordNet-type Semantic Lexicons
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration
A High Recall Error Identification Tool for Hindi Treebank Validation
Resource and Service Centres as the Backbone for a Sustainable Service Infrastructure
The CLARIN-NL Project
The Brandeis Annotation Tool
ANC2Go: A Web Application for Customized Corpus Creation
A Flexible Representation of Heterogeneous Annotation Data
Adapting Chinese Word Segmentation for Machine Translation Based on Short Units
Towards a Standardized Linguistic Annotation of the Textual Content of Labels in Knowledge Representation Systems
Towards the Annotation of Named Entities in the National Corpus of Polish
Language Technology Resource Center
For Standardised Amazigh Linguistic Resources
|
Statistical and machine learning methods |
WITcHCRafT: A Workbench for Intelligent exploraTion of Human ComputeR conversaTions
The Influence of the Utterance Length on the Recognition of Aged Voices
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method
An Evaluation of Predicate Argument Clustering using Pseudo-Disambiguation
Studying Word Sketches for Russian
Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction
A Resource and Tool for Super-sense Tagging of Italian Texts
Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System
Paragraph Acquisition and Selection for List Question Using Amazons Mechanical Turk
Active Learning and Crowd-Sourcing for Machine Translation
Language Identification of Short Text Segments with N-gram Models
The Web Library of Babel: evaluating genre collections
Bootstrapping Named Entity Extraction for the Creation of Mobile Services
Design and Development of Part-of-Speech-Tagging Resources for Wolof (Niger-Congo, spoken in Senegal)
Speech Translation in Pedagogical Environment Using Additional Sources of Knowledge
Statistical French Dependency Parsing: Treebank Conversion and First Results
Unsupervised Discovery of Collective Action Frames for Socio-Cultural Analysis
Word-based Partial Annotation for Efficient Corpus Construction
United we Stand: Improving Sentiment Analysis by Joining Machine Learning and Rule Based Methods
Identification of Rare & Novel Senses Using Translations in a Parallel Corpus
Using Comparable Corpora to Adapt a Translation Model to Domains
Handling of Missing Values in Lexical Acquisition
Generic Ontology Learners on Application Domains
Heterogeneous Sensor Database in Support of Human Behaviour Analysis in Unrestricted Environments: The Audio Part
Automatic Annotation of Co-Occurrence Relations
A Dataset for Assessing Machine Translation Evaluation Metrics
Inferring Subcat Frames of Verbs in Urdu
Arabic Word Segmentation for Better Unit of Analysis
Saturnalia: A Latin-Catalan Parallel Corpus for Statistical MT
A Database of Narrative Schemas
A Language Approach to Modeling Human Behaviors
Automatic Grammar Rule Extraction and Ranking for Definitions
Comparing Computational Models of Selectional Preferences - Second-order Co-Occurrence vs. Latent Semantic Clusters
A Semi-supervised Type-based Classification of Adjectives: Distinguishing Properties and Relations
Evaluating the Impact of Some Linguistic Information on the Performances of a Similarity-based and Translation-oriented Word-Sense Disambiguation Method
Maximum Entropy Classifier Ensembling using Genetic Algorithm for NER in Bengali
Towards a Learning Approach for Abbreviation Detection and Resolution.
Learning Based Java for Rapid Development of NLP Systems
Extensive Evaluation of a FrameNet-WordNet mapping resource
Improved Statistical Measures to Assess Natural Language Parser Performance across Domains
mwetoolkit: a Framework for Multiword Expression Identification
Theres no Data like More Data? Revisiting the Impact of Data Size on a Classification Task
Inferring Syntactic Rules for Word Alignment through Inductive Logic Programming
|
Summarisation |
Model Summaries for Location-related Images
Wikipedia-based Approach for Linking Ontology Concepts to their Realisations in Text
Construction of Text Summarization Corpus for the Credibility of Information on the Web
Cultural Aspects of Spatiotemporal Analysis in Multilingual Applications
Automatic Summarization Using Terminological and Semantic Resources
Analysis and Presentation of Results for Mobile Local Search
Modeling Wikipedia Articles to Enhance Encyclopedic Search
Constructing an Annotated Story Corpus: Some Observations and Issues
A French Human Reference Corpus for Multi-Document Summarization and Sentence Compression
|
T |
Text mining |
Resources for Calendar Expressions Semantic Tagging and Temporal Navigation through Texts
Evaluating a Text Mining Based Educational Search Portal
Wikipedia-based Approach for Linking Ontology Concepts to their Realisations in Text
Construction of Text Summarization Corpus for the Credibility of Information on the Web
Learning to Mine Definitions from Slovene Structured and Unstructured Knowledge-Rich Resources
Annotation Scheme and Gold Standard for Dutch Subjective Adjectives
Detection of Peculiar Examples using LOF and One Class SVM
Comment Extraction from Blog Posts and Its Applications to Opinion Mining
Event Models for Historical Perspectives: Determining Relations between High and Low Level Events in Text, Based on the Classification of Time, Location and Participants.
New Tools for Web-Scale N-grams
Emotion Cause Events: Corpus Construction and Analysis
Can Syntactic and Logical Graphs help Word Sense Disambiguation?
Cultural Heritage: Knowledge Extraction from Web Documents
Annotations for Opinion Mining Evaluation in the Industrial Context of the DOXA project
Text Cluster Trimming for Better Descriptions and Improved Quality
Extracting Product Features and Sentiments from Chinese Customer Reviews
Automatic Grammar Rule Extraction and Ranking for Definitions
Towards a Learning Approach for Abbreviation Detection and Resolution.
SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining
Extracting Lexico-conceptual Knowledge for Developing Persian WordNet
The CALBC Silver Standard Corpus for Biomedical Named Entities ― A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers
Applying a Dynamic Bayesian Network Framework to Transliteration Identification
Sentiment Analysis in the News
Word Sense Annotation of Polysemous Words by Multiple Annotators
Exploiting Scope for Shallow Discourse Parsing
A Digital Archive of Research Papers in Computer Science
|
Textual Entailment and Paraphrasing |
A Random Graph Walk based Approach to Computing Semantic Relatedness Using Knowledge from Wikipedia
Building a Textual Entailment Suite for the Evaluation of Automatic Content Scoring Technologies
Syntactic Testsuites and Textual Entailment Recognition
Mining Wikipedia for Large-scale Repositories of Context-Sensitive Entailment Rules
Identifying Paraphrases between Technical and Lay Corpora
Building Textual Entailment Specialized Data Sets: a Methodology for Isolating Linguistic Phenomena Relevant to Inference
A Resource for Investigating the Impact of Anaphora and Coreference on Inference.
A Corpus for Studying Full Answer Justification
Generating FrameNets of Various Granularities: The FrameNet Transformer
Evaluating Lexical Substitution: Analysis and New Measures
Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank
Mining Naturally-occurring Corrections and Paraphrases from Wikipedias Revision History
Data-Driven and Ontological Analysis of FrameNet for Natural Language Reasoning
|
Tools, systems, applications |
Developing an Expressive Speech Labeling Tool Incorporating the Temporal Characteristics of Emotion
Incorporating Speech Synthesis in the Development of a Mobile Platform for e-learning.
A Freely Available Morphological Analyzer for Turkish
The English-Swedish-Turkish Parallel Treebank
Evaluating a Text Mining Based Educational Search Portal
A Large List of Confusion Sets for Spellchecking Assessed Against a Corpus of Real-word Errors
WITcHCRafT: A Workbench for Intelligent exploraTion of Human ComputeR conversaTions
Annotation Process Management Revisited
Top-Performing Robust Constituency Parsing of Portuguese: Freely Available in as Many Ways as you Can Get it
FreeLing 2.1: Five Years of Open-source Language Processing Tools
Lingua-Align: An Experimental Toolbox for Automatic Tree-to-Tree Alignment
A Morphological Processor Based on Foma for Biscayan (a Basque dialect)
Diabase: Towards a Diachronic BLARK in Support of Historical Studies
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information
A Data Category Registry- and Component-based Metadata Framework
Indexing Methods for Faster and More Effective Person Name Search
Online Japanese Unknown Morpheme Detection using Orthographic Variation
Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar
Appraise: An Open-Source Toolkit for Manual Phrase-Based Evaluation of Translations
Learning Subjectivity Phrases missing from Resources through a Large Set of Semantic Tests
Studying Word Sketches for Russian
A Resource and Tool for Super-sense Tagging of Italian Texts
Building the Basque PropBank
Automatic Detection of Syllable Boundaries in Spontaneous Speech
New Tools for Web-Scale N-grams
Construction of a Chinese Opinion Treebank
Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System
ADN-Classifier:Automatically Assigning Denotation Types to Nominalizations
GernEdiT - The GermaNet Editing Tool
Sustainability of Linguistic Data and Analysis in the Context of a Collaborative eScience Environment
Computer Assisted Semantic Annotation in the DutchSemCor Project
WebLicht: Web-based LRT Services in a Distributed eScience Infrastructure
Virtual Language Observatory: The Portal to the Language Resources and Technology Universe
Fine-Grain Morphological Analyzer and Part-of-Speech Tagger for Arabic Text
Towards an Improved Methodology for Automated Readability Prediction
Example-Based Automatic Phonetic Transcription
A Python Toolkit for Universal Transliteration
Evaluation of HMM-based Models for the Annotation of Unsegmented Dialogue Turns
U-Compare: An Integrated Language Resource Evaluation Platform Including a Comprehensive UIMA Resource Library
Providing Multilingual, Multimodal Answers to Lexical Database Queries
Bulgarian National Corpus Project
Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes
The DAD Parallel Corpora and their Uses
LIPS: A Tool for Predicting the Lexical Isolation Point of a Word
The VeteranTapes: Research Corpus, Fragment Processing Tool, and Enhanced Publications for the e-Humanities
Homographic Ideogram Understanding Using Contextual Dynamic Network
PDTB XML: the XMLization of the Penn Discourse TreeBank 2.0
A Person-Name Filter for Automatic Compilation of Bilingual Person-Name Lexicons
FipsRomanian: Towards a Romanian Version of the Fips Syntactic Parser
Speech Translation in Pedagogical Environment Using Additional Sources of Knowledge
An Evolving eScience Environment for Research Data in Linguistics
CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development
Querying Diverse Treebanks in a Uniform Way
Facilitating Non-expert Users of the KYOTO Platform: the TMEKO Editing Protocol for Synset to Ontology Mappings
Word-based Partial Annotation for Efficient Corpus Construction
Identification of Rare & Novel Senses Using Translations in a Parallel Corpus
A Case Study on Interoperability for Language Resources and Applications
Mining Wikipedia for Large-scale Repositories of Context-Sensitive Entailment Rules
Morphological Analysis and Generation of Arabic Nouns: A Morphemic Functional Approach
Freds Reusable Evaluation Device: Providing Support for Quick and Reliable Linguistic Annotation
The ConceptMapper Approach to Named Entity Recognition
Tag Dictionaries Accelerate Manual Annotation
AhoTransf: A Tool for Multiband Excitation Based Speech Analysis and Modification
A Game-based Approach to Transcribing Images of Text
An Integrated Digital Tool for Accessing Language Resources
Annotation Tool for Extended Textual Coreference and Bridging Anaphora
Extending BART to Provide a Coreference Resolution System for German
Computational Linguistics for Mere Mortals - Powerful but Easy-to-use Linguistic Processing for Scientists in the Humanities
Maskkot ― An Entity-centric Annotation Platform
Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices
LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation
A Named Entity Labeler for German: Exploiting Wikipedia and Distributional Clusters
Djangology: A Light-weight Web-based Tool for Distributed Collaborative Text Annotation
Analysing Temporally Annotated Corpora with CAVaT
A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora
VenPro: A Morphological Analyzer for Venetan
Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building
GIS Application Improvement with Multilingual Lexical and Terminological Resources
LoonyBin: Keeping Language Technologists Sane through Automated Management of Experimental (Hyper)Workflows
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm
Open Soucre Graph Transducer Interpreter and Grammar Development Environment
SINotas: the Evaluation of a NLG Application
Corpus Aligner (CorAl) Evaluation on English-Croatian Parallel Corpora
Information Retrieval of Word Form Variants in Spoken Language Corpora Using Generalized Edit Distance
The Spanish Resource Grammar
A Multilingual CALL Game Based on Speech Translation
Efficiently Extract Rrecurring Tree Fragments from Large Treebanks
Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages
LAT Bridge: Bridging Tools for Annotation and Exploration of Rich Linguistic Data
FastKwic, an Intelligent Concordancer Using FASTR
NPCEditor: A Tool for Building Question-Answering Characters
Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development
Principled Construction of Elicited Imitation Tests
Dialogues in Context: An Objective User-Oriented Evaluation Approach for Virtual Human Dialogue
Resource and Service Centres as the Backbone for a Sustainable Service Infrastructure
FIDJI: Web Question-Answering at Quaero 2009
FrAG, a Hybrid Constraint Grammar Parser for French
An API for Multi-lingual Ontology Matching
An Open Source Process Engine Framework for Realtime Pattern Recognition and Information Fusion Tasks
English-Hindi Transliteration using Multiple Similarity Metrics
How FrameSQL Shows the Japanese FrameNet Data
A Context Sensitive Variant Dictionary for Supporting Variant Selection
Language Resource Management System for Asian WordNet Collaboration and Its Web Service Application
Extracting Surface Realisation Templates from Corpora
Multilingual Voice Creation Toolkit for the MARY TTS Platform
The Ariadne System: A Flexible and Extensible Framework for the Modeling and Storage of Experimental Data in the Humanities.
A Survey of Idiomatic Preposition-Noun-Verb Triples on Token Level
Propbank Frameset Annotation Guidelines Using a Dedicated Editor, Cornerstone
Parsing to Stanford Dependencies: Trade-offs between Speed and Accuracy
Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank
The Brandeis Annotation Tool
Modified LTSE-VAD Algorithm for Applications Requiring Reduced Silence Frame Misclassification
Learning Based Java for Rapid Development of NLP Systems
Integrating a Rule-based with a Hierarchical Translation System
Cooperation for Arabic Language Resources and Tools ― The MEDAR Project
Achieving Domain Specificity in SMT without Overt Siloing
Theres no Data like More Data? Revisiting the Impact of Data Size on a Classification Task
STeP-1: A Set of Fundamental Tools for Persian Text Processing
Hybrid Citation Extraction from Patents
Building a Web Corpus of Czech
A Crash Test with Linguistica in Modern Greek: The Case of Derivational Affixes and Bound Stems
Pr.A.Ti.D: A Coding Scheme for Pragmatic Annotation of Dialogues.
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation
Using Ontologies for Semi-automatic Linking VerbaLex with FrameNet
AutoTagTCG : A Framework for Automatic Thai CG Tagging
Mining the Correlation between Human and Automatic Evaluation at Sentence Level
OAL: A NLP Architecture to Improve the Development of Linguistic Resources for NLP
A General Purpose FrameNet-based Shallow Semantic Parser
Bigorna -- A Toolkit for Orthography Migration Challenges
Experimental Deployment of a Grid Virtual Organization for Human Language Technologies
Propbank Instance Annotation Guidelines Using a Dedicated Editor, Jubilee
Applying a Dynamic Bayesian Network Framework to Transliteration Identification
Flexible Ontology Population from Text: The OwlExporter
Language Technology Resource Center
Annotation of Human Gesture using 3D Skeleton Controls
|
Topic detection & tracking |
Building a Gold Standard for Event Detection in Croatian
|
Typological databases |
Medefaidrin: Resources Documenting the Birth and Death Language Life-cycle
A Survey of Idiomatic Preposition-Noun-Verb Triples on Token Level
|
|
|