AFFILIATIONS: Browse articles of the conference sorted by institution

A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z

A
Aalto University School of Science, FINLAND Comparing computer vision analysis of signed language video with motion capture recordings
Adam Mickiewicz University, Faculty of Mathematics and Computer Science, Department of Computer Linguistics and Artificial Intelligence, POLAND Wordnet Based Lexicon Grammar for Polish
Adam Mickiewicz University, Poznan, Institute of Linguistics, POLAND Developing and evaluating an emergency scenario dialogue corpus
Advocate, Member of the Johannesburg Bar, SOUTH AFRICA Aspects of a Legal Framework for Language Resource Management
Agence France-Presse, FRANCE Aleda, a free large-scale entity database for French
Alexandru Philippide Institute of Philology, Romanian Academy, the Iași, ROMANIA Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations
Allameh Tabatabaiee University, Linguistic Department, IRAN, ISLAMIC REPUBLIC OF Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
Alpage, INRIA Paris-Rocquencourt & Université Paris 7, FRANCE Applying cross-lingual WSD to wordnet development
Evaluating and improving syntactic lexica by plugging them within a parser
Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations
Amirkabir University of Technology, IRAN, ISLAMIC REPUBLIC OF A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora
Applied Language Solutions Ltd., UNITED KINGDOM SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Arabic Textware, JORDAN Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
Association Francophone de la Communication Parlée, FRANCE The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
Athens Technology Center, GREECE Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue
ATILF Nancy, FRANCE Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary
AT&T Labs ― Research, UNITED STATES Building Text-To-Speech Voices in the Cloud
Austrian Academy of Sciences, AUSTRIA Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies.
Austrian Research Institute for Artificial Intelligence, AUSTRIA Applying Random Indexing to Structured Data to Find Contextually Similar Words
Autónoma University Madrid, Computational Linguistics Laboratory, SPAIN Medical Term Extraction in an Arabic Medical Corpus

 

B
BAE Systems, UNITED STATES Identifying Nuggets of Information in GALE Distillation Evaluation
Barcelona Media Research Center, SPAIN Holaaa!! writin like u talk is kewl but kinda hard 4 NLP
BUCEADOR, a multi-language search engine for digital libraries
Barcelona Media, SPAIN A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
BME MOKK, HUNGARY Rapid creation of large-scale corpora and frequency dictionaries
Brandeis University, Computer Science Department, UNITED STATES Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Brandeis University, UNITED STATES Word Sense Inventories by Non-Experts.
ATLIS: Identifying Locational Information in Text Automatically
The Role of Model Testing in Standards Development: The Case of ISO-Space
Annotating dropped pronouns in Chinese newswire text
The TARSQI Toolkit
Brigham Young University, UNITED STATES Item Development and Scoring for Japanese Oral Proficiency Testing
First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis
Bulgarian Academy of Sciences, BULGARIA Bulgarian X-language Parallel Corpus
Expanding Parallel Resources for Medium-Density Languages for Free
Linguistic Analysis Processing Line for Bulgarian

 

C
Cairo University, Faculty of Arts, EGYPT Medical Term Extraction in an Arabic Medical Corpus
Campus de Espinardo, SPAIN A New Method for Evaluating Automatically Learned Terminological Taxonomies
Carnegie Mellon University, Language Technologies Institute, UNITED STATES Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems
Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
Carnegie Mellon University, UNITED STATES A Universal Part-of-Speech Tagset
Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic
Morphosyntactic Analysis of the CHILDES and TalkBank Corpora
Diversifiable Bootstrapping for Acquiring High-Coverage Paraphrase Resource
CASSIDIAN, FRANCE New language resources for the Pashto language
CEA - LIST, FRANCE Evaluation of Unsupervised Information Extraction
Evaluation of a Complex Information Extraction Application in Specific Domain
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation
CEHUM/UM, PORTUGAL Págico: Evaluating Wikipedia-based information retrieval in Portuguese
CELCT, ITALY The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
CAT: the CELCT Annotation Tool
Evaluating Machine Reading Systems through Comprehension Tests
Chinese Whispers: Cooperative Paraphrase Acquisition
CELI, ITALY Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.
Centro de Estudios Lingüísticos y Literarios, El Colegio de México, MEXICO Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
Centro de Linguística da Universidade de Lisboa, PORTUGAL Project FLY: a multidisciplinary project within Linguistics
Modality in Text: a Proposal for Corpus Annotation
Centro de Linguística da Universidade Nova de Lisboa, PORTUGAL An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Dealing with unknown words in statistical machine translation
Centro Ramon Pineiro para a Investigacion en Humanidades. Xunta de Galicia, SPAIN A Galician Syntactic Corpus with Application to Intonation Modeling
Chalmers University, SWEDEN An Open Source Persian Computational Grammar
Charles University in Prague, CZECH REPUBLIC A database of semantic clusters of verb usages
Alternative Lexicalizations of Discourse Connectives in Czech
HamleDT: To Parse or Not to Parse?
Terra: a Collection of Translation Error-Annotated Corpora
Announcing Prague Czech-English Dependency Treebank 2.0
The Joy of Parallelism with CzEng 1.0
Getting more data -- Schoolkids as annotators
Building a learner corpus
Charles University in Prague, Faculty of Arts, CZECH REPUBLIC Building a multilingual parallel corpus for human users
Charles University in Prague, Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, CZECH REPUBLIC Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component
Language Richness of the Web
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Prague Dependency Style Treebank for Tamil
Arabic Word Generation and Modelling for Spell Checking
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
Charles University in Prague, Institute of the Czech National Corpus, CZECH REPUBLIC Balanced data repository of spontaneous spoken Czech
Charles University in Prague, MFF, UFAL, CZECH REPUBLIC A High-Quality Web Corpus of Czech
Automatic MT Error Analysis: Hjerson Helping Addicter
IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus
Chiba Institute of Technology, JAPAN Method for Collection of Acted Speech Using Various Situation Scripts
Chiba University, JAPAN Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
Annotation of anaphoric relations and topic continuity in Japanese conversation
UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
Chinese University of Hong Kong, HONG KONG Quantising Opinions for Political Tweets Analysis
City University of New York, UNITED STATES Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
CLEO, Centre for Open Electronic Publishing, FRANCE Annotated Bibliographical Reference Corpora in Digital Humanities
CLiC-Universitat de Barcelona, SPAIN Annotating Near-Identity from Coreference Disagreements
CLUL, PORTUGAL Introducing the Reference Corpus of Contemporary Portuguese Online
CNAM-CRTD, FRANCE Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
CNR, ITALY Towards a User-Friendly Platform for Building Language Resources based on Web Services
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
CNRS, FRANCE DISLOG: A logic-based language for processing discourse structures
ANALEC: a New Tool for the Dynamic Annotation of Textual Data
Columbia University, UNITED STATES Annotations for Power Relations on Email Threads
The Dependency-Parsed FrameNet Corpus
AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis
Annotating Agreement and Disagreement in Threaded Discussion
Conventional Orthography for Dialectal Arabic
The MASC Word Sense Corpus
Simplified guidelines for the creation of Large Scale Dialectal Arabic Annotations
DramaBank: Annotating Agency in Narrative Discourse
Empirical Comparisons of MASC Word Sense Annotations
Computational & Information Sciences Directorate, Army Research Lab (ARL), UNITED STATES Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System
Computer Research Center of Islamic Sciences, IRAN, ISLAMIC REPUBLIC OF A Framework for Spelling Correction in Persian Language Using Noisy Channel Model
Copenhagen Business School, DENMARK Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies
SMALLWorlds -- Multilingual Content-Controlled Monologues
Translog-II: a Program for Recording User Activity Data for Empirical Reading and Writing Research

 

D
DANS, Data Archiving & Networked Services, NETHERLANDS An Oral History Annotation Tool for INTER-VIEWs
De La Salle University, Center for Empathic Human-Computer Interactions, PHILIPPINES Building a Multimodal Laughter Database for Emotion Recognition
Delft University of Technology, NETHERLANDS The BladeMistress Corpus: From Talk to Action in Virtual Worlds
Creating a Data Collection for Evaluating Rich Speech Retrieval
Deluxe Digital Studios, UNITED KINGDOM SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Department of Speech, Music and Hearing, School of Computer Science and Communication, KTH, SWEDEN 3rd party observer gaze as a continuous measure of dialogue flow
Deutsche Telekom Labs, GERMANY “You Seem Aggressive!” Monitoring Anger in a Practical Application
Fast Labeling and Transcription with the Speechalyzer Toolkit
DFKI GmbH, GERMANY Constructing a Question Corpus for Textual Semantic Relations
Using Verb Subcategorization for Word Sense Disambiguation
Involving Language Professionals in the Evaluation of Machine Translation
Automatic MT Error Analysis: Hjerson Helping Addicter
Joint Grammar and Treebank Development for Mandarin Chinese with HPSG
An Adaptive Framework for Named Entity Combination
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Terra: a Collection of Translation Error-Annotated Corpora
Evaluating the Impact of Phrase Recognition on Concept Tagging
Annotating Opinions in German Political News
MultiUN v2: UN Documents with Multilingual Alignments
Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World
A Graphical Citation Browser for the ACL Anthology
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
CLIMB grammars: three projects using metagrammar engineering
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
The META-SHARE Metadata Schema for the Description of Language Resources
Direction Générale de l'Armement, FRANCE The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
The REPERE Corpus : a multimodal corpus for person recognition
New language resources for the Pashto language
Doshisha University, JAPAN Multimodal Corpus of Multi-party Conversations in Second Language
Dräger Safety AG & Co. KGaA, GERMANY Adapting and evaluating a generic term extraction tool
Dublin City University, IRELAND Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner
Irish Treebanking and Parsing: A Preliminary Evaluation
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Towards a User-Friendly Platform for Building Language Resources based on Web Services
Arabic Word Generation and Modelling for Spell Checking
Automatic Extraction and Evaluation of Arabic LFG Resources
Creating a Data Collection for Evaluating Rich Speech Retrieval
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
Duke University, UNITED STATES A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Duquesne University, EVL Lab, UNITED STATES Distractorless Authorship Verification

 

E
Eastern Michigan University, UNITED STATES ”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
Ecole Polytechnique de Montréal, CANADA A disambiguation resource extracted from Wikipedia for semantic annotation
ELDA/ELRA, FRANCE The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
Towards a User-Friendly Platform for Building Language Resources based on Web Services
ELRA in the heart of a cooperative HLT world
The REPERE Corpus : a multimodal corpus for person recognition
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
VERTa: Linguistic features in MT evaluation
The FLaReNet Strategic Language Resource Agenda
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
New language resources for the Pashto language
Using the International Standard Language Resource Number: Practical and Technical Aspects
An Analytical Model of Language Resource Sustainability
On the Way to a Legal Sharing of Web Applications in NLP
The META-SHARE Metadata Schema for the Description of Language Resources
Elhuyar Foundation, SPAIN PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web
Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques
Elhuyar R&D, SPAIN Building a Basque-Chinese Dictionary by Using English as Pivot
Eolaistriu Technologies, IRELAND Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
Erasmus Medical Center / Linguamatics, NETHERLANDS Centroids: Gold standards with distributional variation
Erasmus University Medical Centre, NETHERLANDS CALBC: Releasing the Final Corpora
European Bioinformatics Institute, UNITED KINGDOM Reclassifying subcategorization frames for experimental analysis and stimulus generation
Annotating progressive aspect constructions in the spoken section of the British National Corpus
Centroids: Gold standards with distributional variation
CALBC: Releasing the Final Corpora
European Commission, Directorate General for Translation (DGT), LUXEMBOURG MultiUN v2: UN Documents with Multilingual Alignments
DGT-TM: A freely available Translation Memory in 22 languages
European Commission, Joint Research Centre (JRC), ITALY DGT-TM: A freely available Translation Memory in 22 languages
JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool
Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text
Euroscript Deutschland, GERMANY Involving Language Professionals in the Evaluation of Machine Translation
Euskal Herriko Unibertsitatea, Informatika Fakultatea, SPAIN Multilingual Central Repository version 3.0
A proposal for improving WordNet Domains
Euskal Herriko Unibertsitatea, SPAIN Matching Cultural Heritage items to Wikipedia
Versatile Speech Databases for High Quality Synthesis for Basque
KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
Strategies to Improve a Speaker Diarisation Tool
Using an ASR database to design a pronunciation evaluation system in Basque

 

F
Fachhochschule Potsdam, University of Applied Science, GERMANY Is it Useful to Support Users with Lexical Resources? A User Study.
Federal University of Rio Grande do Sul, BRAZIL A large scale annotated child language construction database
Florida Institute of Human and Machine Cognition, UNITED STATES LIE: Leadership, Influence and Expertise
Fondazione Bruno Kessler, Trento, ITALY Buildind a Resource of Patterns Using Semantic Types
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction
A treebank-based study on the influence of Italian word order on parsing performance
Brand Pitt: A Corpus to Explore the Art of Naming
A Parallel Corpus of Music and Lyrics Annotated with Emotions
NgramQuery - Smart Information Extraction from Google N-gram using External Resources
Chinese Whispers: Cooperative Paraphrase Acquisition
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
The KnowledgeStore: an Entity-Based Storage System
Fondazione Ugo Bordoni, ITALY Intelligibility assessment in forensic applications
Fordham University, Computer and Information Science Department, UNITED STATES Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing
Fraunhofer-Heinrich Hertz Institute, GERMANY AVATecH ― automated annotation through audio and video analysis
Fraunhofer IAIS Institute, GERMANY Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel
AVATecH ― automated annotation through audio and video analysis
Freelance translator, CROATIA BLEU Evaluation of Machine-Translated English-Croatian Legislation
FreeLing project developer, RUSSIAN FEDERATION FreeLing 3.0: Towards Wider Multilinguality
Freie Universität Berlin, GERMANY Building Large Corpora from the Web Using a New Efficient Tool Chain
From Grammar Rule Extraction to Treebanking: A Bootstrapping Approach
Freie Universität Berlin, Web Based Systems Group, GERMANY Evaluating the Impact of Phrase Recognition on Concept Tagging
DBpedia: A Multilingual Cross-domain Knowledge Base
Friedrich-Schiller Universität, Jena, GERMANY Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
FTW Telecommunications Research Center Vienna, AUSTRIA Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis
Fundació Barcelona Media, SPAIN Spell Checking in Spanish: The Case of Diacritic Accents
VERTa: Linguistic features in MT evaluation
Fundación CTIC, SPAIN The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends

 

G
GEIE ERCIM, FRANCE The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Georgetown University, UNITED STATES A New Twitter Verb Lexicon for Natural Language Processing
Using semi-experts to derive judgments on word sense alignment: a pilot study
German Research Center for Artificial Intelligence, GERMANY Evaluating the Impact of Phrase Recognition on Concept Tagging
Goethe Universität Frankfurt am Main, GERMANY Customization of the Europarl Corpus for Translation Studies
”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
Google Inc., UNITED STATES A Universal Part-of-Speech Tagset
A light way to collect comparable corpora from the Web
Google Research, Google Inc., UNITED STATES A Cross-Lingual Dictionary for English Wikipedia Concepts
Goss Interactive Limited, UNITED KINGDOM Evaluating the Impact of Phrase Recognition on Concept Tagging
Gothenburg University, SWEDEN An Open Source Persian Computational Grammar
Cloud Logic Programming for Integrating Language Technology Resources
Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish
Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
The Rocky Road towards a Swedish FrameNet - Creating SweFN
Combining Language Resources Into A Grammar-Driven Swedish Parser
Semantic Role Labeling with the Swedish FrameNet
Feedback in Nordic First-Encounters: a Comparative Study
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Greedy Intelligence Ltd., CHINA Constructing a Question Corpus for Textual Semantic Relations
Griffith University, AUSTRALIA The Australian National Corpus: National Infrastructure for Language Resources

 

H
Heinrich-Heine-Universität, GERMANY Arabic Word Generation and Modelling for Spell Checking
Hiroshima City University, JAPAN Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques
HITS gGmbH, GERMANY Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles
Hochschule Augsburg, GERMANY Annotation Facilities for the Reliable Analysis of Human Motion
Hogeschool Gent, BELGIUM Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
Beyond SoNaR: towards the facilitation of large corpus building efforts
Hogeschool Gent, LT3, BELGIUM Collection of a corpus of Dutch SMS
Hogeschool Gent, LT3 / Universiteit Gent, BELGIUM Evaluating automatic cross-domain Dutch semantic role annotation
Hokkaido University, JAPAN Two Database Resources for Processing Social Media English Text
Holzweg e-Commerce Solutions, AUSTRIA Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
Hong Kong Polytechnic University, Department of Computing, HONG KONG A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong Polytechnic University, HONG KONG Assigning Connotation Values to Events
Annotating a corpus of human interaction with prosodic profiles ― focusing on Mandarin repair/disfluency
A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong University of Science and Technology, HONG KONG A Multilingual Natural Stress Emotion Database
Hong Kong University of Science and Technology, Human Language Technology Center, Department of Computer Science and Engineering, HONG KONG A Mandarin-English Code-Switching Corpus
Human Speech and Language Technologies, Vicomtech-IK4, SPAIN Using DiAML and ANVIL for multimodal dialogue annotations
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
The coding and annotation of multimodal dialogue acts
Highlighting relevant concepts from Topic Signatures
Humboldt-Universität Berlin, GERMANY Multi-Layer Discourse Annotation of a Dutch Text Corpus
Hungarian Academy of Sciences, HUNGARY Light Verb Constructions in the SzegedParalellFX English--Hungarian Parallel Corpus
Rapid creation of large-scale corpora and frequency dictionaries
Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
Hungarian Academy of Sciences, Research Group on Artificial Intelligence, HUNGARY HunOr: A Hungarian―Russian Parallel Corpus
Automatically Generated Online Dictionaries

 

I
Ibaraki University, JAPAN Detection of Peculiar Word Sense by Distance Metric Learning with Labeled Examples
IBM Research India, INDIA A Study of Word-Classing for MT Reordering
ICLTT, AUSTRIA Federated Search: Towards a Common Search Infrastructure
ICSI, UNITED STATES The MASC Word Sense Corpus
Empirical Comparisons of MASC Word Sense Annotations
ID GmbH & Co. KgaA, GERMANY SemScribe: Natural Language Generation for Medical Reports
IDIAP Research Institute, SWITZERLAND Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
IICT-BAS, Linguistic Modelling Department, BULGARIA A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian
The Political Speech Corpus of Bulgarian
IIT Bombay, INDIA Experiences in Resource Generation for Machine Translation through Crowdsourcing
Re-ordering Source Sentences for SMT
Cost and Benefit of Using WordNet Senses for Sentiment Analysis
IIT - CNR, ITALY Visualizing word senses in WordNet Atlas
ILC ― CNR, ITALY Assigning Connotation Values to Events
Customizable SCF Acquisition in Italian
The Language Library: supporting community effort for collective resource production
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
In the same boat and other idiomatic seafaring expressions
Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access
From medical language processing to BioNLP domain
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser
Creation of a bottom-up corpus-based ontology for Italian Linguistics
The LRE Map. Harmonising Community Descriptions of Resources
The FLaReNet Strategic Language Resource Agenda
Enriching the ISST-TANL Corpus with Semantic Frames
The META-SHARE Metadata Schema for the Description of Language Resources
ILSP - Athena RC, GREECE The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions
Standardizing a Component Metadata Infrastructure
The FLaReNet Strategic Language Resource Agenda
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
Collecting and Using Comparable Corpora for Statistical Machine Translation
The META-SHARE Metadata Schema for the Description of Language Resources
ILTEC, PORTUGAL The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
IMC AG, GERMANY An Adaptive Framework for Named Entity Combination
IMMI - CNRS, FRANCE The FLaReNet Strategic Language Resource Agenda
Indiana University, UNITED STATES A finite-state morphological transducer for Kyrgyz
Using semi-experts to derive judgments on word sense alignment: a pilot study
Annotating Errors in a Hungarian Learner Corpus
INESC-ID/L2F, PORTUGAL An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Extending a wordnet framework for simplicity and scalability
INESC-IST, PORTUGAL Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
INIST - CNRS, FRANCE Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign
Annotating Football Matches: Influence of the Source Medium on Manual Annotation
The Quaero Evaluation Initiative on Term Extraction
INRIA, FRANCE Aleda, a free large-scale entity database for French
Cleaning noisy wordnets
Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus
Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources
INRIA / LORIA / Université Henri Poincaré, FRANCE CoALT: A Software for Comparing Automatic Labelling Tools
INRIA-Nancy, FRANCE Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents
Insitute for System Analysis, Siberian State Aerospace University, RUSSIAN FEDERATION Speech and Language Resources for LVCSR of Russian
Institute for Balkan Studies of Serbian Academy of Sciences and Arts, SERBIA Multimedia database of the cultural heritage of the Balkans
Institute for Information Industry, TAIWAN, PROVINCE OF CHINA NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
Institute for Research in Fundamental Sciences, IRAN, ISLAMIC REPUBLIC OF Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions
Institute for Specialised Communication and Multilingualism, European Academy of Bozen/Bolzano, ITALY Unsupervised acquisition of concatenative morphology
Institute for System Analysis of RAS, RUSSIAN FEDERATION Speech and Language Resources for LVCSR of Russian
Institute of Informatics and Telecommunications, NCSR ‘Demokritos’, GREECE Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Institute of Technology of Cambodia, CAMBODIA Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu
Institut für Deutsche Sprache, GERMANY EXMARaLDA and the FOLK tools ― two toolsets for transcribing and annotating spoken language
The New IDS Corpus Analysis Platform: Challenges and Prospects
Evaluating Query Languages for a Corpus Processing System
Institut Jožef Stefan, Department of Knowledge Technologies, SLOVENIA The goo300k corpus of historical Slovene
Irregularity Detection in Categorized Document Corpora
Instituto Internacional da Língua Portuguesa, CAPE VERDE The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
Intelligent Robotics and Communication Laboratories, Advanced Telecommunications Research Institute International, JAPAN Body-conductive acoustic sensors in human-robot communication
International Institute of Information Technology, INDIA Parsing Any Domain English text to CoNLL dependencies
Hindi Subjective Lexicon: A Lexical Resource for Hindi Adjective Polarity Classification
Interverbum Technology, SWEDEN On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
Iran University of Science and Technology, IRAN, ISLAMIC REPUBLIC OF A Framework for Spelling Correction in Persian Language Using Noisy Channel Model
Improving K-Nearest Neighbor Efficacy for Farsi Text Classification
IRISA ― CNRS, FRANCE Annotating Football Matches: Influence of the Source Medium on Manual Annotation
IRISA, FRANCE A French Fairy Tale Corpus syntactically and semantically annotated
IRIT ― CNRS, FRANCE A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures
Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
IRIT, FRANCE Clause-based Discourse Segmentation of Arabic Texts
ISI / USC, UNITED STATES Evaluating Machine Reading Systems through Comprehension Tests
Islamic Azad University, IRAN, ISLAMIC REPUBLIC OF Improving K-Nearest Neighbor Efficacy for Farsi Text Classification
Istanbul Technical University, TURKEY The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish
Word Alignment for English-Turkish Language Pair
ISTI - CNR, ITALY The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
From medical language processing to BioNLP domain
IXA NLP Group, University of the Basque Country, SPAIN SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Highlighting relevant concepts from Topic Signatures
Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques
Mapping WordNet to the Kyoto ontology

 

J
Jawharlal Nehru University, Centre for Linguistics, INDIA Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi
Johannes Kepler University Linz, AUSTRIA EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems
Johns Hopkins University, UNITED STATES Creating and Curating a Cross-Language Person-Entity Linking Collection

 

K
Katholieke Universiteit Leuven, BELGIUM Annotating Story Timelines as Temporal Dependency Structures
Large aligned treebanks for syntax-based machine translation
Katholieke Universiteit Leuven, CCL, BELGIUM Beyond SoNaR: towards the facilitation of large corpus building efforts
Example-Based Treebank Querying
Keio University, JAPAN Semantic Annotations in Japanese FrameNet: Comparing Frames in Japanese and English
KIIT College of Engineering, INDIA Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
KIIT University, School of Computer Engineering, INDIA A GUI to Detect and Correct Errors in Hindi Dependency Treebank
Konstanz Universität, GERMANY The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
A Rule-based Morphological Analyzer for Murrinh-Patha
A Reference Dependency Bank for Analyzing Complex Predicates
Korea University, KOREA, REPUBLIC OF ISO 24617-2: A semantically-based standard for dialogue annotation
Kyoto University, JAPAN Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese
Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Two Phase Evaluation for Selecting Machine Translation Services
Service Composition Scenarios for Task-Oriented Translation
Annotation of anaphoric relations and topic continuity in Japanese conversation

 

L
L2F/INESC-ID Lisboa, PORTUGAL Building and Exploring Semantic Equivalences Resources
Laboratoire National de métrologie et d'Essais (LNE), FRANCE Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign
The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign
The REPERE Corpus : a multimodal corpus for person recognition
Lancaster University, UNITED KINGDOM Document Attrition in Web Corpora: an Exploration
Language Computer Corporation, UNITED STATES Pursing power in Arabic on-line discussion forums
Language Technologies Research Centre, IIIT, Hyderabad, INDIA A GUI to Detect and Correct Errors in Hindi Dependency Treebank
Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation
LDC, UNITED STATES LDC Forced Aligner
Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities
Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects
Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure
Linguistic Resources for Handwriting Recognition and Translation Evaluation
Creating HAVIC: Heterogeneous Audio Visual Internet Collection
LDC Language Resource Database: Building a Bibliographic Database
LDC / University of Pennsylvania, UNITED STATES Further Developments in Treebank Error Detection Using Derivation Trees
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
Expanding Arabic Treebank to Speech: Results from Broadcast News
Leshan Normal University, CHINA CLTC: A Chinese-English Cross-lingual Topic Corpus
Lexical Computing Ltd, UNITED KINGDOM Word Sketches for Turkish
LIF - CNRS, FRANCE Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
LIG, FRANCE Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
LIMSI ― CNRS, FRANCE Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results
WebAnnotator, an Annotation Tool for Web Pages
Cross-lingual studies of ASR errors: paradigms for perceptual evaluations
Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
Evolution of Event Designation in Media: Preliminary Study
Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign
A Rough Set Formalization of Quantitative Evaluation with Ambiguity
Kitten: a tool for normalizing HTML and extracting its textual content
Evaluation of Unsupervised Information Extraction
A contrastive review of paraphrase acquisition techniques
DEGELS1: A comparable corpus of French Sign Language and co-speech gestures
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs
The LRE Map. Harmonising Community Descriptions of Resources
Corpus of Children Voices for Mid-level Markers and Affect Bursts Analysis
Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier
Designing French Tale Corpora for Entertaining Text To Speech Synthesis
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation
The META-SHARE Metadata Schema for the Description of Language Resources
LIMSI ― CNRS & Université Paris-Sud, FRANCE Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French
Linguamatics LTD, UNITED KINGDOM CALBC: Releasing the Final Corpora
Linguateca/FCCN, PORTUGAL Págico: Evaluating Wikipedia-based information retrieval in Portuguese
Linguatec, GERMANY Large Scale Lexical Analysis
LinkedIn Corp., UNITED STATES Annotations for Power Relations on Email Threads
Linköping University, SWEDEN Alignment-based reordering for SMT
Eye Tracking as a Tool for Machine Translation Error Analysis
On the practice of error analysis for machine translation evaluation
Error profiling for evaluation of machine-translated text: a Polish-English case study
LIPN ― CNRS, FRANCE The Quaero Evaluation Initiative on Term Extraction
LORIA ― CNRS, FRANCE Representation of linguistic and domain knowledge for second language learning in virtual worlds
Loyola College in Maryland, UNITED STATES Creating and Curating a Cross-Language Person-Entity Linking Collection
LPP ― CNRS, FRANCE Designing French Tale Corpora for Entertaining Text To Speech Synthesis
Ludwig-Maximilians-Universität München, Bavarian Archive for Speech Signals, GERMANY Statistical Evaluation of Pronunciation Encoding
Lund University, SWEDEN SMALLWorlds -- Multilingual Content-Controlled Monologues
Constructing Large Proposition Databases
Visualizing Sentiment Analysis on a User Forum
Lymba Corporation, UNITED STATES A Tool for Extracting Conversational Implicatures
Polaris: Lymba's Semantic Parser

 

M
Macquarie University, AUSTRALIA Irish Treebanking and Parsing: A Preliminary Evaluation
The Australian National Corpus: National Infrastructure for Language Resources
A Framework for Evaluating Text Correction
Masarykova Univerzita, Brno, CZECH REPUBLIC Building a 70 billion word corpus of English from ClueWeb
A database of semantic clusters of verb usages
Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification
Legal electronic dictionary for Czech
Max Planck Institute for Evolutionary Anthropology, GERMANY Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages
The Open Linguistics Working Group
Max Planck Institute for Psycholinguistics, NETHERLANDS AVATecH ― automated annotation through audio and video analysis
ELAN development, keeping pace with communities' needs
Semantic metadata mapping in practice: the Virtual Language Observatory
Towards Automatic Gesture Stroke Detection
Federated Search: Towards a Common Search Infrastructure
Proper Language Resource Centers
Standardizing a Component Metadata Infrastructure
Citing on-line Language Resources
”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
The Language Archive ― a new hub for language resources
RELcat: a Relation Registry for ISOcat data categories
Meertens Institute, NETHERLANDS Dynamic web service deployment in a cloud environment
Meraka Institute, CSIR, SOUTH AFRICA Aspects of a Legal Framework for Language Resource Management
Mercedes-Benz R&D North America, UNITED STATES Textual Characteristics for Language Engineering
A Survey of Text Mining Architectures and the UIMA Standard
Mersin University, TURKEY Construction of the Turkish National Corpus (TNC)
MICA Institute, HUST, VIET NAM Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu
MICA Institute, VIET NAM MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels
Microsoft Research Asia, CHINA Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries
Microsoft Research India, INDIA An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
Microsoft Research, UNITED STATES Statistical Section Segmentation in Free-Text Clinical Records
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms
Middle East Technical University, TURKEY METU Turkish Discourse Bank Browser
MIRACL, TUNISIA Clause-based Discourse Segmentation of Arabic Texts
MIT Computer Science and Artificial Intelligence Lab, UNITED STATES Word Sense Inventories by Non-Experts.
Representing General Relational Knowledge in ConceptNet 5
A large scale annotated child language construction database
MITRE Corporation, UNITED STATES International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned
MODUL University Vienna, Department of New Media Technology, AUSTRIA MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
Møller-Trøndelag Kompetansesenter, NORWAY BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users

 

N
Nagoya University, JAPAN Causal analysis of task completion errors in spoken music retrieval interactions
Dictionary Look-up with Katakana Variant Recognition
Nara Institute of Science and Technology, JAPAN Causal analysis of task completion errors in spoken music retrieval interactions
UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
National Institute for Japanese Language, JAPAN Prediction of Non-Linguistic Information of Spontaneous Speech from the Prosodic Annotation: Evaluation of the X-JToBI system
Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
National Institute of Informatics, JAPAN Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques
Annotating Factive Verbs
Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure
National Institute of Information and Communications Technology, JAPAN The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
National Institute of Standards and Technology (NIST), UNITED STATES Creating HAVIC: Heterogeneous Audio Visual Internet Collection
National Taiwan University, TAIWAN, PROVINCE OF CHINA Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition
NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information
National Technical University of Athens, Department of Electrical and Computer Engineering, GREECE A hierarchical approach with feature selection for emotion recognition from speech
National University of Ireland, IRELAND Semi-Supervised Technical Term Tagging With Minimal User Feedback
Expertise Mining for Enterprise Content Management
National University of Singapore, SINGAPORE Affective Common Sense Knowledge Acquisition for Sentiment Analysis
Natural Language Processing Research Group, UNITED KINGDOM Unsupervised document zone identification using probabilistic graphical models
NCSR Demokritos Institute, GREECE Evaluation of Online Dialogue Policy Learning Techniques
The SYNC3 Collaborative Annotation Tool
Nederlandse Taalunie, NETHERLANDS Smooth Sailing for STEVIN
Neofonie GmbH, GERMANY DBpedia: A Multilingual Cross-domain Knowledge Base
NETCONOMY Software & Consulting, AUSTRIA EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems
New York University, UNITED STATES Challenges in the Knowledge Base Population Slot Filling Task
NILC, BRAZIL The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
NI Systems (India) Pvt. Ltd. Bangalore, INDIA Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
Nokia Research Center, CHINA Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
North-West University, SOUTH AFRICA Aspects of a Legal Framework for Language Resource Management
Norwegian University of Science and Technology, NORWAY The Nordic Dialect Corpus
NTT Cyber Space Laboratories, NTT Corporation, JAPAN Constructing a Class-Based Lexical Dictionary using Interactive Topic Models

 

O
Ohio State University, UNITED STATES Associative and Semantic Features Extracted From Web-Harvested Corpora
Open University, UNITED KINGDOM Quantising Opinions for Political Tweets Analysis
Organisations, Information and Knowledge Research Group, UNITED KINGDOM Unsupervised document zone identification using probabilistic graphical models
Osaka University, JAPAN Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types
Otto-von-Guericke-University Magdeburg, GERMANY LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions
Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Oxford Internet Institute, UNITED KINGDOM LIE: Leadership, Influence and Expertise

 

P
Paideia Inc, UNITED STATES A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
Pattern Science AG, GERMANY Textual Characteristics for Language Engineering
A Survey of Text Mining Architectures and the UIMA Standard
Polish Academy of Sciences, Institute of Computer Science, POLAND Creating a Coreference Resolution System for Polish
Towards a comprehensive open repository of Polish language resources
Towards an LFG parser for Polish: An exercise in parasitic grammar development
PoliMorf: a (not so) new open morphological dictionary for Polish
Web Service integration platform for Polish linguistic resources
The Polish Sejm Corpus
Politecnico di Milano, DEI, ITALY Evaluating Multi-focus Natural Language Queries over Data Services
Politecnico di Torino, Dipartimento di Automatica e Informatica, ITALY A platform-independent user-friendly dictionary from Italian to LIS
Postech, KOREA, REPUBLIC OF Grammatical Error Annotation for Korean Learners of Spoken English
PRESTO, JST, JAPAN Towards automation in using multi-modal language resources: compatibility and interoperability for multi-modal features in Kachako
Princeton University, UNITED STATES The MASC Word Sense Corpus
Empirical Comparisons of MASC Word Sense Annotations
PUC-Rio, BRAZIL Págico: Evaluating Wikipedia-based information retrieval in Portuguese
PUCRS, BRAZIL Corpus+WordNet thesaurus generation for ontology enriching
Combining Formal Concept Analysis and semantic information for building ontological structures from texts : an exploratory study
A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever

 

Q
Qliktech, SWEDEN Visualizing Sentiment Analysis on a User Forum

 

R
RACAI, Romanian Academy Bucharest, ROMANIA ROMBAC: The Romanian Balanced Annotated Corpus
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora
Collecting and Using Comparable Corpora for Statistical Machine Translation
Radboud University Nijmegen, CLST, NETHERLANDS An Oral History Annotation Tool for INTER-VIEWs
Collecting and Analysing Chats and Tweets in SoNaR
Collection of a corpus of Dutch SMS
Radboud University Nijmegen, NETHERLANDS DutchSemCor: Targeting the ideal sense-tagged corpus
Beyond SoNaR: towards the facilitation of large corpus building efforts
The DISCO ASR-based CALL system: practicing L2 oral skills and beyond
RATP, FRANCE DECODA: a call-centre human-human spoken conversation corpus
Research Computing Center Moscow State University Vorolievy Gory, RUSSIAN FEDERATION Automatic Term Recognition Needs Multiple Evidence
Rijksuniversiteit Groningen, NETHERLANDS Developing a large semantically annotated corpus
Multi-Layer Discourse Annotation of a Dutch Text Corpus
Large aligned treebanks for syntax-based machine translation
Romanian Academy, Research Institute for Artificial Intelligence, ROMANIA Adding Morpho-semantic Relations to the Romanian Wordnet
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information
Ruhr-Universität Bochum, GERMANY The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
Rutgers University, UNITED STATES Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
RWTH Aachen University, GERMANY Arabic-Segmentation Combination Strategies for Statistical Machine Translation
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora

 

S
Saarland University, Department of Applied Linguistics, GERMANY Coreference in Spoken vs. Written Texts: a Corpus-based Analysis
Saarland University, Department of Computational Linguistics, GERMANY Using Verb Subcategorization for Word Sense Disambiguation
German and English Treebanks and Lexica for Tree-Adjoining Grammars
CLIMB grammars: three projects using metagrammar engineering
Saarland University, Spoken Language Systems, GERMANY MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
A Gold Standard for Relation Extraction in the Food Domain
Sabanci University, TURKEY SUTAV: A Turkish Audio-Visual Database
Santa Anna IT Research Institute AB & Linköping University, SWEDEN A good space: Lexical predictors in word space evaluation
This also affects the context - Errors in extraction based summaries
SAP AG, GERMANY The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Sarah M. Taylor Consulting, UNITED STATES Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
Sara, NETHERLANDS Dynamic web service deployment in a cloud environment
SDL Language Weaver, UNITED STATES An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output
Senshu University, JAPAN A Search Tool for FrameNet Constructicon
Seoul National University, Center for Humanities and Information, KOREA, REPUBLIC OF Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
Seoul National University, Department of Linguistics, KOREA, REPUBLIC OF Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
Seoul National University, KOREA, REPUBLIC OF Dysarthric Speech Database for Development of QoLT Software Technology
Seoul National University of Education, KOREA, REPUBLIC OF Grammatical Error Annotation for Korean Learners of Spoken English
Shahid Beheshti University, IRAN, ISLAMIC REPUBLIC OF Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
Shanghai Jiao Tong University, CHINA Semi-Supervised Technical Term Tagging With Minimal User Feedback
Spell Checking for Chinese
Shiraz University, IRAN, ISLAMIC REPUBLIC OF Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions
Siena College, Department of Computer Science, UNITED STATES Revealing Contentious Concepts Across Social Groups
Simon Fraser University, CANADA A review corpus annotated for negation, speculation and their scope
Social Media Research Group, Institute of Technology Tallaght, IRELAND Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System
Society for Danish Language and Literature, DENMARK Towards a richer wordnet representation of properties
Sofia University, BULGARIA NgramQuery - Smart Information Extraction from Google N-gram using External Resources
Sonear, FRANCE DECODA: a call-centre human-human spoken conversation corpus
Sorano Ltd., DENMARK Creation and use of Language Resources in a Question-Answering eHealth System
Stanford University, UNITED STATES SUTime: A library for recognizing and normalizing time expressions
Suffix Trees as Language Models
The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
State University of Maringa, Department of Informatics, BRAZIL Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Stockholm University, Department of Linguistics, SWEDEN 3rd party observer gaze as a continuous measure of dialogue flow
Stockholm University, SWEDEN Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text
Systems Plus, Inc., UNITED STATES Creating HAVIC: Heterogeneous Audio Visual Internet Collection
SYSTRAN SA, FRANCE Automatic Translation of Scientific Documents in the HAL Archive

 

T
TAUS, NETHERLANDS On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
Technical University Braunschweig, GERMANY Association Norms of German Noun Compounds
Technical University of Crete, GREECE SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks
Associative and Semantic Features Extracted From Web-Harvested Corpora
Technical University of Denmark, DENMARK Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies
Technische Universität Berlin, DAI-Labor, GERMANY GerNED: A German Corpus for Named Entity Disambiguation
Technische Universität Darmstadt, GERMANY Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution
UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF
Technische Universität Darmstadt, Ubiquitous Knowledge Processing Lab, GERMANY The Open Linguistics Working Group
Technische Universität Wien, AUSTRIA Applying Random Indexing to Structured Data to Find Contextually Similar Words
Telefónica Investigación y Desarrollo, SPAIN The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
TextShuttle GmbH, SWITZERLAND SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
The British University in Dubai, UNITED ARAB EMIRATES Arabic Word Generation and Modelling for Spell Checking
Automatic Extraction and Evaluation of Arabic LFG Resources
The Catholic University of Korea, KOREA, REPUBLIC OF Dysarthric Speech Database for Development of QoLT Software Technology
The Communication Centre for the Deaf and Hard-of-Hearing, ICELAND BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users
The Institute of Statistical Mathematics, JAPAN Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
The Netherlands Cancer Institute, NETHERLANDS NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
Tilde, LATVIA Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Collecting and Using Comparable Corpora for Statistical Machine Translation
Latvian and Lithuanian Named Entity Recognition with TildeNER
Tokyo Institute of Technology, JAPAN The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues
Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions
Tokyo University of Foreign Studies, JAPAN Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Tokyo University of Technology, JAPAN Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Toyohashi University of Technology, JAPAN Statistical Machine Translation without Source-side Parallel Corpus Using Word Lattice and Phrase Extension
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
Developing Partially-Transcribed Speech Corpus from Edited Transcriptions
Trinity College Dublin, IRELAND An audiovisual political speech analysis incorporating eye-tracking and perception data
Irish Treebanking and Parsing: A Preliminary Evaluation
The Herme Database of Spontaneous Multimodal Human-Robot Dialogues
On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
Tsinghua University, CHINA Affective Common Sense Knowledge Acquisition for Sentiment Analysis
CLTC: A Chinese-English Cross-lingual Topic Corpus
Tübitak-Bilgem, TURKEY Turkish Paraphrase Corpus

 

U
UCSC UARC, UNITED STATES A Corpus for Research on Deliberation and Debate
UJF-Grenoble1, FRANCE Dbnary: Wiktionary as a LMF based Multilingual RDF network
Collection of a Large Database of French-English SMT Output Corrections
UNDL Foundation, SWITZERLAND Le Petit Prince in UNL
Union College, UNITED STATES Revealing Contentious Concepts Across Social Groups
Universidad Autónoma de Madrid, Departamento de Lingüística, SPAIN Designing a search interface for a Spanish learner spoken corpus: the end-user's evaluation
Universidad Autónoma de Madrid, SPAIN Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Universidad Complutense de Madrid, SPAIN SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis
Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
MaltOptimizer: A System for MaltParser Optimization
Universidad de Alicante, Departamento Lenguajes y Sistemas Informáticos, SPAIN Massively Increasing TIMEX3 Resources: A Transduction Approach
Universidad de Alicante (UA), SPAIN TIMEN: An Open Temporal Expression Normalisation Resource
Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text
Universidad de Barcelona (UB), SPAIN VERTa: Linguistic features in MT evaluation
Universidad de Granada, SPAIN Linguistic knowledge for specialized text production
Medical Term Extraction in an Arabic Medical Corpus
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Universidad de Huelva, SPAIN A review corpus annotated for negation, speculation and their scope
Universidad del País Vasco, SPAIN BUCEADOR, a multi-language search engine for digital libraries
Universidad de Santiago de Compostela, SPAIN A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality
Universidad de Vigo, SPAIN BUCEADOR, a multi-language search engine for digital libraries
Universidad de Zaragoza, GTC, I3A, SPAIN Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
Universidade Aberta, PORTUGAL Building and Exploring Semantic Equivalences Resources
Universidade de Lisboa, PORTUGAL TimeBankPT: A TimeML Annotated Corpus of Portuguese
A PropBank for Portuguese: the CINTIL-PropBank
Rembrandt - a named-entity recognition framework
Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese
Universidade de São Paulo, BRAZIL Propbank-Br: a Brazilian Treebank annotated with semantic role labels
Universidade de São Paulo, Center of Computational Linguistics (NILC), BRAZIL Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Universidade de São Paulo, Department of Informatics, BRAZIL Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Universidade de São Paulo, School of Arts, Sciences and Humanities, BRAZIL Corpus-based Referring Expressions Generation
Portuguese Text Generation from Large Corpora
Universidade de Vigo, SPAIN A Galician Syntactic Corpus with Application to Intonation Modeling
Universidade do Minho, PORTUGAL Structural alignment of plain text books
Universidade Federal de Minas Gerais, BRAZIL The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese
Universidade Técnica de Lisboa, Instituto Superior Técnico, PORTUGAL An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Dealing with unknown words in statistical machine translation
Universidad Nacional Autónoma de México, Instituto de Ingeniería, MEXICO Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
Universidad Nacional de Córdoba, FaMAF, ARGENTINA Evaluating and improving syntactic lexica by plugging them within a parser
Universidad Nacional de Córdoba, Medina Allende s/n, Ciudad Universitaria, FaMAF, ARGENTINA Extending the adverbial coverage of a French morphological lexicon
Universidad Nacional de Educación a Distancia (UNED), Madrid, NLP&IR group, SPAIN Evaluating Machine Reading Systems through Comprehension Tests
Universidad Politécnica de Cataluña, SPAIN Building Synthetic Voices in the META-NET Framework
An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output
The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output
FreeLing 3.0: Towards Wider Multilinguality
BUCEADOR, a multi-language search engine for digital libraries
Universidad Politécnica de Madrid, E.T.S.I. de Telecomunicacion, SPAIN Investigating Verbal Intelligence Using the TF-IDF Approach
Universidad Politécnica de Madrid, Ontology Engineering Group, SPAIN Collaborative semantic editing of linked data lexica
Universidad Politécnica de Valencia, SPAIN Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System
Universidad Pompeu Fabra, SPAIN Iula2Standoff: a tool for creating standoff documents for the IULACT
The IULA Treebank
Towards a User-Friendly Platform for Building Language Resources based on Web Services
Spell Checking in Spanish: The Case of Diacritic Accents
The FLaReNet Strategic Language Resource Agenda
Università Cattolica del Sacro Cuore, Milan, ITALY First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin
Università degli Studi di Siena, ITALY The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
Università degli Studi di Verona, Department of Foreign Languages and Literatures, ITALY A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Università degli Studi Roma Tre, Department of Linguistics, ITALY A topologic view of Topic and Focus marking in Italian
Università degli Studi Roma Tre, Dipartimento di Scienze dell’Educazione, ITALY Pedagogical stances and their multimodal signals.
Università di Bologna, Department of Computer Science, ITALY AnIta: a powerful morphological analyser for Italian
Università di Bologna, Department of Linguistics and Oriental Studies, ITALY AnIta: a powerful morphological analyser for Italian
A topologic view of Topic and Focus marking in Italian
Università di Firenze, Department of Italian Studies, ITALY A topologic view of Topic and Focus marking in Italian
Università di Firenze, ITALY The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building
The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
Università di Macerata, ITALY A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Università di Padova, Department of Information Engineering, ITALY A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Università di Pavia, ITALY Annotating Qualia Relations in Italian and French Complex Nominals
Università di Pisa, Dipartimento di Linguistica, ITALY LexIt: A Computational Resource on Italian Argument Structure
Università di Pisa, ITALY Creation of a bottom-up corpus-based ontology for Italian Linguistics
Enriching the ISST-TANL Corpus with Semantic Frames
Università di Roma La Sapienza, Dipartimenot di Comunicazione e Ricerca Sociale, ITALY Pedagogical stances and their multimodal signals.
Università di Roma La Sapienza, ITALY A New Method for Evaluating Automatically Learned Terminological Taxonomies
Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Università di Roma Tor Vergata, ART Research Group, ITALY PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases
Università di Roma Tor Vergata, ITALY Intelligibility assessment in forensic applications
Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
Università di Torino, Dipartimento di Informatica, ITALY The Parallel-TUT: a multilingual and multiformat treebank
A treebank-based study on the influence of Italian word order on parsing performance
Università di Torino, ITALY NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
Università di Trento, Dipartimento Ingegneria e Scienza dell'Informazione, ITALY Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
A Parallel Corpus of Music and Lyrics Annotated with Emotions
Domain-specific vs. Uniform Modeling for Coreference Resolution
Università di Trento, ITALY DeCour: a corpus of DEceptive statements in Italian COURts
Università di Verona, ITALY Annotating Qualia Relations in Italian and French Complex Nominals
Università Federico II di Napoli, Dipartimento di Scienze Fisiche, ITALY A treebank-based study on the influence of Italian word order on parsing performance
Università Federico II di Napoli, ITALY W-PhAMT: A web tool for phonetic multilevel timeline visualization
Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabi fication
Università San Raffaele di Roma, ITALY Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
Universität Augsburg, Institut für Informatik, GERMANY Using DiAML and ANVIL for multimodal dialogue annotations
Universitat Autònoma de Barcelona (UAB), SPAIN Using Language Resources in Humanities research
Universität Bielefeld, Applied Informatics Group, GERMANY PAMOCAT: Automatic retrieval of specified postures
Universität Bielefeld, Artificial Intelligence Group, GERMANY MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
Universität Bielefeld, CITEC Central Lab, GERMANY PAMOCAT: Automatic retrieval of specified postures
Collaborative semantic editing of linked data lexica
Universität Bielefeld, GERMANY Suffix Trees as Language Models
ULex: new data models and a mobile environment for corpus enrichment.
Universität Bremen, GERMANY A Speech and Gesture Spatial Corpus in Assisted Living
An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use
Universitat d’Alacant, DLSI, SPAIN Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
Universitat d'Alacant (UA), SPAIN A finite-state morphological transducer for Kyrgyz
Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation
Universitat de Barcelona (UB), SPAIN The IULA Treebank
Universität des Saarlandes, GERMANY Suffix Trees as Language Models
A methodology for the extraction of information about the usage of formulaic expressions in scientific texts
Universitatea Alexandru Ioan Cuza, ROMANIA Evaluating Machine Reading Systems through Comprehension Tests
Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations
Universitatea din Bucuresti, ROMANIA The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System
Universität Erlangen-Nuremberg, GERMANY Efficient Dependency Graph Matching with the IMS Open Corpus Workbench
Universität Freiburg, GERMANY Creation and use of Language Resources in a Question-Answering eHealth System
Universität Hamburg, GERMANY Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation
Universität Heidelberg, GERMANY Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards
Universität Heidelberg, ICL, GERMANY French and German Corpora for Audience-based Text Type Classification
Universität Hildesheim, GERMANY MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content
Yes we can!? Annotating English modal verbs
A Tool/Database Interface for Multi-Level Analyses
Universität Innsbruck, AUSTRIA RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
Universitat Jaume I, SPAIN Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
Universität Jena, Julie Lab, GERMANY CALBC: Releasing the Final Corpora
Universität Karlsruhe, GERMANY The KIT Lecture Corpus for Speech Translation
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Universität Leipzig, Department of Computer Science, Natural Language Processing Group, GERMANY MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
The Open Linguistics Working Group
Universität Leipzig, GERMANY Learning Categories and their Instances by Contextual Features
Textual Characteristics for Language Engineering
Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages
Universität München, Research Unit “Quantitative Language Comparison”, GERMANY The Open Linguistics Working Group
Universitat Politècnica de Catalunya, TALP center, SPAIN Highlighting relevant concepts from Topic Signatures
Summarizing a multimodal set of documents in a Smart Room
Universitat Politècnica de Valencia, SPAIN The acquisition and dialog act labeling of the EDECAN-SPORTS corpus
Universitat Pompeu Fabra, Instituto Universitario de Lingüística Aplicada, SPAIN The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
NeoTag: a POS Tagger for Grammatical Neologism Detection
Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
Universitat Pompeu Fabra, SPAIN A Classification of Adjectives for Polarity Lexicons Enhancement
A voting scheme to detect semantic underspecification
A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality
Automatic lexical semantic classification of nouns
The IULA Treebank
The CONCISUS Corpus of Event Summaries
Using Language Resources in Humanities research
Text Simplification Tools for Spanish
Universität Potsdam, GERMANY Yes we can!? Annotating English modal verbs
Fine-grained German Sentiment Analysis on Social Media
Universität Saarland, Computational Linguistics Department, GERMANY The Open Linguistics Working Group
Universität Saarland, Department of Computational Linguistics and Phonetics, GERMANY Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Universität Saarland, GERMANY Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach
Evaluating Machine Reading Systems through Comprehension Tests
Universität Saarland, Spoken Language Systems, GERMANY Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Universität Stuttgart, GERMANY The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles
German """"nach""""-Particle Verbs in Semantic Theory and Corpus Data
Association Norms of German Noun Compounds
Analyzing and Aligning German compound nouns
A Tool/Database Interface for Multi-Level Analyses
Universität Stuttgart, IMS, GERMANY Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank
Automatic classification of German """"an"""" particle verbs
Making Ellipses Explicit in Dependency Conversion for a German Treebank
Logical metonymies and qualia structures: an annotated database of logical metonymies for German
A Corpus-based Study of the German Recipient Passive
Irish Treebanking and Parsing: A Preliminary Evaluation
French and German Corpora for Audience-based Text Type Classification
Adapting and evaluating a generic term extraction tool
Universität Tübingen, GERMANY Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC
A Metadata Editor to Support the Description of Linguistic Resources
A Repository for the Sustainable Management of Research Data
Standardizing a Component Metadata Infrastructure
Large aligned treebanks for syntax-based machine translation
Universität Tübingen - SFS, GERMANY A Comparative Evaluation of Word Sense Disambiguation Algorithms for German
Universität Ulm, GERMANY A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System
Universität Ulm, Institute of Communications Engineering, GERMANY Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues
Using multimodal resources for explanation approaches in intelligent systems
Investigating Verbal Intelligence Using the TF-IDF Approach
Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores
Speech and Language Resources for LVCSR of Russian
Universität Ulm, Institute of Neural Information Processing, GERMANY An audiovisual political speech analysis incorporating eye-tracking and perception data
Universität Wien, Zentrum für Translationswissenschaft, AUSTRIA Knowledge-Rich Context Extraction and Ranking with KnowPipe
Universität Zürich, SWITZERLAND German Verb Patterns and Their Implementation in an Electronic Dictionary
Automatic MT Error Analysis: Hjerson Helping Addicter
Terra: a Collection of Translation Error-Annotated Corpora
Université Aix-Marseille, FRANCE Annotated Bibliographical Reference Corpora in Digital Humanities
Université Aix-Marseille, Laboratoire Parole et Langage, FRANCE SPPAS: a tool for the phonetic segmentation of speech
Orthographic Transcription: which enrichment is required for phonetization?
DEGELS1: A comparable corpus of French Sign Language and co-speech gestures
An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
Université Aix-Marseille, LIF-CNRS, FRANCE Syntactic annotation of spontaneous speech: application to call-center conversation data
DECODA: a call-centre human-human spoken conversation corpus
Université catholique de Louvain, BELGIUM Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Université catholique de Louvain, CENTAL, Institut Langage et Communication, BELGIUM Extraction of unmarked quotations in Newspapers
Université d'Avignon, CERI/LIA, FRANCE Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Université d'Avignon, FRANCE DECODA: a call-centre human-human spoken conversation corpus
Université d'Avignon, LIA, FRANCE Annotated Bibliographical Reference Corpora in Digital Humanities
Université de Genève, Département de Linguistique, SWITZERLAND Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
The Trilingual ALLEGRA Corpus: Presentation and Possible Use for Lexicon Induction
Université de Genève, FTI/TIM, SWITZERLAND Acquisition of Syntactic Simplification Rules for French
Evaluating Appropriateness Of System Responses In A Spoken CALL Game
Université de Genève, SWITZERLAND Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification
Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.
Université de Genève, TIM/ISSCO, SWITZERLAND A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
Annotating Qualia Relations in Italian and French Complex Nominals
A Corpus for a Gesture-Controlled Mobile Spoken Dialogue System
Université de Montréal, CANADA Capturing syntactico-semantic regularities among terms: An application of the FrameNet methodology to terminology
Texto4Science: a Quebec French Database of Annotated Short Text Messages
Université de Montréal, OLST, CANADA Logic Based Methods for Terminological Assessment
Identifying equivalents of specialized verbs in a bilingual comparable corpus of judgments: A frame-based methodology
Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
Université de Nancy, LORIA, FRANCE Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents
Representation of linguistic and domain knowledge for second language learning in virtual worlds
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Université de Nantes, LINA, FRANCE Tackling interoperability issues within UIMA workflows
Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora
Université de Nice Sophia-Antipolis, équipe RL, Laboratoire I3S, FRANCE Unsupervised acquisition of concatenative morphology
Université de Provence, FRANCE An ontological approach to model and query multimodal concurrent linguistic annotations
Université de Rennes 1, IRISA, FRANCE Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora
Towards Fully Automatic Annotation of Audio Books for TTS
Université de Savoie, FRANCE Ontoterminology: How to unify terminology and ontology into a single paradigm
Université de Strasbourg, LiLPA, FRANCE French and German Corpora for Audience-based Text Type Classification
Université de Toulouse, CLLE, FRANCE An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
Université de Toulouse, IRIT, FRANCE An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
Université du Maine, LIUM, FRANCE TED-LIUM: an Automatic Speech Recognition dedicated corpus
Automatic Translation of Scientific Documents in the HAL Archive
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Université du Sud Toulon Var, FRANCE An ontological approach to model and query multimodal concurrent linguistic annotations
Universiteit Antwerp, BELGIUM From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
Irregularity Detection in Categorized Document Corpora
The DISCO ASR-based CALL system: practicing L2 oral skills and beyond
The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
Universiteit Antwerp, CliPS, BELGIUM ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories
Universiteit Antwerpen, BELGIUM “Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives.
Universiteit Gent, BELGIUM From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
Universiteit Twente, NETHERLANDS Evaluating automatic cross-domain Dutch semantic role annotation
Creating a Data Collection for Evaluating Rich Speech Retrieval
Universiteit Utrecht, NETHERLANDS The FLaReNet Strategic Language Resource Agenda
Universiteit Utrecht, UiL-OTS, NETHERLANDS Recent Developments in CLARIN-NL
Evaluating automatic cross-domain Dutch semantic role annotation
Universiteit van Amsterdam, NETHERLANDS Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions
Universiteit van Tilburg, NETHERLANDS Using DiAML and ANVIL for multimodal dialogue annotations
The coding and annotation of multimodal dialogue acts
DutchSemCor: Targeting the ideal sense-tagged corpus
ISO 24617-2: A semantically-based standard for dialogue annotation
Universiteit van Tilburg, TiCC, NETHERLANDS Beyond SoNaR: towards the facilitation of large corpus building efforts
Université Montpellier 2, LIRMM, FRANCE A new dynamic approach for lexical networks evaluation
Université Paris 13, LDI, FRANCE Building a database of French frozen adverbial phrases
Université Paris 3 - sorbonne Nouvelle, LaTTiCe, CNRS, FRANCE Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger
Université Paris 7, FRANCE A Phonemic Corpus of Polish Child-Directed Speech
Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus
Semantic annotation of French corpora: animacy and verb semantic classes
Université Paris 8 Saint Denis, FRANCE Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Université Paris-Est, FRANCE Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger
Université Paris-Est, LIGM, FRANCE A new semantically annotated corpus with syntactic-semantic and cross-lingual senses
Extending the adverbial coverage of a French morphological lexicon
Université Paris Ouest Nanterre La Défense, MoDyCo-CNRS, FRANCE Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
Université Paul Sabatier, IRIT, FRANCE Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs
Universitetet i Bergen, Department of Linguistic, NORWAY A Morphological Analyzer For Wolof Using Finite-State Techniques
Universitetet i Bergen, NORWAY Design and compilation of a specialized Spanish-German parallel corpus
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Universitetet i Oslo, Institutt for lingvistiske og nordiske studier, NORWAY The Nordic Dialect Corpus
Universitetet i Oslo, NORWAY Págico: Evaluating Wikipedia-based information retrieval in Portuguese
The Nordic Dialect Corpus
The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
Université Toulouse II, U.R.I Octogone-Lordat, FRANCE MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions.
University at Albany, SUNY, UNITED STATES Revealing Contentious Concepts Across Social Groups
Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University College Dublin, IRELAND English to Indonesian Transliteration to Support English Pronunciation Practice
Evaluating expressive speech synthesis from audiobook corpora for conversational phrases
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College London, UNITED KINGDOM A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic.
University of Amsterdam, ACLC, NETHERLANDS NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
University of Applied Sciences Chur; Faculty of Information Science, SWITZERLAND Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
University of Applied Sciences Vern, Zagreb, CROATIA Addressing polysemy in bilingual lexicon extraction from comparable corpora
University of Aston, UNITED KINGDOM Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
University of Athens, GREECE Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue
University of Belgrade, Faculty of Mathematics, SERBIA Multimedia database of the cultural heritage of the Balkans
University of Belgrade, SERBIA A tool for enhanced search of multilingual digital libraries of e-journals
University of Birmingham, School of Computer Science, UNITED KINGDOM Building Text-to-Speech Systems for Resource Poor Languages
University of Boston, UNITED STATES Detecting Reduplication in Videos of American Sign Language
Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
University of Brighton, UNITED KINGDOM A Repository of Data and Evaluation Resources for Natural Language Generation
LG-Eval: A Toolkit for Creating Online Language Evaluation Experiments
University of California, Santa Cruz, UNITED STATES An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style
University of Cambridge, UNITED KINGDOM Reclassifying subcategorization frames for experimental analysis and stimulus generation
Annotating progressive aspect constructions in the spoken section of the British National Corpus
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
University of Colorado at Boulder, UNITED STATES Empty Argument Insertion in the Hindi PropBank
University of Colorado, UNITED STATES Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events
Annotating Story Timelines as Temporal Dependency Structures
University of Copenhagen, CST, DENMARK Automatic annotation of head velocity and acceleration in Anvil
DSim, a Danish Parallel Corpus for Text Simplification
Creation and use of Language Resources in a Question-Answering eHealth System
University of Copenhagen, DENMARK A voting scheme to detect semantic underspecification
Towards a richer wordnet representation of properties
A Distributed Resource Repository for Cloud-Based Machine Translation
Creation and use of Language Resources in a Question-Answering eHealth System
Multimodal Behaviour and Feedback in Different Types of Interaction
Feedback in Nordic First-Encounters: a Comparative Study
Polish Multimodal Corpus ― a collection of referential gestures
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Düsseldorf, Institut für Sprache und Information, GERMANY German and English Treebanks and Lexica for Tree-Adjoining Grammars
University of Edinburgh, UNITED KINGDOM The BladeMistress Corpus: From Talk to Action in Virtual Worlds
A Database of Attribution Relations
University of Essex, UNITED KINGDOM DeCour: a corpus of DEceptive statements in Italian COURts
Assessing Crowdsourcing Quality through Objective Tasks
Applying Random Indexing to Structured Data to Find Contextually Similar Words
Domain-specific vs. Uniform Modeling for Coreference Resolution
University of Gothenburg, Sprakbanken, SWEDEN Korp ― the corpus infrastructure of Spräkbanken
The open lexical infrastructure of Spräkbanken
University of Groningen, Institute of Artificial Intelligence, NETHERLANDS Pragmatic identification of the witness sets
University of Helsinki, FINLAND Boosting statistical tagger accuracy with simple rule-based grammars
Constructive Interaction for Talking about Interesting Topics
Representing the Translation Relation in a Bilingual Wordnet
Multimodal Corpus of Multi-party Conversations in Second Language
Rule-Based Detection of Clausal Coordinate Ellipsis
Improving corpus annotation productivity: a method and experiment with interactive tagging
Feedback in Nordic First-Encounters: a Comparative Study
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3
University of Iceland, ICELAND The Icelandic Parsed Historical Corpus (IcePaHC)
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Illinois at Urbana-Champaign, UNITED STATES YADAC: Yet another Dialectal Arabic Corpus
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
University of Jyväskylä, FINLAND Comparing computer vision analysis of signed language video with motion capture recordings
University of Latvia, LATVIA An implementation of a Latvian resource grammar in Grammatical Framework
University of Leeds, UNITED KINGDOM QurAna: Corpus of the Quran annotated with Pronominal Anaphora
QurSim: A corpus for evaluation of relatedness in short texts
Predicting Phrase Breaks in Classical and Modern Standard Arabic Text
Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing
LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis
Development and Application of a Cross-language Document Comparability Metric
Identifying Word Translations from Comparable Documents Without a Seed Lexicon
Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Leipzig, Natural Language Processing Group, GERMANY The Influence of Corpus Quality on Statistical Measurements on Language Resources
University of Limerick, IRELAND Evaluating Machine Reading Systems through Comprehension Tests
University of Ljubljana, Faculty of Arts, SLOVENIA Addressing polysemy in bilingual lexicon extraction from comparable corpora
Irregularity Detection in Categorized Document Corpora
University of Ljubljana, SLOVENIA Cleaning noisy wordnets
University of Łódz, POLAND Towards a comprehensive open repository of Polish language resources
University of Luxembourg, LUXEMBOURG NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
University of Malta, MALTA Incorporating an Error Corpus into a Spellchecker for Maltese
Multimodal Behaviour and Feedback in Different Types of Interaction
A Repository of Data and Evaluation Resources for Natural Language Generation
University of Manchester, UNITED KINGDOM A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic.
Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries
Identification of Manner in Bio-Events
Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench
GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German
University of Maribor, SLOVENIA SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
University of Maryland, UNITED STATES Typing Race Games as a Method to Create Spelling Error Corpora
Creating and Curating a Cross-Language Person-Entity Linking Collection
Linguistic Resources for Handwriting Recognition and Translation Evaluation
University of Minho, LIAAD-INESC, PORTUGAL The Minho Quotation Resource
University of Newcastle, AUSTRALIA The Icelandic Parsed Historical Corpus (IcePaHC)
University of North Texas, UNITED STATES Learning Sentiment Lexicons in Spanish
Unsupervised Word Sense Disambiguation with Multilingual Representations
A Parallel Corpus of Music and Lyrics Annotated with Emotions
University of Osnabrück, Institute for Cognitive Science, GERMANY LexIt: A Computational Resource on Italian Argument Structure
University of Pennsylvania, IRCS, UNITED STATES Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
University of Pennsylvania, UNITED STATES Experiences in Resource Generation for Machine Translation through Crowdsourcing
The Icelandic Parsed Historical Corpus (IcePaHC)
Automatic word alignment tools to scale production of manually aligned parallel texts
A corpus of general and specific sentences from news
University of Potsdam, Applied Computational Linguistics Lab, POLAND SemScribe: Natural Language Generation for Medical Reports
University of Rijeka, Department of Informatics, CROATIA BLEU Evaluation of Machine-Translated English-Croatian Legislation
University of Rochester, UNITED STATES An Annotation Scheme for Quantifier Scope Disambiguation
University of Sheffield, DCS, UNITED KINGDOM Mapping WordNet synsets to Wikipedia articles
University of Sheffield, UNITED KINGDOM Matching Cultural Heritage items to Wikipedia
Assessing the Comparability of News Texts
Large Scale Semantic Annotation, Indexing and Search at The National Archives
TIMEN: An Open Temporal Expression Normalisation Resource
LIE: Leadership, Influence and Expertise
Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles
Massively Increasing TIMEX3 Resources: A Transduction Approach
A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English
Assessing Crowdsourcing Quality through Objective Tasks
A light way to collect comparable corpora from the Web
Applying Random Indexing to Structured Data to Find Contextually Similar Words
Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Southern California, Information Sciences Institute, UNITED STATES Ontologies of Linguistic Annotation: Survey and perspectives
The Open Linguistics Working Group
A generic formalism to represent linguistic corpora in RDF and OWL/DL
University of Southern California, Institute for Creative Technologies, UNITED STATES An audiovisual political speech analysis incorporating eye-tracking and perception data
The BladeMistress Corpus: From Talk to Action in Virtual Worlds
Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems
The Twins Corpus of Museum Visitor Questions
University of Southern California, UNITED STATES The Twins Corpus of Museum Visitor Questions
University of Southern Denmark, DENMARK The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
University of Stirling, UNITED KINGDOM Affective Common Sense Knowledge Acquisition for Sentiment Analysis
University of Sussex, UNITED KINGDOM Corpus Annotation as a Scientific Task
Linguistic Analysis Processing Line for Bulgarian
University of Szeged, Department of Hungarian Linguistics, HUNGARY HunOr: A Hungarian―Russian Parallel Corpus
University of Szeged, Department of Informatics, HUNGARY HunOr: A Hungarian―Russian Parallel Corpus
University of Tartu, ESTONIA Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data
Constructive Interaction for Talking about Interesting Topics
University of Tartu, Institute of Computer Science, ESTONIA Robust clause boundary identification for corpus annotation
University of Texas at Arlington, UNITED STATES Evaluation of Online Dialogue Policy Learning Techniques
University of Texas at Dallas, Human Language Technology Research Institute, UNITED STATES EmpaTweet: Annotating and Detecting Emotions on Twitter
University of Texas at Dallas, UNITED STATES Tajik-Farsi Persian Transliteration Using Statistical Machine Translation
Annotating Spatial Containment Relations Between Events
University of Thessaly, Department of Computer and Communication Engineering, GREECE A hierarchical approach with feature selection for emotion recognition from speech
University of Toronto, CANADA Measuring Interlanguage: Native Language Identification with L1-influence Metrics
Detecting Reduplication in Videos of American Sign Language
University of Tsukuba, JAPAN Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
University of Turin, Department of Computer Science, ITALY Pragmatic identification of the witness sets
University of Washington, Biomedical and Health Informatics, UNITED STATES Statistical Section Segmentation in Free-Text Clinical Records
University of Washington, Department of Linguistics, UNITED STATES Statistical Section Segmentation in Free-Text Clinical Records
University of Washington, UNITED STATES Effort of Genre Variation and Prediction of System Performance
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms
Using a Goodness Measurement for Domain Adaptation: A Case Study on Chinese Word Segmentation
University of Wisconsin-Milwaukee, HIA, UNITED STATES Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
University of Wolverhampton, Research Institute in Information and Language Processing, UNITED KINGDOM CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations
University of Wolverhampton, UNITED KINGDOM Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach
A review corpus annotated for negation, speculation and their scope
Annotating Near-Identity from Coreference Disagreements
PET: a Tool for Post-editing and Assessing Machine Translation
University of Yamanashi, JAPAN Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
University of Zagreb, CROATIA Croatian Dependency Treebank: Recent Development and Initial Experiments
Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences, CROATIA Generation of Verbal Stems in Derivationally Rich Language
BLEU Evaluation of Machine-Translated English-Croatian Legislation
Addressing polysemy in bilingual lexicon extraction from comparable corpora
University of Zurich, Institute of Computational Linguistics, SWITZERLAND Dependency parsing for interaction detection in pharmacogenomics
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
A tree is a Baum is an árbol is a sach'a: Creating a trilingual treebank
UPMF-Grenoble2, FRANCE Collection of a Large Database of French-English SMT Output Corrections
Uppsala University, SWEDEN A Basic Language Resource Kit for Persian
A Distributed Resource Repository for Cloud-Based Machine Translation
Parallel Data, Tools and Interfaces in OPUS
MaltOptimizer: A System for MaltParser Optimization
Large aligned treebanks for syntax-based machine translation
US Department of Defense, UNITED STATES Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?
UTEM, CHILE A proposal for improving WordNet Domains

 

V
Vassar College, UNITED STATES The MASC Word Sense Corpus
Empirical Comparisons of MASC Word Sense Annotations
Veterans Institute, centre for knowledge and expertise, NETHERLANDS An Oral History Annotation Tool for INTER-VIEWs
Vrije Universiteit Amsterdam, Faculteit der Letteren, NETHERLANDS Mapping WordNet to the Kyoto ontology
Vrije Universiteit Amsterdam, NETHERLANDS Building a fine-grained subjectivity lexicon from a web corpus
DutchSemCor: Targeting the ideal sense-tagged corpus

 

W
W4 S.A., FRANCE The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Wake Forest University, UNITED STATES Boosting statistical tagger accuracy with simple rule-based grammars
Waseda University, JAPAN Method for Collection of Acted Speech Using Various Situation Scripts
Wonkwang University, KOREA, REPUBLIC OF Dysarthric Speech Database for Development of QoLT Software Technology
Wrocław University of Technology, Institute of Applied Informatics, POLAND Inforex -- a web-based tool for text corpus management and semantic annotation
Wrocław University of Technology, POLAND Constraint Based Description of Polish Multiword Expressions
Tools for plWordNet Development. Presentation and Perspectives
Recognition of Polish Derivational Relations Based on Supervised Learning Scheme
KPWr: Towards a Free Corpus of Polish

 

X
Xerox India Research Centre, INDIA Experiences in Resource Generation for Machine Translation through Crowdsourcing
Xerox Research Centre Europe, FRANCE Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
XTM International, UNITED KINGDOM On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market

 

Y
Yamagata University, JAPAN Collecting humorous expressions from a community-based question-answering-service corpus
Yaşar University, TURKEY Construction of the Turkish National Corpus (TNC)

 

Z
Zemanta, UNITED STATES Collecting and Using Comparable Corpora for Statistical Machine Translation

Powered by ELDA © 2012 ELDA/ELRA