LREC 2012 Proceedings

AFFILIATIONS: Browse articles of the conference sorted by institution

A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z

A
Aalto University School of Science, FINLAND	Comparing computer vision analysis of signed language video with motion capture recordings
Adam Mickiewicz University, Faculty of Mathematics and Computer Science, Department of Computer Linguistics and Artificial Intelligence, POLAND	Wordnet Based Lexicon Grammar for Polish
Adam Mickiewicz University, Poznan, Institute of Linguistics, POLAND	Developing and evaluating an emergency scenario dialogue corpus
Advocate, Member of the Johannesburg Bar, SOUTH AFRICA	Aspects of a Legal Framework for Language Resource Management
Agence France-Presse, FRANCE	Aleda, a free large-scale entity database for French
Alexandru Philippide Institute of Philology, Romanian Academy, the Iași, ROMANIA	Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations
Allameh Tabatabaiee University, Linguistic Department, IRAN, ISLAMIC REPUBLIC OF	Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
Alpage, INRIA Paris-Rocquencourt & Université Paris 7, FRANCE	Applying cross-lingual WSD to wordnet development Evaluating and improving syntactic lexica by plugging them within a parser Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations
Amirkabir University of Technology, IRAN, ISLAMIC REPUBLIC OF	A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora
Applied Language Solutions Ltd., UNITED KINGDOM	SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Arabic Textware, JORDAN	Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
Association Francophone de la Communication Parlée, FRANCE	The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
Athens Technology Center, GREECE	Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue
ATILF Nancy, FRANCE	Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary
AT&T Labs ― Research, UNITED STATES	Building Text-To-Speech Voices in the Cloud
Austrian Academy of Sciences, AUSTRIA	Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies.
Austrian Research Institute for Artificial Intelligence, AUSTRIA	Applying Random Indexing to Structured Data to Find Contextually Similar Words
Autónoma University Madrid, Computational Linguistics Laboratory, SPAIN	Medical Term Extraction in an Arabic Medical Corpus

B
BAE Systems, UNITED STATES	Identifying Nuggets of Information in GALE Distillation Evaluation
Barcelona Media Research Center, SPAIN	Holaaa!! writin like u talk is kewl but kinda hard 4 NLP BUCEADOR, a multi-language search engine for digital libraries
Barcelona Media, SPAIN	A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
BME MOKK, HUNGARY	Rapid creation of large-scale corpora and frequency dictionaries
Brandeis University, Computer Science Department, UNITED STATES	Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Brandeis University, UNITED STATES	Word Sense Inventories by Non-Experts. ATLIS: Identifying Locational Information in Text Automatically The Role of Model Testing in Standards Development: The Case of ISO-Space Annotating dropped pronouns in Chinese newswire text The TARSQI Toolkit
Brigham Young University, UNITED STATES	Item Development and Scoring for Japanese Oral Proficiency Testing First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis
Bulgarian Academy of Sciences, BULGARIA	Bulgarian X-language Parallel Corpus Expanding Parallel Resources for Medium-Density Languages for Free Linguistic Analysis Processing Line for Bulgarian

C
Cairo University, Faculty of Arts, EGYPT	Medical Term Extraction in an Arabic Medical Corpus
Campus de Espinardo, SPAIN	A New Method for Evaluating Automatically Learned Terminological Taxonomies
Carnegie Mellon University, Language Technologies Institute, UNITED STATES	Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
Carnegie Mellon University, UNITED STATES	A Universal Part-of-Speech Tagset Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic Morphosyntactic Analysis of the CHILDES and TalkBank Corpora Diversifiable Bootstrapping for Acquiring High-Coverage Paraphrase Resource
CASSIDIAN, FRANCE	New language resources for the Pashto language
CEA - LIST, FRANCE	Evaluation of Unsupervised Information Extraction Evaluation of a Complex Information Extraction Application in Specific Domain Identifying bilingual Multi-Word Expressions for Statistical Machine Translation
CEHUM/UM, PORTUGAL	Págico: Evaluating Wikipedia-based information retrieval in Portuguese
CELCT, ITALY	The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation CAT: the CELCT Annotation Tool Evaluating Machine Reading Systems through Comprehension Tests Chinese Whispers: Cooperative Paraphrase Acquisition
CELI, ITALY	Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.
Centro de Estudios Lingüísticos y Literarios, El Colegio de México, MEXICO	Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
Centro de Linguística da Universidade de Lisboa, PORTUGAL	Project FLY: a multidisciplinary project within Linguistics Modality in Text: a Proposal for Corpus Annotation
Centro de Linguística da Universidade Nova de Lisboa, PORTUGAL	An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT Dealing with unknown words in statistical machine translation
Centro Ramon Pineiro para a Investigacion en Humanidades. Xunta de Galicia, SPAIN	A Galician Syntactic Corpus with Application to Intonation Modeling
Chalmers University, SWEDEN	An Open Source Persian Computational Grammar
Charles University in Prague, CZECH REPUBLIC	A database of semantic clusters of verb usages Alternative Lexicalizations of Discourse Connectives in Czech HamleDT: To Parse or Not to Parse? Terra: a Collection of Translation Error-Annotated Corpora Announcing Prague Czech-English Dependency Treebank 2.0 The Joy of Parallelism with CzEng 1.0 Getting more data -- Schoolkids as annotators Building a learner corpus
Charles University in Prague, Faculty of Arts, CZECH REPUBLIC	Building a multilingual parallel corpus for human users
Charles University in Prague, Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, CZECH REPUBLIC	Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component Language Richness of the Web A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation Prague Dependency Style Treebank for Tamil Arabic Word Generation and Modelling for Spell Checking The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
Charles University in Prague, Institute of the Czech National Corpus, CZECH REPUBLIC	Balanced data repository of spontaneous spoken Czech
Charles University in Prague, MFF, UFAL, CZECH REPUBLIC	A High-Quality Web Corpus of Czech Automatic MT Error Analysis: Hjerson Helping Addicter IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus
Chiba Institute of Technology, JAPAN	Method for Collection of Acted Speech Using Various Situation Scripts
Chiba University, JAPAN	Annotation of response tokens and their triggering expressions in Japanese multi-party conversations Annotation of anaphoric relations and topic continuity in Japanese conversation UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
Chinese University of Hong Kong, HONG KONG	Quantising Opinions for Political Tweets Analysis
City University of New York, UNITED STATES	Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
CLEO, Centre for Open Electronic Publishing, FRANCE	Annotated Bibliographical Reference Corpora in Digital Humanities
CLiC-Universitat de Barcelona, SPAIN	Annotating Near-Identity from Coreference Disagreements
CLUL, PORTUGAL	Introducing the Reference Corpus of Contemporary Portuguese Online
CNAM-CRTD, FRANCE	Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
CNR, ITALY	Towards a User-Friendly Platform for Building Language Resources based on Web Services META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
CNRS, FRANCE	DISLOG: A logic-based language for processing discourse structures ANALEC: a New Tool for the Dynamic Annotation of Textual Data
Columbia University, UNITED STATES	Annotations for Power Relations on Email Threads The Dependency-Parsed FrameNet Corpus AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis Annotating Agreement and Disagreement in Threaded Discussion Conventional Orthography for Dialectal Arabic The MASC Word Sense Corpus Simplified guidelines for the creation of Large Scale Dialectal Arabic Annotations DramaBank: Annotating Agency in Narrative Discourse Empirical Comparisons of MASC Word Sense Annotations
Computational & Information Sciences Directorate, Army Research Lab (ARL), UNITED STATES	Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System
Computer Research Center of Islamic Sciences, IRAN, ISLAMIC REPUBLIC OF	A Framework for Spelling Correction in Persian Language Using Noisy Channel Model
Copenhagen Business School, DENMARK	Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies SMALLWorlds -- Multilingual Content-Controlled Monologues Translog-II: a Program for Recording User Activity Data for Empirical Reading and Writing Research

D
DANS, Data Archiving & Networked Services, NETHERLANDS	An Oral History Annotation Tool for INTER-VIEWs
De La Salle University, Center for Empathic Human-Computer Interactions, PHILIPPINES	Building a Multimodal Laughter Database for Emotion Recognition
Delft University of Technology, NETHERLANDS	The BladeMistress Corpus: From Talk to Action in Virtual Worlds Creating a Data Collection for Evaluating Rich Speech Retrieval
Deluxe Digital Studios, UNITED KINGDOM	SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Department of Speech, Music and Hearing, School of Computer Science and Communication, KTH, SWEDEN	3rd party observer gaze as a continuous measure of dialogue flow
Deutsche Telekom Labs, GERMANY	“You Seem Aggressive!” Monitoring Anger in a Practical Application Fast Labeling and Transcription with the Speechalyzer Toolkit
DFKI GmbH, GERMANY	Constructing a Question Corpus for Textual Semantic Relations Using Verb Subcategorization for Word Sense Disambiguation Involving Language Professionals in the Evaluation of Machine Translation Automatic MT Error Analysis: Hjerson Helping Addicter Joint Grammar and Treebank Development for Mandarin Chinese with HPSG An Adaptive Framework for Named Entity Combination A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation Terra: a Collection of Translation Error-Annotated Corpora Evaluating the Impact of Phrase Recognition on Concept Tagging Annotating Opinions in German Political News MultiUN v2: UN Documents with Multilingual Alignments Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World A Graphical Citation Browser for the ACL Anthology META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies CLIMB grammars: three projects using metagrammar engineering The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation The META-SHARE Metadata Schema for the Description of Language Resources
Direction Générale de l'Armement, FRANCE	The ETAPE corpus for the evaluation of speech-based TV content processing in the French language The REPERE Corpus : a multimodal corpus for person recognition New language resources for the Pashto language
Doshisha University, JAPAN	Multimodal Corpus of Multi-party Conversations in Second Language
Dräger Safety AG & Co. KGaA, GERMANY	Adapting and evaluating a generic term extraction tool
Dublin City University, IRELAND	Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner Irish Treebanking and Parsing: A Preliminary Evaluation A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation Towards a User-Friendly Platform for Building Language Resources based on Web Services Arabic Word Generation and Modelling for Spell Checking Automatic Extraction and Evaluation of Arabic LFG Resources Creating a Data Collection for Evaluating Rich Speech Retrieval The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
Duke University, UNITED STATES	A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Duquesne University, EVL Lab, UNITED STATES	Distractorless Authorship Verification

E
Eastern Michigan University, UNITED STATES	”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
Ecole Polytechnique de Montréal, CANADA	A disambiguation resource extracted from Wikipedia for semantic annotation
ELDA/ELRA, FRANCE	The ETAPE corpus for the evaluation of speech-based TV content processing in the French language Towards a User-Friendly Platform for Building Language Resources based on Web Services ELRA in the heart of a cooperative HLT world The REPERE Corpus : a multimodal corpus for person recognition Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora VERTa: Linguistic features in MT evaluation The FLaReNet Strategic Language Resource Agenda META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools New language resources for the Pashto language Using the International Standard Language Resource Number: Practical and Technical Aspects An Analytical Model of Language Resource Sustainability On the Way to a Legal Sharing of Web Applications in NLP The META-SHARE Metadata Schema for the Description of Language Resources
Elhuyar Foundation, SPAIN	PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques
Elhuyar R&D, SPAIN	Building a Basque-Chinese Dictionary by Using English as Pivot
Eolaistriu Technologies, IRELAND	Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
Erasmus Medical Center / Linguamatics, NETHERLANDS	Centroids: Gold standards with distributional variation
Erasmus University Medical Centre, NETHERLANDS	CALBC: Releasing the Final Corpora
European Bioinformatics Institute, UNITED KINGDOM	Reclassifying subcategorization frames for experimental analysis and stimulus generation Annotating progressive aspect constructions in the spoken section of the British National Corpus Centroids: Gold standards with distributional variation CALBC: Releasing the Final Corpora
European Commission, Directorate General for Translation (DGT), LUXEMBOURG	MultiUN v2: UN Documents with Multilingual Alignments DGT-TM: A freely available Translation Memory in 22 languages
European Commission, Joint Research Centre (JRC), ITALY	DGT-TM: A freely available Translation Memory in 22 languages JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text
Euroscript Deutschland, GERMANY	Involving Language Professionals in the Evaluation of Machine Translation
Euskal Herriko Unibertsitatea, Informatika Fakultatea, SPAIN	Multilingual Central Repository version 3.0 A proposal for improving WordNet Domains
Euskal Herriko Unibertsitatea, SPAIN	Matching Cultural Heritage items to Wikipedia Versatile Speech Databases for High Quality Synthesis for Basque KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments Strategies to Improve a Speaker Diarisation Tool Using an ASR database to design a pronunciation evaluation system in Basque

F
Fachhochschule Potsdam, University of Applied Science, GERMANY	Is it Useful to Support Users with Lexical Resources? A User Study.
Federal University of Rio Grande do Sul, BRAZIL	A large scale annotated child language construction database
Florida Institute of Human and Machine Cognition, UNITED STATES	LIE: Leadership, Influence and Expertise
Fondazione Bruno Kessler, Trento, ITALY	Buildind a Resource of Patterns Using Semantic Types The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation Improving the Recall of a Discourse Parser by Constraint-based Postprocessing An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction A treebank-based study on the influence of Italian word order on parsing performance Brand Pitt: A Corpus to Explore the Art of Naming A Parallel Corpus of Music and Lyrics Annotated with Emotions NgramQuery - Smart Information Extraction from Google N-gram using External Resources Chinese Whispers: Cooperative Paraphrase Acquisition META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty The KnowledgeStore: an Entity-Based Storage System
Fondazione Ugo Bordoni, ITALY	Intelligibility assessment in forensic applications
Fordham University, Computer and Information Science Department, UNITED STATES	Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing
Fraunhofer-Heinrich Hertz Institute, GERMANY	AVATecH ― automated annotation through audio and video analysis
Fraunhofer IAIS Institute, GERMANY	Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel AVATecH ― automated annotation through audio and video analysis
Freelance translator, CROATIA	BLEU Evaluation of Machine-Translated English-Croatian Legislation
FreeLing project developer, RUSSIAN FEDERATION	FreeLing 3.0: Towards Wider Multilinguality
Freie Universität Berlin, GERMANY	Building Large Corpora from the Web Using a New Efficient Tool Chain From Grammar Rule Extraction to Treebanking: A Bootstrapping Approach
Freie Universität Berlin, Web Based Systems Group, GERMANY	Evaluating the Impact of Phrase Recognition on Concept Tagging DBpedia: A Multilingual Cross-domain Knowledge Base
Friedrich-Schiller Universität, Jena, GERMANY	Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
FTW Telecommunications Research Center Vienna, AUSTRIA	Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis
Fundació Barcelona Media, SPAIN	Spell Checking in Spanish: The Case of Diacritic Accents VERTa: Linguistic features in MT evaluation
Fundación CTIC, SPAIN	The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends

G
GEIE ERCIM, FRANCE	The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Georgetown University, UNITED STATES	A New Twitter Verb Lexicon for Natural Language Processing Using semi-experts to derive judgments on word sense alignment: a pilot study
German Research Center for Artificial Intelligence, GERMANY	Evaluating the Impact of Phrase Recognition on Concept Tagging
Goethe Universität Frankfurt am Main, GERMANY	Customization of the Europarl Corpus for Translation Studies ”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
Google Inc., UNITED STATES	A Universal Part-of-Speech Tagset A light way to collect comparable corpora from the Web
Google Research, Google Inc., UNITED STATES	A Cross-Lingual Dictionary for English Wikipedia Concepts
Goss Interactive Limited, UNITED KINGDOM	Evaluating the Impact of Phrase Recognition on Concept Tagging
Gothenburg University, SWEDEN	An Open Source Persian Computational Grammar Cloud Logic Programming for Integrating Language Technology Resources Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish Improving the Recall of a Discourse Parser by Constraint-based Postprocessing The Rocky Road towards a Swedish FrameNet - Creating SweFN Combining Language Resources Into A Grammar-Driven Swedish Parser Semantic Role Labeling with the Swedish FrameNet Feedback in Nordic First-Encounters: a Comparative Study Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Greedy Intelligence Ltd., CHINA	Constructing a Question Corpus for Textual Semantic Relations
Griffith University, AUSTRALIA	The Australian National Corpus: National Infrastructure for Language Resources

H
Heinrich-Heine-Universität, GERMANY	Arabic Word Generation and Modelling for Spell Checking
Hiroshima City University, JAPAN	Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques
HITS gGmbH, GERMANY	Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles
Hochschule Augsburg, GERMANY	Annotation Facilities for the Reliable Analysis of Human Motion
Hogeschool Gent, BELGIUM	Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation Beyond SoNaR: towards the facilitation of large corpus building efforts
Hogeschool Gent, LT3, BELGIUM	Collection of a corpus of Dutch SMS
Hogeschool Gent, LT3 / Universiteit Gent, BELGIUM	Evaluating automatic cross-domain Dutch semantic role annotation
Hokkaido University, JAPAN	Two Database Resources for Processing Social Media English Text
Holzweg e-Commerce Solutions, AUSTRIA	Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
Hong Kong Polytechnic University, Department of Computing, HONG KONG	A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong Polytechnic University, HONG KONG	Assigning Connotation Values to Events Annotating a corpus of human interaction with prosodic profiles ― focusing on Mandarin repair/disfluency A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong University of Science and Technology, HONG KONG	A Multilingual Natural Stress Emotion Database
Hong Kong University of Science and Technology, Human Language Technology Center, Department of Computer Science and Engineering, HONG KONG	A Mandarin-English Code-Switching Corpus
Human Speech and Language Technologies, Vicomtech-IK4, SPAIN	Using DiAML and ANVIL for multimodal dialogue annotations SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles The coding and annotation of multimodal dialogue acts Highlighting relevant concepts from Topic Signatures
Humboldt-Universität Berlin, GERMANY	Multi-Layer Discourse Annotation of a Dutch Text Corpus
Hungarian Academy of Sciences, HUNGARY	Light Verb Constructions in the SzegedParalellFX English--Hungarian Parallel Corpus Rapid creation of large-scale corpora and frequency dictionaries Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
Hungarian Academy of Sciences, Research Group on Artificial Intelligence, HUNGARY	HunOr: A Hungarian―Russian Parallel Corpus Automatically Generated Online Dictionaries

I
Ibaraki University, JAPAN	Detection of Peculiar Word Sense by Distance Metric Learning with Labeled Examples
IBM Research India, INDIA	A Study of Word-Classing for MT Reordering
ICLTT, AUSTRIA	Federated Search: Towards a Common Search Infrastructure
ICSI, UNITED STATES	The MASC Word Sense Corpus Empirical Comparisons of MASC Word Sense Annotations
ID GmbH & Co. KgaA, GERMANY	SemScribe: Natural Language Generation for Medical Reports
IDIAP Research Institute, SWITZERLAND	Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
IICT-BAS, Linguistic Modelling Department, BULGARIA	A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian The Political Speech Corpus of Bulgarian
IIT Bombay, INDIA	Experiences in Resource Generation for Machine Translation through Crowdsourcing Re-ordering Source Sentences for SMT Cost and Benefit of Using WordNet Senses for Sentiment Analysis
IIT - CNR, ITALY	Visualizing word senses in WordNet Atlas
ILC ― CNR, ITALY	Assigning Connotation Values to Events Customizable SCF Acquisition in Italian The Language Library: supporting community effort for collective resource production The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation In the same boat and other idiomatic seafaring expressions Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access From medical language processing to BioNLP domain Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser Creation of a bottom-up corpus-based ontology for Italian Linguistics The LRE Map. Harmonising Community Descriptions of Resources The FLaReNet Strategic Language Resource Agenda Enriching the ISST-TANL Corpus with Semantic Frames The META-SHARE Metadata Schema for the Description of Language Resources
ILSP - Athena RC, GREECE	The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions Standardizing a Component Metadata Infrastructure The FLaReNet Strategic Language Resource Agenda META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools Collecting and Using Comparable Corpora for Statistical Machine Translation The META-SHARE Metadata Schema for the Description of Language Resources
ILTEC, PORTUGAL	The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
IMC AG, GERMANY	An Adaptive Framework for Named Entity Combination
IMMI - CNRS, FRANCE	The FLaReNet Strategic Language Resource Agenda
Indiana University, UNITED STATES	A finite-state morphological transducer for Kyrgyz Using semi-experts to derive judgments on word sense alignment: a pilot study Annotating Errors in a Hungarian Learner Corpus
INESC-ID/L2F, PORTUGAL	An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT Extending a wordnet framework for simplicity and scalability
INESC-IST, PORTUGAL	Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
INIST - CNRS, FRANCE	Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign Annotating Football Matches: Influence of the Source Medium on Manual Annotation The Quaero Evaluation Initiative on Term Extraction
INRIA, FRANCE	Aleda, a free large-scale entity database for French Cleaning noisy wordnets Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources
INRIA / LORIA / Université Henri Poincaré, FRANCE	CoALT: A Software for Comparing Automatic Labelling Tools
INRIA-Nancy, FRANCE	Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents
Insitute for System Analysis, Siberian State Aerospace University, RUSSIAN FEDERATION	Speech and Language Resources for LVCSR of Russian
Institute for Balkan Studies of Serbian Academy of Sciences and Arts, SERBIA	Multimedia database of the cultural heritage of the Balkans
Institute for Information Industry, TAIWAN, PROVINCE OF CHINA	NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
Institute for Research in Fundamental Sciences, IRAN, ISLAMIC REPUBLIC OF	Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions
Institute for Specialised Communication and Multilingualism, European Academy of Bozen/Bolzano, ITALY	Unsupervised acquisition of concatenative morphology
Institute for System Analysis of RAS, RUSSIAN FEDERATION	Speech and Language Resources for LVCSR of Russian
Institute of Informatics and Telecommunications, NCSR ‘Demokritos’, GREECE	Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Institute of Technology of Cambodia, CAMBODIA	Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu
Institut für Deutsche Sprache, GERMANY	EXMARaLDA and the FOLK tools ― two toolsets for transcribing and annotating spoken language The New IDS Corpus Analysis Platform: Challenges and Prospects Evaluating Query Languages for a Corpus Processing System
Institut Jožef Stefan, Department of Knowledge Technologies, SLOVENIA	The goo300k corpus of historical Slovene Irregularity Detection in Categorized Document Corpora
Instituto Internacional da Língua Portuguesa, CAPE VERDE	The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
Intelligent Robotics and Communication Laboratories, Advanced Telecommunications Research Institute International, JAPAN	Body-conductive acoustic sensors in human-robot communication
International Institute of Information Technology, INDIA	Parsing Any Domain English text to CoNLL dependencies Hindi Subjective Lexicon: A Lexical Resource for Hindi Adjective Polarity Classification
Interverbum Technology, SWEDEN	On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
Iran University of Science and Technology, IRAN, ISLAMIC REPUBLIC OF	A Framework for Spelling Correction in Persian Language Using Noisy Channel Model Improving K-Nearest Neighbor Efficacy for Farsi Text Classification
IRISA ― CNRS, FRANCE	Annotating Football Matches: Influence of the Source Medium on Manual Annotation
IRISA, FRANCE	A French Fairy Tale Corpus syntactically and semantically annotated
IRIT ― CNRS, FRANCE	A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
IRIT, FRANCE	Clause-based Discourse Segmentation of Arabic Texts
ISI / USC, UNITED STATES	Evaluating Machine Reading Systems through Comprehension Tests
Islamic Azad University, IRAN, ISLAMIC REPUBLIC OF	Improving K-Nearest Neighbor Efficacy for Farsi Text Classification
Istanbul Technical University, TURKEY	The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish Word Alignment for English-Turkish Language Pair
ISTI - CNR, ITALY	The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends From medical language processing to BioNLP domain
IXA NLP Group, University of the Basque Country, SPAIN	SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles Highlighting relevant concepts from Topic Signatures Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques Mapping WordNet to the Kyoto ontology

J
Jawharlal Nehru University, Centre for Linguistics, INDIA	Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi
Johannes Kepler University Linz, AUSTRIA	EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems
Johns Hopkins University, UNITED STATES	Creating and Curating a Cross-Language Person-Entity Linking Collection

K
Katholieke Universiteit Leuven, BELGIUM	Annotating Story Timelines as Temporal Dependency Structures Large aligned treebanks for syntax-based machine translation
Katholieke Universiteit Leuven, CCL, BELGIUM	Beyond SoNaR: towards the facilitation of large corpus building efforts Example-Based Treebank Querying
Keio University, JAPAN	Semantic Annotations in Japanese FrameNet: Comparing Frames in Japanese and English
KIIT College of Engineering, INDIA	Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
KIIT University, School of Computer Engineering, INDIA	A GUI to Detect and Correct Errors in Hindi Dependency Treebank
Konstanz Universität, GERMANY	The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English A Rule-based Morphological Analyzer for Murrinh-Patha A Reference Dependency Bank for Analyzing Complex Predicates
Korea University, KOREA, REPUBLIC OF	ISO 24617-2: A semantically-based standard for dialogue annotation
Kyoto University, JAPAN	Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese Annotation of response tokens and their triggering expressions in Japanese multi-party conversations Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task Two Phase Evaluation for Selecting Machine Translation Services Service Composition Scenarios for Task-Oriented Translation Annotation of anaphoric relations and topic continuity in Japanese conversation

L
L2F/INESC-ID Lisboa, PORTUGAL	Building and Exploring Semantic Equivalences Resources
Laboratoire National de métrologie et d'Essais (LNE), FRANCE	Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign The ETAPE corpus for the evaluation of speech-based TV content processing in the French language Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign The REPERE Corpus : a multimodal corpus for person recognition
Lancaster University, UNITED KINGDOM	Document Attrition in Web Corpora: an Exploration
Language Computer Corporation, UNITED STATES	Pursing power in Arabic on-line discussion forums
Language Technologies Research Centre, IIIT, Hyderabad, INDIA	A GUI to Detect and Correct Errors in Hindi Dependency Treebank Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation
LDC, UNITED STATES	LDC Forced Aligner Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure Linguistic Resources for Handwriting Recognition and Translation Evaluation Creating HAVIC: Heterogeneous Audio Visual Internet Collection LDC Language Resource Database: Building a Bibliographic Database
LDC / University of Pennsylvania, UNITED STATES	Further Developments in Treebank Error Detection Using Derivation Trees Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual Expanding Arabic Treebank to Speech: Results from Broadcast News
Leshan Normal University, CHINA	CLTC: A Chinese-English Cross-lingual Topic Corpus
Lexical Computing Ltd, UNITED KINGDOM	Word Sketches for Turkish
LIF - CNRS, FRANCE	Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
LIG, FRANCE	Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
LIMSI ― CNRS, FRANCE	Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results WebAnnotator, an Annotation Tool for Web Pages Cross-lingual studies of ASR errors: paradigms for perceptual evaluations Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement Evolution of Event Designation in Media: Preliminary Study Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign A Rough Set Formalization of Quantitative Evaluation with Ambiguity Kitten: a tool for normalizing HTML and extracting its textual content Evaluation of Unsupervised Information Extraction A contrastive review of paraphrase acquisition techniques DEGELS1: A comparable corpus of French Sign Language and co-speech gestures Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs The LRE Map. Harmonising Community Descriptions of Resources Corpus of Children Voices for Mid-level Markers and Affect Bursts Analysis Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier Designing French Tale Corpora for Entertaining Text To Speech Synthesis Identifying bilingual Multi-Word Expressions for Statistical Machine Translation The META-SHARE Metadata Schema for the Description of Language Resources
LIMSI ― CNRS & Université Paris-Sud, FRANCE	Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French
Linguamatics LTD, UNITED KINGDOM	CALBC: Releasing the Final Corpora
Linguateca/FCCN, PORTUGAL	Págico: Evaluating Wikipedia-based information retrieval in Portuguese
Linguatec, GERMANY	Large Scale Lexical Analysis
LinkedIn Corp., UNITED STATES	Annotations for Power Relations on Email Threads
Linköping University, SWEDEN	Alignment-based reordering for SMT Eye Tracking as a Tool for Machine Translation Error Analysis On the practice of error analysis for machine translation evaluation Error profiling for evaluation of machine-translated text: a Polish-English case study
LIPN ― CNRS, FRANCE	The Quaero Evaluation Initiative on Term Extraction
LORIA ― CNRS, FRANCE	Representation of linguistic and domain knowledge for second language learning in virtual worlds
Loyola College in Maryland, UNITED STATES	Creating and Curating a Cross-Language Person-Entity Linking Collection
LPP ― CNRS, FRANCE	Designing French Tale Corpora for Entertaining Text To Speech Synthesis
Ludwig-Maximilians-Universität München, Bavarian Archive for Speech Signals, GERMANY	Statistical Evaluation of Pronunciation Encoding
Lund University, SWEDEN	SMALLWorlds -- Multilingual Content-Controlled Monologues Constructing Large Proposition Databases Visualizing Sentiment Analysis on a User Forum
Lymba Corporation, UNITED STATES	A Tool for Extracting Conversational Implicatures Polaris: Lymba's Semantic Parser

M
Macquarie University, AUSTRALIA	Irish Treebanking and Parsing: A Preliminary Evaluation The Australian National Corpus: National Infrastructure for Language Resources A Framework for Evaluating Text Correction
Masarykova Univerzita, Brno, CZECH REPUBLIC	Building a 70 billion word corpus of English from ClueWeb A database of semantic clusters of verb usages Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification Legal electronic dictionary for Czech
Max Planck Institute for Evolutionary Anthropology, GERMANY	Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages The Open Linguistics Working Group
Max Planck Institute for Psycholinguistics, NETHERLANDS	AVATecH ― automated annotation through audio and video analysis ELAN development, keeping pace with communities' needs Semantic metadata mapping in practice: the Virtual Language Observatory Towards Automatic Gesture Stroke Detection Federated Search: Towards a Common Search Infrastructure Proper Language Resource Centers Standardizing a Component Metadata Infrastructure Citing on-line Language Resources ”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project The Language Archive ― a new hub for language resources RELcat: a Relation Registry for ISOcat data categories
Meertens Institute, NETHERLANDS	Dynamic web service deployment in a cloud environment
Meraka Institute, CSIR, SOUTH AFRICA	Aspects of a Legal Framework for Language Resource Management
Mercedes-Benz R&D North America, UNITED STATES	Textual Characteristics for Language Engineering A Survey of Text Mining Architectures and the UIMA Standard
Mersin University, TURKEY	Construction of the Turkish National Corpus (TNC)
MICA Institute, HUST, VIET NAM	Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu
MICA Institute, VIET NAM	MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels
Microsoft Research Asia, CHINA	Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries
Microsoft Research India, INDIA	An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
Microsoft Research, UNITED STATES	Statistical Section Segmentation in Free-Text Clinical Records Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms
Middle East Technical University, TURKEY	METU Turkish Discourse Bank Browser
MIRACL, TUNISIA	Clause-based Discourse Segmentation of Arabic Texts
MIT Computer Science and Artificial Intelligence Lab, UNITED STATES	Word Sense Inventories by Non-Experts. Representing General Relational Knowledge in ConceptNet 5 A large scale annotated child language construction database
MITRE Corporation, UNITED STATES	International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned
MODUL University Vienna, Department of New Media Technology, AUSTRIA	MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
Møller-Trøndelag Kompetansesenter, NORWAY	BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users

N
Nagoya University, JAPAN	Causal analysis of task completion errors in spoken music retrieval interactions Dictionary Look-up with Katakana Variant Recognition
Nara Institute of Science and Technology, JAPAN	Causal analysis of task completion errors in spoken music retrieval interactions UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
National Institute for Japanese Language, JAPAN	Prediction of Non-Linguistic Information of Spontaneous Speech from the Prosodic Annotation: Evaluation of the X-JToBI system Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
National Institute of Informatics, JAPAN	Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques Annotating Factive Verbs Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure
National Institute of Information and Communications Technology, JAPAN	The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
National Institute of Standards and Technology (NIST), UNITED STATES	Creating HAVIC: Heterogeneous Audio Visual Internet Collection
National Taiwan University, TAIWAN, PROVINCE OF CHINA	Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information
National Technical University of Athens, Department of Electrical and Computer Engineering, GREECE	A hierarchical approach with feature selection for emotion recognition from speech
National University of Ireland, IRELAND	Semi-Supervised Technical Term Tagging With Minimal User Feedback Expertise Mining for Enterprise Content Management
National University of Singapore, SINGAPORE	Affective Common Sense Knowledge Acquisition for Sentiment Analysis
Natural Language Processing Research Group, UNITED KINGDOM	Unsupervised document zone identification using probabilistic graphical models
NCSR Demokritos Institute, GREECE	Evaluation of Online Dialogue Policy Learning Techniques The SYNC3 Collaborative Annotation Tool
Nederlandse Taalunie, NETHERLANDS	Smooth Sailing for STEVIN
Neofonie GmbH, GERMANY	DBpedia: A Multilingual Cross-domain Knowledge Base
NETCONOMY Software & Consulting, AUSTRIA	EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems
New York University, UNITED STATES	Challenges in the Knowledge Base Population Slot Filling Task
NILC, BRAZIL	The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
NI Systems (India) Pvt. Ltd. Bangalore, INDIA	Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
Nokia Research Center, CHINA	Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
North-West University, SOUTH AFRICA	Aspects of a Legal Framework for Language Resource Management
Norwegian University of Science and Technology, NORWAY	The Nordic Dialect Corpus
NTT Cyber Space Laboratories, NTT Corporation, JAPAN	Constructing a Class-Based Lexical Dictionary using Interactive Topic Models

O
Ohio State University, UNITED STATES	Associative and Semantic Features Extracted From Web-Harvested Corpora
Open University, UNITED KINGDOM	Quantising Opinions for Political Tweets Analysis
Organisations, Information and Knowledge Research Group, UNITED KINGDOM	Unsupervised document zone identification using probabilistic graphical models
Osaka University, JAPAN	Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types
Otto-von-Guericke-University Magdeburg, GERMANY	LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Oxford Internet Institute, UNITED KINGDOM	LIE: Leadership, Influence and Expertise

P
Paideia Inc, UNITED STATES	A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
Pattern Science AG, GERMANY	Textual Characteristics for Language Engineering A Survey of Text Mining Architectures and the UIMA Standard
Polish Academy of Sciences, Institute of Computer Science, POLAND	Creating a Coreference Resolution System for Polish Towards a comprehensive open repository of Polish language resources Towards an LFG parser for Polish: An exercise in parasitic grammar development PoliMorf: a (not so) new open morphological dictionary for Polish Web Service integration platform for Polish linguistic resources The Polish Sejm Corpus
Politecnico di Milano, DEI, ITALY	Evaluating Multi-focus Natural Language Queries over Data Services
Politecnico di Torino, Dipartimento di Automatica e Informatica, ITALY	A platform-independent user-friendly dictionary from Italian to LIS
Postech, KOREA, REPUBLIC OF	Grammatical Error Annotation for Korean Learners of Spoken English
PRESTO, JST, JAPAN	Towards automation in using multi-modal language resources: compatibility and interoperability for multi-modal features in Kachako
Princeton University, UNITED STATES	The MASC Word Sense Corpus Empirical Comparisons of MASC Word Sense Annotations
PUC-Rio, BRAZIL	Págico: Evaluating Wikipedia-based information retrieval in Portuguese
PUCRS, BRAZIL	Corpus+WordNet thesaurus generation for ontology enriching Combining Formal Concept Analysis and semantic information for building ontological structures from texts : an exploratory study A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever

Q
Qliktech, SWEDEN	Visualizing Sentiment Analysis on a User Forum

R
RACAI, Romanian Academy Bucharest, ROMANIA	ROMBAC: The Romanian Balanced Annotated Corpus PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora Collecting and Using Comparable Corpora for Statistical Machine Translation
Radboud University Nijmegen, CLST, NETHERLANDS	An Oral History Annotation Tool for INTER-VIEWs Collecting and Analysing Chats and Tweets in SoNaR Collection of a corpus of Dutch SMS
Radboud University Nijmegen, NETHERLANDS	DutchSemCor: Targeting the ideal sense-tagged corpus Beyond SoNaR: towards the facilitation of large corpus building efforts The DISCO ASR-based CALL system: practicing L2 oral skills and beyond
RATP, FRANCE	DECODA: a call-centre human-human spoken conversation corpus
Research Computing Center Moscow State University Vorolievy Gory, RUSSIAN FEDERATION	Automatic Term Recognition Needs Multiple Evidence
Rijksuniversiteit Groningen, NETHERLANDS	Developing a large semantically annotated corpus Multi-Layer Discourse Annotation of a Dutch Text Corpus Large aligned treebanks for syntax-based machine translation
Romanian Academy, Research Institute for Artificial Intelligence, ROMANIA	Adding Morpho-semantic Relations to the Romanian Wordnet Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information
Ruhr-Universität Bochum, GERMANY	The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
Rutgers University, UNITED STATES	Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
RWTH Aachen University, GERMANY	Arabic-Segmentation Combination Strategies for Statistical Machine Translation RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora

S
Saarland University, Department of Applied Linguistics, GERMANY	Coreference in Spoken vs. Written Texts: a Corpus-based Analysis
Saarland University, Department of Computational Linguistics, GERMANY	Using Verb Subcategorization for Word Sense Disambiguation German and English Treebanks and Lexica for Tree-Adjoining Grammars CLIMB grammars: three projects using metagrammar engineering
Saarland University, Spoken Language Systems, GERMANY	MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis A Gold Standard for Relation Extraction in the Food Domain
Sabanci University, TURKEY	SUTAV: A Turkish Audio-Visual Database
Santa Anna IT Research Institute AB & Linköping University, SWEDEN	A good space: Lexical predictors in word space evaluation This also affects the context - Errors in extraction based summaries
SAP AG, GERMANY	The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Sarah M. Taylor Consulting, UNITED STATES	Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
Sara, NETHERLANDS	Dynamic web service deployment in a cloud environment
SDL Language Weaver, UNITED STATES	An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output
Senshu University, JAPAN	A Search Tool for FrameNet Constructicon
Seoul National University, Center for Humanities and Information, KOREA, REPUBLIC OF	Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
Seoul National University, Department of Linguistics, KOREA, REPUBLIC OF	Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
Seoul National University, KOREA, REPUBLIC OF	Dysarthric Speech Database for Development of QoLT Software Technology
Seoul National University of Education, KOREA, REPUBLIC OF	Grammatical Error Annotation for Korean Learners of Spoken English
Shahid Beheshti University, IRAN, ISLAMIC REPUBLIC OF	Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
Shanghai Jiao Tong University, CHINA	Semi-Supervised Technical Term Tagging With Minimal User Feedback Spell Checking for Chinese
Shiraz University, IRAN, ISLAMIC REPUBLIC OF	Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions
Siena College, Department of Computer Science, UNITED STATES	Revealing Contentious Concepts Across Social Groups
Simon Fraser University, CANADA	A review corpus annotated for negation, speculation and their scope
Social Media Research Group, Institute of Technology Tallaght, IRELAND	Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System
Society for Danish Language and Literature, DENMARK	Towards a richer wordnet representation of properties
Sofia University, BULGARIA	NgramQuery - Smart Information Extraction from Google N-gram using External Resources
Sonear, FRANCE	DECODA: a call-centre human-human spoken conversation corpus
Sorano Ltd., DENMARK	Creation and use of Language Resources in a Question-Answering eHealth System
Stanford University, UNITED STATES	SUTime: A library for recognizing and normalizing time expressions Suffix Trees as Language Models The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
State University of Maringa, Department of Informatics, BRAZIL	Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Stockholm University, Department of Linguistics, SWEDEN	3rd party observer gaze as a continuous measure of dialogue flow
Stockholm University, SWEDEN	Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text
Systems Plus, Inc., UNITED STATES	Creating HAVIC: Heterogeneous Audio Visual Internet Collection
SYSTRAN SA, FRANCE	Automatic Translation of Scientific Documents in the HAL Archive

T
TAUS, NETHERLANDS	On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
Technical University Braunschweig, GERMANY	Association Norms of German Noun Compounds
Technical University of Crete, GREECE	SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks Associative and Semantic Features Extracted From Web-Harvested Corpora
Technical University of Denmark, DENMARK	Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies
Technische Universität Berlin, DAI-Labor, GERMANY	GerNED: A German Corpus for Named Entity Disambiguation
Technische Universität Darmstadt, GERMANY	Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF
Technische Universität Darmstadt, Ubiquitous Knowledge Processing Lab, GERMANY	The Open Linguistics Working Group
Technische Universität Wien, AUSTRIA	Applying Random Indexing to Structured Data to Find Contextually Similar Words
Telefónica Investigación y Desarrollo, SPAIN	The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
TextShuttle GmbH, SWITZERLAND	SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
The British University in Dubai, UNITED ARAB EMIRATES	Arabic Word Generation and Modelling for Spell Checking Automatic Extraction and Evaluation of Arabic LFG Resources
The Catholic University of Korea, KOREA, REPUBLIC OF	Dysarthric Speech Database for Development of QoLT Software Technology
The Communication Centre for the Deaf and Hard-of-Hearing, ICELAND	BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users
The Institute of Statistical Mathematics, JAPAN	Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
The Netherlands Cancer Institute, NETHERLANDS	NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
Tilde, LATVIA	Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries Collecting and Using Comparable Corpora for Statistical Machine Translation Latvian and Lithuanian Named Entity Recognition with TildeNER
Tokyo Institute of Technology, JAPAN	The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions
Tokyo University of Foreign Studies, JAPAN	Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Tokyo University of Technology, JAPAN	Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Toyohashi University of Technology, JAPAN	Statistical Machine Translation without Source-side Parallel Corpus Using Word Lattice and Phrase Extension Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation Developing Partially-Transcribed Speech Corpus from Edited Transcriptions
Trinity College Dublin, IRELAND	An audiovisual political speech analysis incorporating eye-tracking and perception data Irish Treebanking and Parsing: A Preliminary Evaluation The Herme Database of Spontaneous Multimodal Human-Robot Dialogues On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
Tsinghua University, CHINA	Affective Common Sense Knowledge Acquisition for Sentiment Analysis CLTC: A Chinese-English Cross-lingual Topic Corpus
Tübitak-Bilgem, TURKEY	Turkish Paraphrase Corpus

U
UCSC UARC, UNITED STATES	A Corpus for Research on Deliberation and Debate
UJF-Grenoble1, FRANCE	Dbnary: Wiktionary as a LMF based Multilingual RDF network Collection of a Large Database of French-English SMT Output Corrections
UNDL Foundation, SWITZERLAND	Le Petit Prince in UNL
Union College, UNITED STATES	Revealing Contentious Concepts Across Social Groups
Universidad Autónoma de Madrid, Departamento de Lingüística, SPAIN	Designing a search interface for a Spanish learner spoken corpus: the end-user's evaluation
Universidad Autónoma de Madrid, SPAIN	Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Universidad Complutense de Madrid, SPAIN	SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation MaltOptimizer: A System for MaltParser Optimization
Universidad de Alicante, Departamento Lenguajes y Sistemas Informáticos, SPAIN	Massively Increasing TIMEX3 Resources: A Transduction Approach
Universidad de Alicante (UA), SPAIN	TIMEN: An Open Temporal Expression Normalisation Resource Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text
Universidad de Barcelona (UB), SPAIN	VERTa: Linguistic features in MT evaluation
Universidad de Granada, SPAIN	Linguistic knowledge for specialized text production Medical Term Extraction in an Arabic Medical Corpus Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Universidad de Huelva, SPAIN	A review corpus annotated for negation, speculation and their scope
Universidad del País Vasco, SPAIN	BUCEADOR, a multi-language search engine for digital libraries
Universidad de Santiago de Compostela, SPAIN	A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality
Universidad de Vigo, SPAIN	BUCEADOR, a multi-language search engine for digital libraries
Universidad de Zaragoza, GTC, I3A, SPAIN	Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
Universidade Aberta, PORTUGAL	Building and Exploring Semantic Equivalences Resources
Universidade de Lisboa, PORTUGAL	TimeBankPT: A TimeML Annotated Corpus of Portuguese A PropBank for Portuguese: the CINTIL-PropBank Rembrandt - a named-entity recognition framework Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese
Universidade de São Paulo, BRAZIL	Propbank-Br: a Brazilian Treebank annotated with semantic role labels
Universidade de São Paulo, Center of Computational Linguistics (NILC), BRAZIL	Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Universidade de São Paulo, Department of Informatics, BRAZIL	Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Universidade de São Paulo, School of Arts, Sciences and Humanities, BRAZIL	Corpus-based Referring Expressions Generation Portuguese Text Generation from Large Corpora
Universidade de Vigo, SPAIN	A Galician Syntactic Corpus with Application to Intonation Modeling
Universidade do Minho, PORTUGAL	Structural alignment of plain text books
Universidade Federal de Minas Gerais, BRAZIL	The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese
Universidade Técnica de Lisboa, Instituto Superior Técnico, PORTUGAL	An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT Dealing with unknown words in statistical machine translation
Universidad Nacional Autónoma de México, Instituto de Ingeniería, MEXICO	Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
Universidad Nacional de Córdoba, FaMAF, ARGENTINA	Evaluating and improving syntactic lexica by plugging them within a parser
Universidad Nacional de Córdoba, Medina Allende s/n, Ciudad Universitaria, FaMAF, ARGENTINA	Extending the adverbial coverage of a French morphological lexicon
Universidad Nacional de Educación a Distancia (UNED), Madrid, NLP&IR group, SPAIN	Evaluating Machine Reading Systems through Comprehension Tests
Universidad Politécnica de Cataluña, SPAIN	Building Synthetic Voices in the META-NET Framework An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output FreeLing 3.0: Towards Wider Multilinguality BUCEADOR, a multi-language search engine for digital libraries
Universidad Politécnica de Madrid, E.T.S.I. de Telecomunicacion, SPAIN	Investigating Verbal Intelligence Using the TF-IDF Approach
Universidad Politécnica de Madrid, Ontology Engineering Group, SPAIN	Collaborative semantic editing of linked data lexica
Universidad Politécnica de Valencia, SPAIN	Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System
Universidad Pompeu Fabra, SPAIN	Iula2Standoff: a tool for creating standoff documents for the IULACT The IULA Treebank Towards a User-Friendly Platform for Building Language Resources based on Web Services Spell Checking in Spanish: The Case of Diacritic Accents The FLaReNet Strategic Language Resource Agenda
Università Cattolica del Sacro Cuore, Milan, ITALY	First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin
Università degli Studi di Siena, ITALY	The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
Università degli Studi di Verona, Department of Foreign Languages and Literatures, ITALY	A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Università degli Studi Roma Tre, Department of Linguistics, ITALY	A topologic view of Topic and Focus marking in Italian
Università degli Studi Roma Tre, Dipartimento di Scienze dell’Educazione, ITALY	Pedagogical stances and their multimodal signals.
Università di Bologna, Department of Computer Science, ITALY	AnIta: a powerful morphological analyser for Italian
Università di Bologna, Department of Linguistics and Oriental Studies, ITALY	AnIta: a powerful morphological analyser for Italian A topologic view of Topic and Focus marking in Italian
Università di Firenze, Department of Italian Studies, ITALY	A topologic view of Topic and Focus marking in Italian
Università di Firenze, ITALY	The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
Università di Macerata, ITALY	A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Università di Padova, Department of Information Engineering, ITALY	A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Università di Pavia, ITALY	Annotating Qualia Relations in Italian and French Complex Nominals
Università di Pisa, Dipartimento di Linguistica, ITALY	LexIt: A Computational Resource on Italian Argument Structure
Università di Pisa, ITALY	Creation of a bottom-up corpus-based ontology for Italian Linguistics Enriching the ISST-TANL Corpus with Semantic Frames
Università di Roma La Sapienza, Dipartimenot di Comunicazione e Ricerca Sociale, ITALY	Pedagogical stances and their multimodal signals.
Università di Roma La Sapienza, ITALY	A New Method for Evaluating Automatically Learned Terminological Taxonomies Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Università di Roma Tor Vergata, ART Research Group, ITALY	PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases
Università di Roma Tor Vergata, ITALY	Intelligibility assessment in forensic applications Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
Università di Torino, Dipartimento di Informatica, ITALY	The Parallel-TUT: a multilingual and multiformat treebank A treebank-based study on the influence of Italian word order on parsing performance
Università di Torino, ITALY	NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
Università di Trento, Dipartimento Ingegneria e Scienza dell'Informazione, ITALY	Improving the Recall of a Discourse Parser by Constraint-based Postprocessing A Parallel Corpus of Music and Lyrics Annotated with Emotions Domain-specific vs. Uniform Modeling for Coreference Resolution
Università di Trento, ITALY	DeCour: a corpus of DEceptive statements in Italian COURts
Università di Verona, ITALY	Annotating Qualia Relations in Italian and French Complex Nominals
Università Federico II di Napoli, Dipartimento di Scienze Fisiche, ITALY	A treebank-based study on the influence of Italian word order on parsing performance
Università Federico II di Napoli, ITALY	W-PhAMT: A web tool for phonetic multilevel timeline visualization Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabification
Università San Raffaele di Roma, ITALY	Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
Universität Augsburg, Institut für Informatik, GERMANY	Using DiAML and ANVIL for multimodal dialogue annotations
Universitat Autònoma de Barcelona (UAB), SPAIN	Using Language Resources in Humanities research
Universität Bielefeld, Applied Informatics Group, GERMANY	PAMOCAT: Automatic retrieval of specified postures
Universität Bielefeld, Artificial Intelligence Group, GERMANY	MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
Universität Bielefeld, CITEC Central Lab, GERMANY	PAMOCAT: Automatic retrieval of specified postures Collaborative semantic editing of linked data lexica
Universität Bielefeld, GERMANY	Suffix Trees as Language Models ULex: new data models and a mobile environment for corpus enrichment.
Universität Bremen, GERMANY	A Speech and Gesture Spatial Corpus in Assisted Living An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use
Universitat d’Alacant, DLSI, SPAIN	Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
Universitat d'Alacant (UA), SPAIN	A finite-state morphological transducer for Kyrgyz Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation
Universitat de Barcelona (UB), SPAIN	The IULA Treebank
Universität des Saarlandes, GERMANY	Suffix Trees as Language Models A methodology for the extraction of information about the usage of formulaic expressions in scientific texts
Universitatea Alexandru Ioan Cuza, ROMANIA	Evaluating Machine Reading Systems through Comprehension Tests Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations
Universitatea din Bucuresti, ROMANIA	The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System
Universität Erlangen-Nuremberg, GERMANY	Efficient Dependency Graph Matching with the IMS Open Corpus Workbench
Universität Freiburg, GERMANY	Creation and use of Language Resources in a Question-Answering eHealth System
Universität Hamburg, GERMANY	Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation
Universität Heidelberg, GERMANY	Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards
Universität Heidelberg, ICL, GERMANY	French and German Corpora for Audience-based Text Type Classification
Universität Hildesheim, GERMANY	MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content Yes we can!? Annotating English modal verbs A Tool/Database Interface for Multi-Level Analyses
Universität Innsbruck, AUSTRIA	RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
Universitat Jaume I, SPAIN	Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
Universität Jena, Julie Lab, GERMANY	CALBC: Releasing the Final Corpora
Universität Karlsruhe, GERMANY	The KIT Lecture Corpus for Speech Translation The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Universität Leipzig, Department of Computer Science, Natural Language Processing Group, GERMANY	MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis The Open Linguistics Working Group
Universität Leipzig, GERMANY	Learning Categories and their Instances by Contextual Features Textual Characteristics for Language Engineering Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages
Universität München, Research Unit “Quantitative Language Comparison”, GERMANY	The Open Linguistics Working Group
Universitat Politècnica de Catalunya, TALP center, SPAIN	Highlighting relevant concepts from Topic Signatures Summarizing a multimodal set of documents in a Smart Room
Universitat Politècnica de Valencia, SPAIN	The acquisition and dialog act labeling of the EDECAN-SPORTS corpus
Universitat Pompeu Fabra, Instituto Universitario de Lingüística Aplicada, SPAIN	The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language NeoTag: a POS Tagger for Grammatical Neologism Detection Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
Universitat Pompeu Fabra, SPAIN	A Classification of Adjectives for Polarity Lexicons Enhancement A voting scheme to detect semantic underspecification A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality Automatic lexical semantic classification of nouns The IULA Treebank The CONCISUS Corpus of Event Summaries Using Language Resources in Humanities research Text Simplification Tools for Spanish
Universität Potsdam, GERMANY	Yes we can!? Annotating English modal verbs Fine-grained German Sentiment Analysis on Social Media
Universität Saarland, Computational Linguistics Department, GERMANY	The Open Linguistics Working Group
Universität Saarland, Department of Computational Linguistics and Phonetics, GERMANY	Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Universität Saarland, GERMANY	Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach Evaluating Machine Reading Systems through Comprehension Tests
Universität Saarland, Spoken Language Systems, GERMANY	Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Universität Stuttgart, GERMANY	The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles German """"nach""""-Particle Verbs in Semantic Theory and Corpus Data Association Norms of German Noun Compounds Analyzing and Aligning German compound nouns A Tool/Database Interface for Multi-Level Analyses
Universität Stuttgart, IMS, GERMANY	Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank Automatic classification of German """"an"""" particle verbs Making Ellipses Explicit in Dependency Conversion for a German Treebank Logical metonymies and qualia structures: an annotated database of logical metonymies for German A Corpus-based Study of the German Recipient Passive Irish Treebanking and Parsing: A Preliminary Evaluation French and German Corpora for Audience-based Text Type Classification Adapting and evaluating a generic term extraction tool
Universität Tübingen, GERMANY	Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC A Metadata Editor to Support the Description of Linguistic Resources A Repository for the Sustainable Management of Research Data Standardizing a Component Metadata Infrastructure Large aligned treebanks for syntax-based machine translation
Universität Tübingen - SFS, GERMANY	A Comparative Evaluation of Word Sense Disambiguation Algorithms for German
Universität Ulm, GERMANY	A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System
Universität Ulm, Institute of Communications Engineering, GERMANY	Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues Using multimodal resources for explanation approaches in intelligent systems Investigating Verbal Intelligence Using the TF-IDF Approach Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores Speech and Language Resources for LVCSR of Russian
Universität Ulm, Institute of Neural Information Processing, GERMANY	An audiovisual political speech analysis incorporating eye-tracking and perception data
Universität Wien, Zentrum für Translationswissenschaft, AUSTRIA	Knowledge-Rich Context Extraction and Ranking with KnowPipe
Universität Zürich, SWITZERLAND	German Verb Patterns and Their Implementation in an Electronic Dictionary Automatic MT Error Analysis: Hjerson Helping Addicter Terra: a Collection of Translation Error-Annotated Corpora
Université Aix-Marseille, FRANCE	Annotated Bibliographical Reference Corpora in Digital Humanities
Université Aix-Marseille, Laboratoire Parole et Langage, FRANCE	SPPAS: a tool for the phonetic segmentation of speech Orthographic Transcription: which enrichment is required for phonetization? DEGELS1: A comparable corpus of French Sign Language and co-speech gestures An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
Université Aix-Marseille, LIF-CNRS, FRANCE	Syntactic annotation of spontaneous speech: application to call-center conversation data DECODA: a call-centre human-human spoken conversation corpus
Université catholique de Louvain, BELGIUM	Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Université catholique de Louvain, CENTAL, Institut Langage et Communication, BELGIUM	Extraction of unmarked quotations in Newspapers
Université d'Avignon, CERI/LIA, FRANCE	Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Université d'Avignon, FRANCE	DECODA: a call-centre human-human spoken conversation corpus
Université d'Avignon, LIA, FRANCE	Annotated Bibliographical Reference Corpora in Digital Humanities
Université de Genève, Département de Linguistique, SWITZERLAND	Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns The Trilingual ALLEGRA Corpus: Presentation and Possible Use for Lexicon Induction
Université de Genève, FTI/TIM, SWITZERLAND	Acquisition of Syntactic Simplification Rules for French Evaluating Appropriateness Of System Responses In A Spoken CALL Game
Université de Genève, SWITZERLAND	Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.
Université de Genève, TIM/ISSCO, SWITZERLAND	A Scalable Architecture For Web Deployment of Spoken Dialogue Systems Annotating Qualia Relations in Italian and French Complex Nominals A Corpus for a Gesture-Controlled Mobile Spoken Dialogue System
Université de Montréal, CANADA	Capturing syntactico-semantic regularities among terms: An application of the FrameNet methodology to terminology Texto4Science: a Quebec French Database of Annotated Short Text Messages
Université de Montréal, OLST, CANADA	Logic Based Methods for Terminological Assessment Identifying equivalents of specialized verbs in a bilingual comparable corpus of judgments: A frame-based methodology Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
Université de Nancy, LORIA, FRANCE	Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents Representation of linguistic and domain knowledge for second language learning in virtual worlds Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Université de Nantes, LINA, FRANCE	Tackling interoperability issues within UIMA workflows Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora
Université de Nice Sophia-Antipolis, équipe RL, Laboratoire I3S, FRANCE	Unsupervised acquisition of concatenative morphology
Université de Provence, FRANCE	An ontological approach to model and query multimodal concurrent linguistic annotations
Université de Rennes 1, IRISA, FRANCE	Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora Towards Fully Automatic Annotation of Audio Books for TTS
Université de Savoie, FRANCE	Ontoterminology: How to unify terminology and ontology into a single paradigm
Université de Strasbourg, LiLPA, FRANCE	French and German Corpora for Audience-based Text Type Classification
Université de Toulouse, CLLE, FRANCE	An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
Université de Toulouse, IRIT, FRANCE	An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
Université du Maine, LIUM, FRANCE	TED-LIUM: an Automatic Speech Recognition dedicated corpus Automatic Translation of Scientific Documents in the HAL Archive Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Université du Sud Toulon Var, FRANCE	An ontological approach to model and query multimodal concurrent linguistic annotations
Universiteit Antwerp, BELGIUM	From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information Irregularity Detection in Categorized Document Corpora The DISCO ASR-based CALL system: practicing L2 oral skills and beyond The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
Universiteit Antwerp, CliPS, BELGIUM	ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories
Universiteit Antwerpen, BELGIUM	“Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives.
Universiteit Gent, BELGIUM	From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
Universiteit Twente, NETHERLANDS	Evaluating automatic cross-domain Dutch semantic role annotation Creating a Data Collection for Evaluating Rich Speech Retrieval
Universiteit Utrecht, NETHERLANDS	The FLaReNet Strategic Language Resource Agenda
Universiteit Utrecht, UiL-OTS, NETHERLANDS	Recent Developments in CLARIN-NL Evaluating automatic cross-domain Dutch semantic role annotation
Universiteit van Amsterdam, NETHERLANDS	Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions
Universiteit van Tilburg, NETHERLANDS	Using DiAML and ANVIL for multimodal dialogue annotations The coding and annotation of multimodal dialogue acts DutchSemCor: Targeting the ideal sense-tagged corpus ISO 24617-2: A semantically-based standard for dialogue annotation
Universiteit van Tilburg, TiCC, NETHERLANDS	Beyond SoNaR: towards the facilitation of large corpus building efforts
Université Montpellier 2, LIRMM, FRANCE	A new dynamic approach for lexical networks evaluation
Université Paris 13, LDI, FRANCE	Building a database of French frozen adverbial phrases
Université Paris 3 - sorbonne Nouvelle, LaTTiCe, CNRS, FRANCE	Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger
Université Paris 7, FRANCE	A Phonemic Corpus of Polish Child-Directed Speech Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus Semantic annotation of French corpora: animacy and verb semantic classes
Université Paris 8 Saint Denis, FRANCE	Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Université Paris-Est, FRANCE	Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger
Université Paris-Est, LIGM, FRANCE	A new semantically annotated corpus with syntactic-semantic and cross-lingual senses Extending the adverbial coverage of a French morphological lexicon
Université Paris Ouest Nanterre La Défense, MoDyCo-CNRS, FRANCE	Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
Université Paul Sabatier, IRIT, FRANCE	Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber) Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs
Universitetet i Bergen, Department of Linguistic, NORWAY	A Morphological Analyzer For Wolof Using Finite-State Techniques
Universitetet i Bergen, NORWAY	Design and compilation of a specialized Spanish-German parallel corpus Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Universitetet i Oslo, Institutt for lingvistiske og nordiske studier, NORWAY	The Nordic Dialect Corpus
Universitetet i Oslo, NORWAY	Págico: Evaluating Wikipedia-based information retrieval in Portuguese The Nordic Dialect Corpus The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
Université Toulouse II, U.R.I Octogone-Lordat, FRANCE	MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions.
University at Albany, SUNY, UNITED STATES	Revealing Contentious Concepts Across Social Groups Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University College Dublin, IRELAND	English to Indonesian Transliteration to Support English Pronunciation Practice Evaluating expressive speech synthesis from audiobook corpora for conversational phrases Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College London, UNITED KINGDOM	A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic.
University of Amsterdam, ACLC, NETHERLANDS	NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
University of Applied Sciences Chur; Faculty of Information Science, SWITZERLAND	Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
University of Applied Sciences Vern, Zagreb, CROATIA	Addressing polysemy in bilingual lexicon extraction from comparable corpora
University of Aston, UNITED KINGDOM	Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
University of Athens, GREECE	Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue
University of Belgrade, Faculty of Mathematics, SERBIA	Multimedia database of the cultural heritage of the Balkans
University of Belgrade, SERBIA	A tool for enhanced search of multilingual digital libraries of e-journals
University of Birmingham, School of Computer Science, UNITED KINGDOM	Building Text-to-Speech Systems for Resource Poor Languages
University of Boston, UNITED STATES	Detecting Reduplication in Videos of American Sign Language Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
University of Brighton, UNITED KINGDOM	A Repository of Data and Evaluation Resources for Natural Language Generation LG-Eval: A Toolkit for Creating Online Language Evaluation Experiments
University of California, Santa Cruz, UNITED STATES	An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style
University of Cambridge, UNITED KINGDOM	Reclassifying subcategorization frames for experimental analysis and stimulus generation Annotating progressive aspect constructions in the spoken section of the British National Corpus Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
University of Colorado at Boulder, UNITED STATES	Empty Argument Insertion in the Hindi PropBank
University of Colorado, UNITED STATES	Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events Annotating Story Timelines as Temporal Dependency Structures
University of Copenhagen, CST, DENMARK	Automatic annotation of head velocity and acceleration in Anvil DSim, a Danish Parallel Corpus for Text Simplification Creation and use of Language Resources in a Question-Answering eHealth System
University of Copenhagen, DENMARK	A voting scheme to detect semantic underspecification Towards a richer wordnet representation of properties A Distributed Resource Repository for Cloud-Based Machine Translation Creation and use of Language Resources in a Question-Answering eHealth System Multimodal Behaviour and Feedback in Different Types of Interaction Feedback in Nordic First-Encounters: a Comparative Study Polish Multimodal Corpus ― a collection of referential gestures Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Düsseldorf, Institut für Sprache und Information, GERMANY	German and English Treebanks and Lexica for Tree-Adjoining Grammars
University of Edinburgh, UNITED KINGDOM	The BladeMistress Corpus: From Talk to Action in Virtual Worlds A Database of Attribution Relations
University of Essex, UNITED KINGDOM	DeCour: a corpus of DEceptive statements in Italian COURts Assessing Crowdsourcing Quality through Objective Tasks Applying Random Indexing to Structured Data to Find Contextually Similar Words Domain-specific vs. Uniform Modeling for Coreference Resolution
University of Gothenburg, Sprakbanken, SWEDEN	Korp ― the corpus infrastructure of Spräkbanken The open lexical infrastructure of Spräkbanken
University of Groningen, Institute of Artificial Intelligence, NETHERLANDS	Pragmatic identification of the witness sets
University of Helsinki, FINLAND	Boosting statistical tagger accuracy with simple rule-based grammars Constructive Interaction for Talking about Interesting Topics Representing the Translation Relation in a Bilingual Wordnet Multimodal Corpus of Multi-party Conversations in Second Language Rule-Based Detection of Clausal Coordinate Ellipsis Improving corpus annotation productivity: a method and experiment with interactive tagging Feedback in Nordic First-Encounters: a Comparative Study Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3
University of Iceland, ICELAND	The Icelandic Parsed Historical Corpus (IcePaHC) Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Illinois at Urbana-Champaign, UNITED STATES	YADAC: Yet another Dialectal Arabic Corpus An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
University of Jyväskylä, FINLAND	Comparing computer vision analysis of signed language video with motion capture recordings
University of Latvia, LATVIA	An implementation of a Latvian resource grammar in Grammatical Framework
University of Leeds, UNITED KINGDOM	QurAna: Corpus of the Quran annotated with Pronominal Anaphora QurSim: A corpus for evaluation of relatedness in short texts Predicting Phrase Breaks in Classical and Modern Standard Arabic Text Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis Development and Application of a Cross-language Document Comparability Metric Identifying Word Translations from Comparable Documents Without a Seed Lexicon Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Leipzig, Natural Language Processing Group, GERMANY	The Influence of Corpus Quality on Statistical Measurements on Language Resources
University of Limerick, IRELAND	Evaluating Machine Reading Systems through Comprehension Tests
University of Ljubljana, Faculty of Arts, SLOVENIA	Addressing polysemy in bilingual lexicon extraction from comparable corpora Irregularity Detection in Categorized Document Corpora
University of Ljubljana, SLOVENIA	Cleaning noisy wordnets
University of Łódz, POLAND	Towards a comprehensive open repository of Polish language resources
University of Luxembourg, LUXEMBOURG	NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
University of Malta, MALTA	Incorporating an Error Corpus into a Spellchecker for Maltese Multimodal Behaviour and Feedback in Different Types of Interaction A Repository of Data and Evaluation Resources for Natural Language Generation
University of Manchester, UNITED KINGDOM	A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic. Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries Identification of Manner in Bio-Events Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German
University of Maribor, SLOVENIA	SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
University of Maryland, UNITED STATES	Typing Race Games as a Method to Create Spelling Error Corpora Creating and Curating a Cross-Language Person-Entity Linking Collection Linguistic Resources for Handwriting Recognition and Translation Evaluation
University of Minho, LIAAD-INESC, PORTUGAL	The Minho Quotation Resource
University of Newcastle, AUSTRALIA	The Icelandic Parsed Historical Corpus (IcePaHC)
University of North Texas, UNITED STATES	Learning Sentiment Lexicons in Spanish Unsupervised Word Sense Disambiguation with Multilingual Representations A Parallel Corpus of Music and Lyrics Annotated with Emotions
University of Osnabrück, Institute for Cognitive Science, GERMANY	LexIt: A Computational Resource on Italian Argument Structure
University of Pennsylvania, IRCS, UNITED STATES	Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
University of Pennsylvania, UNITED STATES	Experiences in Resource Generation for Machine Translation through Crowdsourcing The Icelandic Parsed Historical Corpus (IcePaHC) Automatic word alignment tools to scale production of manually aligned parallel texts A corpus of general and specific sentences from news
University of Potsdam, Applied Computational Linguistics Lab, POLAND	SemScribe: Natural Language Generation for Medical Reports
University of Rijeka, Department of Informatics, CROATIA	BLEU Evaluation of Machine-Translated English-Croatian Legislation
University of Rochester, UNITED STATES	An Annotation Scheme for Quantifier Scope Disambiguation
University of Sheffield, DCS, UNITED KINGDOM	Mapping WordNet synsets to Wikipedia articles
University of Sheffield, UNITED KINGDOM	Matching Cultural Heritage items to Wikipedia Assessing the Comparability of News Texts Large Scale Semantic Annotation, Indexing and Search at The National Archives TIMEN: An Open Temporal Expression Normalisation Resource LIE: Leadership, Influence and Expertise Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles Massively Increasing TIMEX3 Resources: A Transduction Approach A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English Assessing Crowdsourcing Quality through Objective Tasks A light way to collect comparable corpora from the Web Applying Random Indexing to Structured Data to Find Contextually Similar Words Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Southern California, Information Sciences Institute, UNITED STATES	Ontologies of Linguistic Annotation: Survey and perspectives The Open Linguistics Working Group A generic formalism to represent linguistic corpora in RDF and OWL/DL
University of Southern California, Institute for Creative Technologies, UNITED STATES	An audiovisual political speech analysis incorporating eye-tracking and perception data The BladeMistress Corpus: From Talk to Action in Virtual Worlds Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems The Twins Corpus of Museum Visitor Questions
University of Southern California, UNITED STATES	The Twins Corpus of Museum Visitor Questions
University of Southern Denmark, DENMARK	The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
University of Stirling, UNITED KINGDOM	Affective Common Sense Knowledge Acquisition for Sentiment Analysis
University of Sussex, UNITED KINGDOM	Corpus Annotation as a Scientific Task Linguistic Analysis Processing Line for Bulgarian
University of Szeged, Department of Hungarian Linguistics, HUNGARY	HunOr: A Hungarian―Russian Parallel Corpus
University of Szeged, Department of Informatics, HUNGARY	HunOr: A Hungarian―Russian Parallel Corpus
University of Tartu, ESTONIA	Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data Constructive Interaction for Talking about Interesting Topics
University of Tartu, Institute of Computer Science, ESTONIA	Robust clause boundary identification for corpus annotation
University of Texas at Arlington, UNITED STATES	Evaluation of Online Dialogue Policy Learning Techniques
University of Texas at Dallas, Human Language Technology Research Institute, UNITED STATES	EmpaTweet: Annotating and Detecting Emotions on Twitter
University of Texas at Dallas, UNITED STATES	Tajik-Farsi Persian Transliteration Using Statistical Machine Translation Annotating Spatial Containment Relations Between Events
University of Thessaly, Department of Computer and Communication Engineering, GREECE	A hierarchical approach with feature selection for emotion recognition from speech
University of Toronto, CANADA	Measuring Interlanguage: Native Language Identification with L1-influence Metrics Detecting Reduplication in Videos of American Sign Language
University of Tsukuba, JAPAN	Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
University of Turin, Department of Computer Science, ITALY	Pragmatic identification of the witness sets
University of Washington, Biomedical and Health Informatics, UNITED STATES	Statistical Section Segmentation in Free-Text Clinical Records
University of Washington, Department of Linguistics, UNITED STATES	Statistical Section Segmentation in Free-Text Clinical Records
University of Washington, UNITED STATES	Effort of Genre Variation and Prediction of System Performance Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms Using a Goodness Measurement for Domain Adaptation: A Case Study on Chinese Word Segmentation
University of Wisconsin-Milwaukee, HIA, UNITED STATES	Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
University of Wolverhampton, Research Institute in Information and Language Processing, UNITED KINGDOM	CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations
University of Wolverhampton, UNITED KINGDOM	Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach A review corpus annotated for negation, speculation and their scope Annotating Near-Identity from Coreference Disagreements PET: a Tool for Post-editing and Assessing Machine Translation
University of Yamanashi, JAPAN	Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
University of Zagreb, CROATIA	Croatian Dependency Treebank: Recent Development and Initial Experiments Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences, CROATIA	Generation of Verbal Stems in Derivationally Rich Language BLEU Evaluation of Machine-Translated English-Croatian Legislation Addressing polysemy in bilingual lexicon extraction from comparable corpora
University of Zurich, Institute of Computational Linguistics, SWITZERLAND	Dependency parsing for interaction detection in pharmacogenomics MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis A tree is a Baum is an árbol is a sach'a: Creating a trilingual treebank
UPMF-Grenoble2, FRANCE	Collection of a Large Database of French-English SMT Output Corrections
Uppsala University, SWEDEN	A Basic Language Resource Kit for Persian A Distributed Resource Repository for Cloud-Based Machine Translation Parallel Data, Tools and Interfaces in OPUS MaltOptimizer: A System for MaltParser Optimization Large aligned treebanks for syntax-based machine translation
US Department of Defense, UNITED STATES	Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?
UTEM, CHILE	A proposal for improving WordNet Domains

V
Vassar College, UNITED STATES	The MASC Word Sense Corpus Empirical Comparisons of MASC Word Sense Annotations
Veterans Institute, centre for knowledge and expertise, NETHERLANDS	An Oral History Annotation Tool for INTER-VIEWs
Vrije Universiteit Amsterdam, Faculteit der Letteren, NETHERLANDS	Mapping WordNet to the Kyoto ontology
Vrije Universiteit Amsterdam, NETHERLANDS	Building a fine-grained subjectivity lexicon from a web corpus DutchSemCor: Targeting the ideal sense-tagged corpus

W
W4 S.A., FRANCE	The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Wake Forest University, UNITED STATES	Boosting statistical tagger accuracy with simple rule-based grammars
Waseda University, JAPAN	Method for Collection of Acted Speech Using Various Situation Scripts
Wonkwang University, KOREA, REPUBLIC OF	Dysarthric Speech Database for Development of QoLT Software Technology
Wrocław University of Technology, Institute of Applied Informatics, POLAND	Inforex -- a web-based tool for text corpus management and semantic annotation
Wrocław University of Technology, POLAND	Constraint Based Description of Polish Multiword Expressions Tools for plWordNet Development. Presentation and Perspectives Recognition of Polish Derivational Relations Based on Supervised Learning Scheme KPWr: Towards a Free Corpus of Polish

X
Xerox India Research Centre, INDIA	Experiences in Resource Generation for Machine Translation through Crowdsourcing
Xerox Research Centre Europe, FRANCE	Task-Driven Linguistic Analysis based on an Underspecified Features Representation Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
XTM International, UNITED KINGDOM	On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market

Y
Yamagata University, JAPAN	Collecting humorous expressions from a community-based question-answering-service corpus
Yaşar University, TURKEY	Construction of the Turkish National Corpus (TNC)

Z
Zemanta, UNITED STATES	Collecting and Using Comparable Corpora for Statistical Machine Translation