AUTHORS: Browse articles of the conference sorted by author

A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z

A
Abbott, Rob A Corpus for Research on Deliberation and Debate
UCSC UARC, UNITED STATES
Abdul-Mageed, Muhammad AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis
Columbia University, UNITED STATES
Abe, Yusuke Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
University of Tsukuba, JAPAN
Abolahrar, Elnaz An Open Source Persian Computational Grammar
Chalmers University, SWEDEN
Abou-Zleikha, Mohamed Evaluating expressive speech synthesis from audiobook corpora for conversational phrases
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College Dublin, IRELAND
Abrate, Matteo Visualizing word senses in WordNet Atlas
IIT-CNR, ITALY
Acar, Süleyman Word Alignment for English-Turkish Language Pair
Istanbul Technical University, TURKEY
Adda-Decker, Martine Cross-lingual studies of ASR errors: paradigms for perceptual evaluations
Designing French Tale Corpora for Entertaining Text To Speech Synthesis
LIMSI - CNRS, FRANCE
Adda, Gilles The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
AFCP, FRANCE
Adell, Jordi BUCEADOR, a multi-language search engine for digital libraries
Universitat Politècnica de Catalunya, SPAIN
Adophs, Peter Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World
DFKI GmbH, GERMANY
Adson, Kristina Annotating Opinions in German Political News
DFKI GmbH, GERMANY
Afantenos, Stergos An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
IRIT, Univ. Toulouse, FRANCE
Agarwal, Rahul A GUI to Detect and Correct Errors in Hindi Dependency Treebank
Language Technologies Research Center (LTRC), IIIT-H, INDIA
Agerri, Rodrigo SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
IXA NLP Group, University of the Basque Country, SPAIN
Aggarwal, Priti The Twins Corpus of Museum Visitor Questions
University of Southern California, Institute for Creative Technologies, UNITED STATES
Agic, Zeljko Croatian Dependency Treebank: Recent Development and Initial Experiments
University of Zagreb, CROATIA
Agirre, Eneko Matching Cultural Heritage items to Wikipedia
University of the Basque Country, SPAIN
Agosti, Maristella A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Department of Information Engineering― University of Padua, ITALY
Agrawal, Shyam Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
KIIT College of Engineering, INDIA
Ahlberg, Malin Combining Language Resources Into A Grammar-Driven Swedish Parser
Gothenburg University, SWEDEN
Ahlsén, Elisabeth Feedback in Nordic First-Encounters: a Comparative Study
University of Gothenburg, SWEDEN
Ahmad, Nadeem A platform-independent user-friendly dictionary from Italian to LIS
Politecnico di Torino, Dipartimento di Automatica e Informatica, ITALY
Ahmed, Tafseer A Reference Dependency Bank for Analyzing Complex Predicates
Universität Konstanz, GERMANY
Ahmed, Zeeshan Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College Dublin, IRELAND
Ahrenberg, Lars Alignment-based reordering for SMT
On the practice of error analysis for machine translation evaluation
Error profiling for evaluation of machine-translated text: a Polish-English case study
Linköping University, SWEDEN
Ahtaridis, Eleftheria LDC Language Resource Database: Building a Bibliographic Database
LDC, UNITED STATES
Aikawa, Kiyoaki Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Tokyo University of Technology, JAPAN
Aizawa, Akiko Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques
Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure
National Institute of Informatics, JAPAN
Akagi, Toshiki Collecting humorous expressions from a community-based question-answering-service corpus
Yamagata University, JAPAN
Akcam, Zumrut Revealing Contentious Concepts Across Social Groups
University at Albany, SUNY, UNITED STATES
Aker, Ahmet Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles
A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English
Assessing Crowdsourcing Quality through Objective Tasks
A light way to collect comparable corpora from the Web
Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Sheffield, UNITED KINGDOM
Akiba, Tomoyosi Statistical Machine Translation without Source-side Parallel Corpus Using Word Lattice and Phrase Extension
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Toyohashi University of Technology, JAPAN
Aksan, Mustafa Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY
Aksan, Yeşim Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY
Alazard, Charlotte MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions.
U.R.I Octogone-Lordat, EA 4156, Université Toulouse II, FRANCE
Albakour, M-Dyaa Assessing Crowdsourcing Quality through Objective Tasks
Applying Random Indexing to Structured Data to Find Contextually Similar Words
University of Essex, UNITED KINGDOM
Albayrak, Sahin GerNED: A German Corpus for Named Entity Disambiguation
DAI-Labor, Technische Universität Berlin, GERMANY
Alber, Birgit A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Department of Foreign Languages and Literatures― University of Verona, ITALY
Albert, Camille Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
IRIT-CNRS, FRANCE
Alcina, Amparo Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
Universitat Jaume I, SPAIN
Alegria, Iñaki Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques
IXA group, UPV/EHU, SPAIN
Aleksandrov, Martin NgramQuery - Smart Information Extraction from Google N-gram using External Resources
Sofia University, BULGARIA
Aleksic, Vera Large Scale Lexical Analysis
Linguatec, GERMANY
Alexandersson, Jan ISO 24617-2: A semantically-based standard for dialogue annotation
Tilburg University, NETHERLANDS
Alexandersson, Simon 3rd party observer gaze as a continuous measure of dialogue flow
KTH Speech, Music and Hearing, SWEDEN
Alfano, Iolanda Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabi fication
University of Naples “Federico II”, ITALY
Alicante, Anita A treebank-based study on the influence of Italian word order on parsing performance
Dipartimento di Scienze Fisiche, Università ”Federico II” di Napoli, ITALY
Allen, James An Annotation Scheme for Quantifier Scope Disambiguation
University of Rochester, UNITED STATES
Allwood, Jens Feedback in Nordic First-Encounters: a Comparative Study
University of Gothenburg, SWEDEN
Almeida, José João The Minho Quotation Resource
Structural alignment of plain text books
LIAAD-INESC, University of Minho, PORTUGAL
Almissour, Zewar Pursing power in Arabic on-line discussion forums
Language Computer Corporation, UNITED STATES
Aloni, Maria Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions
University of Amsterdam, NETHERLANDS
Al-Sabbagh, Rania YADAC: Yet another Dialectal Arabic Corpus
Department of Linguistics, University of Illinois at Urbana-Champaign, UNITED STATES
Aluísio, Sandra Maria Propbank-Br: a Brazilian Treebank annotated with semantic role labels
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Universidade de São Paulo, BRAZIL
Ambati, Bharat Ram Word Sketches for Turkish
A GUI to Detect and Correct Errors in Hindi Dependency Treebank
Lexical Computing Ltd, UNITED KINGDOM
Amoia, Marilisa Coreference in Spoken vs. Written Texts: a Corpus-based Analysis
Department of Applied Linguistics, Saarland University, GERMANY
Anand, Pranav A Corpus for Research on Deliberation and Debate
UCSC UARC, UNITED STATES
Ananiadou, Sophia A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic.
Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries
Identification of Manner in Bio-Events
Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench
University of Manchester, UNITED KINGDOM
Anastasiou, Dimitra A Speech and Gesture Spatial Corpus in Assisted Living
Universität Bremen, GERMANY
Andersen, Ulrich Creation and use of Language Resources in a Question-Answering eHealth System
Sorano Ltd., DENMARK
Andreas, Jacob Annotating Agreement and Disagreement in Threaded Discussion
Columbia University, UNITED STATES
Andrich, Rico Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Antonishek, Brian Creating HAVIC: Heterogeneous Audio Visual Internet Collection
NIST, UNITED STATES
Apidianaki, Marianna Applying cross-lingual WSD to wordnet development
Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations
Alpage, INRIA Paris-Rocquencourt & Université Paris 7, FRANCE
Araki, Kenji Two Database Resources for Processing Social Media English Text
Hokkaido University, JAPAN
Ar, Balamurali Cost and Benefit of Using WordNet Senses for Sentiment Analysis
IIT Bombay, INDIA
Arbillot, Eric DECODA: a call-centre human-human spoken conversation corpus
RATP, FRANCE
Aristar-Dry, Helen ”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
Eastern Michigan University, UNITED STATES
Armenta, Ana The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
Speech Technology Group, Telefónica Investigación y Desarrollo, SPAIN
Arnulphy, Béatrice Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French
Evolution of Event Designation in Media: Preliminary Study
LIMSI-CNRS & Univ. Paris-Sud, FRANCE
Arora, Piyush Hindi Subjective Lexicon: A Lexical Resource for Hindi Adjective Polarity Classification
International Institute of Information Technology, INDIA
Arranz, Victoria ELRA in the heart of a cooperative HLT world
VERTa: Linguistic features in MT evaluation
Using the International Standard Language Resource Number: Practical and Technical Aspects
An Analytical Model of Language Resource Sustainability
On the Way to a Legal Sharing of Web Applications in NLP
The META-SHARE Metadata Schema for the Description of Language Resources
ELDA/ELRA, FRANCE
Artstein, Ron The Twins Corpus of Museum Visitor Questions
University of Southern California, Institute for Creative Technologies, UNITED STATES
Arza, Montserrat A Galician Syntactic Corpus with Application to Intonation Modeling
Centro Ramon Pineiro para a Investigacion en Humanidades. Xunta de Galicia, SPAIN
Asher, Nicholas An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
IRIT, Univ. Toulouse, FRANCE
Astésano, Corine MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions.
U.R.I Octogone-Lordat, EA 4156, Université Toulouse II, FRANCE
Atalla, Malik SemScribe: Natural Language Generation for Medical Reports
ID GmbH & Co. KgaA, GERMANY
Atasoy, Gülsüm Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY
Atserias, Jordi Spell Checking in Spanish: The Case of Diacritic Accents
VERTa: Linguistic features in MT evaluation
Fundació Barcelona Media, SPAIN
Attard, Andrew Incorporating an Error Corpus into a Spellchecker for Maltese
University of Malta, MALTA
Attia, Mohammed Arabic Word Generation and Modelling for Spell Checking
Automatic Extraction and Evaluation of Arabic LFG Resources
School of Computing, Dublin City University, IRELAND
Atwell, Eric QurAna: Corpus of the Quran annotated with Pronominal Anaphora
QurSim: A corpus for evaluation of relatedness in short texts
Predicting Phrase Breaks in Classical and Modern Standard Arabic Text
Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing
LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis
University of Leeds, UNITED KINGDOM
Auer, Eric Federated Search: Towards a Common Search Infrastructure
Max Planck Institute for Psycholinguistics, NETHERLANDS
Augustinus, Liesbeth Example-Based Treebank Querying
Centre for Computational Linguistics, KU Leuven, BELGIUM
Avgustinova, Tania CLIMB grammars: three projects using metagrammar engineering
Saarland University, Department of Computational Linguistics, GERMANY
Avramidis, Eleftherios Involving Language Professionals in the Evaluation of Machine Translation
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
DFKI GmbH, GERMANY
Aziz, Wilker PET: a Tool for Post-editing and Assessing Machine Translation
University of Wolverhampton, UNITED KINGDOM

 

B
Babko-Malaya, Olga Identifying Nuggets of Information in GALE Distillation Evaluation
BAE Systems, UNITED STATES
Babych, Bogdan Development and Application of a Cross-language Document Comparability Metric
Identifying Word Translations from Comparable Documents Without a Seed Lexicon
Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Leeds, UNITED KINGDOM
Bacciu, Clara Visualizing word senses in WordNet Atlas
IIT-CNR, ITALY
Bachan, Jolanta Developing and evaluating an emergency scenario dialogue corpus
Adam Mickiewicz University, Poznan, Institute of Linguistics, POLAND
Bagherbeygi, Somayeh Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
Linguistic Dept., Allameh Tabatabaiee University, IRAN, ISLAMIC REPUBLIC OF
Baker, Collin F. The MASC Word Sense Corpus
Empirical Comparisons of MASC Word Sense Annotations
ICSI, UNITED STATES
Bakliwal, Akshat Hindi Subjective Lexicon: A Lexical Resource for Hindi Adjective Polarity Classification
International Institute of Information Technology, INDIA
Balahur, Alexandra Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text
European Commission, Joint Research Centre (JRC), ITALY
Bali, Kalika Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
Arabic Textware, JORDAN
Ballesteros, Miguel MaltOptimizer: A System for MaltParser Optimization
Universidad Complutense de Madrid, SPAIN
Banea, Carmen Learning Sentiment Lexicons in Spanish
University of North Texas, UNITED STATES
Bank, Mathias Textual Characteristics for Language Engineering
A Survey of Text Mining Architectures and the UIMA Standard
Pattern Science AG, GERMANY
Bański, Piotr The New IDS Corpus Analysis Platform: Challenges and Prospects
Evaluating Query Languages for a Corpus Processing System
Institute for the German Language (IDS), GERMANY
Baran, Elizabeth Annotating dropped pronouns in Chinese newswire text
Brandeis University, UNITED STATES
Barbot, Nelly Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora
IRISA - University of Rennes 1, FRANCE
Barbu Mititelu, Verginica Adding Morpho-semantic Relations to the Romanian Wordnet
Romanian Academy Research Institute for Artificial Intelligence, ROMANIA
Barcellini, Flore Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
CNAM-CRTD, FRANCE
Barcellos de Oliveira, Gladis The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
NILC, BRAZIL
Barker, Emma Assessing the Comparability of News Texts
University of Sheffield, UNITED KINGDOM
Barone, Rossano Corpus Annotation as a Scientific Task
University of Sussex, UNITED KINGDOM
Barreaux, Sabine The Quaero Evaluation Initiative on Term Extraction
INIST - CNRS, FRANCE
Barrena, Ander Matching Cultural Heritage items to Wikipedia
University of the Basque Country, SPAIN
Bartalesi Lenzi, Valentina CAT: the CELCT Annotation Tool
CELCT, ITALY
Basile, Valerio Developing a large semantically annotated corpus
University of Groningen, NETHERLANDS
Battocchi, Alberto A Parallel Corpus of Music and Lyrics Annotated with Emotions
DISI - University of Trento, ITALY
Bauer, Daniel The Dependency-Parsed FrameNet Corpus
Columbia University, UNITED STATES
Bazillon, Thierry Syntactic annotation of spontaneous speech: application to call-center conversation data
DECODA: a call-centre human-human spoken conversation corpus
Aix Marseille Univ, LIF-CNRS, FRANCE
Bechet, Frederic Syntactic annotation of spontaneous speech: application to call-center conversation data
DECODA: a call-centre human-human spoken conversation corpus
Aix Marseille Univ, LIF-CNRS, FRANCE
Becks, Daniela A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content
University of Hildesheim, GERMANY
Beisswanger, Elena Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
Friedrich-Schiller-Universität Jena, GERMANY
Bellot, Patrice Annotated Bibliographical Reference Corpora in Digital Humanities
Université Aix-Marseille, FRANCE
Bel, Núria A Classification of Adjectives for Polarity Lexicons Enhancement
A voting scheme to detect semantic underspecification
Iula2Standoff: a tool for creating standoff documents for the IULACT
Automatic lexical semantic classification of nouns
The IULA Treebank
Towards a User-Friendly Platform for Building Language Resources based on Web Services
Using Language Resources in Humanities research
The FLaReNet Strategic Language Resource Agenda
Universitat Pompeu Fabra, SPAIN
Belz, Anja A Repository of Data and Evaluation Resources for Natural Language Generation
LG-Eval: A Toolkit for Creating Online Language Evaluation Experiments
University of Brighton, UNITED KINGDOM
Benamara, Farah An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
Clause-based Discourse Segmentation of Arabic Texts
IRIT, Univ. Toulouse, FRANCE
Bennett, Paul GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German
University of Manchester, UNITED KINGDOM
Bensley, Jeremy Pursing power in Arabic on-line discussion forums
Language Computer Corporation, UNITED STATES
Bentivogli, Luisa The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Chinese Whispers: Cooperative Paraphrase Acquisition
CELCT, ITALY
Berka, Jan Automatic MT Error Analysis: Hjerson Helping Addicter
Charles University in Prague, MFF, UFAL, CZECH REPUBLIC
Berovic, Dasa Croatian Dependency Treebank: Recent Development and Initial Experiments
University of Zagreb, CROATIA
Berridge, Damon Document Attrition in Web Corpora: an Exploration
Lancaster University, UNITED KINGDOM
Bertrand, Roxane Orthographic Transcription: which enrichment is required for phonetization?
Laboratoire Parole et Langage, CNRS & Aix-Marseille Université, FRANCE
Berwick, Robert A large scale annotated child language construction database
MIT, UNITED STATES
Berzlánovich, Ildikó Multi-Layer Discourse Annotation of a Dutch Text Corpus
University of Groningen, NETHERLANDS
Besacier, Laurent Collection of a Large Database of French-English SMT Output Corrections
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
UJF-Grenoble1, FRANCE
Besançon, Romaric Evaluation of Unsupervised Information Extraction
Evaluation of a Complex Information Extraction Application in Specific Domain
CEA, LIST, Vision and Content Engineering Laboratory, FRANCE
Beskow, Jonas 3rd party observer gaze as a continuous measure of dialogue flow
KTH Speech, Music and Hearing, SWEDEN
Bethard, Steven Annotating Story Timelines as Temporal Dependency Structures
University of Colorado, UNITED STATES
Bhattacharyya, Pushpak Experiences in Resource Generation for Machine Translation through Crowdsourcing
Cost and Benefit of Using WordNet Senses for Sentiment Analysis
IIT Bombay, INDIA
Biagioni, Stefania From medical language processing to BioNLP domain
, ITALY
Bianchi, Elisa Creation of a bottom-up corpus-based ontology for Italian Linguistics
Università di Pisa, ITALY
Bianchini, Claudia S. Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Université Paris 8 Saint Denis, FRANCE
Biber, Hanno Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies.
Austrian Academy of Sciences, AUSTRIA
Bick, Eckhard The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
University of Southern Denmark, DENMARK
Bieler, Heike SemScribe: Natural Language Generation for Medical Reports
Applied Computational Linguistics Lab, University of Potsdam, POLAND
Biemann, Chris Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution
Technische Universität Darmstadt, GERMANY
Bies, Ann Further Developments in Treebank Error Detection Using Derivation Trees
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Expanding Arabic Treebank to Speech: Results from Broadcast News
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
Bigi, Brigitte SPPAS: a tool for the phonetic segmentation of speech
Orthographic Transcription: which enrichment is required for phonetization?
Laboratoire Parole et Langage, CNRS & Aix-Marseille Université, FRANCE
Bigouroux, Nicolas DECODA: a call-centre human-human spoken conversation corpus
Sonear, FRANCE
Bildhauer, Felix Building Large Corpora from the Web Using a New Efficient Tool Chain
Freie Universität Berlin, GERMANY
Billières, Michel MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions.
U.R.I Octogone-Lordat, EA 4156, Université Toulouse II, FRANCE
Bini, Johan Visualizing Sentiment Analysis on a User Forum
Qliktech, SWEDEN
Bittar, André Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
Xerox Research Centre Europe, FRANCE
Bizer, Christian Evaluating the Impact of Phrase Recognition on Concept Tagging
DBpedia: A Multilingual Cross-domain Knowledge Base
DFKI GmbH, GERMANY
Blache, Philippe An ontological approach to model and query multimodal concurrent linguistic annotations
Université de Provence, FRANCE
Black, Alan Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems
Language Technologies Institute, Carnegie Mellon University, UNITED STATES
Black, William A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic.
University of Manchester, UNITED KINGDOM
Blain, Frédéric Automatic Translation of Scientific Documents in the HAL Archive
SYSTRAN SA, FRANCE
Blanchon, Hervé Collection of a Large Database of French-English SMT Output Corrections
UPMF-Grenoble2, FRANCE
Blanco, Eduardo Polaris: Lymba's Semantic Parser
Lymba Corporation, UNITED STATES
Boeffard, Olivier Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora
Towards Fully Automatic Annotation of Audio Books for TTS
IRISA - University of Rennes 1, FRANCE
Boella, Guido NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
Università di Torino, ITALY
Bojar, Ondřej Automatic MT Error Analysis: Hjerson Helping Addicter
Terra: a Collection of Translation Error-Annotated Corpora
Announcing Prague Czech-English Dependency Treebank 2.0
The Joy of Parallelism with CzEng 1.0
Charles University in Prague, MFF, UFAL, CZECH REPUBLIC
Bonafonte, Antonio Building Synthetic Voices in the META-NET Framework
BUCEADOR, a multi-language search engine for digital libraries
Universitat Politècnica de Catalunya, SPAIN
Bonansinga, Giulia LexIt: A Computational Resource on Italian Argument Structure
University of Pisa, Department of Linguistics, ITALY
Bondi Johannessen, Janne The Nordic Dialect Corpus
Universitetet i Oslo, Institutt for lingvistiske og nordiske studier, NORWAY
Bongelli, Ramona A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Università di Macerata, ITALY
Bonneau-Maynard, Hélène Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier
LIMSI ― CNRS, FRANCE
Bordea, Georgeta Semi-Supervised Technical Term Tagging With Minimal User Feedback
Expertise Mining for Enterprise Content Management
National University of Ireland, IRELAND
Bordel, German KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
University of the Basque Country, SPAIN
Borgia, Fabrizio Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Université Paul Sabatier, IRIT, FRANCE
Borgwaldt, Susanne Association Norms of German Noun Compounds
TU Braunschweig, GERMANY
Borin, Lars Korp ― the corpus infrastructure of Spräkbanken
The open lexical infrastructure of Spräkbanken
Sprakbanken, University of Gothenburg, SWEDEN
Boruta, Luc A Phonemic Corpus of Polish Child-Directed Speech
Université Paris 7, FRANCE
Bosca, Alessio Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.
CELI, ITALY
Bosco, Cristina The Parallel-TUT: a multilingual and multiformat treebank
A treebank-based study on the influence of Italian word order on parsing performance
Dipartimento di Informatica, Università di Torino, ITALY
Bos, Johan Developing a large semantically annotated corpus
University of Groningen, NETHERLANDS
Botchan, Nick Word Sense Inventories by Non-Experts.
Brandeis University, UNITED STATES
Bott, Stefan Text Simplification Tools for Spanish
Universitat Pompeu Fabra, SPAIN
Bouamor, Dhouha Identifying bilingual Multi-Word Expressions for Statistical Machine Translation
CEA-LIST, FRANCE
Bouamor, Houda A contrastive review of paraphrase acquisition techniques
LIMSI - CNRS, FRANCE
Boudahmane, Karim New language resources for the Pashto language
Direction Générale de l'Armement, FRANCE
Bouillon, Pierrette Evaluating Appropriateness Of System Responses In A Spoken CALL Game
Annotating Qualia Relations in Italian and French Complex Nominals
University of Geneva, FTI/TIM, SWITZERLAND
Bouma, Gosse Multi-Layer Discourse Annotation of a Dutch Text Corpus
University of Groningen, NETHERLANDS
Bourse, Sarah A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures
IRIT-CNRS, FRANCE
Boutora, Leïla DEGELS1: A comparable corpus of French Sign Language and co-speech gestures
LPL & Aix-Marseille University, FRANCE
Boz, Umit Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University at Albany, UNITED STATES
Braasch, Anna Creation and use of Language Resources in a Question-Answering eHealth System
University of Copenhagen, CST, DENMARK
Bracewell, David Pursing power in Arabic on-line discussion forums
Language Computer Corporation, UNITED STATES
Braffort, Annelies DEGELS1: A comparable corpus of French Sign Language and co-speech gestures
, FRANCE
Branco, António TimeBankPT: A TimeML Annotated Corpus of Portuguese
A PropBank for Portuguese: the CINTIL-PropBank
Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese
University of Lisbon, PORTUGAL
Bras, Myriam An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
CLLE, Univ. Toulouse, FRANCE
Breiteneder, Evelyn Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies.
Austrian Academy of Sciences, AUSTRIA
Bremin, Sofia Eye Tracking as a Tool for Machine Translation Error Analysis
Linköping University, SWEDEN
Brierley, Claire Predicting Phrase Breaks in Classical and Modern Standard Arabic Text
Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing
University of Leeds, UNITED KINGDOM
Briesch, Douglas Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System
Computational & Information Sciences Directorate, Army Research Lab (ARL), UNITED STATES
Brkić, Marija BLEU Evaluation of Machine-Translated English-Croatian Legislation
University of Rijeka, Department of Informatics, CROATIA
Broadwell, Aaron Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University at Albany, UNITED STATES
Broda, Bartosz Constraint Based Description of Polish Multiword Expressions
Inforex -- a web-based tool for text corpus management and semantic annotation
Tools for plWordNet Development. Presentation and Perspectives
KPWr: Towards a Free Corpus of Polish
Wroclaw University of Technology, POLAND
Broeder, Daan Federated Search: Towards a Common Search Infrastructure
Proper Language Resource Centers
Standardizing a Component Metadata Infrastructure
Citing on-line Language Resources
The Language Archive ― a new hub for language resources
Max Planck Institute for Psycholinguistics, NETHERLANDS
Brooke, Julian Measuring Interlanguage: Native Language Identification with L1-influence Metrics
University of Toronto, CANADA
Brouwer, Matthijs Dynamic web service deployment in a cloud environment
Meertens Institute, NETHERLANDS
Brown, Guy J. A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English
University of Sheffield, UNITED KINGDOM
Brunessaux, Sylvie New language resources for the Pashto language
CASSIDIAN, FRANCE
Brüning, Bernhard PAMOCAT: Automatic retrieval of specified postures
CITEC Central Lab, University of Bielefeld, GERMANY
Bruno, Emmanuel An ontological approach to model and query multimodal concurrent linguistic annotations
Université du Sud Toulon Var, FRANCE
Buendía-Castro, Miriam Linguistic knowledge for specialized text production
University of Granada, SPAIN
Bueno-Díaz, Conchi Medical Term Extraction in an Arabic Medical Corpus
Computational Linguistics Laboratory-Autónoma University Madrid, SPAIN
Buitelaar, Paul Semi-Supervised Technical Term Tagging With Minimal User Feedback
Expertise Mining for Enterprise Content Management
National University of Ireland, IRELAND
Buldorini, Cinzia A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Università di Macerata, ITALY
Bunt, Harry Using DiAML and ANVIL for multimodal dialogue annotations
The coding and annotation of multimodal dialogue acts
ISO 24617-2: A semantically-based standard for dialogue annotation
Tilburg Center for Cognition and Communication, Tilburg University, NETHERLANDS
Burchardt, Aljoscha Involving Language Professionals in the Evaluation of Machine Translation
DFKI GmbH, GERMANY
Burger, Birgitta Comparing computer vision analysis of signed language video with motion capture recordings
University of Jyväskyä, FINLAND
Burkhardt, Felix “You Seem Aggressive!” Monitoring Anger in a Practical Application
Fast Labeling and Transcription with the Speechalyzer Toolkit
Deutsche Telekom Laboratories, GERMANY
Buttery, Paula Reclassifying subcategorization frames for experimental analysis and stimulus generation
Annotating progressive aspect constructions in the spoken section of the British National Corpus
University of Cambridge, UNITED KINGDOM
Butt, Miriam A Reference Dependency Bank for Analyzing Complex Predicates
Universität Konstanz, GERMANY
Buyko, Ekaterina Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
Friedrich-Schiller-Universität Jena, GERMANY

 

C
Cabral, Joao Paulo Evaluating expressive speech synthesis from audiobook corpora for conversational phrases
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College Dublin, IRELAND
Cabrera-Diego, Luis Adrián Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
Instituto de Ingeniería, Universidad Nacional Autónoma de México, MEXICO
Caelen-Haumont, Geneviève Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu
MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels
MICA Institute, HUST, VIET NAM
Cahill, Peter Evaluating expressive speech synthesis from audiobook corpora for conversational phrases
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College Dublin, IRELAND
Caines, Andrew Reclassifying subcategorization frames for experimental analysis and stimulus generation
Annotating progressive aspect constructions in the spoken section of the British National Corpus
European Bioinformatics Institute, UNITED KINGDOM
Çakıcı, Ruket METU Turkish Discourse Bank Browser
Middle East Technical University, TURKEY
Çakmak, Mehmet Talha Word Alignment for English-Turkish Language Pair
Istanbul Technical University, TURKEY
Calabrese, Omar The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
University of Siena, ITALY
Callahan, Brendan Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure
LDC, UNITED STATES
Calzolari, Nicoletta The Language Library: supporting community effort for collective resource production
The LRE Map. Harmonising Community Descriptions of Resources
The FLaReNet Strategic Language Resource Agenda
ILC-CNR, ITALY
Cambria, Erik Affective Common Sense Knowledge Acquisition for Sentiment Analysis
National University of Singapore, SINGAPORE
Camelin, Nathalie Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
LIUM, FRANCE
Caminero, Javier The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Telefónica Investigación y Desarrollo, SPAIN
Campbell, Nick An audiovisual political speech analysis incorporating eye-tracking and perception data
The Herme Database of Spontaneous Multimodal Human-Robot Dialogues
Trinity College Dublin, IRELAND
Campillo, Francisco A Galician Syntactic Corpus with Application to Intonation Modeling
Universidade de Vigo, SPAIN
Campillos Llanos, Leonardo Designing a search interface for a Spanish learner spoken corpus: the end-user's evaluation
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Universidad Autónoma de Madrid, Departamento de Lingüística, SPAIN
Cancedda, Nicola Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Xerox Research Centre Europe, FRANCE
Candido Jr., Arnaldo Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Center of Computational Linguistics (NILC), University of São Paulo, BRAZIL
Candito, Marie Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus
Université Paris 7, FRANCE
Canestrari, Carla A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Università di Macerata, ITALY
Capurro, Daniel Statistical Section Segmentation in Free-Text Clinical Records
Biomedical and Health Informatics, University of Washington, UNITED STATES
Carbonell, Jaime Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
LTI/CMU, UNITED STATES
Cardenal, Antonio BUCEADOR, a multi-language search engine for digital libraries
Universidad de Vigo, SPAIN
Cardiff, John Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System
Social Media Research Group, IIT Dublin, IRELAND
Cardoso, Nuno Rembrandt - a named-entity recognition framework
University of Lisbon, PORTUGAL
Carl, Michael Translog-II: a Program for Recording User Activity Data for Empirical Reading and Writing Research
Copenhagen Business School, DENMARK
Carré, Matthieu The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
ELRA in the heart of a cooperative HLT world
The REPERE Corpus : a multimodal corpus for person recognition
ELDA, FRANCE
Carrillo de Albornoz, Jorge SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis
Universidad Complutense de Madrid, SPAIN
Carson-Berndsen, Julie English to Indonesian Transliteration to Support English Pronunciation Practice
Evaluating expressive speech synthesis from audiobook corpora for conversational phrases
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College Dublin, IRELAND
Cartoni, Bruno Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
The Trilingual ALLEGRA Corpus: Presentation and Possible Use for Lexicon Induction
Université de Genève, Département de Linguistique, SWITZERLAND
Caruso, Christopher Creating HAVIC: Heterogeneous Audio Visual Internet Collection
LDC, UNITED STATES
Carvalheiro, Catarina A PropBank for Portuguese: the CINTIL-PropBank
University of Lisbon, PORTUGAL
Carvalho, Gracinda Building and Exploring Semantic Equivalences Resources
Universidade Aberta, PORTUGAL
Carvalho, Leonardo Corpus+WordNet thesaurus generation for ontology enriching
PUCRS, BRAZIL
Carvalho, Nuno Structural alignment of plain text books
Universidade do Minho, PORTUGAL
Caselli, Tommaso Assigning Connotation Values to Events
Customizable SCF Acquisition in Italian
Hong Kong Polytechnic University, HONG KONG
Cassidy, Steve The Australian National Corpus: National Infrastructure for Language Resources
Macquarie University, AUSTRALIA
Castellón, Irene VERTa: Linguistic features in MT evaluation
Universidad de Barcelona (UB), SPAIN
Castilho, Fernando Corpus+WordNet thesaurus generation for ontology enriching
PUCRS, BRAZIL
Castilho, Sheila PET: a Tool for Post-editing and Assessing Machine Translation
University of Wolverhampton, UNITED KINGDOM
Castillo, Mauro A proposal for improving WordNet Domains
UTEM, CHILE
Castro, Sérgio A PropBank for Portuguese: the CINTIL-PropBank
University of Lisbon, PORTUGAL
Catizone, Roberta LIE: Leadership, Influence and Expertise
University of Sheffield, UNITED KINGDOM
Cattoni, Roldano The KnowledgeStore: an Entity-Based Storage System
Fondazione Bruno Kessler, ITALY
Caudai, Claudia Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access
ILC - CNR, Pisa, ITALY
Cetinoglu, Ozlem Irish Treebanking and Parsing: A Preliminary Evaluation
Universität Stuttgart, IMS, GERMANY
Cettolo, Mauro The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Fondazione Bruno Kessler, ITALY
Chang, Angel X. A Cross-Lingual Dictionary for English Wikipedia Concepts
SUTime: A library for recognizing and normalizing time expressions
Google Research, Google Inc., UNITED STATES
Charonnat, Laure Towards Fully Automatic Annotation of Audio Books for TTS
IRISA - University of Rennes 1, FRANCE
Charton, Eric A disambiguation resource extracted from Wikipedia for semantic annotation
École Polytechnique de Montréal, CANADA
Cheng, Shuk-Man NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
National Taiwan University, TAIWAN, PROVINCE OF CHINA
Cheng, Xiwen Annotating Opinions in German Political News
DFKI GmbH, GERMANY
Chen, Helen Kaiyun Annotating a corpus of human interaction with prosodic profiles ― focusing on Mandarin repair/disfluency
A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong Polytechnic University, HONG KONG
Chen, Hsin-Hsi Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition
NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information
National Taiwan University, TAIWAN, PROVINCE OF CHINA
Chen, Tianqi Semi-Supervised Technical Term Tagging With Minimal User Feedback
Shanghai Jiao Tong University, CHINA
Chen, Yu Joint Grammar and Treebank Development for Mandarin Chinese with HPSG
MultiUN v2: UN Documents with Multilingual Alignments
DFKI GmbH, GERMANY
Chiarcos, Christian Ontologies of Linguistic Annotation: Survey and perspectives
The Open Linguistics Working Group
A generic formalism to represent linguistic corpora in RDF and OWL/DL
Information Science Institute, University of Southern California, UNITED STATES
Chiu, Tin-Shing A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong Polytechnic University, Department of Computing, HONG KONG
Choe, Jae-Woong ISO 24617-2: A semantically-based standard for dialogue annotation
Korea University, KOREA, REPUBLIC OF
Cho, Eunah The KIT Lecture Corpus for Speech Translation
Karlsruhe Institute of Technology, GERMANY
Choi, Dae-Lim Dysarthric Speech Database for Development of QoLT Software Technology
Wonkwang Univ., KOREA, REPUBLIC OF
Choi, Jinho D. Empty Argument Insertion in the Hindi PropBank
University of Colorado at Boulder, UNITED STATES
Choudhury, Monojit An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
Microsoft Research India, INDIA
Choudhury, Rahzeb On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
TAUS, NETHERLANDS
Choukri, Khalid ELRA in the heart of a cooperative HLT world
The FLaReNet Strategic Language Resource Agenda
New language resources for the Pashto language
Using the International Standard Language Resource Number: Practical and Technical Aspects
An Analytical Model of Language Resource Sustainability
ELDA/ELRA, FRANCE
Chowdhury, Md. Faisal Mahbub An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction
Fondazione Bruno Kessler, ITALY
Christiansen, Thomas Ulrich Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies
Technical University of Denmark, DENMARK
Chu, Chenhui Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese
Kyoto University, JAPAN
Chung, Minhwa Dysarthric Speech Database for Development of QoLT Software Technology
Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
Seoul National Univ., KOREA, REPUBLIC OF
Cieri, Christopher Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities
LDC Language Resource Database: Building a Bibliographic Database
LDC, UNITED STATES
Cignoni, Laura In the same boat and other idiomatic seafaring expressions
Istituto di Linguistica Computazionale, C.N.R., ITALY
Cimiano, Philipp Collaborative semantic editing of linked data lexica
CITEC, Universität Bielefeld, GERMANY
Cinková, Silvie A database of semantic clusters of verb usages
Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Ciravegna, Fabio Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
Unsupervised document zone identification using probabilistic graphical models
University of Sheffield, UNITED KINGDOM
Clapham, R.P. NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
University of Amsterdam/ACLC, NETHERLANDS
Clarke, James An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
University of Illinois (Urbana-Champaign), UNITED STATES
Clark, Eleanor Two Database Resources for Processing Social Media English Text
Hokkaido University, JAPAN
Claveau, Vincent Annotating Football Matches: Influence of the Source Medium on Manual Annotation
IRISA ― CNRS, FRANCE
Clematide, Simon Dependency parsing for interaction detection in pharmacogenomics
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
Institute of Computational Linguistics, University of Zurich, SWITZERLAND
Clough, Paul Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles
Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Sheffield, UNITED KINGDOM
C.M. de Sousa, Sheila A review corpus annotated for negation, speculation and their scope
University of Wolverhampton, UNITED KINGDOM
Coesemans, Roel Irregularity Detection in Categorized Document Corpora
Universiteit Antwerp, BELGIUM
Coheur, Luísa An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Extending a wordnet framework for simplicity and scalability
Dealing with unknown words in statistical machine translation
IST, PORTUGAL
Collet, Christophe Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs
Université Paul Sabatier, IRIT, FRANCE
Colpaert, Jozef The DISCO ASR-based CALL system: practicing L2 oral skills and beyond
University of Antwerp, BELGIUM
Comeliau, Jean-Loup The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
W4 S.A., FRANCE
Comelles, Elisabet VERTa: Linguistic features in MT evaluation
Universidad de Barcelona (UB), SPAIN
Conkie, Alistair Building Text-To-Speech Voices in the Cloud
AT&T Labs ― Research, UNITED STATES
Constant, Matthieu A new semantically annotated corpus with syntactic-semantic and cross-lingual senses
Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger
Extending the adverbial coverage of a French morphological lexicon
Université Paris-Est, LIGM, FRANCE
Copestake, Ann Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
University of Cambridge, UNITED KINGDOM
Corazza, Anna A treebank-based study on the influence of Italian word order on parsing performance
Dipartimento di Scienze Fisiche, Università ”Federico II” di Napoli, ITALY
Corcoglioniti, Francesco The KnowledgeStore: an Entity-Based Storage System
Fondazione Bruno Kessler, ITALY
Correia, Margarita The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
ILTEC, PORTUGAL
Corvey, William J. Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events
University of Colorado, UNITED STATES
Costa, Ângela An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Dealing with unknown words in statistical machine translation
Centro de Linguistica da Universidade Nova de Lisboa, PORTUGAL
Costa, Francisco TimeBankPT: A TimeML Annotated Corpus of Portuguese
University of Lisbon, PORTUGAL
Costa-Jussà, Marta R. A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Holaaa!! writin like u talk is kewl but kinda hard 4 NLP
BUCEADOR, a multi-language search engine for digital libraries
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
Barcelona Media, SPAIN
Costa, Luís Págico: Evaluating Wikipedia-based information retrieval in Portuguese
Linguateca/FCCN, PORTUGAL
Costantini, Giovanni Intelligibility assessment in forensic applications
Department of Electronic Engineering, University of Rome “Tor Vergata”, ITALY
Cottin, Florent Texto4Science: a Quebec French Database of Annotated Short Text Messages
Université de Montréal, CANADA
Crabbé, Benoit Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus
Université Paris 7, FRANCE
Cristea, Dan Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations
Faculty of Computer Science, “Alexandru Ioan Cuza” University Iași, ROMANIA
Cuadros, Montse Highlighting relevant concepts from Topic Signatures
Vicomtech-IK4, SPAIN
Cucchiarini, Catia The DISCO ASR-based CALL system: practicing L2 oral skills and beyond
Radboud University Nijmegen, NETHERLANDS
Cuevas - Alonso, Miguel A Galician Syntactic Corpus with Application to Intonation Modeling
Universidade de Vigo, SPAIN
Cu, Jocelynn Building a Multimodal Laughter Database for Emotion Recognition
Center for Empathic Human-Computer Interactions, De La Salle University, PHILIPPINES
Curto, Sérgio Extending a wordnet framework for simplicity and scalability
INESC-ID/L2F, PORTUGAL
Cutrullà, Maria Grazia Enriching the ISST-TANL Corpus with Semantic Frames
Università di Pisa, ITALY
Cutugno, Francesco W-PhAMT: A web tool for phonetic multilevel timeline visualization
Università di Napoli Federico II, ITALY
Cvrček, František Legal electronic dictionary for Czech
Faculty of Informatics Masaryk University, CZECH REPUBLIC

 

D
Dacos, Marin Annotated Bibliographical Reference Corpora in Digital Humanities
CLEO, Centre for Open Electronic Publishing, FRANCE
Daelemans, Walter ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories
“Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives.
The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
CLiPS - University of Antwerp, BELGIUM
Daiber, Joachim Evaluating the Impact of Phrase Recognition on Concept Tagging
German Research Center for Artificial Intelligence, GERMANY
Dalbelo Bašić, Bojana Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian
University of Zagreb, CROATIA
Dale, Robert A Framework for Evaluating Text Correction
Macquarie University, AUSTRALIA
d'Alessandro, Christophe Designing French Tale Corpora for Entertaining Text To Speech Synthesis
LIMSI-CNRS, FRANCE
Dalianis, Hercules Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text
Stockholm University, SWEDEN
Dalle, Patrice Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Université Paul Sabatier, IRIT, FRANCE
Damani, Om Re-ordering Source Sentences for SMT
IIT Bombay, INDIA
Damljanovic, Danica Applying Random Indexing to Structured Data to Find Contextually Similar Words
University of Sheffield, UNITED KINGDOM
Danielsson, Henrik Eye Tracking as a Tool for Machine Translation Error Analysis
A good space: Lexical predictors in word space evaluation
This also affects the context - Errors in extraction based summaries
Linköping University, SWEDEN
Danlos, Laurence Semantic annotation of French corpora: animacy and verb semantic classes
Université Paris 7, FRANCE
Darme, Cécile Unsupervised acquisition of concatenative morphology
Team RL, Laboratory I3S, University of Nice Sophia-Antipolis, FRANCE
Das, Dipanjan A Universal Part-of-Speech Tagset
Carnegie Mellon University, UNITED STATES
Davis, Chris Irwin Tajik-Farsi Persian Transliteration Using Statistical Machine Translation
University of Texas at Dallas, UNITED STATES
Dayrell, Carmen Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Department of Modern Languages, University of São Paulo, BRAZIL
Declerck, Thierry Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
The META-SHARE Metadata Schema for the Description of Language Resources
DFKI GmbH, GERMANY
De Clercq, Orphée Collection of a corpus of Dutch SMS
Evaluating automatic cross-domain Dutch semantic role annotation
LT3, Language and Translation Technology Team, University College Ghent, BELGIUM
De Cock, Martine Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
Ghent University, BELGIUM
De Decker, Benny The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
University of Antwerp, BELGIUM
Degaetano-Ortlieb, Stefania Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach
Saarland University, GERMANY
Dekova, Rositsa Bulgarian X-language Parallel Corpus
Department of Computational Linguistics, Institute for Bulgarian, Bulgarian Academy of Sciences, BULGARIA
Delaborde, Agnes Corpus of Children Voices for Mid-level Markers and Affect Bursts Analysis
LIMSI ― CNRS, FRANCE
Deléglise, Paul TED-LIUM: an Automatic Speech Recognition dedicated corpus
Université du Maine, LIUM, FRANCE
Del Gratta, Riccardo The Language Library: supporting community effort for collective resource production
The LRE Map. Harmonising Community Descriptions of Resources
ILC-CNR, ITALY
Delhay, Arnaud Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora
IRISA - University of Rennes 1, FRANCE
DeLooze, Celine The Herme Database of Spontaneous Multimodal Human-Robot Dialogues
Trinity College Dublin, IRELAND
del Pozo, Arantza SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Human Speech and Language Technologies, Vicomtech-IK4, SPAIN
De Luca, Ernesto William GerNED: A German Corpus for Named Entity Disambiguation
Is it Useful to Support Users with Lexical Resources? A User Study.
DAI-Labor, Technische Universität Berlin, GERMANY
De Marsico, Maria Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Università di Roma La Sapienza, ITALY
Demberg, Vera German and English Treebanks and Lexica for Tree-Adjoining Grammars
Saarland University, Department of Computational Linguistics, GERMANY
de Melo, Gerard Empirical Comparisons of MASC Word Sense Annotations
ICSI, UNITED STATES
Demirhan, Umut Ufuk Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY
Demir, Seniz Turkish Paraphrase Corpus
Tübitak-Bilgem, TURKEY
De Mori, Renato DECODA: a call-centre human-human spoken conversation corpus
Universite d’Avignon, FRANCE
Denis, Alexandre Representation of linguistic and domain knowledge for second language learning in virtual worlds
CNRS/LORIA, FRANCE
Den, Yasuharu Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
Annotation of anaphoric relations and topic continuity in Japanese conversation
UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
Chiba University, JAPAN
De Pauw, Guy The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
University of Antwerp, BELGIUM
Deplano, Melanie Syntactic annotation of spontaneous speech: application to call-center conversation data
Aix Marseille Univ, LIF-CNRS, FRANCE
Derczynski, Leon TIMEN: An Open Temporal Expression Normalisation Resource
Massively Increasing TIMEX3 Resources: A Transduction Approach
University of Sheffield, UNITED KINGDOM
D'Errico, Francesca Pedagogical stances and their multimodal signals.
Dipartimento di Scienze dell’Educazione ― Università Roma Tre, ITALY
Desipri, Elina The META-SHARE Metadata Schema for the Description of Language Resources
ILSP - Athena RC, GREECE
De Smedt, Koenraad Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Universitetet i Bergen, NORWAY
De Smedt, Tom “Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives.
Universiteit Antwerpen, BELGIUM
Devillers, Laurence Corpus of Children Voices for Mid-level Markers and Affect Bursts Analysis
LIMSI ― CNRS, FRANCE
De Weerdt, Danny Comparing computer vision analysis of signed language video with motion capture recordings
University of Jyväskyä, FINLAND
D'Halleweyn, Elisabeth Smooth Sailing for STEVIN
Nederlandse Taalunie, NETHERLANDS
Diab, Mona Annotations for Power Relations on Email Threads
AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis
Conventional Orthography for Dialectal Arabic
Simplified guidelines for the creation of Large Scale Dialectal Arabic Annotations
Columbia University, UNITED STATES
di Caro, Luigi NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
Università di Torino, ITALY
Dickinson, Markus Using semi-experts to derive judgments on word sense alignment: a pilot study
Annotating Errors in a Hungarian Learner Corpus
Indiana University, UNITED STATES
Dickinson, Sven Detecting Reduplication in Videos of American Sign Language
University of Toronto, CANADA
Diez, Mireia KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
University of the Basque Country, SPAIN
Di Fabbrizio, Giuseppe Building Text-To-Speech Voices in the Cloud
AT&T Labs ― Research, UNITED STATES
Dima, Emanuel A Metadata Editor to Support the Description of Linguistic Resources
A Repository for the Sustainable Management of Research Data
Universität Tübingen, GERMANY
Dinarelli, Marco Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results
LIMSI ― CNRS, FRANCE
Dini, Luca Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.
CELI, ITALY
Dinu, Liviu P. The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System
University of Bucharest, ROMANIA
Di Nunzio, Giorgio Maria A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Department of Information Engineering― University of Padua, ITALY
Dione, Cheikh M. Bamba A Morphological Analyzer For Wolof Using Finite-State Techniques
Department of Linguistic, University of Bergen, NORWAY
DiPersio, Denise Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities
LDC Language Resource Database: Building a Bibliographic Database
LDC, UNITED STATES
Dipper, Stefanie The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
Ruhr-Universität Bochum, GERMANY
Doermann, David Linguistic Resources for Handwriting Recognition and Translation Evaluation
University of Maryland, UNITED STATES
Domingo, Judith Holaaa!! writin like u talk is kewl but kinda hard 4 NLP
Barcelona Media Research Center, SPAIN
Dong, Yang Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Universidad Autónoma de Madrid, SPAIN
Doukhan, David Designing French Tale Corpora for Entertaining Text To Speech Synthesis
LIMSI-CNRS, FRANCE
Draper, Mary Pursing power in Arabic on-line discussion forums
Language Computer Corporation, UNITED STATES
Dras, Mark Irish Treebanking and Parsing: A Preliminary Evaluation
Macquarie University, AUSTRALIA
Dridan, Rebecca The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
Universitetet i Oslo, NORWAY
Drouin, Patrick Texto4Science: a Quebec French Database of Annotated Short Text Messages
Université de Montréal, CANADA
Drude, Sebastian ”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
The Language Archive ― a new hub for language resources
Max Planck Institute for Psycholinguistics, NETHERLANDS
Drury, Brett The Minho Quotation Resource
LIAAD-INESC, University of Minho, PORTUGAL
Duka, Angelina GerNED: A German Corpus for Named Entity Disambiguation
DAI-Labor, Technische Universität Berlin, GERMANY
Dukes, Kais LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis
University of Leeds, UNITED KINGDOM
Duran, Magali Sanches Propbank-Br: a Brazilian Treebank annotated with semantic role labels
Universidade de São Paulo, BRAZIL
Durco, Matej Federated Search: Towards a Common Search Infrastructure
ICLTT, AUSTRIA
Durgar El-Kahlout, Ilknur Turkish Paraphrase Corpus
Tübitak-Bilgem, TURKEY
Durrell, Martin GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German
University of Manchester, UNITED KINGDOM
Dušek, Ondřej The Joy of Parallelism with CzEng 1.0
Charles University in Prague, CZECH REPUBLIC
Dussin, Marco A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Department of Information Engineering― University of Padua, ITALY

 

E
Eberle, Kurt A Tool/Database Interface for Multi-Level Analyses
Universität Stuttgart, GERMANY
Ebrahim, Mohamed JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool
European Commision, Joint Research Centre (JRC), ITALY
Eckart, Kerstin German """"nach""""-Particle Verbs in Semantic Theory and Corpus Data
A Tool/Database Interface for Multi-Level Analyses
University of Stuttgart, GERMANY
Eckart, Thomas Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages
The Influence of Corpus Quality on Statistical Measurements on Language Resources
Universität Leipzig, GERMANY
Eckle-Kohler, Judith UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF
The Open Linguistics Working Group
Technische Universität Darmstadt, GERMANY
Edlund, Jens 3rd party observer gaze as a continuous measure of dialogue flow
KTH Speech, Music and Hearing, SWEDEN
Egg, Markus Multi-Layer Discourse Annotation of a Dutch Text Corpus
Humboldt University, GERMANY
Eickhoff, Carsten The BladeMistress Corpus: From Talk to Action in Virtual Worlds
TUDelft, NETHERLANDS
Eisele, Andreas MultiUN v2: UN Documents with Multilingual Alignments
DGT-TM: A freely available Translation Memory in 22 languages
Directorate-General for Translation (DGT), LUXEMBOURG
Elahimanesh, Mohammad Hossein Improving K-Nearest Neighbor Efficacy for Farsi Text Classification
Islamic Azad University, IRAN, ISLAMIC REPUBLIC OF
Elahi, Mohammad Fazleh An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use
University of Bremen, GERMANY
Elbers, Willem Proper Language Resource Centers
Max Planck Institute for Psycholinguistics, NETHERLANDS
El-Beze, Marc DECODA: a call-centre human-human spoken conversation corpus
Universite d’Avignon, FRANCE
Elfardy, Heba Simplified guidelines for the creation of Large Scale Dialectal Arabic Annotations
Columbia University, UNITED STATES
El-Haj, Mahmoud Assessing Crowdsourcing Quality through Objective Tasks
University of Essex, UNITED KINGDOM
Ellis, Joe Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
El Maarouf, Ismaïl A French Fairy Tale Corpus syntactically and semantically annotated
IRISA, FRANCE
Elson, David DramaBank: Annotating Agency in Narrative Discourse
Columbia University, UNITED STATES
Enache, Ramona Combining Language Resources Into A Grammar-Driven Swedish Parser
Gothenburg University, SWEDEN
Eom, Soojeong Using semi-experts to derive judgments on word sense alignment: a pilot study
Georgetown University, UNITED STATES
Erdogan, Hakan SUTAV: A Turkish Audio-Visual Database
Sabanci University, TURKEY
Eriksen, Olle BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users
Møller-Trøndelag Kompetansesenter, NORWAY
Eriksson, Anders Visualizing Sentiment Analysis on a User Forum
Lund University, SWEDEN
Erjavec, Tomaž The goo300k corpus of historical Slovene
Institut Jozef Stefan, Department of Knowledge Technologies, SLOVENIA
Erro, Daniel Versatile Speech Databases for High Quality Synthesis for Basque
Strategies to Improve a Speaker Diarisation Tool
Using an ASR database to design a pronunciation evaluation system in Basque
University of the Basque Country, SPAIN
Eryiğit, Gülşen The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish
Word Alignment for English-Turkish Language Pair
Istanbul Technical University, TURKEY
Escalada, José Gregorio The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
Speech Technology Group, Telefónica Investigación y Desarrollo, SPAIN
Eskevich, Maria Creating a Data Collection for Evaluating Rich Speech Retrieval
Centre for Digital Video Processing, School of Computing, Dublin City University, IRELAND
Esperança-Rodier, Emmanuelle Collection of a Large Database of French-English SMT Output Corrections
UJF-Grenoble1, FRANCE
Esplà-Gomis, Miquel Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation
Universitat d'Alacant (UA), SPAIN
Estève, Yannick TED-LIUM: an Automatic Speech Recognition dedicated corpus
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Université du Maine, LIUM, FRANCE
Evang, Kilian Developing a large semantically annotated corpus
University of Groningen, NETHERLANDS
Exner, Peter Constructing Large Proposition Databases
Lund University, SWEDEN

 

F
Faath, Elodie Annotated Bibliographical Reference Corpora in Digital Humanities
CLEO, Centre for Open Electronic Publishing, FRANCE
Fabbri, Marco RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building
Università di Firenze, ITALY
Fabre, Cecile An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
CLLE, Univ. Toulouse, FRANCE
Faessler, Erik Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
Friedrich-Schiller-Universität Jena, GERMANY
Falco, Mathieu-Henri Kitten: a tool for normalizing HTML and extracting its textual content
LIMSI - CNRS, FRANCE
Falk, Ingrid Representation of linguistic and domain knowledge for second language learning in virtual worlds
Lorraine University/LORIA, FRANCE
Fallu, Mark The Australian National Corpus: National Infrastructure for Language Resources
Griffith University, AUSTRALIA
Fang, Alex Chengyu ISO 24617-2: A semantically-based standard for dialogue annotation
Tilburg University, NETHERLANDS
Faralli, Stefano A New Method for Evaluating Automatically Learned Terminological Taxonomies
Sapienza Università di Roma, ITALY
Farré, Jacques Unsupervised acquisition of concatenative morphology
Team RL, Laboratory I3S, University of Nice Sophia-Antipolis, FRANCE
Faulstich, Lukas C. SemScribe: Natural Language Generation for Medical Reports
ID GmbH & Co. KgaA, GERMANY
Favre, Benoit Syntactic annotation of spontaneous speech: application to call-center conversation data
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Aix Marseille Univ, LIF-CNRS, FRANCE
Fazly, Afsaneh Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions
Institute for Research in Fundamental Sciences, IRAN, ISLAMIC REPUBLIC OF
Federico, Marcello The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Fondazione Bruno Kessler, ITALY
Federmann, Christian Involving Language Professionals in the Evaluation of Machine Translation
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
DFKI GmbH, GERMANY
Feilmayr, Christina EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems
Johannes Kepler University Linz, AUSTRIA
Fellbaum, Christiane The MASC Word Sense Corpus
Empirical Comparisons of MASC Word Sense Annotations
Princeton University, UNITED STATES
Felt, Paul First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis
Brigham Young University, UNITED STATES
Feltrim, Valéria Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Department of Informatics, State University of Maringa, BRAZIL
Fernandes, Paulo A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever
PUCRS, BRAZIL
Fernández Martínez, Fernando Investigating Verbal Intelligence Using the TF-IDF Approach
Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores
E.T.S.I. de Telecomunicacion, Universidad Politécnica de Madrid, SPAIN
Fernandez-Ordonez, Erwin Unsupervised Word Sense Disambiguation with Multilingual Representations
University of North Texas, UNITED STATES
Fernandez, Raquel Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions
University of Amsterdam, NETHERLANDS
Fernando, Samuel Matching Cultural Heritage items to Wikipedia
Mapping WordNet synsets to Wikipedia articles
University of Sheffield, UNITED KINGDOM
Ferreira, José Pedro The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
ILTEC, PORTUGAL
Ferret, Olivier Evaluation of Unsupervised Information Extraction
Evaluation of a Complex Information Extraction Application in Specific Domain
CEA, LIST, Vision and Content Engineering Laboratory, FRANCE
Ferro, Marcello Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access
ILC - CNR, Pisa, ITALY
Fialho, Pedro Extending a wordnet framework for simplicity and scalability
INESC-ID/L2F, PORTUGAL
Filatova, Elena Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing
Computer and Information Science Department, Fordham University, UNITED STATES
Filhol, Michael Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs
LIMSI - CNRS, FRANCE
Finley, James International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned
MITRE Corporation, UNITED STATES
Fisas, Beatríz The IULA Treebank
Universitat Pompeu Fabra, SPAIN
Fischer, Peter M. The New IDS Corpus Analysis Platform: Challenges and Prospects
Institute for the German Language (IDS), GERMANY
Fiscus, Jonathan Creating HAVIC: Heterogeneous Audio Visual Internet Collection
NIST, UNITED STATES
Fišer, Darja Cleaning noisy wordnets
Addressing polysemy in bilingual lexicon extraction from comparable corpora
University of Ljubljana, SLOVENIA
Fishel, Mark SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Automatic MT Error Analysis: Hjerson Helping Addicter
Terra: a Collection of Translation Error-Annotated Corpora
TextShuttle GmbH, SWITZERLAND
Fiumara, James Creating HAVIC: Heterogeneous Audio Visual Internet Collection
LDC, UNITED STATES
Flickinger, Dan The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
Stanford University, UNITED STATES
Fohr, Dominique CoALT: A Software for Comparing Automatic Labelling Tools
INRIA/LORIA, Université Henri Poincaré, FRANCE
Fokkens, Antske CLIMB grammars: three projects using metagrammar engineering
Saarland University, Department of Computational Linguistics, GERMANY
Fonollosa, José A. R. BUCEADOR, a multi-language search engine for digital libraries
Universitat Politècnica de Catalunya, SPAIN
Forascu, Corina Evaluating Machine Reading Systems through Comprehension Tests
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information
Al. I. Cuza University of Iasi, ROMANIA
Formiga, Lluís The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output
Universitat Politècnica de Catalunya, SPAIN
Fornaciari, Tommaso DeCour: a corpus of DEceptive statements in Italian COURts
University of Trento, ITALY
Forner, Pamela Evaluating Machine Reading Systems through Comprehension Tests
CELCT, ITALY
Forsberg, Markus Cloud Logic Programming for Integrating Language Technology Resources
Korp ― the corpus infrastructure of Spräkbanken
The open lexical infrastructure of Spräkbanken
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Gothenburg, SWEDEN
Forster, Jens RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
RWTH Aachen University, GERMANY
Fort, Karën Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign
Annotating Football Matches: Influence of the Source Medium on Manual Annotation
INIST - CNRS, FRANCE
Fosler-Lussier, Eric Associative and Semantic Features Extracted From Web-Harvested Corpora
Ohio State University, UNITED STATES
Foster, Jennifer Irish Treebanking and Parsing: A Preliminary Evaluation
Dublin City University, IRELAND
Fox Tree, Jean A Corpus for Research on Deliberation and Debate
UCSC UARC, UNITED STATES
François, Claire Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign
INIST - CNRS, FRANCE
Francom, Jerid Boosting statistical tagger accuracy with simple rule-based grammars
Wake Forest University, UNITED STATES
Francopoulo, Gil The LRE Map. Harmonising Community Descriptions of Resources
The META-SHARE Metadata Schema for the Description of Language Resources
LIMSI ― CNRS, FRANCE
Frederking, Robert Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
LTI/CMU, UNITED STATES
Freitas, Cláudia Págico: Evaluating Wikipedia-based information retrieval in Portuguese
PUC-Rio, BRAZIL
Friberg Heppin, Karin The Rocky Road towards a Swedish FrameNet - Creating SweFN
Semantic Role Labeling with the Swedish FrameNet
University of Gothenburg, SWEDEN
Frick, Elena The New IDS Corpus Analysis Platform: Challenges and Prospects
Evaluating Query Languages for a Corpus Processing System
Institute for the German Language (IDS), GERMANY
Friedrich, Annemarie Suffix Trees as Language Models
Universität des Saarlandes, GERMANY
Friesen, Rafael LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions
Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Frommer, Jörg LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions
Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Frontini, Francesca Customizable SCF Acquisition in Italian
The Language Library: supporting community effort for collective resource production
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser
The META-SHARE Metadata Schema for the Description of Language Resources
ILC-CNR, ITALY
Fuchs, Matthew A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
Paideia Inc, UNITED STATES
Fučíková, Eva Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Fuentes, Maria Spell Checking in Spanish: The Case of Diacritic Accents
Summarizing a multimodal set of documents in a Smart Room
Fundació Barcelona Media, SPAIN
Fujii, Atsushi Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions
Tokyo Institute of Technology, JAPAN
Fujii, Yuya Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions
Tokyo Institute of Technology, JAPAN
Fung, Pascale A Multilingual Natural Stress Emotion Database
A Mandarin-English Code-Switching Corpus
Hong Kong University of Science and Technology, HONG KONG
Fürstenau, Hagen The Dependency-Parsed FrameNet Corpus
Columbia University, UNITED STATES
Furukawa, Hirohisa Multimodal Corpus of Multi-party Conversations in Second Language
Doshisha University, JAPAN

 

G
Gábor, Kata Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations
Alpage, INRIA Paris-Rocquencourt & Université Paris 7, FRANCE
Gagliardi, Gloria A topologic view of Topic and Focus marking in Italian
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
Department of Italian Studies, University of Firenze, ITALY
Gagnon, Michel A disambiguation resource extracted from Wikipedia for semantic annotation
École Polytechnique de Montréal, CANADA
Gahbiche-Braham, Souhir Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier
LIMSI ― CNRS, FRANCE
Gaizauskas, Robert Assessing the Comparability of News Texts
TIMEN: An Open Temporal Expression Normalisation Resource
Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles
A light way to collect comparable corpora from the Web
Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Sheffield, UNITED KINGDOM
Galibert, Olivier Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign
The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign
The REPERE Corpus : a multimodal corpus for person recognition
Laboratoire National de métrologie et d'Essais / INRA, Jouy-en-Josas, France / Synapse Développement, Toulouse, France / Jouve, Mayenne, France, FRANCE
Galuščáková, Petra The Joy of Parallelism with CzEng 1.0
Charles University in Prague, CZECH REPUBLIC
Ganis, James The BladeMistress Corpus: From Talk to Action in Virtual Worlds
University of Edinburgh, UNITED KINGDOM
Garcia Casademont, Emília Building Synthetic Voices in the META-NET Framework
Universitat Politècnica de Catalunya, SPAIN
Garcia, Fernando The acquisition and dialog act labeling of the EDECAN-SPORTS corpus
Universitat Politècnica de Valencia, SPAIN
García-Miguel, José M. A Galician Syntactic Corpus with Application to Intonation Modeling
Universidade de Vigo, SPAIN
Gardent, Claire Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents
Representation of linguistic and domain knowledge for second language learning in virtual worlds
CNRS/LORIA Nancy, FRANCE
Garrido, Juan María The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
Speech and Language Group, Barcelona Media Centre d’Innovació, SPAIN
Garrote-Salazar, Marta Medical Term Extraction in an Arabic Medical Corpus
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Computational Linguistics Laboratory-Autónoma University Madrid, SPAIN
Gatt, Albert Incorporating an Error Corpus into a Spellchecker for Maltese
A Repository of Data and Evaluation Resources for Natural Language Generation
University of Malta, MALTA
Gavrila, Monica Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation
University of Hamburg, GERMANY
Gavrilidou, Maria Standardizing a Component Metadata Infrastructure
The META-SHARE Metadata Schema for the Description of Language Resources
ILSP, Athens, GREECE
Gavrilov, Zoya Detecting Reduplication in Videos of American Sign Language
University of Toronto, CANADA
Gayo, Iria A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality
Universidad de Santiago de Compostela, SPAIN
Gebre, Binyam Gebrekidan AVATecH ― automated annotation through audio and video analysis
Towards Automatic Gesture Stroke Detection
Max Planck Institute for Psycholinguistics, NETHERLANDS
Généreux, Michel Introducing the Reference Corpus of Contemporary Portuguese Online
CLUL, PORTUGAL
Genov, Angel Bulgarian X-language Parallel Corpus
Expanding Parallel Resources for Medium-Density Languages for Free
Department of Computational Linguistics, Institute for Bulgarian, Bulgarian Academy of Sciences, BULGARIA
Georgakopoulou, Panayota SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Deluxe Digital Studios, UNITED KINGDOM
Georgila, Kallirroi Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems
Institute for Creative Technologies, University of Southern California, UNITED STATES
Georgi, Ryan Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms
University of Washington, UNITED STATES
Gerlach, Johanna Evaluating Appropriateness Of System Responses In A Spoken CALL Game
University of Geneva, FTI/TIM, SWITZERLAND
Gershman, Anatole Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
LTI/CMU, UNITED STATES
Gerten, Jillian The Twins Corpus of Museum Visitor Questions
University of Southern California, Institute for Creative Technologies, UNITED STATES
Gertz, Michael Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards
Universität Heidelberg, GERMANY
Gervás, Pablo SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis
Universidad Complutense de Madrid, SPAIN
Gesmundo, Andrea Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification
University of Geneva, SWITZERLAND
Ghayoomi, Masood From Grammar Rule Extraction to Treebanking: A Bootstrapping Approach
Freie Universität Berlin, GERMANY
Gheorghita, Inga Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary
ATILF Nancy / University of Riga / HAS Busapest, FRANCE
Ghosh, Sucheta Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
DISI, University of Trento, ITALY
Ghribi, Maha Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign
INIST - CNRS, FRANCE
Giampiccolo, Danilo Chinese Whispers: Cooperative Paraphrase Acquisition
CELCT, ITALY
Giannopoulou, Ioanna META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
ELDA, FRANCE
Giannoudaki, Maria Associative and Semantic Features Extracted From Web-Harvested Corpora
Technical University of Crete, GREECE
Giannoulis, Panagiotis A hierarchical approach with feature selection for emotion recognition from speech
Department of Electrical and Computer Engineering, National Technical University of Athens, GREECE
Gibbon, Dafydd ULex: new data models and a mobile environment for corpus enrichment.
Universität Bielefeld, GERMANY
Gilmartin, Emer The Herme Database of Spontaneous Multimodal Human-Robot Dialogues
Trinity College Dublin, IRELAND
Gindl, Stefan MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
MODUL University Vienna, Department of New Media Technology, AUSTRIA
Giovannetti, Emiliano Creation of a bottom-up corpus-based ontology for Italian Linguistics
ILC - CNR, Pisa, ITALY
Gippert, Jost ”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
University of Frankfurt, GERMANY
Girardi, Christian META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
The KnowledgeStore: an Entity-Based Storage System
Fondazione Bruno Kessler, ITALY
Giraudel, Aude The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
The REPERE Corpus : a multimodal corpus for person recognition
DGA, FRANCE
Girju, Roxana YADAC: Yet another Dialectal Arabic Corpus
Department of Linguistics, University of Illinois at Urbana-Champaign, UNITED STATES
Glaros, Nikos Collecting and Using Comparable Corpora for Statistical Machine Translation
ILSP - Athena RC, GREECE
Goggi, Sara From medical language processing to BioNLP domain
, ITALY
Göhring, Anne A tree is a Baum is an árbol is a sach'a: Creating a trilingual treebank
Institute for Computational Linguistics, University of Zurich, SWITZERLAND
Gojun, Anita Adapting and evaluating a generic term extraction tool
Institute for Natural Language Processing, University of Stuttgart, GERMANY
Goldhahn, Dirk Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages
The Influence of Corpus Quality on Statistical Measurements on Language Resources
Universität Leipzig, GERMANY
Gomes, Mariana Project FLY: a multidisciplinary project within Linguistics
Centro de Linguística da Universidade de Lisboa, PORTUGAL
Gonçalves, Patricia Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese
University of Lisbon, PORTUGAL
Gonzalez-Agirre, Aitor Multilingual Central Repository version 3.0
A proposal for improving WordNet Domains
Euskal Herriko Unibertsitatea, Informatika Fakultatea, SPAIN
Gonzalez, Matilde Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs
Université Paul Sabatier, IRIT, FRANCE
Gonzalo, Carlos Using Language Resources in Humanities research
Universitat Pompeu Fabra, SPAIN
Goodwin, Travis Annotating Spatial Containment Relations Between Events
University of Texas at Dallas, UNITED STATES
Gornostay, Tatiana Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Tilde, LATVIA
Görög, Attila DutchSemCor: Targeting the ideal sense-tagged corpus
VU University Amsterdam, NETHERLANDS
Gozalo, Paula Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Universidad Autónoma de Madrid, SPAIN
Graça, João A PropBank for Portuguese: the CINTIL-PropBank
University of Lisbon, PORTUGAL
Graff, David Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects
LDC, UNITED STATES
Granada, Roger Corpus+WordNet thesaurus generation for ontology enriching
PUCRS, BRAZIL
Grau, Brigitte Evaluation of Unsupervised Information Extraction
LIMSI ― CNRS, FRANCE
Gravier, Guillaume The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
AFCP, FRANCE
Gray, Steven A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic.
University College London, UNITED KINGDOM
Greenwood, Mark A. Large Scale Semantic Annotation, Indexing and Search at The National Archives
University of Sheffield, UNITED KINGDOM
Gregori, Lorenzo RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building
Università di Firenze, ITALY
Grezka, Aude Building a database of French frozen adverbial phrases
LDI, University of Paris 13, FRANCE
Griffitt, Kira Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
Grimes, Stephen Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Automatic word alignment tools to scale production of manually aligned parallel texts
Linguistic Resources for Handwriting Recognition and Translation Evaluation
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
Grishman, Ralph Challenges in the Knowledge Base Population Slot Filling Task
New York University, UNITED STATES
Grissom II, Alvin Annotating Factive Verbs
National Institute of Informatics, JAPAN
Grosse, Corinne Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
CNAM-CRTD, FRANCE
Grotz, Maximilian Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues
Institute of Communications Engineering ― Dialogue Systems, University of Ulm, GERMANY
Grouin, Cyril Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign
LIMSI - CNRS, FRANCE
Gruzitis, Normunds An implementation of a Latvian resource grammar in Grammatical Framework
Institute of Mathematics and Computer Science, University of Latvia, LATVIA
Guerini, Marco Brand Pitt: A Corpus to Explore the Art of Naming
FBK-Irst, ITALY
Guerrisi, Vincenzo Evaluating Multi-focus Natural Language Queries over Data Services
DEI - Politecnico di Milano, ITALY
Guilherme, Ana Project FLY: a multidisciplinary project within Linguistics
Centro de Linguística da Universidade de Lisboa, PORTUGAL
Guirao, José M. Medical Term Extraction in an Arabic Medical Corpus
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Granada University, SPAIN
Gulati, Asheesh Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.
University of Geneva, SWITZERLAND
Gupta, Kanika Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
NI Systems (India) Pvt. Ltd. Bangalore, INDIA
Gupta, Somya Experiences in Resource Generation for Machine Translation through Crowdsourcing
IIT Bombay, INDIA
Gurevych, Iryna UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF
The Open Linguistics Working Group
Technische Universität Darmstadt, GERMANY
Gurrutxaga, Antton Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques
Elhuyar Foundation, SPAIN
Gustavsson, Lisa 3rd party observer gaze as a continuous measure of dialogue flow
Department of Speech, Music and Hearing, School of Computer Science and Communication, KTH, SWEDEN
Guthrie, Josh EmpaTweet: Annotating and Detecting Emotions on Twitter
Human Language Technology Research Institute, University of Texas at Dallas, UNITED STATES
Guthrie, Louise LIE: Leadership, Influence and Expertise
University of Sheffield, UNITED KINGDOM

 

H
Haase, Matthias LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions
Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Habash, Nizar Conventional Orthography for Dialectal Arabic
Columbia University, UNITED STATES
Hadrich Belguith, Lamia Clause-based Discourse Segmentation of Arabic Texts
MIRACL, TUNISIA
Haertel, Robbie First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis
Brigham Young University, UNITED STATES
Hagège, Caroline Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
Xerox Research Centre Europe, FRANCE
Hagen, Kristin The Nordic Dialect Corpus
Universitetet i Oslo, Institutt for lingvistiske og nordiske studier, NORWAY
Hagita, Norihiro Body-conductive acoustic sensors in human-robot communication
Intelligent Robotics and Communication Laboratories, Advanced Telecommunications Research Institute International, JAPAN
Hahn, Udo Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
CALBC: Releasing the Final Corpora
Friedrich-Schiller-Universität Jena, GERMANY
Haja, Gabriela Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations
“Alexandru Philippide” Institute of Philology, Romanian Academy, the Iași, ROMANIA
Hajič, Jan HamleDT: To Parse or Not to Parse?
Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Hajičová, Eva Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Haltrup Hansen, Dorte A Distributed Resource Repository for Cloud-Based Machine Translation
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Copenhagen, DENMARK
Hammarström, Harald Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages
Max Planck Institute for Evolutionary Anthropology, GERMANY
Hamon, Olivier Towards a User-Friendly Platform for Building Language Resources based on Web Services
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
Using the International Standard Language Resource Number: Practical and Technical Aspects
On the Way to a Legal Sharing of Web Applications in NLP
ELDA, FRANCE
Hamzeh, Ali Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions
Shiraz University, IRAN, ISLAMIC REPUBLIC OF
Hana, Jirka Getting more data -- Schoolkids as annotators
Building a learner corpus
Charles University in Prague, CZECH REPUBLIC
Han, Jing Guang The Herme Database of Spontaneous Multimodal Human-Robot Dialogues
Trinity College Dublin, IRELAND
Hanoka, Valérie Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources
INRIA, FRANCE
Harabagiu, Sanda M. Annotating Spatial Containment Relations Between Events
EmpaTweet: Annotating and Detecting Emotions on Twitter
University of Texas at Dallas, UNITED STATES
Hara, Sunao Causal analysis of task completion errors in spoken music retrieval interactions
Nara Institute of Science and Technology, JAPAN
Hartmann, Silvana UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF
The Open Linguistics Working Group
Technische Universität Darmstadt, GERMANY
Haselbach, Boris German """"nach""""-Particle Verbs in Semantic Theory and Corpus Data
A Tool/Database Interface for Multi-Level Analyses
University of Stuttgart, GERMANY
Hasida, Koiti ISO 24617-2: A semantically-based standard for dialogue annotation
Tilburg University, NETHERLANDS
Hassan, Samer Unsupervised Word Sense Disambiguation with Multilingual Representations
University of North Texas, UNITED STATES
Haugh, Michael The Australian National Corpus: National Infrastructure for Language Resources
Griffith University, AUSTRALIA
Hautli, Annette A Reference Dependency Bank for Analyzing Complex Predicates
Universität Konstanz, GERMANY
Havasi, Catherine Representing General Relational Knowledge in ConceptNet 5
MIT, UNITED STATES
Hayashi, Yoshihiko Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types
Osaka University, JAPAN
Hazem, Amir Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora
LINA, Université de Nantes, FRANCE
Heal, Kristian First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis
Brigham Young University, UNITED STATES
Heid, Ulrich French and German Corpora for Audience-based Text Type Classification
Adapting and evaluating a generic term extraction tool
Analyzing and Aligning German compound nouns
A Tool/Database Interface for Multi-Level Analyses
IMS, Universität Stuttgart, GERMANY
Heinroth, Tobias Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues
Institute of Communications Engineering ― Dialogue Systems, University of Ulm, GERMANY
Héja, Enikő Automatically Generated Online Dictionaries
Dept. of Language Technology, Research Institute for Linguistics, HAS, HUNGARY
Heldner, Mattias 3rd party observer gaze as a continuous measure of dialogue flow
Stockholm University, Linguistics, SWEDEN
Hellmann, Sebastian The Open Linguistics Working Group
Department of Computer Science, University of Leipzig, GERMANY
Hendrickx, Iris Introducing the Reference Corpus of Contemporary Portuguese Online
Modality in Text: a Proposal for Corpus Annotation
CLUL, PORTUGAL
Henestroza Anguiano, Enrique Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus
INRIA, FRANCE
Hennig, Leonhard GerNED: A German Corpus for Named Entity Disambiguation
DAI-Labor, Technische Universität Berlin, GERMANY
Henrichsen, Peter Juel Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies
SMALLWorlds -- Multilingual Content-Controlled Monologues
Copenhagen Business School, DENMARK
Henrich, Verena A Comparative Evaluation of Word Sense Disambiguation Algorithms for German
A Repository for the Sustainable Management of Research Data
Universität Tübingen - SFS, GERMANY
Henriksen, Lina Creation and use of Language Resources in a Question-Answering eHealth System
University of Copenhagen, CST, DENMARK
Heracleous, Panikos Body-conductive acoustic sensors in human-robot communication
Intelligent Robotics and Communication Laboratories, Advanced Telecommunications Research Institute International, JAPAN
Hermida, Jesús M. Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text
University of Alicante, SPAIN
Hernáez, Inma Versatile Speech Databases for High Quality Synthesis for Basque
Using an ASR database to design a pronunciation evaluation system in Basque
University of the Basque Country, SPAIN
Hernandez, Nicolas Tackling interoperability issues within UIMA workflows
LINA ― University of Nantes, FRANCE
Herrmann, Teresa The KIT Lecture Corpus for Speech Translation
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Karlsruhe Institute of Technology, GERMANY
He, Ruixin Evolution of Event Designation in Media: Preliminary Study
LIMSI ― CNRS, FRANCE
He, Yulan Quantising Opinions for Political Tweets Analysis
Open University, UNITED KINGDOM
Hilgers, F.J.M. NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
The Netherlands Cancer Institute, NETHERLANDS
Hinrichs, Erhard A Comparative Evaluation of Word Sense Disambiguation Algorithms for German
Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC
A Metadata Editor to Support the Description of Linguistic Resources
A Repository for the Sustainable Management of Research Data
Universität Tübingen - SFS, GERMANY
Hinrichs, Marie A Repository for the Sustainable Management of Research Data
Universität Tübingen, GERMANY
Hirst, Graeme Measuring Interlanguage: Native Language Identification with L1-influence Metrics
University of Toronto, CANADA
Hjalmarsson, Anna 3rd party observer gaze as a continuous measure of dialogue flow
KTH Speech, Music and Hearing, SWEDEN
Hladka, Barbora Getting more data -- Schoolkids as annotators
Charles University in Prague, CZECH REPUBLIC
Ho-Dac, Mai An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
CLLE, Univ. Toulouse, FRANCE
Hofer, Gregor Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis
FTW Telecommunications Research Center Vienna, AUSTRIA
Holmqvist, Maria Alignment-based reordering for SMT
Linköping University, SWEDEN
Holub, Martin A database of semantic clusters of verb usages
Charles University in Prague, CZECH REPUBLIC
Hong, Hyejin Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
Department of Linguistics, Seoul National University, KOREA, REPUBLIC OF
Hoppermann, Christina A Metadata Editor to Support the Description of Linguistic Resources
A Repository for the Sustainable Management of Research Data
Universität Tübingen, GERMANY
Horák, Aleš Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification
Masarykova Univerzita, Brno, CZECH REPUBLIC
Hornbostel, Kerstin Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
Friedrich-Schiller-Universität Jena, GERMANY
Hoste, Veronique From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
Evaluating automatic cross-domain Dutch semantic role annotation
Beyond SoNaR: towards the facilitation of large corpus building efforts
University College Ghent and Ghent University, BELGIUM
Hovy, Eduard Evaluating Machine Reading Systems through Comprehension Tests
ISI / USC, UNITED STATES
Hoyoux, Thomas RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
University of Innsbruck, AUSTRIA
Huang, Chu-Ren A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong Polytechnic University, HONG KONG
Huang, Pei-Wen NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
Institute for Information Industry, TAIWAN, PROVINCE OF CHINA
Hu, Hongzhan Eye Tracking as a Tool for Machine Translation Error Analysis
Linköping University, SWEDEN
Hulden, Mans Boosting statistical tagger accuracy with simple rule-based grammars
University of Helsinki, FINLAND
Humphreys, Llio NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
Università di Torino, ITALY
Hurtado, Lluis-F. The acquisition and dialog act labeling of the EDECAN-SPORTS corpus
Universitat Politècnica de Valencia, SPAIN
Hussain, Amir Affective Common Sense Knowledge Acquisition for Sentiment Analysis
University of Stirling, UNITED KINGDOM
Huszka, Csaba Creation and use of Language Resources in a Question-Answering eHealth System
Freiburg University, GERMANY

 

I
Ide, Nancy The MASC Word Sense Corpus
Empirical Comparisons of MASC Word Sense Annotations
Vassar College, UNITED STATES
Idiart, Marco A large scale annotated child language construction database
Federal University of Rio Grande do Sul, BRAZIL
Iida, Ryu The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues
Tokyo Institute of Technology, JAPAN
Iliev, Georgi Expanding Parallel Resources for Medium-Density Languages for Free
Bulgarian Academy of Sciences, BULGARIA
Illouz, Gabriel A contrastive review of paraphrase acquisition techniques
LIMSI - CNRS, FRANCE
Imamura, Kenji Constructing a Class-Based Lexical Dictionary using Interactive Topic Models
NTT Cyber Space Laboratories, NTT Corporation, JAPAN
Ingason, Anton Karl The Icelandic Parsed Historical Corpus (IcePaHC)
University of Pennsylvania, UNITED STATES
Inoue, Masashi Collecting humorous expressions from a community-based question-answering-service corpus
Yamagata University, JAPAN
Ion, Radu ROMBAC: The Romanian Balanced Annotated Corpus
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora
RACAI, Romanian Academy Bucharest, ROMANIA
Iosif, Elias SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks
Associative and Semantic Features Extracted From Web-Harvested Corpora
Technical University of Crete, GREECE
Ipasov, Mirlan A finite-state morphological transducer for Kyrgyz
Indiana University, UNITED STATES
Irimia, Elena ROMBAC: The Romanian Balanced Annotated Corpus
RACAI, Romanian Academy Bucharest, ROMANIA
Irsig, Kristin SemScribe: Natural Language Generation for Medical Reports
ID GmbH & Co. KgaA, GERMANY
Ishi, Carlos Body-conductive acoustic sensors in human-robot communication
Intelligent Robotics and Communication Laboratories, Advanced Telecommunications Research Institute International, JAPAN
Ishida, Toru Two Phase Evaluation for Selecting Machine Translation Services
Service Composition Scenarios for Task-Oriented Translation
Kyoto University, JAPAN
Islam, Zahurul Customization of the Europarl Corpus for Translation Studies
Goethe-Universität Frankfurt, GERMANY
Ismael, Safa Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Linguistic Resources for Handwriting Recognition and Translation Evaluation
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
Ivanova, Nedelina BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users
The Communication Centre for the Deaf and Hard-of-Hearing, ICELAND
Izquierdo, Rubén DutchSemCor: Targeting the ideal sense-tagged corpus
Tilburg University, NETHERLANDS

 

J
Jabaian, Bassam Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
University of Avignon, LIA-CERI, FRANCE
Jäger, Petr Building a learner corpus
Charles University in Prague, CZECH REPUBLIC
Jaja, Claire Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System
Computational & Information Sciences Directorate, Army Research Lab (ARL), UNITED STATES
Jakob, Max DBpedia: A Multilingual Cross-domain Knowledge Base
Neofonie GmbH, GERMANY
Jakubíček, Miloš Building a 70 billion word corpus of English from ClueWeb
Masaryk University, CZECH REPUBLIC
Janssen, Maarten The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
NeoTag: a POS Tagger for Grammatical Neologism Detection
Universitat Pompeu Fabra, Instituto Universitario de Lingüística Aplicada, SPAIN
Jantunen, Tommi Comparing computer vision analysis of signed language video with motion capture recordings
University of Jyväskyä, FINLAND
Jastrzebska, Justyna A Phonemic Corpus of Polish Child-Directed Speech
Université Paris 7, FRANCE
Jauch, Ronny Association Norms of German Noun Compounds
Universität Stuttgart, GERMANY
Jean-Louis, Ludovic Evaluation of a Complex Information Extraction Application in Specific Domain
CEA, LIST, Vision and and Content Engineering Laboratory, FRANCE
Jezek, Elisabetta Annotating Qualia Relations in Italian and French Complex Nominals
Università di Pavia, ITALY
Ji, Heng Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
Computer Science Department and Linguistics Department, Queens College and Graduate Center, City University of New York, UNITED STATES
Jínová, Pavlína Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component
Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics, CZECH REPUBLIC
Jin, Peng CLTC: A Chinese-English Cross-lingual Topic Corpus
Leshan Normal University, CHINA
J. Maña, Manuel A review corpus annotated for negation, speculation and their scope
Universidad de Huelva, SPAIN
Joachimsen, Jan Incorporating an Error Corpus into a Spellchecker for Maltese
University of Malta, MALTA
Jóhannsdóttir, Kristín Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Iceland, ICELAND
Johannsen, Anders Creation and use of Language Resources in a Question-Answering eHealth System
University of Copenhagen, CST, DENMARK
Johansson Kokkinakis, Sofie Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish
University of Gothenburg, SWEDEN
Johansson, Richard Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
Semantic Role Labeling with the Swedish FrameNet
University of Gothenburg, SWEDEN
Johnson, Joseph EmpaTweet: Annotating and Detecting Emotions on Twitter
Human Language Technology Research Institute, University of Texas at Dallas, UNITED STATES
Jokinen, Kristiina Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data
Constructive Interaction for Talking about Interesting Topics
Multimodal Corpus of Multi-party Conversations in Second Language
Feedback in Nordic First-Encounters: a Comparative Study
University of Tartu, ESTONIA
Jones, Gareth J.F. Creating a Data Collection for Evaluating Rich Speech Retrieval
Centre for Digital Video Processing, School of Computing, Dublin City University, IRELAND
Jongejan, Bart Automatic annotation of head velocity and acceleration in Anvil
University of Copenhagen, CST, DENMARK
Jönsson, Arne A good space: Lexical predictors in word space evaluation
This also affects the context - Errors in extraction based summaries
Santa Anna IT Research Institute AB & Linköping University, SWEDEN
Joshi, Aditya Cost and Benefit of Using WordNet Senses for Sentiment Analysis
IIT Bombay, INDIA
Joshi, Aravind Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
IRCS, University of Pennsylvania, UNITED STATES
Joubert, Alain A new dynamic approach for lexical networks evaluation
LIRMM, Université Montpellier II, FRANCE
Judea, Alex Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles
University of Stuttgart, GERMANY
Juzi, Hossein A Framework for Spelling Correction in Persian Language Using Noisy Channel Model
Computer Research Center of Islamic Sciences, IRAN, ISLAMIC REPUBLIC OF

 

K
Kaalep, Heiki-Jaan Robust clause boundary identification for corpus annotation
University of Tartu, Institute of Computer Science, ESTONIA
Kaeshammer, Miriam German and English Treebanks and Lexica for Tree-Adjoining Grammars
University of Düsseldorf, Institut für Sprache und Information, GERMANY
Kafkas, Şenay Centroids: Gold standards with distributional variation
CALBC: Releasing the Final Corpora
European Bioinformatics Institute, UNITED KINGDOM
Kahn, Juliette The REPERE Corpus : a multimodal corpus for person recognition
Laboratoire National de métrologie et d'Essais (LNE), FRANCE
Kallionen, Petter 3rd party observer gaze as a continuous measure of dialogue flow
Stockholm University, Linguistics, SWEDEN
Kancheva, Stanislava A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian
Linguistic Analysis Processing Line for Bulgarian
Linguistic Modelling Department, IICT-BAS, BULGARIA
Kane, John An audiovisual political speech analysis incorporating eye-tracking and perception data
Trinity College Dublin, IRELAND
Kane, Mark Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College Dublin, IRELAND
Kanoulas, Evangelos A light way to collect comparable corpora from the Web
Google Inc., UNITED STATES
Kano, Yoshinobu Towards automation in using multi-modal language resources: compatibility and interoperability for multi-modal features in Kachako
PRESTO, JST, JAPAN
Karan, Mladen Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian
University of Zagreb, CROATIA
Karkaletsis, Vangelis Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Evaluation of Online Dialogue Policy Learning Techniques
Institute of Informatics and Telecommunications, NCSR ‘Demokritos’, GREECE
Karlsson, Johanna Eye Tracking as a Tool for Machine Translation Error Analysis
Linköping University, SWEDEN
Karppa, Matti Comparing computer vision analysis of signed language video with motion capture recordings
Aalto University School of Science, FINLAND
Kaspersson, Thomas This also affects the context - Errors in extraction based summaries
Santa Anna IT Research Institute AB & Linköping University, SWEDEN
Katsamanis, Athanasios The Twins Corpus of Museum Visitor Questions
University of Southern California, UNITED STATES
Katz, Graham A New Twitter Verb Lexicon for Natural Language Processing
Using semi-experts to derive judgments on word sense alignment: a pilot study
Georgetown University, UNITED STATES
Kawahara, Tatsuya Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Kyoto University, JAPAN
Kaya, Hamza Turkish Paraphrase Corpus
Tübitak-Bilgem, TURKEY
Kay, Martin Suffix Trees as Language Models
Stanford University, UNITED STATES
Kayser, Lars Creation and use of Language Resources in a Question-Answering eHealth System
University of Copenhagen, DENMARK
Kemps-Snijders, Marc Dynamic web service deployment in a cloud environment
Meertens Institute, NETHERLANDS
Kempton, Timothy A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English
University of Sheffield, UNITED KINGDOM
Kennington, Casey Redd Suffix Trees as Language Models
Universität Bielefeld, GERMANY
Kermes, Hannah A methodology for the extraction of information about the usage of formulaic expressions in scientific texts
Universität des Saarlandes, GERMANY
Keskes, Iskandar Clause-based Discourse Segmentation of Arabic Texts
IRIT, FRANCE
Kestemont, Mike The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
University of Antwerp, BELGIUM
Ketzan, Erik The New IDS Corpus Analysis Platform: Challenges and Prospects
Institute for the German Language (IDS), GERMANY
Khademian, Mahdi A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora
Amirkabir University of Technology, IRAN, ISLAMIC REPUBLIC OF
Khadivi, Shahram A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora
Amirkabir University of Technology, IRAN, ISLAMIC REPUBLIC OF
Khapra, Mitesh M. Experiences in Resource Generation for Machine Translation through Crowdsourcing
IIT Bombay, INDIA
Kikuchi, Hideaki Method for Collection of Acted Speech Using Various Situation Scripts
Waseda University, JAPAN
Kilgarriff, Adam Word Sketches for Turkish
Lexical Computing Ltd, UNITED KINGDOM
Kim, Bong-Wan Dysarthric Speech Database for Development of QoLT Software Technology
Wonkwang Univ., KOREA, REPUBLIC OF
Kim, Hae-Ri Grammatical Error Annotation for Korean Learners of Spoken English
Seoul National University of Education, KOREA, REPUBLIC OF
Kim, Sunhee Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
Center for Humanities and Information, Seoul National University, KOREA, REPUBLIC OF
Kimura, Chieko Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Universidad Autónoma de Madrid, SPAIN
Kim, Yeon-Jun Building Text-To-Speech Voices in the Cloud
AT&T Labs ― Research, UNITED STATES
Kim, Yeon-Whoa Dysarthric Speech Database for Development of QoLT Software Technology
Wonkwang Univ., KOREA, REPUBLIC OF
Kim, Young-Min Annotated Bibliographical Reference Corpora in Digital Humanities
Université d'Avignon, LIA, FRANCE
King, Joseph A Corpus for Research on Deliberation and Debate
UCSC UARC, UNITED STATES
Kipp, Michael Using DiAML and ANVIL for multimodal dialogue annotations
Annotation Facilities for the Reliable Analysis of Human Motion
University of Applied Sciences, Augsburg, GERMANY
Kirrane, Sabrina Expertise Mining for Enterprise Content Management
National University of Ireland, IRELAND
Kirshboim, Tal Annotating Opinions in German Political News
DFKI GmbH, GERMANY
Kisselew, Max French and German Corpora for Audience-based Text Type Classification
IMS, Universität Stuttgart, GERMANY
Kitaoka, Norihide Causal analysis of task completion errors in spoken music retrieval interactions
Nagoya University, JAPAN
Klakow, Dietrich A Gold Standard for Relation Extraction in the Food Domain
Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Saarland University, Spoken Language Systems, GERMANY
Klenner, Manfred MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
University of Zürich, Institute of Comp. Linguistics, SWITZERLAND
Klerke, Sigrid DSim, a Danish Parallel Corpus for Text Simplification
University of Copenhagen, CST, DENMARK
Klocek, Szymon DGT-TM: A freely available Translation Memory in 22 languages
European Commission, Directorate General for Translation (DGT), LUXEMBOURG
Kluewer, Tina Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World
DFKI GmbH, GERMANY
Kocoń, Jan Inforex -- a web-based tool for text corpus management and semantic annotation
Wroclaw University of Technology, Institute of Applied Informatics, POLAND
Koeling, Rob Corpus Annotation as a Scientific Task
University of Sussex, UNITED KINGDOM
Koeva, Svetla Bulgarian X-language Parallel Corpus
Department of Computational Linguistics, Institute for Bulgarian, Bulgarian Academy of Sciences, BULGARIA
Koiso, Hanae Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
National Institute for Japanese Language, JAPAN
Kokkinakis, Dimitrios Semantic Role Labeling with the Swedish FrameNet
University of Gothenburg, SWEDEN
Kolachina, Prasanth Parsing Any Domain English text to CoNLL dependencies
International Institute of Information Technology, INDIA
Kolachina, Sudheer Parsing Any Domain English text to CoNLL dependencies
Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
International Institute of Information Technology, INDIA
Koller, Oscar RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
RWTH Aachen University, GERMANY
Kolomiyets, Oleksandr Annotating Story Timelines as Temporal Dependency Structures
Katholieke Universiteit Leuven, BELGIUM
Koltuksuz, Ahmet Construction of the Turkish National Corpus (TNC)
Yaşar University, TURKEY
Komachi, Mamoru UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
NAIST, JAPAN
Konstantinova, Natalia A review corpus annotated for negation, speculation and their scope
University of Wolverhampton, UNITED KINGDOM
Konstantopoulos, Stasinos Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Institute of Informatics and Telecommunications, NCSR ‘Demokritos’, GREECE
Kopeć, Mateusz Creating a Coreference Resolution System for Polish
Polish Academy of Sciences, Institute of Computer Science, POLAND
Kordoni, Valia Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Using Verb Subcategorization for Word Sense Disambiguation
Department of Computational Linguistics and Phonetics, Saarland University, GERMANY
Kornai, András Rapid creation of large-scale corpora and frequency dictionaries
Hungarian Academy of Sciences, HUNGARY
Kors, Jan CALBC: Releasing the Final Corpora
Erasmus Medical Centre, NETHERLANDS
Köser, Stephanie A Gold Standard for Relation Extraction in the Food Domain
Saarland University, Spoken Language Systems, GERMANY
Kotzé, Gideon Large aligned treebanks for syntax-based machine translation
University of Groningen, NETHERLANDS
Kouroupetroglou, Georgios Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue
University of Athens, GREECE
Kouylekov, Milen Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.
CELI, ITALY
Kow, Eric LG-Eval: A Toolkit for Creating Online Language Evaluation Experiments
University of Brighton, UNITED KINGDOM
Kraft, Florian The KIT Lecture Corpus for Speech Translation
Karlsruhe Institute of Technology, GERMANY
Křen, Michal Balanced data repository of spontaneous spoken Czech
Institute of the Czech National Corpus, Charles University in Prague, CZECH REPUBLIC
Krisch, Jennifer French and German Corpora for Audience-based Text Type Classification
IMS, Universität Stuttgart, GERMANY
Krstev, Cvetana A tool for enhanced search of multilingual digital libraries of e-journals
University of Belgrade, SERBIA
Kruschwitz, Udo Assessing Crowdsourcing Quality through Objective Tasks
Applying Random Indexing to Structured Data to Find Contextually Similar Words
University of Essex, UNITED KINGDOM
Kubelka, Ozren Addressing polysemy in bilingual lexicon extraction from comparable corpora
University of Applied Sciences Vern, Zagreb, CROATIA
Kuhn, Jonas Making Ellipses Explicit in Dependency Conversion for a German Treebank
A Corpus-based Study of the German Recipient Passive
Universität Stuttgart, IMS, GERMANY
Kulick, Seth Further Developments in Treebank Error Detection Using Derivation Trees
Expanding Arabic Treebank to Speech: Results from Broadcast News
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
Kumaran, A An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora
Microsoft Research India, INDIA
Kumar, Ritesh Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi
Centre for Linguistics, Jawharlal Nehru University, INDIA
Kunchukuttan, Anoop Experiences in Resource Generation for Machine Translation through Crowdsourcing
IIT Bombay, INDIA
Kunst, Jan Pieter Dynamic web service deployment in a cloud environment
Meertens Institute, NETHERLANDS
Kunze, Manuela Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Kunz, Kerstin Coreference in Spoken vs. Written Texts: a Corpus-based Analysis
Department of Applied Linguistics, Saarland University, GERMANY
Kupietz, Marc The New IDS Corpus Analysis Platform: Challenges and Prospects
Institute for the German Language (IDS), GERMANY
Kurc, Roman Constraint Based Description of Polish Multiword Expressions
Wroclaw University of Technology, POLAND
Kuriyama, Naoko The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues
Tokyo Institute of Technology, JAPAN
Kurohashi, Sadao Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese
Kyoto University, JAPAN
Kurtic, Emina A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English
University of Sheffield, UNITED KINGDOM
Kurtoğlu, Özlem Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY
Kushkuley, Sophie Word Sense Inventories by Non-Experts.
Brandeis University, UNITED STATES
Kusumoto, Takanori Statistical Machine Translation without Source-side Parallel Corpus Using Word Lattice and Phrase Extension
Toyohashi University of Technology, JAPAN
Kvist, Maria Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text
Stockholm University, SWEDEN
Kweon, Soo-Ok Grammatical Error Annotation for Korean Learners of Spoken English
Postech, KOREA, REPUBLIC OF

 

L
Laaksonen, Jorma Comparing computer vision analysis of signed language video with motion capture recordings
Aalto University School of Science, FINLAND
Labropoulou, Penny The META-SHARE Metadata Schema for the Description of Language Resources
ILSP - Athena RC, GREECE
Ladha, Kushal Experiences in Resource Generation for Machine Translation through Crowdsourcing
IIT Bombay, INDIA
Lafourcade, Mathieu A new dynamic approach for lexical networks evaluation
LIRMM, Université Montpellier II, FRANCE
Lager, Torbjörn Cloud Logic Programming for Integrating Language Technology Resources
University of Gothenburg, SWEDEN
Lamel, Lori Cross-lingual studies of ASR errors: paradigms for perceptual evaluations
LIMSI - CNRS, FRANCE
Lampen, Lari Semantic metadata mapping in practice: the Virtual Language Observatory
Max Planck Institute for Psycholinguistics, NETHERLANDS
Landragin, Frederic ANALEC: a New Tool for the Dynamic Annotation of Textual Data
CNRS, FRANCE
Lange, Julia LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions
Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Langlais, Philippe Texto4Science: a Quebec French Database of Annotated Short Text Messages
Université de Montréal, CANADA
Laoudi, Jamal Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System
Computational & Information Sciences Directorate, Army Research Lab (ARL), UNITED STATES
Laparra, Egoitz Multilingual Central Repository version 3.0
Mapping WordNet to the Kyoto ontology
Euskal Herriko Unibertsitatea, Informatika Fakultatea, SPAIN
Lapesa, Gabriella LexIt: A Computational Resource on Italian Argument Structure
University of Osnabrück, Institute for Cognitive Science, GERMANY
Laplaza, Yesika The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
Speech and Language Group, Barcelona Media Centre d’Innovació, SPAIN
Laporte, Éric A new semantically annotated corpus with syntactic-semantic and cross-lingual senses
Université Paris-Est, LIGM, FRANCE
Lapshinova-Koltunski, Ekaterina Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach
Coreference in Spoken vs. Written Texts: a Corpus-based Analysis
Saarland University, GERMANY
Larasati, Septina Dian IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus
Charles University in Prague, MFF, UFAL, CZECH REPUBLIC
Larson, Martha Creating a Data Collection for Evaluating Rich Speech Retrieval
Delft University of Technology, Delft, NETHERLANDS
Lasarcyk, Eva A Gold Standard for Relation Extraction in the Food Domain
Saarland University, Spoken Language Systems, GERMANY
Laskova, Laska A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian
Linguistic Analysis Processing Line for Bulgarian
Linguistic Modelling Department, IICT-BAS, BULGARIA
La Torre, Pietro Evaluating Multi-focus Natural Language Queries over Data Services
DEI - Politecnico di Milano, ITALY
Lavelli, Alberto An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction
A treebank-based study on the influence of Italian word order on parsing performance
A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Fondazione Bruno Kessler, ITALY
Lavergne, Thomas Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier
LIMSI ― CNRS, FRANCE
Lavrac, Nada Irregularity Detection in Categorized Document Corpora
Institut Jozef Stefan, Department of Knowledge Technologies, SLOVENIA
Lavrenko, Victor The BladeMistress Corpus: From Talk to Action in Virtual Worlds
University of Edinburgh, UNITED KINGDOM
Lawrie, Dawn Creating and Curating a Cross-Language Person-Entity Linking Collection
Loyola College in Maryland, UNITED STATES
Layher, Georg An audiovisual political speech analysis incorporating eye-tracking and perception data
Universität Ulm - Institute of Neural Information Processing, GERMANY
Leano, Vincenza Anna W-PhAMT: A web tool for phonetic multilevel timeline visualization
Università di Napoli Federico II, ITALY
Ledbetter, Scott Annotating Errors in a Hungarian Learner Corpus
Indiana University, UNITED STATES
Le Draoulec, Anne An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
CLLE, Univ. Toulouse, FRANCE
Lee, Gary Geunbae Grammatical Error Annotation for Korean Learners of Spoken English
Postech, KOREA, REPUBLIC OF
Lee, Haejoong Creating HAVIC: Heterogeneous Audio Visual Internet Collection
LDC, UNITED STATES
Lee, Kyusong Grammatical Error Annotation for Korean Learners of Spoken English
Postech, KOREA, REPUBLIC OF
Lee, Lung-Hao NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
National Taiwan University, TAIWAN, PROVINCE OF CHINA
Lee, Mark Building Text-to-Speech Systems for Resource Poor Languages
University of Birmingham, School of Computer Science, UNITED KINGDOM
Lee, Yong-Ju Dysarthric Speech Database for Development of QoLT Software Technology
Wonkwang Univ., KOREA, REPUBLIC OF
Lefever, Els Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
University College Ghent, BELGIUM
Lefèvre, Fabrice Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
University of Avignon, LIA-CERI, FRANCE
Leijten, Marielle From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
University of Antwerp, BELGIUM
Le Maguer, Sébastien Towards Fully Automatic Annotation of Audio Books for TTS
IRISA - University of Rennes 1, FRANCE
Lenart, Michał Web Service integration platform for Polish linguistic resources
Institute of Computer Science, Polish Academy of Sciences, POLAND
Lenci, Alessandro LexIt: A Computational Resource on Italian Argument Structure
Enriching the ISST-TANL Corpus with Semantic Frames
University of Pisa, Department of Linguistics, ITALY
Lendvai, Piroska Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
Hungarian Academy of Sciences, HUNGARY
Lenkiewicz, Przemyslaw AVATecH ― automated annotation through audio and video analysis
Towards Automatic Gesture Stroke Detection
Max Planck Institute for Psycholinguistics, NETHERLANDS
Leone, Giovanna Pedagogical stances and their multimodal signals.
Dipartimenot di Comunicazione e Ricerca Sociale ― Sapienza Università di Roma, ITALY
Lesmo, Leonardo The Parallel-TUT: a multilingual and multiformat treebank
Dipartimento di Informatica, Università di Torino, ITALY
Leuski, Anton The BladeMistress Corpus: From Talk to Action in Virtual Worlds
University of Southern California, Institute for Creative Technologies, UNITED STATES
Lewin, Ian Centroids: Gold standards with distributional variation
CALBC: Releasing the Final Corpora
Erasmus Medical Center / Linguamatics, NETHERLANDS
Lewis, David On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
Trinity College Dublin, IRELAND
Lewis, William Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms
Microsoft Research, UNITED STATES
L'Homme, Marie-Claude Capturing syntactico-semantic regularities among terms: An application of the FrameNet methodology to terminology
Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
Université de Montréal, CANADA
Liberman, Mark Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities
LDC, UNITED STATES
Li, Hong Annotating Opinions in German Political News
DFKI GmbH, GERMANY
Lima, Gabriel Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Center of Computational Linguistics (NILC), University of São Paulo, BRAZIL
Lima, Vera Combining Formal Concept Analysis and semantic information for building ontological structures from texts : an exploratory study
PUCRS, BRAZIL
Lin, Ching-Sheng Revealing Contentious Concepts Across Social Groups
University at Albany, SUNY, UNITED STATES
Lindén, Krister Representing the Translation Relation in a Bilingual Wordnet
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3
University of Helsinki, FINLAND
Lin, Donghui Two Phase Evaluation for Selecting Machine Translation Services
Service Composition Scenarios for Task-Oriented Translation
Kyoto University, JAPAN
Lin, Grace An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style
University of California Santa Cruz, UNITED STATES
Lin, Shih-Peng NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
Institute for Information Industry, TAIWAN, PROVINCE OF CHINA
Linsmayr, Elisabeth EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems
NETCONOMY Software & Consulting, AUSTRIA
Li, Shuguang Constructing a Question Corpus for Textual Semantic Relations
Greedy Intelligence Ltd., CHINA
Lis, Magdalena Polish Multimodal Corpus ― a collection of referential gestures
University of Copenhagen, DENMARK
Li, Tian A Multilingual Natural Stress Emotion Database
Hong Kong University of Science and Technology, HONG KONG
Littauer, Richard The Open Linguistics Working Group
Computational Linguistics Department, Saarland University, GERMANY
Liu, Bo Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
Rutgers University, UNITED STATES
Liu, Ting Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University at Albany, UNITED STATES
Liu, Wen-shen NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
Institute for Information Industry, TAIWAN, PROVINCE OF CHINA
Li, Xuansong Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
Automatic word alignment tools to scale production of manually aligned parallel texts
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
Liyanapathirana, Jeevanthi Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
IDIAP Research Institute, SWITZERLAND
Li, Ying A Mandarin-English Code-Switching Corpus
HKUST, HONG KONG
Ljubešić, Nikola Addressing polysemy in bilingual lexicon extraction from comparable corpora
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences, CROATIA
Llorens, Hector TIMEN: An Open Temporal Expression Normalisation Resource
Massively Increasing TIMEX3 Resources: A Transduction Approach
University of Alicante, SPAIN
Loáiciga, Sharid Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.
University of Geneva, SWITZERLAND
Lolive, Damien Towards Fully Automatic Annotation of Audio Books for TTS
IRISA - University of Rennes 1, FRANCE
Lombardi Vallauri, Edoardo A topologic view of Topic and Focus marking in Italian
Department of Linguistics, University of Roma Tre, ITALY
Lonsdale, Deryle Item Development and Scoring for Japanese Oral Proficiency Testing
First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis
Brigham Young University, UNITED STATES
Lopes, Lucelene A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever
PUCRS, BRAZIL
Lopez de Lacalle, Oier Matching Cultural Heritage items to Wikipedia
University of the Basque Country, SPAIN
Lorente, Mercè The IULA Treebank
Universitat Pompeu Fabra, SPAIN
Lorenzo, Alejandra Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents
Lorraine University/LORIA Nancy, FRANCE
Loth, Carola Adapting and evaluating a generic term extraction tool
Dräger Safety AG & Co. KGaA, GERMANY
Louis, Annie A corpus of general and specific sentences from news
University of Pennsylvania, UNITED STATES
Loukachevitch, Natalia Automatic Term Recognition Needs Multiple Evidence
Research Computing Center Moscow State University Vorolievy Gory, RUSSIAN FEDERATION
Lu, Bao-liang Spell Checking for Chinese
Shanghai Jiao Tong University, CHINA
Luder, Marc German Verb Patterns and Their Implementation in an Electronic Dictionary
Universität Zürich, SWITZERLAND
Luís, Tiago An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Spoken Language Systems Laboratory ― L2F/INESC-ID, PORTUGAL
Lupu, Mihai Applying Random Indexing to Structured Data to Find Contextually Similar Words
Vienna University of Technology, AUSTRIA
Lu, Qin A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong Polytechnic University, Department of Computing, HONG KONG
Luyckx, Kim The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
University of Antwerp, BELGIUM
Lynn, Teresa Irish Treebanking and Parsing: A Preliminary Evaluation
Dublin City University, IRELAND
Lynum, André The Nordic Dialect Corpus
Norwegian University of Science and Technology, NORWAY
Lyse, Gunn Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Universitetet i Bergen, NORWAY

 

M
Maamouri, Mohamed Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects
Expanding Arabic Treebank to Speech: Results from Broadcast News
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
Machado Jr., Danilo Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Department of Informatics, State University of Maringa, BRAZIL
Macken, Lieve From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
University College Ghent and Ghent University, BELGIUM
MacWhinney, Brian Morphosyntactic Analysis of the CHILDES and TalkBank Corpora
Carnegie Mellon University, UNITED STATES
Maegaard, Bente Creation and use of Language Resources in a Question-Answering eHealth System
University of Copenhagen, CST, DENMARK
Maekawa, Kikuo Prediction of Non-Linguistic Information of Spontaneous Speech from the Prosodic Annotation: Evaluation of the X-JToBI system
National Institute for Japanese Language, JAPAN
Magnini, Bernardo A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
The KnowledgeStore: an Entity-Based Storage System
Fondazione Bruno Kessler, ITALY
Majliš, Martin Language Richness of the Web
The Joy of Parallelism with CzEng 1.0
Charles University in Prague, Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, CZECH REPUBLIC
Makedon, Fillia Evaluation of Online Dialogue Policy Learning Techniques
University of Texas at Arlington, UNITED STATES
Maks, Isa Building a fine-grained subjectivity lexicon from a web corpus
VU University, Faculty of Arts, NETHERLANDS
Malekinezhad, Hossein Improving K-Nearest Neighbor Efficacy for Farsi Text Classification
Islamic Azad University, IRAN, ISLAMIC REPUBLIC OF
Malvessi Mittmann, Maryualê The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese
Universidade Federal de Minas Gerais, BRAZIL
Mambrini, Francesco First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin
Università Cattolica del Sacro Cuore, Milan, ITALY
Mandl, Thomas A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content
University of Hildesheim, GERMANY
Manning, Christopher SUTime: A library for recognizing and normalizing time expressions
Stanford University, UNITED STATES
Manshadi, Mehdi An Annotation Scheme for Quantifier Scope Disambiguation
University of Rochester, UNITED STATES
Mansour, Saab Arabic-Segmentation Combination Strategies for Statistical Machine Translation
A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora
RWTH Aachen University, GERMANY
Manterola, Iker Building a Basque-Chinese Dictionary by Using English as Pivot
PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web
Elhuyar R&D, SPAIN
Mapelli, Valérie ELRA in the heart of a cooperative HLT world
The REPERE Corpus : a multimodal corpus for person recognition
The META-SHARE Metadata Schema for the Description of Language Resources
ELDA/ELRA, FRANCE
Marchetti, Alessandro Chinese Whispers: Cooperative Paraphrase Acquisition
CELCT, ITALY
Marcińczuk, Michał Inforex -- a web-based tool for text corpus management and semantic annotation
KPWr: Towards a Free Corpus of Polish
Wroclaw University of Technology, Institute of Applied Informatics, POLAND
Mareček, David HamleDT: To Parse or Not to Parse?
The Joy of Parallelism with CzEng 1.0
Charles University in Prague, CZECH REPUBLIC
Mariani, Joseph The LRE Map. Harmonising Community Descriptions of Resources
The FLaReNet Strategic Language Resource Agenda
LIMSI-CNRS, FRANCE
Marimon, Montserrat The IULA Treebank
Universitat de Barcelona, SPAIN
Marinelli, Rita In the same boat and other idiomatic seafaring expressions
Istituto di Linguistica Computazionale, C.N.R., ITALY
Marín, Ignacio The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Fundación CTIC, SPAIN
Mariotti, Andre Corpus-based Referring Expressions Generation
School of Arts, Sciences and Humanities, University of Sao Paulo, BRAZIL
Marklund, Ellen 3rd party observer gaze as a continuous measure of dialogue flow
Stockholm University, Linguistics, SWEDEN
Maroto, Nava Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
Universidad Complutense de Madrid, SPAIN
Màrquez, Lluís An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output
The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output
Universitat Politècnica de Catalunya, SPAIN
Marquilhas, Rita Project FLY: a multidisciplinary project within Linguistics
Centro de Linguística da Universidade de Lisboa, PORTUGAL
Marquina, Montse Holaaa!! writin like u talk is kewl but kinda hard 4 NLP
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
Barcelona Media Research Center, SPAIN
Maršík, Jiří The Joy of Parallelism with CzEng 1.0
Charles University in Prague, CZECH REPUBLIC
Martens, Scott Large aligned treebanks for syntax-based machine translation
University of Tübingen, GERMANY
Martí, M. Antònia Annotating Near-Identity from Coreference Disagreements
CLiC - University of Barcelona, SPAIN
Martindale, Marianna J. Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?
US Department of Defense, UNITED STATES
Martineau, Claude Extending the adverbial coverage of a French morphological lexicon
LIGM, Université Paris-Est, FRANCE
Martínez Alonso, Héctor A voting scheme to detect semantic underspecification
University of Copenhagen, DENMARK
Martínez Cortés, Juan Pablo Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
GTC, I3A, Universidad de Zaragoza, SPAIN
Martin, James H. Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events
University of Colorado, UNITED STATES
Martins de Matos, David Building and Exploring Semantic Equivalences Resources
L2F/INESC-ID Lisboa, PORTUGAL
Martins, Ronaldo Le Petit Prince in UNL
UNDL Foundation, SWITZERLAND
Marujo, Luís Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
LTI/CMU and INESC-ID/IST, UNITED STATES
Marzi, Claudia Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access
ILC - CNR, Pisa, ITALY
Masneri, Stefano AVATecH ― automated annotation through audio and video analysis
Fraunhofer-Heinrich Hertz Institute, GERMANY
Mastropavlos, Nikos Collecting and Using Comparable Corpora for Statistical Machine Translation
ILSP - Athena RC, GREECE
Matsubayashi, Yuichiroh Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure
National Institute of Informatics, JAPAN
Matsui, Kengo Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Tokyo University of Foreign Studies, JAPAN
Matsui, Tomoko Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
The Institute of Statistical Mathematics, JAPAN
Matsumoto, Yuji UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
NAIST, JAPAN
Matsuo, Yoshihiro Constructing a Class-Based Lexical Dictionary using Interactive Topic Models
NTT Cyber Space Laboratories, NTT Corporation, JAPAN
Matsushita, Hitokazu Item Development and Scoring for Japanese Oral Proficiency Testing
Brigham Young University, UNITED STATES
Matsuyoshi, Suguru Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
University of Yamanashi, JAPAN
Matuschek, Michael UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF
The Open Linguistics Working Group
Technische Universität Darmstadt, GERMANY
Mavroeidis, Dimitris META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
ILSP - Athena RC, GREECE
Max, Aurélien A contrastive review of paraphrase acquisition techniques
LIMSI - CNRS, FRANCE
Ma, Xiaoyi LDC Forced Aligner
LDC, UNITED STATES
Mayfield, James Creating and Curating a Cross-Language Person-Entity Linking Collection
Johns Hopkins University, UNITED STATES
May, Jonathan An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output
SDL Language Weaver, UNITED STATES
Maynard, Diana Large Scale Semantic Annotation, Indexing and Search at The National Archives
University of Sheffield, UNITED KINGDOM
Maza, Benjamin DECODA: a call-centre human-human spoken conversation corpus
Universite d’Avignon, FRANCE
Maziarz, Marek Tools for plWordNet Development. Presentation and Perspectives
Recognition of Polish Derivational Relations Based on Supervised Learning Scheme
KPWr: Towards a Free Corpus of Polish
Wroclaw University of Technology, POLAND
Mazo, Hélène ELRA in the heart of a cooperative HLT world
ELDA/ELRA, FRANCE
McCrae, John Collaborative semantic editing of linked data lexica
CITEC, Universität Bielefeld, GERMANY
McDonald, Ryan A Universal Part-of-Speech Tagset
Google Inc., UNITED STATES
McKeown, Kathleen Annotating Agreement and Disagreement in Threaded Discussion
Columbia University, UNITED STATES
McLeod, Sarah International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned
MITRE Corporation, UNITED STATES
McNamee, Paul Creating and Curating a Cross-Language Person-Entity Linking Collection
Johns Hopkins University, UNITED STATES
Megyesi, Beáta A Basic Language Resource Kit for Persian
Uppsala University, SWEDEN
Mehdad, Yashar Chinese Whispers: Cooperative Paraphrase Acquisition
Fondazione Bruno Kessler, Trento, ITALY
Mehler, Alexander Customization of the Europarl Corpus for Translation Studies
Goethe-Universität Frankfurt, GERMANY
Melandri, Matias AnIta: a powerful morphological analyser for Italian
Dept. of Computer Science, University of Bologna, ITALY
Melero, Maite A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Holaaa!! writin like u talk is kewl but kinda hard 4 NLP
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
Barcelona Media, SPAIN
Mella, Odile CoALT: A Software for Comparing Automatic Labelling Tools
INRIA/LORIA, Université Henri Poincaré, FRANCE
Mello, Heliana The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese
UFMG, BRAZIL
Melloni, Chiara Annotating Qualia Relations in Italian and French Complex Nominals
Università di Verona, ITALY
Mencarelli, Silvia Modality in Text: a Proposal for Corpus Annotation
Centro de Linguística da Universidade de Lisboa, PORTUGAL
Mendes, Amália Introducing the Reference Corpus of Contemporary Portuguese Online
Modality in Text: a Proposal for Corpus Annotation
CLUL, PORTUGAL
Mendes, Ana Cristina An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Extending a wordnet framework for simplicity and scalability
IST, PORTUGAL
Mendes, Pablo Evaluating the Impact of Phrase Recognition on Concept Tagging
DBpedia: A Multilingual Cross-domain Knowledge Base
Web Based Systems Group, Freie Universität Berlin, GERMANY
Meneghetti, Breno Corpus+WordNet thesaurus generation for ontology enriching
PUCRS, BRAZIL
Merkel, Magnus Alignment-based reordering for SMT
Linköping University, SWEDEN
Merkus, Iris Statistical Evaluation of Pronunciation Encoding
Bavarian Archive for Speech Signals, Ludwig-Maximilians-Universität München, GERMANY
Mersinli, Ümit Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY
Metaxas, Dimitris Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
Rutgers University, UNITED STATES
Meyer, Christian M. UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF
The Open Linguistics Working Group
Technische Universität Darmstadt, GERMANY
Meyer, Thomas Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
IDIAP Research Institute, SWITZERLAND
Michaelis, Bernd Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Michael, Nicholas Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
Rutgers University, UNITED STATES
Michel, Martial Creating HAVIC: Heterogeneous Audio Visual Internet Collection
Systems Plus, Inc., UNITED STATES
Mihalcea, Rada Learning Sentiment Lexicons in Spanish
Unsupervised Word Sense Disambiguation with Multilingual Representations
A Parallel Corpus of Music and Lyrics Annotated with Emotions
University of North Texas, UNITED STATES
Mikelić Preradović, Nives Generation of Verbal Stems in Derivationally Rich Language
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences, CROATIA
Mikulová, Marie Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Milette, Greg Identifying Nuggets of Information in GALE Distillation Evaluation
BAE Systems, UNITED STATES
Miłkowski, Marcin PoliMorf: a (not so) new open morphological dictionary for Polish
Institute of Computer Science, Polish Academy of Sciences, POLAND
Miller, Keith J. International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned
MITRE Corporation, UNITED STATES
Mille, Simon Text Simplification Tools for Spanish
Universitat Pompeu Fabra, SPAIN
Milward, David CALBC: Releasing the Final Corpora
Linguamatics LTD, UNITED KINGDOM
Minaei, Behrouz Improving K-Nearest Neighbor Efficacy for Farsi Text Classification
Iran University of Science and Technology, IRAN, ISLAMIC REPUBLIC OF
Minaei-Bidgoli, Behrouz A Framework for Spelling Correction in Persian Language Using Noisy Channel Model
Iran University of Science and Technology, IRAN, ISLAMIC REPUBLIC OF
Min, Bonan Challenges in the Knowledge Base Population Slot Filling Task
New York University, UNITED STATES
Mingers, Insa Adapting and evaluating a generic term extraction tool
Dräger Safety AG & Co. KGaA, GERMANY
Minker, Wolfgang Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues
Using multimodal resources for explanation approaches in intelligent systems
A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System
Investigating Verbal Intelligence Using the TF-IDF Approach
Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores
Speech and Language Resources for LVCSR of Russian
Institute of Communications Engineering ― Dialogue Systems, University of Ulm, GERMANY
Minutoli, Salvatore META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
Consiglio Nazionale Ricerche (CNR), ITALY
Mírovský, Jiří Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component
Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics, CZECH REPUBLIC
Misra Sharma, Dipti Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
LTRC, IIIT-Hyderabad, INDIA
Mitamura, Teruko Diversifiable Bootstrapping for Acquiring High-Coverage Paraphrase Resource
Carnegie Mellon University, UNITED STATES
Mitkov, Ruslan Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach
A review corpus annotated for negation, speculation and their scope
CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations
University of Wolverhampton, UNITED KINGDOM
Miyajima, Takahiro Method for Collection of Acted Speech Using Various Situation Scripts
Waseda University, JAPAN
Miyao, Yusuke Annotating Factive Verbs
Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure
National Institute of Informatics, JAPAN
Miyashita, Takahiro Body-conductive acoustic sensors in human-robot communication
Intelligent Robotics and Communication Laboratories, Advanced Telecommunications Research Institute International, JAPAN
Moens, Marie-Francine Annotating Story Timelines as Temporal Dependency Structures
Katholieke Universiteit Leuven, BELGIUM
Mohamed, Emad Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic
Carnegie Mellon University, UNITED STATES
Mohit, Behrang Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic
Carnegie Mellon University, UNITED STATES
Mohr, Christian The KIT Lecture Corpus for Speech Translation
Karlsruhe Institute of Technology, GERMANY
Moldovan, Dan A Tool for Extracting Conversational Implicatures
Polaris: Lymba's Semantic Parser
Lymba Corporation, UNITED STATES
Momtazi, Saeedeh Fine-grained German Sentiment Analysis on Social Media
Universität Potsdam, GERMANY
Monachesi, Paola Evaluating automatic cross-domain Dutch semantic role annotation
An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use
, NETHERLANDS
Monachini, Monica The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
The FLaReNet Strategic Language Resource Agenda
The META-SHARE Metadata Schema for the Description of Language Resources
ILC - CNR, Pisa, ITALY
Mondary, Thibault The Quaero Evaluation Initiative on Term Extraction
LIPN - CNRS, FRANCE
Moneglia, Massimo The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building
Università di Firenze, ITALY
Montemagni, Simonetta Enriching the ISST-TANL Corpus with Semantic Frames
ILC - CNR, Pisa, ITALY
Montiel-Ponsoda, Elena Collaborative semantic editing of linked data lexica
Ontology Engineering Group, Universidad Politécnica de Madrid, SPAIN
Moraes, Silvia Combining Formal Concept Analysis and semantic information for building ontological structures from texts : an exploratory study
PUCRS, BRAZIL
Moran, Steven The Open Linguistics Working Group
Research Unit “Quantitative Language Comparison”, University of Munich, GERMANY
Morante, Roser ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories
The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
CLiPS - University of Antwerp, BELGIUM
Morell, Carlos Iula2Standoff: a tool for creating standoff documents for the IULACT
UPF, SPAIN
Moreno, Amparo Using Language Resources in Humanities research
Universitat Autònoma de Barcelona, SPAIN
Moreno, Asunción Building Synthetic Voices in the META-NET Framework
BUCEADOR, a multi-language search engine for digital libraries
Universitat Politècnica de Catalunya, SPAIN
Moreno-Sandoval, Antonio Medical Term Extraction in an Arabic Medical Corpus
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Computational Linguistics Laboratory-Autónoma University Madrid, SPAIN
Moretti, Giovanni The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
CAT: the CELCT Annotation Tool
CELCT, ITALY
Moriceau, Véronique Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
Evolution of Event Designation in Media: Preliminary Study
Kitten: a tool for normalizing HTML and extracting its textual content
LIMSI ― CNRS, FRANCE
Morin, Emmanuel Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora
LINA, Université de Nantes, FRANCE
Morris, Amanda Creating HAVIC: Heterogeneous Audio Visual Internet Collection
LDC, UNITED STATES
Mörth, Karlheinz Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
Austrian Academy of Sciences, AUSTRIA
Mostefa, Djamel ELRA in the heart of a cooperative HLT world
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
New language resources for the Pashto language
ELDA/ELRA, FRANCE
Moszkowicz, Jessica The Role of Model Testing in Standards Development: The Case of ISO-Space
Brandeis University, UNITED STATES
Mota, Cristina Págico: Evaluating Wikipedia-based information retrieval in Portuguese
Linguateca/FCCN, PORTUGAL
Mott, Justin Further Developments in Treebank Error Detection Using Derivation Trees
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
Muhonen, Kristiina Rule-Based Detection of Clausal Coordinate Ellipsis
Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3
University of Helsinki, FINLAND
Muischnek, Kadri Robust clause boundary identification for corpus annotation
University of Tartu, Institute of Computer Science, ESTONIA
Müller de Oliveira, Gilvan The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
Instituto Internacional da Língua Portuguesa - IILP, CAPE VERDE
Muller, Philippe An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
IRIT, Univ. Toulouse, FRANCE
Murisasco, Elisabeth An ontological approach to model and query multimodal concurrent linguistic annotations
Université du Sud Toulon Var, FRANCE

 

N
Nagy T., István HunOr: A Hungarian―Russian Parallel Corpus
University of Szeged, Department of Informatics, HUNGARY
Nakagawa, Natsuko Annotation of anaphoric relations and topic continuity in Japanese conversation
Kyoto University, JAPAN
Nakagawa, Seiichi Developing Partially-Transcribed Speech Corpus from Edited Transcriptions
Toyohashi University of Technology, JAPAN
Nakazawa, Toshiaki Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese
Kyoto University, JAPAN
Nanba, Hidetsugu Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques
Hiroshima City University, JAPAN
Narasimhan, Bhuvana Empty Argument Insertion in the Hindi PropBank
University of Colorado at Boulder, UNITED STATES
Narawa, Chiharu Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types
Osaka University, JAPAN
Narayanan, Shrikanth The Twins Corpus of Museum Visitor Questions
University of Southern California, UNITED STATES
Narroway, George A Framework for Evaluating Text Correction
Macquarie University, AUSTRALIA
Nasr, Alexis Syntactic annotation of spontaneous speech: application to call-center conversation data
Aix Marseille Univ, LIF-CNRS, FRANCE
Nastase, Vivi Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles
HITS gGmbH, GERMANY
Navarretta, Costanza Multimodal Behaviour and Feedback in Different Types of Interaction
Feedback in Nordic First-Encounters: a Comparative Study
University of Copenhagen, DENMARK
Navas, Eva Versatile Speech Databases for High Quality Synthesis for Basque
Strategies to Improve a Speaker Diarisation Tool
Using an ASR database to design a pronunciation evaluation system in Basque
BUCEADOR, a multi-language search engine for digital libraries
University of the Basque Country, SPAIN
Navigli, Roberto A New Method for Evaluating Automatically Learned Terminological Taxonomies
Sapienza Università di Roma, ITALY
Nawaz, Raheel Identification of Manner in Bio-Events
University of Manchester, UNITED KINGDOM
Nazarenko, Adeline The Quaero Evaluation Initiative on Term Extraction
LIPN - CNRS, FRANCE
Nazarian, Angela The Twins Corpus of Museum Visitor Questions
University of Southern California, Institute for Creative Technologies, UNITED STATES
Nazar, Rogelio Spell Checking in Spanish: The Case of Diacritic Accents
UPF, SPAIN
Negri, Matteo Chinese Whispers: Cooperative Paraphrase Acquisition
Fondazione Bruno Kessler, Trento, ITALY
Neidle, Carol Detecting Reduplication in Videos of American Sign Language
Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
Boston University, UNITED STATES
Nenkova, Ani A corpus of general and specific sentences from news
University of Pennsylvania, UNITED STATES
Neralwala, Huzaifa Annotations for Power Relations on Email Threads
LinkedIn Corp., UNITED STATES
Neumann, Günter An Adaptive Framework for Named Entity Combination
DFKI GmbH, GERMANY
Neumann, Heiko An audiovisual political speech analysis incorporating eye-tracking and perception data
Universität Ulm - Institute of Neural Information Processing, GERMANY
Nevskaya, Irina ”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
University of Frankfurt, GERMANY
Ney, Hermann Arabic-Segmentation Combination Strategies for Statistical Machine Translation
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
RWTH Aachen University, GERMANY
Nicolas, Lionel Unsupervised acquisition of concatenative morphology
Institute for Specialised Communication and Multilingualism, European Academy of Bozen/Bolzano, ITALY
Niculae, Vlad The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System
University of Bucharest, ROMANIA
Niehues, Jan The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Karlsruhe Institute of Technology, GERMANY
Nieman, Annamart Aspects of a Legal Framework for Language Resource Management
Advocate, Member of the Johannesburg Bar, SOUTH AFRICA
Niemi, Jyrki Representing the Translation Relation in a Bilingual Wordnet
University of Helsinki, FINLAND
Nimb, Sanni Towards a richer wordnet representation of properties
Society for Danish Language and Literature, DENMARK
Nishida, Masafumi Multimodal Corpus of Multi-party Conversations in Second Language
Doshisha University, JAPAN
Nishizaki, Hiromitsu Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
University of Yamanashi, JAPAN
Nivre, Joakim A Basic Language Resource Kit for Persian
MaltOptimizer: A System for MaltParser Optimization
Uppsala University, SWEDEN
Noecker Jr, John Distractorless Authorship Verification
Duquesne University EVL Lab, UNITED STATES
Nøklestad, Anders The Nordic Dialect Corpus
Universitetet i Oslo, NORWAY
Nordhoff, Sebastian Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages
The Open Linguistics Working Group
Max Planck Institute for Evolutionary Anthropology, GERMANY
Norgaard, Ole Creation and use of Language Resources in a Question-Answering eHealth System
University of Copenhagen, DENMARK
Nothdurft, Florian Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues
Using multimodal resources for explanation approaches in intelligent systems
Institute of Communications Engineering ― Dialogue Systems, University of Ulm, GERMANY
Novais, Eder Corpus-based Referring Expressions Generation
Portuguese Text Generation from Large Corpora
School of Arts, Sciences and Humanities, University of Sao Paulo, BRAZIL
Novák, Michal The Joy of Parallelism with CzEng 1.0
Charles University in Prague, CZECH REPUBLIC
Nugues, Pierre Constructing Large Proposition Databases
Visualizing Sentiment Analysis on a User Forum
Lund University, SWEDEN

 

O
Oard, Douglas Creating and Curating a Cross-Language Person-Entity Linking Collection
University of Maryland, UNITED STATES
Obradović, Ivan A tool for enhanced search of multilingual digital libraries of e-journals
University of Belgrade, SERBIA
O'Connor, Alexander On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
Trinity College Dublin, IRELAND
Odijk, Jan Recent Developments in CLARIN-NL
The FLaReNet Strategic Language Resource Agenda
UiL-OTS, Utrecht University, NETHERLANDS
Odriozola, Igor Versatile Speech Databases for High Quality Synthesis for Basque
Using an ASR database to design a pronunciation evaluation system in Basque
University of the Basque Country, SPAIN
Oepen, Stephan The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
Universitetet i Oslo, NORWAY
Offersgaard, Lene A Distributed Resource Repository for Cloud-Based Machine Translation
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Copenhagen, DENMARK
Oflazer, Kemal Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic
Carnegie Mellon University, UNITED STATES
Ogbureke, Kalu Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College Dublin, IRELAND
Ogiso, Toshinobu UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
NAIST, JAPAN
Ogrodniczuk, Maciej Creating a Coreference Resolution System for Polish
Towards a comprehensive open repository of Polish language resources
PoliMorf: a (not so) new open morphological dictionary for Polish
Web Service integration platform for Polish linguistic resources
The Polish Sejm Corpus
Polish Academy of Sciences, Institute of Computer Science, POLAND
Ohara, Kyoko Semantic Annotations in Japanese FrameNet: Comparing Frames in Japanese and English
Keio University, JAPAN
Ohta, Kengo Developing Partially-Transcribed Speech Corpus from Edited Transcriptions
Toyohashi University of Technology, JAPAN
Okawa, Shigeki Method for Collection of Acted Speech Using Various Situation Scripts
Chiba Institute of Technology, JAPAN
Okita, Tsuyoshi Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner
Dublin City University, IRELAND
Okken, Thomas Building Text-To-Speech Voices in the Cloud
AT&T Labs ― Research, UNITED STATES
Oksanen, Ville Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Helsinki, FINLAND
Olsen, Sussi A Distributed Resource Repository for Cloud-Based Machine Translation
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Copenhagen, DENMARK
Olson, Jesper Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
Nokia Research Center, CHINA
Olsson, Leif-Jöran The open lexical infrastructure of Spräkbanken
Sprakbanken, University of Gothenburg, SWEDEN
Oostdijk, Nelleke Collection of a corpus of Dutch SMS
Beyond SoNaR: towards the facilitation of large corpus building efforts
CLST, Centre for Language and Speech Technology, Radboud University Nijmegen, NETHERLANDS
Orasan, Constantin Annotating Near-Identity from Coreference Disagreements
CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations
University of Wolverhampton, UNITED KINGDOM
Ordelman, Roeland Creating a Data Collection for Evaluating Rich Speech Retrieval
University of Twente, NETHERLANDS
O'Regan, Jim Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
Eolaistriu Technologies, IRELAND
Origlia, Antonio W-PhAMT: A web tool for phonetic multilevel timeline visualization
Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabi fication
Università di Napoli Federico II, ITALY
Osenova, Petya A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian
Linguistic Analysis Processing Line for Bulgarian
The Political Speech Corpus of Bulgarian
Linguistic Modelling Department, IICT-BAS, BULGARIA
Otto, Mirko LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions
Otto-von-Guericke-University Magdeburg, GERMANY
Over, Paul Creating HAVIC: Heterogeneous Audio Visual Internet Collection
NIST, UNITED STATES
Øvrelid, Lilja The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
Universitetet i Oslo, NORWAY
Ozbal, Gozde Brand Pitt: A Corpus to Explore the Art of Naming
FBK-Irst, ITALY
Öz, Seda Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY

 

P
Pado, Sebastian French and German Corpora for Audience-based Text Type Classification
ICL, Universität Heidelberg, GERMANY
Padró, Lluís Highlighting relevant concepts from Topic Signatures
FreeLing 3.0: Towards Wider Multilinguality
TALP center, Universitat Politècnica de Catalunya, SPAIN
Padró, Muntsa Automatic lexical semantic classification of nouns
Universitat Pompeu Fabra, SPAIN
Paggio, Patrizia Multimodal Behaviour and Feedback in Different Types of Interaction
Feedback in Nordic First-Encounters: a Comparative Study
University of Malta, MALTA
Paikens, Peteris An implementation of a Latvian resource grammar in Grammatical Framework
Institute of Mathematics and Computer Science, University of Latvia, LATVIA
Pajas, Petr Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Paladini, Samuele RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building
Università di Firenze, ITALY
Pala, Karel Legal electronic dictionary for Czech
Faculty of Informatics Masaryk University, CZECH REPUBLIC
Palmer, Martha Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events
Empty Argument Insertion in the Hindi PropBank
University of Colorado, UNITED STATES
Panevová, Jarmila Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Panning, Axel Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Panunzi, Alessandro The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building
The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
Università di Firenze, ITALY
Paoloni, Andrea Intelligibility assessment in forensic applications
Fondazione “Ugo Bordoni”, ITALY
Papageorgiou, Haris The META-SHARE Metadata Schema for the Description of Language Resources
ILSP - Athena RC, GREECE
Papangelis, Alexandros Evaluation of Online Dialogue Policy Learning Techniques
University of Texas at Arlington, UNITED STATES
Paraboni, Ivandre Corpus-based Referring Expressions Generation
Portuguese Text Generation from Large Corpora
School of Arts, Sciences and Humanities, University of Sao Paulo, BRAZIL
Paramita, Monica Lestari Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles
Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Sheffield, UNITED KINGDOM
Pardelli, Gabriella From medical language processing to BioNLP domain
, ITALY
Pareti, Silvia A Database of Attribution Relations
University of Edinburgh, UNITED KINGDOM
Park, Jungyeul Using the International Standard Language Resource Number: Practical and Technical Aspects
ELDA, FRANCE
Paroubek, Patrick A Rough Set Formalization of Quantitative Evaluation with Ambiguity
LIMSI ― CNRS, FRANCE
Parra Escartín, Carla Design and compilation of a specialized Spanish-German parallel corpus
Universitetet i Bergen, NORWAY
Passarotti, Marco First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin
Università Cattolica del Sacro Cuore, Milan, ITALY
Passonneau, Rebecca J. The MASC Word Sense Corpus
Empirical Comparisons of MASC Word Sense Annotations
Columbia University, UNITED STATES
Patejuk, Agnieszka Towards an LFG parser for Polish: An exercise in parasitic grammar development
Polish Academy of Sciences, Institute of Computer Science, POLAND
Patel, Pratik Experiences in Resource Generation for Machine Translation through Crowdsourcing
University of Pennsylvania, UNITED STATES
Paternò, Fabio The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
ISTI - CNR, ITALY
Patrik, Lambert Automatic Translation of Scientific Documents in the HAL Archive
LIUM, University of Le Mans, FRANCE
Paul, Michael The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
National Institute of Information and Communications Technology, JAPAN
Paulsson, Niklas The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
ELDA, FRANCE
Paulus, Amélie Texto4Science: a Quebec French Database of Annotated Short Text Messages
Université de Montréal, CANADA
Pavlović-Lažetić, Gordana Multimedia database of the cultural heritage of the Balkans
Faculty of Mathematics, University of Belgrade, SERBIA
Pazienza, Maria Teresa PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases
Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
ART Research Group, University of Rome, ITALY
P. Cruz, Noa A review corpus annotated for negation, speculation and their scope
Universidad de Huelva, SPAIN
Pearman, Andrea The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
Speech and Language Group, Barcelona Media Centre d’Innovació, SPAIN
Pecina, Pavel A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Arabic Word Generation and Modelling for Spell Checking
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
Faculty of Mathematics and Physics, Charles University in Prague, CZECH REPUBLIC
Pedersen, Bolette Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Copenhagen, DENMARK
Peersman, Claudia The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
University of Antwerp, BELGIUM
Penagarikano, Mikel KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
University of the Basque Country, SPAIN
Peñas, Anselmo Evaluating Machine Reading Systems through Comprehension Tests
NLP&IR group, UNED, SPAIN
Penkale, Sergio SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Applied Language Solutions Ltd., UNITED KINGDOM
Pereira, Bianca Expertise Mining for Enterprise Content Management
National University of Ireland, IRELAND
Pereira, Hilder Corpus-based Referring Expressions Generation
School of Arts, Sciences and Humanities, University of Sao Paulo, BRAZIL
Pereira, Sílvia A PropBank for Portuguese: the CINTIL-PropBank
University of Lisbon, PORTUGAL
Perez-Beltrachini, Laura Representation of linguistic and domain knowledge for second language learning in virtual worlds
Lorraine University/LORIA, FRANCE
Pérez-Ortiz, Juan Antonio Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation
Universitat d'Alacant (UA), SPAIN
Perez-Rosas, Veronica Learning Sentiment Lexicons in Spanish
University of North Texas, UNITED STATES
Péri, Pauline Orthographic Transcription: which enrichment is required for phonetization?
Laboratoire Parole et Langage, CNRS & Aix-Marseille Université, FRANCE
Pery-Woodley, Marie-Paul An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
CLLE, Univ. Toulouse, FRANCE
Petasis, Georgios The SYNC3 Collaborative Annotation Tool
NCSR Demokritos Institute, GREECE
Peterson, Katherine Automatic word alignment tools to scale production of manually aligned parallel texts
University of Pennsylvania, UNITED STATES
Peters, Pam The Australian National Corpus: National Infrastructure for Language Resources
Macquarie University, AUSTRALIA
Petrakis, Stefanos MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
University of Zürich, Institute of Comp. Linguistics, SWITZERLAND
Petrak, Johann Applying Random Indexing to Structured Data to Find Contextually Similar Words
Austrian Research Institute for Artificial Intelligence, AUSTRIA
Petrov, Slav A Universal Part-of-Speech Tagset
Google Inc., UNITED STATES
Petukhova, Volha Using DiAML and ANVIL for multimodal dialogue annotations
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
The coding and annotation of multimodal dialogue acts
ISO 24617-2: A semantically-based standard for dialogue annotation
Human Speech and Language Technologies, Vicomtech-IK4, SPAIN
Pęzik, Piotr Towards a comprehensive open repository of Polish language resources
University of Łódz, POLAND
Pham, Binh Hai MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels
MICA Institute, VIET NAM
Piasecki, Maciej Constraint Based Description of Polish Multiword Expressions
Tools for plWordNet Development. Presentation and Perspectives
Recognition of Polish Derivational Relations Based on Supervised Learning Scheme
Wroclaw University of Technology, POLAND
Piater, Justus RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
University of Innsbruck, AUSTRIA
Picton, Aurélie Annotating Qualia Relations in Italian and French Complex Nominals
Université de Genève, TIM/ISSCO, SWITZERLAND
Pierrel, Jean-Marie Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary
ATILF Nancy / University of Riga / HAS Busapest, FRANCE
Pietrobon, Ricardo A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Duke University, UNITED STATES
Pighin, Daniele An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output
The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output
Universitat Politècnica de Catalunya, SPAIN
Pilos, Spyridon DGT-TM: A freely available Translation Memory in 22 languages
European Commission, Directorate General for Translation (DGT), LUXEMBOURG
Pimentel, Janine Identifying equivalents of specialized verbs in a bilingual comparable corpus of judgments: A frame-based methodology
Capturing syntactico-semantic regularities among terms: An application of the FrameNet methodology to terminology
Université de Montréal, OLST, CANADA
Pinnis, Mārcis Collecting and Using Comparable Corpora for Statistical Machine Translation
Latvian and Lithuanian Named Entity Recognition with TildeNER
Tilde, LATVIA
Piperidis, Stelios The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions
The FLaReNet Strategic Language Resource Agenda
The META-SHARE Metadata Schema for the Description of Language Resources
ILSP - Athena RC, GREECE
Pirrelli, Vito Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access
ILC - CNR, Pisa, ITALY
Pitsch, Karola PAMOCAT: Automatic retrieval of specified postures
Applied Informatics Group, University of Bielefeld, GERMANY
Plaza, Laura SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis
Universidad Complutense de Madrid, SPAIN
Ploch, Danuta GerNED: A German Corpus for Named Entity Disambiguation
DAI-Labor, Technische Universität Berlin, GERMANY
P. Neto, João Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
INESC-IST, PORTUGAL
Poch, Marc Towards a User-Friendly Platform for Building Language Resources based on Web Services
UPF, SPAIN
Poesio, Massimo DeCour: a corpus of DEceptive statements in Italian COURts
Domain-specific vs. Uniform Modeling for Coreference Resolution
University of Essex, UNITED KINGDOM
Poggi, Isabella Pedagogical stances and their multimodal signals.
Dipartimento di Scienze dell’Educazione ― Università Roma Tre, ITALY
Poibeau, Thierry ANALEC: a New Tool for the Dynamic Annotation of Textual Data
CNRS, FRANCE
Poláková, Lucie Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component
Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics, CZECH REPUBLIC
Pollak, Senja Irregularity Detection in Categorized Document Corpora
University of Ljubljana, Faculty of Arts, SLOVENIA
Pomikálek, Jan Building a 70 billion word corpus of English from ClueWeb
Masaryk University, CZECH REPUBLIC
Popelka, Jan Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Popel, Martin HamleDT: To Parse or Not to Parse?
The Joy of Parallelism with CzEng 1.0
Charles University in Prague, CZECH REPUBLIC
Popescu-Belis, Andrei Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
ISO 24617-2: A semantically-based standard for dialogue annotation
IDIAP Research Institute, SWITZERLAND
Popescu, Octavian Buildind a Resource of Patterns Using Semantic Types
Fondazione Bruno Kessler, Trento, ITALY
Popović, Maja Involving Language Professionals in the Evaluation of Machine Translation
Automatic MT Error Analysis: Hjerson Helping Addicter
Terra: a Collection of Translation Error-Annotated Corpora
DFKI GmbH, GERMANY
Potamianos, Alexandros SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks
Associative and Semantic Features Extracted From Web-Harvested Corpora
Technical University of Crete, GREECE
Potamianos, Gerasimos A hierarchical approach with feature selection for emotion recognition from speech
Department of Computer and Communication Engineering, University of Thessaly, GREECE
Potet, Marion Collection of a Large Database of French-English SMT Output Corrections
UJF-Grenoble1, FRANCE
Poudat, Céline Building a database of French frozen adverbial phrases
LDI, University of Paris 13, FRANCE
Pozzi, María Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
Centro de Estudios Lingüísticos y Literarios, El Colegio de México, MEXICO
Prabhakaran, Vinodkumar Annotations for Power Relations on Email Threads
Columbia University, UNITED STATES
Prasad, Rashmi Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
HIA, University of Wisconsin-Milwaukee, UNITED STATES
Preotiuc-Pietro, Daniel Unsupervised document zone identification using probabilistic graphical models
Natural Language Processing Research Group, UNITED KINGDOM
Prevot, Laurent An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
LPL, Aix-Marseille Univ., FRANCE
Priestley, Joel The Nordic Dialect Corpus
Universitetet i Oslo, Institutt for lingvistiske og nordiske studier, NORWAY
Prinetto, Paolo A platform-independent user-friendly dictionary from Italian to LIS
Politecnico di Torino, Dipartimento di Automatica e Informatica, ITALY
Procter, Rob A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic.
University of Manchester, UNITED KINGDOM
Proisl, Thomas Efficient Dependency Graph Matching with the IMS Open Corpus Workbench
Universität Erlangen-Nuremberg, GERMANY
Pröll, Birgit EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems
Johannes Kepler University Linz, AUSTRIA
Prolo, Carlos A. A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever
PUCRS, BRAZIL
Prytz Lillkull, Anna Eye Tracking as a Tool for Machine Translation Error Analysis
Linköping University, SWEDEN
Przepiórkowski, Adam Towards a comprehensive open repository of Polish language resources
Towards an LFG parser for Polish: An exercise in parasitic grammar development
PoliMorf: a (not so) new open morphological dictionary for Polish
Institute of Computer Science, Polish Academy of Sciences, POLAND
Pucher, Michael Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis
FTW Telecommunications Research Center Vienna, AUSTRIA
Purtonen, Tanja Rule-Based Detection of Clausal Coordinate Ellipsis
Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3
University of Helsinki, FINLAND
Pustejovsky, James Word Sense Inventories by Non-Experts.
ATLIS: Identifying Locational Information in Text Automatically
The Role of Model Testing in Standards Development: The Case of ISO-Space
The TARSQI Toolkit
Brandeis University, UNITED STATES

 

Q
QasemiZadeh, Behrang Semi-Supervised Technical Term Tagging With Minimal User Feedback
National University of Ireland, IRELAND
Quarteroni, Silvia Evaluating Multi-focus Natural Language Queries over Data Services
DEI - Politecnico di Milano, ITALY
Quasthoff, Uwe Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages
The Influence of Corpus Quality on Statistical Measurements on Language Resources
Universität Leipzig, GERMANY
Quignard, Matthieu Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
LORIA, FRANCE
Quintard, Ludovic Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign
The REPERE Corpus : a multimodal corpus for person recognition
Laboratoire National de métrologie et d'Essais / INRA, Jouy-en-Josas, France / Synapse Développement, Toulouse, France / Jouve, Mayenne, France, FRANCE
Quixal, Martí Holaaa!! writin like u talk is kewl but kinda hard 4 NLP
Barcelona Media Research Center, SPAIN
Quochi, Valeria Customizable SCF Acquisition in Italian
Towards a User-Friendly Platform for Building Language Resources based on Web Services
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser
The FLaReNet Strategic Language Resource Agenda
ILC-CNR, ITALY

 

R
Rabanus, Stefan A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Department of Foreign Languages and Literatures― University of Verona, ITALY
Radziszewski, Adam KPWr: Towards a Free Corpus of Polish
Wroclaw University of Technology, POLAND
Rafelsberger, Walter Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
Holzweg e-Commerce Solutions, AUSTRIA
Raggett, Dave The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
GEIE ERCIM, FRANCE
Rahim, Umair Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores
Institute of Communications Engineering, University of Ulm, GERMANY
Rajapakse, Rohana Evaluating the Impact of Phrase Recognition on Concept Tagging
Goss Interactive Limited, UNITED KINGDOM
Rakho, Myriam A new semantically annotated corpus with syntactic-semantic and cross-lingual senses
Université Paris-Est, LIGM, FRANCE
Rak, Rafal Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench
University of Manchester, UNITED KINGDOM
Ramanathan, Ananthakrishnan A Study of Word-Classing for MT Reordering
IBM Research India, INDIA
Ramasamy, Loganathan HamleDT: To Parse or Not to Parse?
Prague Dependency Style Treebank for Tamil
Charles University in Prague, CZECH REPUBLIC
Rambousek, Adam A database of semantic clusters of verb usages
Masarykova Univerzita, Brno, CZECH REPUBLIC
Rambow, Owen Annotations for Power Relations on Email Threads
The Dependency-Parsed FrameNet Corpus
Conventional Orthography for Dialectal Arabic
Columbia University, UNITED STATES
Ramocki, Radoslaw Recognition of Polish Derivational Relations Based on Supervised Learning Scheme
Wroclaw University of Technology, POLAND
Rapp, Reinhard Identifying Word Translations from Comparable Documents Without a Seed Lexicon
University of Leeds, UNITED KINGDOM
Raso, Tommaso The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese
UFMG, BRAZIL
Rayner, Manny Evaluating Appropriateness Of System Responses In A Spoken CALL Game
A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
A Corpus for a Gesture-Controlled Mobile Spoken Dialogue System
University of Geneva, FTI/TIM, SWITZERLAND
Rayson, Paul Document Attrition in Web Corpora: an Exploration
Lancaster University, UNITED KINGDOM
R. Banga, Eduardo BUCEADOR, a multi-language search engine for digital libraries
Universidad de Vigo, SPAIN
Read, Jonathon The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
Universitetet i Oslo, NORWAY
Rebeyrolles, Josette An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
CLLE, Univ. Toulouse, FRANCE
Rebholz-Schuhmann, Dietrich Centroids: Gold standards with distributional variation
CALBC: Releasing the Final Corpora
European Bioinformatics Institute, UNITED KINGDOM
Recasens, Marta Annotating Near-Identity from Coreference Disagreements
CLiC - University of Barcelona, SPAIN
Recski, Gábor Rapid creation of large-scale corpora and frequency dictionaries
Hungarian Academy of Sciences, HUNGARY
Reddy, Siva Word Sketches for Turkish
Lexical Computing Ltd, UNITED KINGDOM
Redeker, Gisela Multi-Layer Discourse Annotation of a Dutch Text Corpus
University of Groningen, NETHERLANDS
Reed, Marian Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities
LDC, UNITED STATES
Rehbein, Ines Yes we can!? Annotating English modal verbs
Universität Potsdam, GERMANY
Rello, Luz A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality
Universitat Pompeu Fabra, SPAIN
Remus, Robert MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
Learning Categories and their Instances by Contextual Features
Textual Characteristics for Language Engineering
University of Leipzig, Department of Computer Science, Natural Language Processing Group, GERMANY
Renau, Irene Spell Checking in Spanish: The Case of Diacritic Accents
UPF, SPAIN
Renders, Jean-Michel Task-Driven Linguistic Analysis based on an Underspecified Features Representation
Xerox Research Centre Europe, FRANCE
Ren, Xiaoai Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University at Albany, UNITED STATES
Rett, Joerg The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
SAP AG, GERMANY
Reynaert, Martin Beyond SoNaR: towards the facilitation of large corpus building efforts
TiCC, Tilburg University, NETHERLANDS
Ribeiro, Joana An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
IST, PORTUGAL
Riccardi, Giuseppe Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
DISI, University of Trento, ITALY
Riccioni, Ilaria A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Università di Macerata, ITALY
Rigau, German Multilingual Central Repository version 3.0
A proposal for improving WordNet Domains
Highlighting relevant concepts from Topic Signatures
Mapping WordNet to the Kyoto ontology
Euskal Herriko Unibertsitatea, Informatika Fakultatea, SPAIN
Rilliard, Albert Designing French Tale Corpora for Entertaining Text To Speech Synthesis
LIMSI-CNRS, FRANCE
Rinaldi, Fabio Dependency parsing for interaction detection in pharmacogenomics
Institute of Computational Linguistics, University of Zurich, SWITZERLAND
Ringger, Eric First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis
Brigham Young University, UNITED STATES
Rios, Annette A tree is a Baum is an árbol is a sach'a: Creating a trilingual treebank
Institute for Computational Linguistics, University of Zurich, SWITZERLAND
Rizov, Borislav Bulgarian X-language Parallel Corpus
Department of Computational Linguistics, Institute for Bulgarian, Bulgarian Academy of Sciences, BULGARIA
Roach, Michael A. EmpaTweet: Annotating and Detecting Emotions on Twitter
Human Language Technology Research Institute, University of Texas at Dallas, UNITED STATES
Robaldo, Livio NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
Pragmatic identification of the witness sets
Università di Torino, ITALY
Roberts, Kirk Annotating Spatial Containment Relations Between Events
EmpaTweet: Annotating and Detecting Emotions on Twitter
University of Texas at Dallas, UNITED STATES
Roberts, Will Using Verb Subcategorization for Word Sense Disambiguation
Saarland University, Department of Computational Linguistics, GERMANY
Robichaud, Benoît Logic Based Methods for Terminological Assessment
OLST, Université de Montréal, CANADA
Roche, Christophe Ontoterminology: How to unify terminology and ontology into a single paradigm
University of Savoie, FRANCE
Rocio, Vitor Building and Exploring Semantic Equivalences Resources
Universidade Aberta, PORTUGAL
Rodrigo, Álvaro Evaluating Machine Reading Systems through Comprehension Tests
NLP&IR group, UNED, SPAIN
Rodrigues, Paul Typing Race Games as a Method to Create Spelling Error Corpora
University of Maryland, UNITED STATES
Rodriguez-Fuentes, Luis Javier KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
University of the Basque Country, SPAIN
Rodríguez, Horacio Summarizing a multimodal set of documents in a Smart Room
TALP Research Center, Technical University of Catalonia, SPAIN
Rodríguez, Mari Carmen The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Telefónica Investigación y Desarrollo, SPAIN
Rodríguez, Miguel Ángel The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
Speech Technology Group, Telefónica Investigación y Desarrollo, SPAIN
Rögnvaldsson, Eiríkur The Icelandic Parsed Historical Corpus (IcePaHC)
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
University of Iceland, ICELAND
Rojas-Barahona, Lina M. Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
Inria Nancy, FRANCE
Romeo, Lauren Automatic lexical semantic classification of nouns
Universitat Pompeu Fabra, SPAIN
Rompré Brodeur, Eugénie Texto4Science: a Quebec French Database of Annotated Short Text Messages
Université de Montréal, CANADA
Rosen, Alexandr Building a multilingual parallel corpus for human users
Building a learner corpus
Charles University in Prague, Faculty of Arts, CZECH REPUBLIC
Rosenthal, Sara Annotating Agreement and Disagreement in Threaded Discussion
Columbia University, UNITED STATES
Roshchina, Alexandra Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System
Social Media Research Group, IIT Dublin, IRELAND
Rösner, Dietmar LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions
Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Rosner, Michael Incorporating an Error Corpus into a Spellchecker for Maltese
University of Malta, MALTA
Roßdeutscher, Antje Automatic classification of German """"an"""" particle verbs
Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, GERMANY
Rosset, Sophie Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results
Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign
Designing French Tale Corpora for Entertaining Text To Speech Synthesis
LIMSI ― CNRS, FRANCE
Rosso, Paolo Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System
Universidad Politécnica de Valencia, SPAIN
Roth, Benjamin A Gold Standard for Relation Extraction in the Food Domain
Saarland University, Spoken Language Systems, GERMANY
Roth, Dan An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
University of Illinois (Urbana-Champaign), UNITED STATES
Rousseau, Anthony TED-LIUM: an Automatic Speech Recognition dedicated corpus
Université du Maine, LIUM, FRANCE
Roux, Justus Aspects of a Legal Framework for Language Resource Management
North-West University, SOUTH AFRICA
Rowley, Andrew Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench
University of Manchester, UNITED KINGDOM
Roxendal, Johan Korp ― the corpus infrastructure of Spräkbanken
Sprakbanken, University of Gothenburg, SWEDEN
Roy, Shourya Experiences in Resource Generation for Machine Translation through Crowdsourcing
Xerox India Research Centre, INDIA
Rozis, Roberts Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Tilde, LATVIA
Rubino, Francesco Assigning Connotation Values to Events
Customizable SCF Acquisition in Italian
The Language Library: supporting community effort for collective resource production
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser
The LRE Map. Harmonising Community Descriptions of Resources
ILC - CNR, Pisa, ITALY
Rued, Stefan Logical metonymies and qualia structures: an annotated database of logical metonymies for German
Universität Stuttgart, IMS, GERMANY
Ruiz-Martinez, Juana Maria A New Method for Evaluating Automatically Learned Terminological Taxonomies
Campus de Espinardo, SPAIN
Rumshisky, Anna Word Sense Inventories by Non-Experts.
MIT, UNITED STATES
Ruppenhofer, Josef MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
Yes we can!? Annotating English modal verbs
University of Hildesheim, GERMANY
Russo, Irene Assigning Connotation Values to Events
Customizable SCF Acquisition in Italian
The Language Library: supporting community effort for collective resource production
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
The LRE Map. Harmonising Community Descriptions of Resources
ILC - CNR, Pisa, ITALY
Russo, Lorenza Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.
University of Geneva, SWITZERLAND
Rutten, Robin An Oral History Annotation Tool for INTER-VIEWs
CLST, Radboud University Nijmegen, NETHERLANDS
Ryan, Michael Distractorless Authorship Verification
Duquesne University EVL Lab, UNITED STATES
Rychlý, Pavel Building a 70 billion word corpus of English from ClueWeb
Legal electronic dictionary for Czech
Masaryk University, CZECH REPUBLIC
Rygl, Jan Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification
Masarykova Univerzita, Brno, CZECH REPUBLIC
Rysova, Magdalena Alternative Lexicalizations of Discourse Connectives in Czech
Charles University in Prague, CZECH REPUBLIC
Rytting, C. Anton Typing Race Games as a Method to Create Spelling Error Corpora
University of Maryland, UNITED STATES

 

S
Sabou, Marta Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
MODUL University Vienna, Department of New Media Technology, AUSTRIA
Sacaleanu, Bogdan An Adaptive Framework for Named Entity Combination
IMC AG, GERMANY
Sadamitsu, Kugatsu Constructing a Class-Based Lexical Dictionary using Interactive Topic Models
NTT Cyber Space Laboratories, NTT Corporation, JAPAN
Sagae, Kenji Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems
Institute for Creative Technologies, University of Southern California, UNITED STATES
Saggion, Horacio The CONCISUS Corpus of Event Summaries
Text Simplification Tools for Spanish
Universitat Pompeu Fabra, SPAIN
Sagot, Benoît Aleda, a free large-scale entity database for French
Cleaning noisy wordnets
Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources
Applying cross-lingual WSD to wordnet development
Evaluating and improving syntactic lexica by plugging them within a parser
Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations
INRIA, FRANCE
Saif, Hassan Quantising Opinions for Political Tweets Analysis
Open University, UNITED KINGDOM
Saint-Dizier, Patrick DISLOG: A logic-based language for processing discourse structures
A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures
Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
CNRS, FRANCE
Sainz, Iñaki Versatile Speech Databases for High Quality Synthesis for Basque
Using an ASR database to design a pronunciation evaluation system in Basque
University of the Basque Country, SPAIN
Saito, Kuniko Constructing a Class-Based Lexical Dictionary using Interactive Topic Models
NTT Cyber Space Laboratories, NTT Corporation, JAPAN
Sales, Afonso A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever
PUCRS, BRAZIL
Samardzic, Tanja Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification
University of Geneva, SWITZERLAND
Samih, Younes Arabic Word Generation and Modelling for Spell Checking
Heinrich-Heine-Universität, GERMANY
Sammons, Mark An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
University of Illinois (Urbana-Champaign), UNITED STATES
Sam, Sethserey Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu
Institute of Technology of Cambodia, CAMBODIA
Samsudin, Nur-Hana Building Text-to-Speech Systems for Resource Poor Languages
University of Birmingham, School of Computer Science, UNITED KINGDOM
Samy, Doaa Medical Term Extraction in an Arabic Medical Corpus
Cairo University, Faculty of Arts, EGYPT
Sánchez-Cárdenas, Beatriz Linguistic knowledge for specialized text production
University of Granada, SPAIN
Sánchez-Cartagena, Víctor M. Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation
Universitat d'Alacant (UA), SPAIN
Sánchez, Jon Versatile Speech Databases for High Quality Synthesis for Basque
Using an ASR database to design a pronunciation evaluation system in Basque
University of the Basque Country, SPAIN
Sanchis, Emilio The acquisition and dialog act labeling of the EDECAN-SPORTS corpus
Universitat Politècnica de Valencia, SPAIN
Sanders, Eric An Oral History Annotation Tool for INTER-VIEWs
Collecting and Analysing Chats and Tweets in SoNaR
CLST, Radboud University Nijmegen, NETHERLANDS
Sandford Pedersen, Bolette A voting scheme to detect semantic underspecification
Towards a richer wordnet representation of properties
University of Copenhagen, DENMARK
Sangodkar, Amit Re-ordering Source Sentences for SMT
IIT Bombay, INDIA
Sanguinetti, Manuela The Parallel-TUT: a multilingual and multiformat treebank
Dipartimento di Informatica, Università di Torino, ITALY
Santos, André Structural alignment of plain text books
Universidade do Minho, PORTUGAL
Santos, Diana Págico: Evaluating Wikipedia-based information retrieval in Portuguese
University of Oslo, NORWAY
Santos, Rita Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese
University of Lisbon, PORTUGAL
San Vicente, Iñaki Building a Basque-Chinese Dictionary by Using English as Pivot
PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web
Elhuyar R&D, SPAIN
Saquete, Estela TIMEN: An Open Temporal Expression Normalisation Resource
Massively Increasing TIMEX3 Resources: A Transduction Approach
University of Alicante, SPAIN
Saralegi, Xabier Building a Basque-Chinese Dictionary by Using English as Pivot
Elhuyar R&D, SPAIN
Saratxaga, Ibon Versatile Speech Databases for High Quality Synthesis for Basque
Strategies to Improve a Speaker Diarisation Tool
Using an ASR database to design a pronunciation evaluation system in Basque
University of the Basque Country, SPAIN
Saravanan, K An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora
Microsoft Research India, INDIA
Sasaki, Felix Evaluating the Impact of Phrase Recognition on Concept Tagging
German Research Center for Artificial Intelligence, GERMANY
Sasaki, Minoru Detection of Peculiar Word Sense by Distance Metric Learning with Labeled Examples
Ibaraki University, JAPAN
Sassi, Manuela From medical language processing to BioNLP domain
, ITALY
Sato, Hiroaki A Search Tool for FrameNet Constructicon
Senshu University, JAPAN
Sato, Satoshi Dictionary Look-up with Katakana Variant Recognition
Nagoya University, JAPAN
Savkov, Aleksandar Linguistic Analysis Processing Line for Bulgarian
University of Sussex, UNITED KINGDOM
Sawalha, Majdi Predicting Phrase Breaks in Classical and Modern Standard Arabic Text
Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing
University of Leeds, UNITED KINGDOM
Sawyer, Jennifer An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style
University of California Santa Cruz, UNITED STATES
Scagliola, Stef An Oral History Annotation Tool for INTER-VIEWs
Veterans Institute, centre for knowledge and expertise, NETHERLANDS
Scarpato, Noemi Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
University of Rome, Tor Vergata, ITALY
Schabus, Dietmar Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis
FTW Telecommunications Research Center Vienna, AUSTRIA
Schäfer, Roland Building Large Corpora from the Web Using a New Efficient Tool Chain
Freie Universität Berlin, GERMANY
Schäfer, Ulrich A Graphical Citation Browser for the ACL Anthology
DFKI GmbH, GERMANY
Scharl, Arno Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
MODUL University Vienna, Department of New Media Technology, AUSTRIA
Scheible, Christian Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank
Universität Stuttgart, IMS, GERMANY
Scheible, Silke GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German
University of Manchester, UNITED KINGDOM
Schein, Aaron International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned
MITRE Corporation, UNITED STATES
Scherer, Stefan An audiovisual political speech analysis incorporating eye-tracking and perception data
University of Southern California, Institute for Creative Technologies, UNITED STATES
Scherrer, Yves The Trilingual ALLEGRA Corpus: Presentation and Possible Use for Lexicon Induction
Université de Genève, Département de Linguistique, SWITZERLAND
Schiel, Florian Statistical Evaluation of Pronunciation Encoding
Bavarian Archive for Speech Signals, Ludwig-Maximilians-Universität München, GERMANY
Schierle, Martin Textual Characteristics for Language Engineering
A Survey of Text Mining Architectures and the UIMA Standard
Mercedes-Benz R&D North America, UNITED STATES
Schlaf, Antje Learning Categories and their Instances by Contextual Features
Universität Leipzig, GERMANY
Schlogl, Stephan Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
Trinity College Dublin, IRELAND
Schlüter, Patrick DGT-TM: A freely available Translation Memory in 22 languages
European Commission, Directorate General for Translation (DGT), LUXEMBOURG
Schmidt, Christoph RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
RWTH Aachen University, GERMANY
Schmidt, Thomas EXMARaLDA and the FOLK tools ― two toolsets for transcribing and annotating spoken language
Institut für Deutsche Sprache, GERMANY
Schmitt, Alexander A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System
Universität Ulm, GERMANY
Schneider, Daniel AVATecH ― automated annotation through audio and video analysis
Fraunhofer IAIS Institute, GERMANY
Schneider, Gerold Dependency parsing for interaction detection in pharmacogenomics
Institute of Computational Linguistics, University of Zurich, SWITZERLAND
Schneider, Michael Identifying Nuggets of Information in GALE Distillation Evaluation
BAE Systems, UNITED STATES
Schnier, Christian PAMOCAT: Automatic retrieval of specified postures
Applied Informatics Group, University of Bielefeld, GERMANY
Schnober, Carsten The New IDS Corpus Analysis Platform: Challenges and Prospects
Evaluating Query Languages for a Corpus Processing System
Institute for the German Language (IDS), GERMANY
Schonefeld, Oliver The New IDS Corpus Analysis Platform: Challenges and Prospects
Institute for the German Language (IDS), GERMANY
Schreer, Oliver AVATecH ― automated annotation through audio and video analysis
Fraunhofer-Heinrich Hertz Institute, GERMANY
Schröder, Marc META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
DFKI GmbH, GERMANY
Schröder, Susann Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
Friedrich-Schiller-Universität Jena, GERMANY
Schroeder Richerson, Elizabeth International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned
MITRE Corporation, UNITED STATES
Schulte im Walde, Sabine Automatic classification of German """"an"""" particle verbs
Association Norms of German Noun Compounds
Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, GERMANY
Schulz, Julia Maria A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content
University of Hildesheim, GERMANY
Schulz, Stefan Creation and use of Language Resources in a Question-Answering eHealth System
Freiburg University, GERMANY
Schumann, Anne-Kathrin Knowledge-Rich Context Extraction and Ranking with KnowPipe
Universität Wien, Zentrum für Translationswissenschaft, AUSTRIA
Schütze, Hinrich Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank
Universität Stuttgart, IMS, GERMANY
Schuurman, Ineke Beyond SoNaR: towards the facilitation of large corpus building efforts
CCL, Leuven University, BELGIUM
Schwarz, Christoph Large Scale Lexical Analysis
Linguatec, GERMANY
Schwenk, Holger Automatic Translation of Scientific Documents in the HAL Archive
LIUM, University of Le Mans, FRANCE
Sclaroff, Stan Detecting Reduplication in Videos of American Sign Language
Boston University, UNITED STATES
Scogin, Sarah Identifying Nuggets of Information in GALE Distillation Evaluation
BAE Systems, UNITED STATES
Scott, Donia Corpus Annotation as a Scientific Task
University of Sussex, UNITED KINGDOM
Seddah, Djamé Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus
INRIA, FRANCE
Seeker, Wolfgang Making Ellipses Explicit in Dependency Conversion for a German Treebank
German """"nach""""-Particle Verbs in Semantic Theory and Corpus Data
Universität Stuttgart, IMS, GERMANY
Segarra, Encarna The acquisition and dialog act labeling of the EDECAN-SPORTS corpus
Universitat Politècnica de Valencia, SPAIN
Seinturier, Julien An ontological approach to model and query multimodal concurrent linguistic annotations
Université du Sud Toulon Var, FRANCE
Seiss, Melanie The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
A Rule-based Morphological Analyzer for Murrinh-Patha
Konstanz University, GERMANY
Seljan, Sanja BLEU Evaluation of Machine-Translated English-Croatian Legislation
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences, CROATIA
Semecký, Jiří Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Semenkin, Eugene Speech and Language Resources for LVCSR of Russian
Insitute for System Analysis, Siberian State Aerospace University, RUSSIAN FEDERATION
Semmar, Nasredine Identifying bilingual Multi-Word Expressions for Statistical Machine Translation
CEA-LIST, FRANCE
Senft, Gunter Citing on-line Language Resources
Max-Planck Institute for Psycholinguistics, NETHERLANDS
Seo, Hongsuck Grammatical Error Annotation for Korean Learners of Spoken English
Postech, KOREA, REPUBLIC OF
Sepesy Maucec, Mirjam SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
University of Maribor, SLOVENIA
Seppi, Kevin First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis
Brigham Young University, UNITED STATES
Serafini, Luciano The KnowledgeStore: an Entity-Based Storage System
Fondazione Bruno Kessler, ITALY
Seraji, Mojgan A Basic Language Resource Kit for Persian
Uppsala University, SWEDEN
Sérasset, Gilles Dbnary: Wiktionary as a LMF based Multilingual RDF network
UJF-Grenoble 1, Laboratoire d’Informatique de Grenoble, FRANCE
Seretan, Violeta Acquisition of Syntactic Simplification Rules for French
FTI/TIM, University of Geneva, SWITZERLAND
Sezer, Taner Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY
Sgall, Petr Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Shaalan, Khaled Arabic Word Generation and Modelling for Spell Checking
Automatic Extraction and Evaluation of Arabic LFG Resources
The British University in Dubai, UNITED ARAB EMIRATES
Shaikh, Samira Revealing Contentious Concepts Across Social Groups
Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University at Albany, SUNY, UNITED STATES
Shamsfard, Mehrnoush Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
Shahid Beheshti University, IRAN, ISLAMIC REPUBLIC OF
Sharaf, Abdul-Baquee QurAna: Corpus of the Quran annotated with Pronominal Anaphora
QurSim: A corpus for evaluation of relatedness in short texts
University of Leeds, UNITED KINGDOM
Sharma Grover, Aditi Aspects of a Legal Framework for Language Resource Management
Meraka Institute, CSIR, SOUTH AFRICA
Sharoff, Serge Identifying Word Translations from Comparable Documents Without a Seed Lexicon
University of Leeds, UNITED KINGDOM
Shaw, Barbara Creating HAVIC: Heterogeneous Audio Visual Internet Collection
NIST, UNITED STATES
Sheykholeslam, Mohammad Hoseyn A Framework for Spelling Correction in Persian Language Using Noisy Channel Model
Computer Research Center of Islamic Sciences, IRAN, ISLAMIC REPUBLIC OF
Shi, Chunqi Two Phase Evaluation for Selecting Machine Translation Services
Service Composition Scenarios for Task-Oriented Translation
Kyoto University, JAPAN
Shi, Dingxu A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong Polytechnic University, HONG KONG
Shimada, Masahiko Two Phase Evaluation for Selecting Machine Translation Services
Kyoto University, JAPAN
Shima, Hideki Diversifiable Bootstrapping for Acquiring High-Coverage Paraphrase Resource
Carnegie Mellon University, UNITED STATES
Shinnou, Hiroyuki Detection of Peculiar Word Sense by Distance Metric Learning with Labeled Examples
Ibaraki University, JAPAN
Shirai, Katsuhiko Method for Collection of Acted Speech Using Various Situation Scripts
Waseda University, JAPAN
Shi, Ying Pursing power in Arabic on-line discussion forums
Language Computer Corporation, UNITED STATES
Shoaib, Umar A platform-independent user-friendly dictionary from Italian to LIS
Politecnico di Torino, Dipartimento di Automatica e Informatica, ITALY
Shvets, Alexander Speech and Language Resources for LVCSR of Russian
Institute for System Analysis of RAS, RUSSIAN FEDERATION
Sidorov, Maxim Speech and Language Resources for LVCSR of Russian
Insitute for System Analysis, Siberian State Aerospace University, RUSSIAN FEDERATION
Siegert, Ingo Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Sierra, Gerardo Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
Instituto de Ingeniería, Universidad Nacional Autónoma de México, MEXICO
Sigurðsson, Einar Freyr The Icelandic Parsed Historical Corpus (IcePaHC)
University of Iceland, ICELAND
Sikimić, Biljana Multimedia database of the cultural heritage of the Balkans
Institute for Balkan Studies of Serbian Academy of Sciences and Arts, SERBIA
Silva, Douglas Portuguese Text Generation from Large Corpora
School of Arts, Sciences and Humanities, University of Sao Paulo, BRAZIL
Silva, João A PropBank for Portuguese: the CINTIL-PropBank
Dealing with unknown words in statistical machine translation
University of Lisbon, PORTUGAL
Silveira, Sara A PropBank for Portuguese: the CINTIL-PropBank
University of Lisbon, PORTUGAL
Simelio, Nuria Using Language Resources in Humanities research
Universitat Autònoma de Barcelona, SPAIN
Simionescu, Radu Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations
Faculty of Computer Science, “Alexandru Ioan Cuza” University Iași, ROMANIA
Simões, Alberto Págico: Evaluating Wikipedia-based information retrieval in Portuguese
CEHUM/UM, PORTUGAL
Simov, Kiril A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian
Linguistic Analysis Processing Line for Bulgarian
The Political Speech Corpus of Bulgarian
Linguistic Modelling Department, IICT-BAS, BULGARIA
Šindlerová, Jana Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Singh, Anil Kumar A GUI to Detect and Correct Errors in Hindi Dependency Treebank
A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation
School of Computer Engg., KIIT University, INDIA
Singh, Pooja Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
KIIT College of Engineering, INDIA
Sinha, Shweta Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
KIIT College of Engineering, INDIA
Şirin, Utku METU Turkish Discourse Bank Browser
Middle East Technical University, TURKEY
Sjögren, Gerd On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
Interverbum Technology, SWEDEN
Skadiņa, Inguna Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Collecting and Using Comparable Corpora for Statistical Machine Translation
Tilde, LATVIA
Skeppstedt, Maria Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text
Stockholm University, SWEDEN
Sloetjes, Han ELAN development, keeping pace with communities' needs
Max Planck Institute for Psycholinguistics, NETHERLANDS
Sluban, Borut Irregularity Detection in Categorized Document Corpora
Institut Jozef Stefan, Department of Knowledge Technologies, SLOVENIA
Small, Sharon Revealing Contentious Concepts Across Social Groups
Department of Computer Science, Siena College, UNITED STATES
Smejkalová, Lenka A database of semantic clusters of verb usages
Charles University in Prague, CZECH REPUBLIC
Smith, Christian A good space: Lexical predictors in word space evaluation
This also affects the context - Errors in extraction based summaries
Santa Anna IT Research Institute AB & Linköping University, SWEDEN
Šnajder, Jan Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian
University of Zagreb, CROATIA
Søgaard, Anders DSim, a Danish Parallel Corpus for Text Simplification
University of Copenhagen, CST, DENMARK
Šojat, Krešimir Generation of Verbal Stems in Derivationally Rich Language
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences, CROATIA
Somasundaram, Aarthy ELAN development, keeping pace with communities' needs
Max Planck Institute for Psycholinguistics, NETHERLANDS
Song, Yan Using a Goodness Measurement for Domain Adaptation: A Case Study on Chinese Word Segmentation
University of Washington, UNITED STATES
Song, Zhiyi Linguistic Resources for Handwriting Recognition and Translation Evaluation
LDC, UNITED STATES
Soria, Claudia The LRE Map. Harmonising Community Descriptions of Resources
The FLaReNet Strategic Language Resource Agenda
ILC/CNR, ITALY
Soroa, Aitor Matching Cultural Heritage items to Wikipedia
University of the Basque Country, SPAIN
Specia, Lucia PET: a Tool for Post-editing and Assessing Machine Translation
University of Wolverhampton, UNITED KINGDOM
Speer, Robert Representing General Relational Knowledge in ConceptNet 5
MIT, UNITED STATES
Spiliotopoulos, Dimitris Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue
Athens Technology Center, GREECE
Spitkovsky, Valentin I. A Cross-Lingual Dictionary for English Wikipedia Concepts
Google Research, Google Inc., UNITED STATES
Sporleder, Caroline Evaluating Machine Reading Systems through Comprehension Tests
Saarland University, GERMANY
Spousta, Miroslav A High-Quality Web Corpus of Czech
Charles University in Prague, MFF, UFAL, CZECH REPUBLIC
Spoustová, Johanka A High-Quality Web Corpus of Czech
Charles University in Prague, MFF, UFAL, CZECH REPUBLIC
Springorum, Sylvia Automatic classification of German """"an"""" particle verbs
Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, GERMANY
Sprugnoli, Rachele CAT: the CELCT Annotation Tool
CELCT, ITALY
Spyns, Peter Smooth Sailing for STEVIN
Nederlandse Taalunie, NETHERLANDS
Srikumar, Vivek An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
University of Illinois (Urbana-Champaign), UNITED STATES
Stahl, Ken Revealing Contentious Concepts Across Social Groups
University at Albany, SUNY, UNITED STATES
Štajner, Sanja Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach
University of Wolverhampton, UNITED KINGDOM
Sta. Maria, Madelene Building a Multimodal Laughter Database for Emotion Recognition
Center for Empathic Human-Computer Interactions, De La Salle University, PHILIPPINES
Stanilovsky, Evgeny FreeLing 3.0: Towards Wider Multilinguality
FreeLing project developer, RUSSIAN FEDERATION
Stanković, Ranka A tool for enhanced search of multilingual digital libraries of e-journals
University of Belgrade, SERBIA
Stavropoulou, Pepi Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue
University of Athens, GREECE
Stede, Manfred SemScribe: Natural Language Generation for Medical Reports
Applied Computational Linguistics Lab, University of Potsdam, POLAND
Ștefănescu, Dan ROMBAC: The Romanian Balanced Annotated Corpus
RACAI, Romanian Academy Bucharest, ROMANIA
Stehouwer, Herman Semantic metadata mapping in practice: the Virtual Language Observatory
Federated Search: Towards a Common Search Infrastructure
Max Planck Institute for Psycholinguistics, NETHERLANDS
Steinberger, Ralf DGT-TM: A freely available Translation Memory in 22 languages
JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool
European Commission, Joint Research Centre (JRC), ITALY
Stein, Daniel Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel
Fraunhofer IAIS, GERMANY
Stellato, Armando PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases
Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
ART Research Group, University of Rome, ITALY
Štěpánek, Jan HamleDT: To Parse or Not to Parse?
Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Stern, Rosa Aleda, a free large-scale entity database for French
Agence France-Presse, FRANCE
Stevenson, Mark Matching Cultural Heritage items to Wikipedia
Mapping WordNet synsets to Wikipedia articles
University of Sheffield, UNITED KINGDOM
Štindlová, Barbora Building a learner corpus
Charles University in Prague, CZECH REPUBLIC
Stoyanova, Ivelina Bulgarian X-language Parallel Corpus
Department of Computational Linguistics, Institute for Bulgarian, Bulgarian Academy of Sciences, BULGARIA
Strapparava, Carlo Brand Pitt: A Corpus to Explore the Art of Naming
A Parallel Corpus of Music and Lyrics Annotated with Emotions
NgramQuery - Smart Information Extraction from Google N-gram using External Resources
FBK-Irst, ITALY
Strassel, Stephanie Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure
Linguistic Resources for Handwriting Recognition and Translation Evaluation
Creating HAVIC: Heterogeneous Audio Visual Internet Collection
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES
Strik, Helmer The DISCO ASR-based CALL system: practicing L2 oral skills and beyond
Radboud University Nijmegen, NETHERLANDS
Stromer-Galley, Jennifer Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University at Albany, UNITED STATES
Strötgen, Jannik Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards
Universität Heidelberg, GERMANY
Strube, Michael Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles
HITS gGmbH, GERMANY
Strzalkowski, Tomek Revealing Contentious Concepts Across Social Groups
Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University at Albany, SUNY, UNITED STATES
Stüker, Sebastian The KIT Lecture Corpus for Speech Translation
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Karlsruhe Institute of Technology, GERMANY
Stymne, Sara Alignment-based reordering for SMT
Eye Tracking as a Tool for Machine Translation Error Analysis
On the practice of error analysis for machine translation evaluation
Linköping University, SWEDEN
Suarez, Merlin Teodosia Building a Multimodal Laughter Database for Emotion Recognition
Center for Empathic Human-Computer Interactions, De La Salle University, PHILIPPINES
Su, Fangzhong Development and Application of a Cross-language Document Comparability Metric
Collecting and Using Comparable Corpora for Statistical Machine Translation
University of Leeds, UNITED KINGDOM
Şulea, Octavia-Maria The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System
University of Bucharest, ROMANIA
Sulger, Sebastian A Reference Dependency Bank for Analyzing Complex Predicates
Universität Konstanz, GERMANY
Sundberg, Rasmus Visualizing Sentiment Analysis on a User Forum
Lund University, SWEDEN
Sutcliffe, Richard Evaluating Machine Reading Systems through Comprehension Tests
University of Limerick, IRELAND
Suzuki, Takafumi Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
University of Tsukuba, JAPAN
Swift, Mary An Annotation Scheme for Quantifier Scope Disambiguation
University of Rochester, UNITED STATES
Szabó, Martina Katalin HunOr: A Hungarian―Russian Parallel Corpus
University of Szeged, Department of Hungarian Linguistics, HUNGARY
Szasz, Sandra The CONCISUS Corpus of Event Summaries
Universitat Pompeu Fabra, SPAIN
Szekely, Eva Evaluating expressive speech synthesis from audiobook corpora for conversational phrases
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College Dublin, IRELAND
Sznajder, Marta Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions
University of Amsterdam, NETHERLANDS
Szymanik, Jakub Pragmatic identification of the witness sets
Institute of Artificial Intelligence, University of Groningen, NETHERLANDS

 

T
Taboada, Maite A review corpus annotated for negation, speculation and their scope
Simon Fraser University, CANADA
Tadić, Marko Generation of Verbal Stems in Derivationally Rich Language
Croatian Dependency Treebank: Recent Development and Initial Experiments
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences, CROATIA
Taghipour, Kaveh A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora
Amirkabir University of Technology, IRAN, ISLAMIC REPUBLIC OF
Tagnin, Stella Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
Department of Modern Languages, University of São Paulo, BRAZIL
Tahon, Marie Corpus of Children Voices for Mid-level Markers and Affect Bursts Analysis
LIMSI ― CNRS, FRANCE
Takács, Dávid Automatically Generated Online Dictionaries
Dept. of Language Technology, Research Institute for Linguistics, HAS, HUNGARY
Takamori, Emi Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Universidad Autónoma de Madrid, SPAIN
Takanashi, Katsuya Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
Kyoto University, JAPAN
Takeda, Kazuya Causal analysis of task completion errors in spoken music retrieval interactions
Nagoya University, JAPAN
Takezawa, Toshiyuki Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques
Hiroshima City University, JAPAN
Tamburini, Fabio AnIta: a powerful morphological analyser for Italian
A topologic view of Topic and Focus marking in Italian
Dept. of Linguistics and Oriental Studies, University of Bologna, ITALY
Tamchyna, Aleš The Joy of Parallelism with CzEng 1.0
Charles University in Prague, CZECH REPUBLIC
Tanasijević, Ivana Multimedia database of the cultural heritage of the Balkans
Faculty of Mathematics, University of Belgrade, SERBIA
Tang, Guoyu CLTC: A Chinese-English Cross-lingual Topic Corpus
Tsinghua University, CHINA
Tanguy, Ludovic An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
CLLE, Univ. Toulouse, FRANCE
Tang, Yi-jie Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition
Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information
National Taiwan University, TAIWAN, PROVINCE OF CHINA
Tannier, Xavier WebAnnotator, an Annotation Tool for Web Pages
Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French
Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
Evolution of Event Designation in Media: Preliminary Study
A Rough Set Formalization of Quantitative Evaluation with Ambiguity
LIMSI ― CNRS, FRANCE
Taslimipoor, Shiva Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions
Shiraz University, IRAN, ISLAMIC REPUBLIC OF
Tatu, Marta A Tool for Extracting Conversational Implicatures
Lymba Corporation, UNITED STATES
Tavares, Leonor Project FLY: a multidisciplinary project within Linguistics
Centro de Linguística da Universidade de Lisboa, PORTUGAL
Tavarez, David Strategies to Improve a Speaker Diarisation Tool
University of the Basque Country, SPAIN
Tavosanis, Mirko Creation of a bottom-up corpus-based ontology for Italian Linguistics
Università di Pisa, ITALY
Taylor, Sarah Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
Sarah M. Taylor Consulting, UNITED STATES
Teich, Elke Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach
Saarland University, GERMANY
Teissèdre, Charles Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
MoDyCo-CNRS - UMR 7114, Université Paris Ouest Nanterre La Défense, FRANCE
Tellier, Isabelle Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger
Université Paris 3 - sorbonne Nouvelle, LaTTiCe, CNRS, FRANCE
Temnikova, Irina CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations
Research Institute in Information and Language Processing, University of Wolverhampton, UNITED KINGDOM
Tenjes, Silvi Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data
University of Tartu, ESTONIA
Tepper, Michael Statistical Section Segmentation in Free-Text Clinical Records
Department of Linguistics, University of Washington, UNITED STATES
Terai, Asuka The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues
Tokyo Institute of Technology, JAPAN
Thomas, Arthur LIE: Leadership, Influence and Expertise
Oxford Internet Institute, UNITED KINGDOM
Thompson, Paul Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries
Identification of Manner in Bio-Events
University of Manchester, UNITED KINGDOM
Thuilier, Juliette Semantic annotation of French corpora: animacy and verb semantic classes
Université Paris 7, FRANCE
Thurmair, Gregor Large Scale Lexical Analysis
Linguatec, GERMANY
Tiedemann, Jörg A Distributed Resource Repository for Cloud-Based Machine Translation
Parallel Data, Tools and Interfaces in OPUS
Large aligned treebanks for syntax-based machine translation
Uppsala University, SWEDEN
Tiotto, Gabriele A platform-independent user-friendly dictionary from Italian to LIS
Politecnico di Torino, Dipartimento di Automatica e Informatica, ITALY
Todirascu, Amalia French and German Corpora for Audience-based Text Type Classification
LiLPA, Université de Strasbourg, FRANCE
Todisco, Massimiliano Intelligibility assessment in forensic applications
Department of Electronic Engineering, University of Rome “Tor Vergata”, ITALY
Tokunaga, Takenobu The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues
Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions
Tokyo Institute of Technology, JAPAN
Tolone, Elsa Evaluating and improving syntactic lexica by plugging them within a parser
Extending the adverbial coverage of a French morphological lexicon
FaMAF, Universidad Nacional de Cordoba, ARGENTINA
Toman, Josef Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Tomaselli, Alessandra A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
Department of Foreign Languages and Literatures― University of Verona, ITALY
Tomlinson, Marc Pursing power in Arabic on-line discussion forums
Language Computer Corporation, UNITED STATES
Tonelli, Sara Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
FBK-IRST, ITALY
Topkaya, Ibrahim Saygin SUTAV: A Turkish Audio-Visual Database
Sabanci University, TURKEY
Toporowska Gronostaj, Maria The Rocky Road towards a Swedish FrameNet - Creating SweFN
University of Gothenburg, SWEDEN
Toral, Antonio Towards a User-Friendly Platform for Building Language Resources based on Web Services
DCU, IRELAND
Torner, Sergi The IULA Treebank
Universitat Pompeu Fabra, SPAIN
Tounsi, Lamia Automatic Extraction and Evaluation of Arabic LFG Resources
School of Computing, Dublin City University, IRELAND
Toyota, Itsuki Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
University of Tsukuba, JAPAN
Trancoso, Isabel Dealing with unknown words in statistical machine translation
Instituto Superior Técnico, Technical University of Lisbon, PORTUGAL
Tran, Do-Dat MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels
MICA Institute, VIET NAM
Traum, David ISO 24617-2: A semantically-based standard for dialogue annotation
Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems
The Twins Corpus of Museum Visitor Questions
Tilburg University, NETHERLANDS
Traumüller, Jenny Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
Friedrich-Schiller-Universität Jena, GERMANY
Treurniet, Maaske Collection of a corpus of Dutch SMS
CLST, Centre for Language and Speech Technology, Radboud University Nijmegen, NETHERLANDS
Trevisan, Marco Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.
CELI, ITALY
Trilsbeek, Paul The Language Archive ― a new hub for language resources
Max Planck Institute for Psycholinguistics, NETHERLANDS
Trippel, Thorsten A Metadata Editor to Support the Description of Linguistic Resources
A Repository for the Sustainable Management of Research Data
Standardizing a Component Metadata Infrastructure
Universität Tübingen, GERMANY
Trtovac, Aleksandra A tool for enhanced search of multilingual digital libraries of e-journals
University of Belgrade, SERBIA
Tscherwinka, Cindy Involving Language Professionals in the Evaluation of Machine Translation
Euroscript Deutschland, GERMANY
Tschöpel, Sebastian AVATecH ― automated annotation through audio and video analysis
Fraunhofer IAIS Institute, GERMANY
Tsourakis, Nikos A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
A Corpus for a Gesture-Controlled Mobile Spoken Dialogue System
University of Geneva, TIM/ISSCO, SWITZERLAND
Tsuchiya, Masatoshi Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
Developing Partially-Transcribed Speech Corpus from Edited Transcriptions
Toyohashi University of Technology, JAPAN
Tsujii, Jun'ichi Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries
Microsoft Research Asia, CHINA
Tufiș, Dan ROMBAC: The Romanian Balanced Annotated Corpus
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information
Collecting and Using Comparable Corpora for Statistical Machine Translation
RACAI, Romanian Academy Bucharest, ROMANIA
Turbati, Andrea PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases
ART Research Group, University of Rome, ITALY
Turchi, Marco JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool
European Commision, Joint Research Centre (JRC), ITALY
Turmo, Jordi Summarizing a multimodal set of documents in a Smart Room
TALP Research Center, Technical University of Catalonia, SPAIN
Tyers, Francis A finite-state morphological transducer for Kyrgyz
Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
Universitat d'Alacant (UA), SPAIN

 

U
Uchiyama, Kiyoko Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques
National Institute of Informatics, JAPAN
Udupa, Raghavendra An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora
Microsoft Research India, INDIA
Uhrig, Peter Efficient Dependency Graph Matching with the IMS Open Corpus Workbench
Universität Erlangen-Nuremberg, GERMANY
Uí Dhonnchadha, Elaine Irish Treebanking and Parsing: A Preliminary Evaluation
Trinity College Dublin, IRELAND
Ultes, Stefan A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System
Universität Ulm, GERMANY
Um, Yongnam Dysarthric Speech Database for Development of QoLT Software Technology
The Catholic Univ. of Korea, KOREA, REPUBLIC OF
Unal, Erdem Turkish Paraphrase Corpus
Tübitak-Bilgem, TURKEY
Uneson, Marcus SMALLWorlds -- Multilingual Content-Controlled Monologues
Lund University, SWEDEN
Uppström, Jonatan The open lexical infrastructure of Spräkbanken
Sprakbanken, University of Gothenburg, SWEDEN
Uren, Victoria Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
University of Aston, UNITED KINGDOM
Urešová, Zdeňka Announcing Prague Czech-English Dependency Treebank 2.0
Charles University in Prague, CZECH REPUBLIC
Uryupina, Olga Domain-specific vs. Uniform Modeling for Coreference Resolution
Università di Trento, Dipartimento Ingegneria e Scienza dell'Informazione, ITALY
Usabaev, Bela Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel
Fraunhofer IAIS, GERMANY
Uszkoreit, Hans Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World
DFKI GmbH, GERMANY
Utsuro, Takehito Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
University of Tsukuba, JAPAN
Utvić, Miloš A tool for enhanced search of multilingual digital libraries of e-journals
University of Belgrade, SERBIA

 

V
Vaassen, Frederik The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
University of Antwerp, BELGIUM
Vaidya, Ashwini Empty Argument Insertion in the Hindi PropBank
University of Colorado at Boulder, UNITED STATES
Válková, Lucie Balanced data repository of spontaneous spoken Czech
Institute of the Czech National Corpus, Charles University in Prague, CZECH REPUBLIC
van Cranenburgh, Andreas Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions
University of Amsterdam, NETHERLANDS
Vandeghinste, Vincent Example-Based Treebank Querying
Large aligned treebanks for syntax-based machine translation
Centre for Computational Linguistics, KU Leuven, BELGIUM
van de Loo, Janneke The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
University of Antwerp, BELGIUM
Van den Bosch, Antal DutchSemCor: Targeting the ideal sense-tagged corpus
Radboud University Nijmegen, NETHERLANDS
van den Brekel, M. NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
The Netherlands Cancer Institute, NETHERLANDS
van den Heuvel, Henk An Oral History Annotation Tool for INTER-VIEWs
Collection of a corpus of Dutch SMS
CLST, Radboud University Nijmegen, NETHERLANDS
Vanderdonckt, Jean The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
Université catholique de Louvain, BELGIUM
van der Molen, L. NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
The Netherlands Cancer Institute, NETHERLANDS
van der Torre, Leon NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
University of Luxembourg, LUXEMBOURG
van der Vliet, Nynke Multi-Layer Discourse Annotation of a Dutch Text Corpus
University of Groningen, NETHERLANDS
Vanderwende, Lucy Statistical Section Segmentation in Free-Text Clinical Records
Microsoft Research, UNITED STATES
Van Doremalen, Joost The DISCO ASR-based CALL system: practicing L2 oral skills and beyond
Radboud University Nijmegen, NETHERLANDS
Van Eynde, Frank Example-Based Treebank Querying
Centre for Computational Linguistics, KU Leuven, BELGIUM
van Genabith, Josef Irish Treebanking and Parsing: A Preliminary Evaluation
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Arabic Word Generation and Modelling for Spell Checking
Automatic Extraction and Evaluation of Arabic LFG Resources
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
Dublin City University, IRELAND
van Gompel, Maarten Beyond SoNaR: towards the facilitation of large corpus building efforts
Radboud University Nijmegen, NETHERLANDS
Van Huyssteen, Gerhard Aspects of a Legal Framework for Language Resource Management
North-West University, SOUTH AFRICA
van Mulligen, Erik CALBC: Releasing the Final Corpora
Erasmus Medical Centre, NETHERLANDS
van Son, R.J.JH. NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
The Netherlands Cancer Institute, NETHERLANDS
van Uytvanck, Dieter Semantic metadata mapping in practice: the Virtual Language Observatory
Proper Language Resource Centers
Standardizing a Component Metadata Infrastructure
Citing on-line Language Resources
Max Planck Institute for Psycholinguistics, NETHERLANDS
Van Waes, Luuk From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
University of Antwerp, BELGIUM
Varga, Andrea Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
Unsupervised document zone identification using probabilistic graphical models
University of Sheffield, UNITED KINGDOM
Varga, Dániel Rapid creation of large-scale corpora and frequency dictionaries
BME MOKK, HUNGARY
Varges, Sebastian SemScribe: Natural Language Generation for Medical Reports
Applied Computational Linguistics Lab, University of Potsdam, POLAND
Varma, Vasudeva Hindi Subjective Lexicon: A Lexical Resource for Hindi Adjective Polarity Classification
International Institute of Information Technology, INDIA
Varona, Amparo KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
University of the Basque Country, SPAIN
Vasilescu, Ioana Cross-lingual studies of ASR errors: paradigms for perceptual evaluations
LIMSI - CNRS, FRANCE
Vasiljevs, Andrejs Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Collecting and Using Comparable Corpora for Statistical Machine Translation
Tilde, LATVIA
Vaughan, Brian The Herme Database of Spontaneous Multimodal Human-Robot Dialogues
Trinity College Dublin, IRELAND
Vavřín, Martin Building a multilingual parallel corpus for human users
Charles University in Prague, Faculty of Arts, CZECH REPUBLIC
Vázquez, Silvia A Classification of Adjectives for Polarity Lexicons Enhancement
The IULA Treebank
Universitat Pompeu Fabra, SPAIN
Velardi, Paola A New Method for Evaluating Automatically Learned Terminological Taxonomies
Sapienza Università di Roma, ITALY
Venhuizen, Noortje Developing a large semantically annotated corpus
University of Groningen, NETHERLANDS
Venturi, Giulia Enriching the ISST-TANL Corpus with Semantic Frames
ILC - CNR, Pisa, ITALY
Vergez-Couret, Marianne An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
CLLE, Univ. Toulouse, FRANCE
Verhagen, Marc ATLIS: Identifying Locational Information in Text Automatically
The TARSQI Toolkit
Brandeis University, UNITED STATES
Verlic, Mateja Collecting and Using Comparable Corpora for Statistical Machine Translation
Zemanta, UNITED STATES
Verma, Sudha Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events
University of Colorado, UNITED STATES
Vertan, Cristina Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation
University of Hamburg, GERMANY
Vetulani, Zygmunt Wordnet Based Lexicon Grammar for Polish
Adam Mickiewicz University, Faculty of Mathematics and Computer Science, Department of Computer Linguistics and Artificial Intelligence, POLAND
v. Hahn, Walther Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation
University of Hamburg, GERMANY
Vičić, Tomislav BLEU Evaluation of Machine-Translated English-Croatian Legislation
Freelance translator, CROATIA
Victorri, Bernard ANALEC: a New Tool for the Dynamic Annotation of Textual Data
CNRS, FRANCE
Vieira, Renata Corpus+WordNet thesaurus generation for ontology enriching
A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever
PUCRS, BRAZIL
Vieu, Laure An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
IRIT, Univ. Toulouse, FRANCE
Vieweg, Sarah Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events
University of Colorado, UNITED STATES
Viitaniemi, Ville Comparing computer vision analysis of signed language video with motion capture recordings
Aalto University School of Science, FINLAND
Vilar, David Involving Language Professionals in the Evaluation of Machine Translation
DFKI GmbH, GERMANY
Villaneau, Jeanne A French Fairy Tale Corpus syntactically and semantically annotated
IRISA, FRANCE
Villavicencio, Aline A large scale annotated child language construction database
Federal University of Rio Grande do Sul, BRAZIL
Villegas, Marta The IULA Treebank
The IULA Treebank
Using Language Resources in Humanities research
Universitat Pompeu Fabra, SPAIN
Villemonte de la Clergerie, Éric Evaluating and improving syntactic lexica by plugging them within a parser
Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations
Alpage, INRIA Paris-Rocquencourt & Université Paris 7, FRANCE
Vilnat, Anne Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French
Kitten: a tool for normalizing HTML and extracting its textual content
A contrastive review of paraphrase acquisition techniques
LIMSI-CNRS & Univ. Paris-Sud, FRANCE
Vincze, Veronika Light Verb Constructions in the SzegedParalellFX English--Hungarian Parallel Corpus
HunOr: A Hungarian―Russian Parallel Corpus
Hungarian Academy of Sciences, HUNGARY
Virk, Shafqat Mumtaz An Open Source Persian Computational Grammar
University of Gothenburg, SWEDEN
Visser, Tom Dynamic web service deployment in a cloud environment
Sara, NETHERLANDS
Visweswariah, Karthik A Study of Word-Classing for MT Reordering
IBM Research India, INDIA
Vivaldi, Jorge Iula2Standoff: a tool for creating standoff documents for the IULACT
Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
The IULA Treebank
UPF, SPAIN
Vogel, John ATLIS: Identifying Locational Information in Text Automatically
Brandeis University, UNITED STATES
Volk, Martin SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
TextShuttle GmbH, SWITZERLAND
Volodina, Elena Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish
University of Gothenburg, SWEDEN
Voss, Clare Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System
Computational & Information Sciences Directorate, Army Research Lab (ARL), UNITED STATES
Vossen, Piek Building a fine-grained subjectivity lexicon from a web corpus
DutchSemCor: Targeting the ideal sense-tagged corpus
Mapping WordNet to the Kyoto ontology
VU University, Faculty of Arts, NETHERLANDS
Voutilainen, Atro Improving corpus annotation productivity: a method and experiment with interactive tagging
Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3
University of Helsinki, FINLAND
Voyatzi, Stavroula Extending the adverbial coverage of a French morphological lexicon
LIGM, Université Paris-Est, FRANCE

 

W
Waclawičová, Martina Balanced data repository of spontaneous spoken Czech
Institute of the Czech National Corpus, Charles University in Prague, CZECH REPUBLIC
Waibel, Alex The KIT Lecture Corpus for Speech Translation
Karlsruhe Institute of Technology, GERMANY
Walker, Marilyn A Corpus for Research on Deliberation and Debate
An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style
UCSC UARC, UNITED STATES
Wallenberg, Joel The Icelandic Parsed Historical Corpus (IcePaHC)
University of Newcastle, AUSTRALIA
Waltinger, Ulli MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
University of Bielefeld, Artificial Intelligence Group, GERMANY
Wang, Chieh-Jen NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
National Taiwan University, TAIWAN, PROVINCE OF CHINA
Wang, Dong Effort of Genre Variation and Prediction of System Performance
University of Washington, UNITED STATES
Wang, Rui Constructing a Question Corpus for Textual Semantic Relations
Joint Grammar and Treebank Development for Mandarin Chinese with HPSG
DFKI GmbH, GERMANY
Wang, Wei Evaluation of Unsupervised Information Extraction
CEA, LIST, Vision and Content Engineering Laboratory, FRANCE
Wang, Xiaolin Spell Checking for Chinese
Shanghai Jiao Tong University, CHINA
Wang, Xinkai Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries
University of Manchester, UNITED KINGDOM
Wardyński, Adam KPWr: Towards a Free Corpus of Polish
Wroclaw University of Technology, POLAND
Washington, Jonathan A finite-state morphological transducer for Kyrgyz
Indiana University, UNITED STATES
Wasmuth, Sven PAMOCAT: Automatic retrieval of specified postures
CITEC Central Lab, University of Bielefeld, GERMANY
Watrin, Patrick Extraction of unmarked quotations in Newspapers
CENTAL - Institut Langage et Communication ― UCLouvain, BELGIUM
Wattam, Stephen Document Attrition in Web Corpora: an Exploration
Lancaster University, UNITED KINGDOM
Way, Andy SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Applied Language Solutions Ltd., UNITED KINGDOM
Webb, Nick Revealing Contentious Concepts Across Social Groups
Union College, UNITED STATES
Weber, Benoît MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels
MICA Institute, VIET NAM
Webster, Philip Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
University of Sheffield, UNITED KINGDOM
Wedekind, Jürgen Creation and use of Language Resources in a Question-Answering eHealth System
University of Copenhagen, CST, DENMARK
Weichselbraun, Albert Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
University of Applied Sciences Chur; Faculty of Information Science, SWITZERLAND
Weiser, Stéphanie Extraction of unmarked quotations in Newspapers
CENTAL - Institut Langage et Communication ― UCLouvain, BELGIUM
Weißbach, Bernd Adapting and evaluating a generic term extraction tool
Dräger Safety AG & Co. KGaA, GERMANY
Weiss, Sandra Error profiling for evaluation of machine-translated text: a Polish-English case study
Linköping University, SWEDEN
Weitz, Benjamin A Graphical Citation Browser for the ACL Anthology
DFKI GmbH, GERMANY
Wei, Zhongyu Quantising Opinions for Political Tweets Analysis
Chinese University of Hong Kong, HONG KONG
Weller, Marion Analyzing and Aligning German compound nouns
Universität Stuttgart, GERMANY
Wells, Bill A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English
University of Sheffield, UNITED KINGDOM
Wendemuth, Andreas Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus
Otto-von-Guericke-University Magdeburg, GERMANY
Wester, Martin Eye Tracking as a Tool for Machine Translation Error Analysis
Linköping University, SWEDEN
Whitt, Richard J. GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German
University of Manchester, UNITED KINGDOM
Wiegand, Michael MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
A Gold Standard for Relation Extraction in the Food Domain
Saarland University, Spoken Language Systems, GERMANY
Wilcock, Graham Constructive Interaction for Talking about Interesting Topics
University of Helsinki, FINLAND
Wilks, Yorick LIE: Leadership, Influence and Expertise
Florida Institute of Human and Machine Cognition, UNITED STATES
Williams, Jennifer A New Twitter Verb Lexicon for Natural Language Processing
Georgetown University, UNITED STATES
Windhouwer, Menzo Standardizing a Component Metadata Infrastructure
”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project
RELcat: a Relation Registry for ISOcat data categories
MPI for Psycholinguistics, NETHERLANDS
Witkamp, Paula An Oral History Annotation Tool for INTER-VIEWs
DANS, Data Archiving & Networked Services, NETHERLANDS
Witt, Andreas The New IDS Corpus Analysis Platform: Challenges and Prospects
Institute for the German Language (IDS), GERMANY
Wittenburg, Peter Towards Automatic Gesture Stroke Detection
The Language Archive ― a new hub for language resources
Max Planck Institute for Psycholinguistics, NETHERLANDS
Woliński, Marcin PoliMorf: a (not so) new open morphological dictionary for Polish
Institute of Computer Science, Polish Academy of Sciences, POLAND
Womser-Hacker, Christa A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content
University of Hildesheim, GERMANY
Wong, Kam-Fai Quantising Opinions for Political Tweets Analysis
Chinese University of Hong Kong, HONG KONG
Wright, Jonathan Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure
LDC, UNITED STATES
Wu, Jingsi Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
University at Albany, UNITED STATES

 

X
Xia, Fei Statistical Section Segmentation in Free-Text Clinical Records
Effort of Genre Variation and Prediction of System Performance
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms
Using a Goodness Measurement for Domain Adaptation: A Case Study on Chinese Word Segmentation
Department of Linguistics, University of Washington, UNITED STATES
Xia, Yunqing Affective Common Sense Knowledge Acquisition for Sentiment Analysis
CLTC: A Chinese-English Cross-lingual Topic Corpus
Tsinghua University, CHINA
Xue, Nianwen Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Annotating dropped pronouns in Chinese newswire text
Computer Science Department, Brandeis University, UNITED STATES
Xu, Feiyu Annotating Opinions in German Political News
Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World
DFKI GmbH, GERMANY
Xu, Hongzhi A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
Hong Kong Polytechnic University, HONG KONG

 

Y
Yamamoto, Seiichi Multimodal Corpus of Multi-party Conversations in Second Language
Doshisha University, JAPAN
Yamasaki, Shota Multimodal Corpus of Multi-party Conversations in Second Language
Doshisha University, JAPAN
Yang, Fei Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
Rutgers University, UNITED STATES
Yang, Peng Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
Rutgers University, UNITED STATES
Yang, Shaohua Spell Checking for Chinese
Shanghai Jiao Tong University, CHINA
Yang, Xia CLTC: A Chinese-English Cross-lingual Topic Corpus
Leshan Normal University, CHINA
Yang, Yaqin Annotating dropped pronouns in Chinese newswire text
Brandeis University, UNITED STATES
Yankama, Beracah A large scale annotated child language construction database
MIT, UNITED STATES
Yetisgen-Yildiz, Meliha Statistical Section Segmentation in Free-Text Clinical Records
Biomedical and Health Informatics, University of Washington, UNITED STATES
Yıldız, İpek Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY
Yılmazer, Hakan Construction of the Turkish National Corpus (TNC)
Mersin University, TURKEY
Yoshida, Nao Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
Kyoto University, JAPAN
Yu, Chi-Hsin Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information
National Taiwan University, TAIWAN, PROVINCE OF CHINA
Yu, Yue A Mandarin-English Code-Switching Corpus
HKUST, HONG KONG
Yvon, François Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier
LIMSI ― CNRS, FRANCE

 

Z
Zablotskaya, Kseniya Investigating Verbal Intelligence Using the TF-IDF Approach
Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores
Institute of Communications Engineering, University of Ulm, GERMANY
Zablotskiy, Sergey Speech and Language Resources for LVCSR of Russian
Institute of Communications Engineering, University of Ulm, GERMANY
Žabokrtský, Zdeněk Language Richness of the Web
HamleDT: To Parse or Not to Parse?
Prague Dependency Style Treebank for Tamil
Announcing Prague Czech-English Dependency Treebank 2.0
The Joy of Parallelism with CzEng 1.0
Charles University in Prague, Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, CZECH REPUBLIC
Zahra, Amalia English to Indonesian Transliteration to Support English Pronunciation Practice
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
University College Dublin, IRELAND
Zanoli, Roberto The KnowledgeStore: an Entity-Based Storage System
Fondazione Bruno Kessler, ITALY
Zarcone, Alessandra Logical metonymies and qualia structures: an annotated database of logical metonymies for German
Universität Stuttgart, IMS, GERMANY
Zargayouna, Haïfa The Quaero Evaluation Initiative on Term Extraction
LIPN - CNRS, FRANCE
Zarrieß, Sina A Corpus-based Study of the German Recipient Passive
Universität Stuttgart, IMS, GERMANY
Zastrow, Thomas Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC
A Repository for the Sustainable Management of Research Data
University of Tübingen, GERMANY
Zelle, Uwe RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
RWTH Aachen University, GERMANY
Zeman, Daniel Automatic MT Error Analysis: Hjerson Helping Addicter
HamleDT: To Parse or Not to Parse?
Charles University in Prague, MFF, UFAL, CZECH REPUBLIC
Zeyrek, Deniz METU Turkish Discourse Bank Browser
Middle East Technical University, TURKEY
Zhang, Yi Joint Grammar and Treebank Development for Mandarin Chinese with HPSG
CLIMB grammars: three projects using metagrammar engineering
DFKI GmbH, GERMANY
Zhang, Ziqi Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
University of Sheffield, UNITED KINGDOM
Zhao, Hai Spell Checking for Chinese
Shanghai Jiao Tong University, CHINA
Ziering, Patrick A Corpus-based Study of the German Recipient Passive
Universität Stuttgart, IMS, GERMANY
Zinn, Claus A Metadata Editor to Support the Description of Linguistic Resources
A Repository for the Sustainable Management of Research Data
Universität Tübingen, GERMANY
Zinsmeister, Heike The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
Universität Stuttgart, GERMANY
Zséder, Attila Rapid creation of large-scale corpora and frequency dictionaries
Hungarian Academy of Sciences, HUNGARY
Zuczkowski, Andrzej A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
Università di Macerata, ITALY
Zufferey, Sandrine Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
Université catholique de Louvain, BELGIUM
Zumpe, Matthias A Distributed Resource Repository for Cloud-Based Machine Translation
Uppsala University, SWEDEN
Zuo, Xin A Multilingual Natural Stress Emotion Database
Hong Kong University of Science and Technology, HONG KONG
Zweigenbaum, Pierre Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation
LIMSI - CNRS, FRANCE
Zydroń, Andrzej On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
XTM International, UNITED KINGDOM

Powered by ELDA © 2012 ELDA/ELRA