AUTHORS: Browse articles of the conference sorted by author

A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z

A
Abe, Akinori Relationships between Nursing Converstaions and Activities
ATR Knowledge Science Labs.
Abe, Yasunori Extraction of Informative Expressions from Domain-specific Documents
Application of Resource-based Machine Translation to Real Business Scenes
Japan Airlines
Abekawa, Takeshi Constructing a Corpus that Indicates Patterns of Modification between Draft and Final Translations by Human Translators
Graduate School of Education, University of Tokyo
Aberdeen, John Applying Automated Metrics to Speech Translation Dialogs
MITRE Corporation
Aboutajdine, Driss A Multi-Word Term Extraction Program for Arabic Language
GSCM_LRIT
Abouzakhar, Nasser Unsupervised Learning-based Anomalous Arabic Text Detection
NLP Research Group, Dept of Computer Sci. The University of Sheffield
Abu Shawar, Bayan An AI-inspired intelligent agent/student architecture to combine Language Resources research and teaching
Arab Open University
Abuhakema, Ghazi Annotating an Arabic Learner Corpus for Error
Montclair State University
Adda, Gilles CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content
Annotation and analysis of overlapping speech in political interviews
Developments of “Lëtzebuergesch” Resources for Automatic Speech Processing and Linguistic Studies
LIMSI-CNRS
Adda-Decker, Martine Speech Errors on Frequently Observed Homophones in French: Perceptual Evaluation vs Automatic Classification
Annotation and analysis of overlapping speech in political interviews
Developments of “Lëtzebuergesch” Resources for Automatic Speech Processing and Linguistic Studies
LIMSI-CNRS
Adderley, Richard The MoveOn Motorcycle Speech Corpus
A E Solutions
Adell, Jordi Corpus and Voices for Catalan Speech Synthesis
UPC-TALP
Adler, Meni Tagging a Hebrew Corpus: the Case of Participles
Ben Gurion University of the Negev
Adolphs, Peter Some Fine Points of Hybrid Natural Language Parsing
Acquiring a Poor Man’s Inflectional Lexicon for German
DFKI GmbH
Agili, Andrea Integration of a Multilingual Keyword Extractor in a Document Management System
DrWolf
Agirre, Eneko Using the Multilingual Central Repository for Graph-Based Word Sense Disambiguation
KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
WNTERM: Enriching the MCR with a Terminological Dictionary
University of the Basque Country
Aguado de Cea, Guadalupe Tagging Spanish Texts: the Problem of Problem of “SE”
Universidad Politécnica de Madrid
Ahmad, Khurshid Sentiment Analysis and the Use of Extrinsic Datasets in Evaluation
Trinity College Dublin
Ahrenberg, Lars Converting Romanized Persian to the Arabic Writing Systems
Linkoping University, Sweden
Ahrens, Kathleen Extracting Concrete Senses of Lexicon through Measurement of Conceptual Similarity in Ontologies
National Taiwan University
Aijaz, Adil An eRulemaking Corpus: Identifying Substantive Issues in Public Comments
Cornell University
Aikawa, Kiyoaki Test Collections for Spoken Document Retrieval from Lecture Audio Data
Tokyo University of Technology
Aikawa, Takako Post-MT Term Swapper: Supplementing a Statistical Machine Translation System with a User Dictionary
Microsoft
Akiba, Tomoyosi Test Collections for Spoken Document Retrieval from Lecture Audio Data
Toyohashi University of Technology
Alabau Gonzalvo, Vicent Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation.
Instituto Tecnológico de Informática, Universidad Politécnica de Valencia
Al-Badrashiny, Mohamed A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields
RDI
Al-Basoumy, Husein A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields
RDI
Alcázar, José Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases
Respiratory Department. Unidad de Trastornos Respiratorios del Sueño. Hospital Clínico Universitario Málaga
Aldezabal, Izaskun WNTERM: Enriching the MCR with a Terminological Dictionary
IXA NLP Research Group
Alegria, Iñaki Spelling Correction: from Two-Level Morphology to Open Source
University of the Basque Country
Alex, Beatrice Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks
Comparing Corpus-based to Web-based Lookup Techniques for Automatic English Inclusion Detection
University of Edinburgh
Alkhalifa, Musa Arabic WordNet: Semi-automatic Extensions using Bayesian Inference
Universitat de Barcelona
Allauzen, Alexandre Training and Evaluation of POS Taggers on the French MULTITAG Corpus
LIMSI-CNRS
Allen, James Production in a Multimodal Corpus: how Speakers Communicate Complex Actions
University of Rochester
Allison, Ben Professor or Screaming Beast? Detecting Anomalous Words in Chinese
Authorship Attribution of E-Mail: Comparing Classifiers over a New Corpus for Evaluation
Using a Probabilistic Model of Context to Detect Word Obfuscation
Unsupervised Learning-based Anomalous Arabic Text Detection
University of Sheffield
Almási, Attila Hungarian Word-Sense Disambiguated Corpus
University of Szeged
Almberg, Jørn RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus
Norwegian University of Science and Technology
Alonso Ramos, Margarita Using Semantically Annotated Corpora to Build Collocation Resources
Universidade da Coruña
Altmeyer, Randolf A Dependency Parser for Thai
University of Saarland
Alvarez Montero, Francisco Conceptual Modeling of Ontology-based Linguistic Resources with a Focus on Semantic Relations
Universidad Autonoma de Sinaloa
Alvarez, Alison Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Linguistics Department, University of Pittsburgh
Álvez, Javier Complete and Consistent Annotation of WordNet using the Top Concept Ontology
University of the Basque Country
Alzghool, Muath Combining Multiple Models for Speech Information Retrieval
University of Ottawa
Amar, Muriel Classification Procedures for Software Evaluation
Urfist-École nationale des Chartes, Paris
Ambati, Vamshi Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Language Technologies Institute, Carnegie Mellon University
Amdal, Ingunn RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus
Norwegian University of Science and Technology
Amengual, J. C. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Amoia, Marilisa A Test Suite for Inference Involving Adjectives
University of Saarland
Ananiadou, Sophia Connecting Text Mining and Pathways using the PathText Resource
Clustering Related Terms with Definitions
Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora
National Centre for Text Mining, University of Manchester
Andersen, Øistein E. The BNC Parsed with RASP4UIMA
University of Cambridge
Anick, Peter Similar Term Discovery using Web Search
Yahoo, Inc.
Antoine, Jean-Yves Automatic Rich Annotation of Large Corpus of Conversational transcribed speech: the Chunking Task of the EPAC Project
Université François Rabelais, Tours
Aparicio, Juan AnCora-Verb: A Lexical Resource for the Semantic Annotation of Corpora
CLiC-University of Barcelona
Apidianaki, Marianna Translation-oriented Word Sense Induction Based on Parallel Corpora
University Paris 7
Araki, Kenji A Multi-Lingual Dictionary of Dirty Words
What is poorly Said is a Little Funny
Hokkaido University
Aranovich, Roberto Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Language Technologies Institute, Carnegie Mellon University
Arehart, Mark A Ground Truth Dataset for Matching Culturally Diverse Romanized Person Names
Adjudicator Agreement and System Rankings for Person Name Search
An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
MITRE Corporation
Areta, Nerea Analysis and Performance of Morphological Query Expansion and Language-Filtering Words on Basque Web Searching
Elhuyar Fundazioa, R&D
Arhar, Špela Slovene Terminology Web Portal and the TBX-Compatible Simplified DTD/schema
Amebis d. o. o.
Arimoto, Yoshiko Automatic Emotional Degree Labeling for Speakers’ Anger Utterance during Natural Japanese Dialog
Graduate School of Bionics, Computer and Media Sciences, Tokyo University of Technology
Arnaudov, Todor Smarty - Extendable Framework for Bilingual and Multilingual Comprehension Assistants
University of Plovdiv
Arranz, Victoria A Guide for the Production of Reusable Language Resources
Latest Developments in ELRA’s Services
ELDA
Artstein, Ron Anaphoric Annotation in the ARRAU Corpus
Institute for Creative Technologies, USC
Athitsos, Vassilis Benchmark Databases for Video-Based Automatic Sign Language Recognition
University of Texas at Arlington
Atserias, Jordi Complete and Consistent Annotation of WordNet using the Top Concept Ontology
Semantically Annotated Snapshot of the English Wikipedia
Yahoo! Research Barcelona
Attardi, Giuseppe Comparing Italian parsers on a common Treebank: the EVALITA experience
Semantically Annotated Snapshot of the English Wikipedia
Università di Pisa
Atterer, Michaela An Inverted Index for Storing and Retrieving Grammatical Dependencies
A Question Answering System for German. Experiments with Morphological Linguistic Resources
Institute for Linguistics, University of Potsdam
Attia, Mohamed A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields
MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic
RDI
Atwell, Eric ProPOSEL: A Prosody and POS English Lexicon for Language Engineering
An AI-inspired intelligent agent/student architecture to combine Language Resources research and teaching
University of Leeds
Aubergé, Véronique Multimodal Spontaneous Expressive Speech Corpus for Hungarian
ICP/Gipsa lab, Grenoble
Audibert, Nicolas Multimodal Spontaneous Expressive Speech Corpus for Hungarian
ICP/Gipsa lab, Grenoble
Ayache, Christelle Question Answering on Speech Transcriptions: the QAST evaluation in CLEF
EASY, Evaluation of Parsers of French: what are the Results?
PASSAGE: from French Parser Evaluation to Large Sized Treebank
ELDA
Azeredo, Susana Keywords, k-NN and Neural Networks: a Support for Hierarchical Categorization of Texts in Brazilian Portuguese
PUCRS

 

B
Bański, Piotr Enhancing an English-Polish Electronic Dictionary for Multiword Expression Research
University of Warsaw
Babko-Malaya, Olga A Pilot Arabic Propbank
Annotation of Nuggets and Relevance in GALE Distillation Evaluation
BAE Systems
Babych, Bogdan Generalising Lexical Translation Strategies for MT Using Comparable Corpora
Sensitivity of Automated MT Evaluation Metrics on Higher Quality MT Output: BLEU vs Task-Based Evaluation Methods
University of Leeds
Bachan, Jolanta An Automatic Close Copy Speech Synthesis Tool for Large-Scale Speech Corpus Evaluation
Uniwersytet im. Adama Mickiewicza, Poznań
Badia, Toni Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
Rapid Deployment of a New METIS Language Pair: Catalan-English
User-Centred Design of Error Correction Tools
GliCom, Fundació Barcelona Media, UPF
Bai, Lakshmi Developing Verb Frames for Hindi
Language Technologies Research Centre, IIIT, Hyderabad
Baker, Collin MASC: the Manually Annotated Sub-Corpus of American English
ICSI
Baker, Kirk Statistical Identification of English Loanwords in Korean Using Automatically Generated Training Data
OSU
Balahur-Dobrescu, Alexandra Named Entity Relation Mining using Wikipedia
Al.I.Cuza University of Iasi
Baldwin, Timothy Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German
University of Melbourne
Bali, Kalika A Common Parts-of-Speech Tagset Framework for Indian Languages
Microsoft Research Lab India
Ball, Catherine An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
MITRE Corporation
Ball, Julian Named Entity Recognition for Digitised Historical Texts
University of Southampton
Bamman, David The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin
Tufts University, Boston
Banea, Carmen A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources
University of North Texas
Banik, Eva A Study of Parentheticals in Discourse Corpora - Implications for NLG Systems
The Open University
Barbot, Nelly Comparing Set-Covering Strategies for Optimal Corpus Design
IRISA / Universite Rennes 1 - Enssat
Barbu Mititelu, Verginica Annotation of WordNet Verbs with TimeML Event Classes
Romanian Academy
Barbu, Ana-Maria Romanian Lexical Data Bases: Inflected and Syllabic Forms Dictionaries
Institute of Linguistics, Bucharest
Barfüßer, Sabine ALC: Alcohol Language Corpus
BAS Bavarian Archive for Speech Signals
Baroni, Marco Cleaneval: a Competition for Cleaning Web Pages
Trento University
Baroni, Paola Semantic Press
ILC-CNR, Pisa
Barrachina, S. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Barras, Claude Annotation and analysis of overlapping speech in political interviews
LIMSI-CNRS
Barreaud, Vincent WEB-Based Listening Test System for Speech Synthesis and Speech Conversion Evaluation
IRISA / Universite Rennes 1
Bartalesi Lenzi, Valentina Evaluation of Natural Language Tools for Italian: EVALITA 2007
CELCT
Bartolini, Roberto Ontology Learning and Semantic Annotation: a Necessary Symbiosis
A Bilingual Corpus of Inter-linked Events
UFRA: a UIMA-based Approach to Federated Language Resource Architecture
ILC-CNR, Pisa
Basili, Roberto Towards a Vector Space Model for FrameNet-like Resources
University of Roma Tor Vergata
Baudrion, Philippe Task-Based Evaluation of Meeting Browsers: from Task Elicitation to User Behavior Analysis
ISSCO, University of Geneva
Bazillon, Thierry Manual vs Assisted Transcription of Prepared and Spontaneous Speech
Université du Maine
Beňa, Peter CzEng 0.7: Parallel Corpus with Community-Supplied Translations
Charles University, Prague
Beaugendre, Frédéric A Multi-sensor Speech Database with Applications towards Robust Speech Processing in hostile Environments
Voice Insight, Brussels
Bechara, Hanan The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Tübingen
Bechet, Frederic Local Methods for On-Demand Out-of-Vocabulary Word Retrieval
Semantic Frame Annotation on the French MEDIA corpus
University of Avignon
Begum, Rafiya Developing Verb Frames for Hindi
Language Technologies Research Centre, IIIT, Hyderabad
Beisswanger, Elena Semantic Annotations for Biology: a Corpus Development Initiative at the Jena University Language & Information Engineering (JULIE) Lab
Jena University, JULIE Lab
Bejan, Cosmin A Linguistic Resource for Discovering Event Structures and Resolving Event Coreference
The University of Texas at Dallas
Bel, Núria Automatic Acquisition for low frequency lexical items
COLDIC, a Lexicographic Platform for LMF compliant lexica
Pompeu Fabra University
Bellot, Patrice Evaluating Robustness Of A QA System Through A Corpus Of Real-Life Questions
Evaluation of Lexical Resources and Semantic Networks on a Corpus of Mental Associations
LIA - University of Avignon
Benavent, Francesc User-Centred Design of Error Correction Tools
Pompeu Fabra University
Bendahman, Chomicha Quick Rich Transcriptions of Arabic Broadcast News Speech Data
ELDA
Bensley, Jeremy Unsupervised Resource Creation for Textual Inference Applications
Language Computer Corporation
Benyon, David Dialogue, Speech and Images: the Companions Project Data Set
Napier University
Berck, Peter Exploring and Enriching a Language Resource Archive via the Web
Max Planck Institute for Psycholinguistics
Berend, Nina German Today: a really extensive Corpus of Spoken Standard German
Institut für Deutsche Sprache, Mannheim
Bergler, Sabine Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles
Concordia University
Bertagna, Francesca Evaluation of Natural Language Tools for Italian: EVALITA 2007
Semantic Press
ILC-CNR, Pisa
Bertran, Manuel Arabic WordNet: Semi-automatic Extensions using Bayesian Inference
Universitat Politécnica de Catalunya
Bertrand, Roxane Creating and Exploiting Multimodal Annotated Corpora
CNRS & Aix-Marseille Universités
Besacier, Laurent First Broadcast News Transcription System for Khmer Language
Laboratoire d’Informatique de Grenoble (LIG)
Besançon, Romaric The INFILE Project: a Crosslingual Filtering Systems Evaluation Campaign
CEA LIST
Bestgen, Yves Building Affective Lexicons from Specific Corpora for Automatic Sentiment Analysis
FNRS - Universilté catholique de Louvain
Bethard, Steven Building a Corpus of Temporal-Causal Structure
University of Colorado
Bhattacharya, Tanmoy A Common Parts-of-Speech Tagset Framework for Indian Languages
Delhi University
Bhattacharyya, Pushpak A Common Parts-of-Speech Tagset Framework for Indian Languages
Lexical Resources for Semantics Extraction
IIT-Bombay
Biber, Hanno Words in Contexts: Digital Editions of Literary Journals in the “AAC - Austrian Academy Corpus”
Austrian Academy of Sciences
Bieler, Heike Measures for Term and Sentence Relevances: an Evaluation for German
University of Potsdam
Bielický, Viktor Building the Valency Lexicon of Arabic Verbs
Charles University, Prague
Biemann, Christian Unsupervised Parts-of-Speech Induction for Bengali
ASV Toolbox: a Modular Collection of Language Exploration Tools
University of Leipzig
Bies, Ann Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation
Enhancing the Arabic Treebank: a Collaborative Effort toward New Annotation Guidelines
A Pilot Arabic Propbank
Linguistic Data Consortium
Bigi, Brigitte First Broadcast News Transcription System for Khmer Language
Laboratoire d’Informatique de Grenoble (LIG)
Bilinski, Eric An Evaluation of Spoken and Textual Interaction in the RITEL Interactive Question Answering System
Developments of “Lëtzebuergesch” Resources for Automatic Speech Processing and Linguistic Studies
LIMSI-CNRS
Bindi, Remo Encoding Terms from a Scientific Domain in a Terminological Database: Methodology and Criteria
ILC-CNR, Pisa
Bird, Steven The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
University of Melbourne
Bittencourt, Evandro Evaluating Summaries Automatically - A system Proposal
Department of Foreign Trade, University of Joinville
Blache, Philippe Evaluating Robustness Of A QA System Through A Corpus Of Real-Life Questions
Evaluation of Lexical Resources and Semantic Networks on a Corpus of Mental Associations
Creating and Exploiting Multimodal Annotated Corpora
LPL - University of Provence
Black, Alan NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls
Language Technologies Institute, Carnegie Mellon University
Blanchon, Hervé SECTra_w.1: an Online Collaborative System for Evaluating, Post-editing and Presenting MT Translation Corpora
GETALP, LIG, UPMF
Blanco, Eduardo Causal Relation Extraction
HLTRI at UTD
Blin, Laurent WEB-Based Listening Test System for Speech Synthesis and Speech Conversion Evaluation
IRISA / Universite Rennes 1
Bobicev, Victoria Estimating Word Phonosemantics
Technical University of Moldova
Bobrow, Daniel The Encoding of lexical implications in VerbNet Predicates of change of locations
PARC
Boeffard, Olivier WEB-Based Listening Test System for Speech Synthesis and Speech Conversion Evaluation
Automatic Phone Segmentation of Expressive Speech
Comparing Set-Covering Strategies for Optimal Corpus Design
IRISA / Universite Rennes 1
Boguraev, Branimir Navigating through Dense Annotation Spaces
A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture
IBM Research
Boitet, Christian SECTra_w.1: an Online Collaborative System for Evaluating, Post-editing and Presenting MT Translation Corpora
GETALP, LIG, UJF
Bojar, Ondřej CzEng 0.7: Parallel Corpus with Community-Supplied Translations
Charles University, Prague
Boleda, Gemma Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
GliCom, Fundació Barcelona Media, UPF
Bonafonte, Antonio Corpus and Voices for Catalan Speech Synthesis
UPC-TALP
Bond, Francis Boot-Strapping a WordNet Using Multiple Existing WordNets
Extraction of Attribute Concepts from Japanese Adjectives
Development of the Japanese WordNet
National Institute of Information and Communications Technology
Bonkowski, Christian The MoveOn Motorcycle Speech Corpus
Fraunhofer IAIS
Bonneau-Maynard, Hélène Training and Evaluation of POS Taggers on the French MULTITAG Corpus
LIMSI-CNRS
Bontcheva, Kalina A Text-based Query Interface to OWL Ontologies
University of Sheffield
Boonkwan, Prachya OpenCCG Workbench and Visualization Tool
National Electronics and Computer Technology Center
Boras, Damir Generating a Morphological Lexicon of Organization Entity Names
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences
Bordag, Stefan UnsuParse: unsupervised Parsing with unsupervised Part of Speech Tagging
University of Leipzig
Boriboon, Monthika OpenCCG Workbench and Visualization Tool
National Electronics and Computer Technology Center
Borrajo, Daniel Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence
Departamento de Informática, Universidad Carlos III de Madrid, Leganés, Spain
Bos, Johan Let’s not Argue about Semantics
University of Rome La Sapienza
Bosch, Sonja Experimental Fast-Tracking of Morphological Analysers for Nguni Languages
University of South Africa
Bosco, Cristina Comparing Italian parsers on a common Treebank: the EVALITA experience
Automatic extraction of subcategorization frames for Italian
Evaluation of Natural Language Tools for Italian: EVALITA 2007
Università di Torino
Bouhjar, Aicha Amazigh Language Terminology in Morocco or Management of a “Multidimensional” Variation
IRCAM
Bouillon, Pierrette Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
Building Mobile Spoken Dialogue Applications Using Regulus
Université de Genève/ETI/TIM
Boula de Mareuil, Philippe Annotation and analysis of overlapping speech in political interviews
LIMSI-CNRS
Boulaknadel, Siham A Multi-Word Term Extraction Program for Arabic Language
LINA - Université de Nantes & GSCM_LRIT
Boullosa, Jose R. User-Centred Design of Error Correction Tools
Pompeu Fabra University
Bouma, Gosse A Coreference Corpus and Resolution System for Dutch
University of Groningen
Bourdaillet, Julien Representation of Atypical Entities in Ontologies
LIP6 - Université Pierre et Marie Curie
Boxwell, Stephen Projecting Propbank Roles onto the CCGbank
The Ohio State University
Braasch, Anna Merging a Syntactic Resource with a WordNet: a Feasibility Study of a Merge between STO and DanNet
University of Copenhagen, Centre for Language Technology (CST)
Braffort, Annelies Sign Language Corpus Annotation: toward a new Methodology
LIMSI-CNRS
Branco, António LX-Service: Web Services of Language Technology for Portuguese
Anaphora Resolution Exercise: an Overview
University of Lisbon
Brandschain, Linda New Resources for Document Classification, Analysis and Translation Technologies
Speaker Recognition: Building the Mixer 4 and 5 Corpora
Linguistic Data Consortium
Braschler, Martin From Research to Application in Multilingual Information Access: the Contribution of Evaluation
Zurich University of Applied Sciences
Braslavski, Pavel Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
Institute of Engineering Science, RAS
Breiteneder, Evelyn Words in Contexts: Digital Editions of Literary Journals in the “AAC - Austrian Academy Corpus”
Austrian Academy of Sciences
Brew, Chris Statistical Identification of English Loanwords in Korean Using Automatically Generated Training Data
OSU
Brewster, Christopher A Comparative Evaluation of Term Recognition Algorithms
Dialogue, Speech and Images: the Companions Project Data Set
University of Sheffield
Brierley, Claire ProPOSEL: A Prosody and POS English Lexicon for Language Engineering
University of Leeds
Brinckmann, Caren memasysco: XML schema based metadata management system for speech corpora
German Today: a really extensive Corpus of Spoken Standard German
Institut für Deutsche Sprache, Mannheim
Briscoe, Ted The BNC Parsed with RASP4UIMA
University of Cambridge
Bristot, Antonella Enriching the Venice Italian Treebank with Dependency and Grammatical Relations
Università di Venezia
Broda, Bartosz Corpus-based Semantic Relatedness for the Construction of Polish WordNet
Institute of Applied Informatics, Wrocław University of Technology
Broeder, Daan Foundation of a Component-based Flexible Registry for Language Resources and Technology
Building a Federation of Language Resource Repositories: the DAM-LR Project and its Continuation within CLARIN.
A Grid of Regional Language Archives
Max Planck Institute for Psycholinguistics
Bronsart, Sebastien Translation Adequacy and Preference Evaluation Tool (TAP-ET)
Odds of Successful Transfer of Low-Level Concepts: a Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA’s TRANSTAC Program
National Institute of Standards and Technology
Brugman, Hennie A Common Multimedia Annotation Framework for Cross Linking Cultural Heritage Digital Collections
Max Planck Institute for Psycholinguistics
Brutti, Alessio WOZ Acoustic Data Collection for Interactive TV
Fondazione Bruno Kessler - FBK, Trento
Buczyński, Aleksander ♠ Demo: An Open Source Tool for Partial Parsing and Morphosyntactic Disambiguation
Institute of Computer Science, Polish Academy of Sciences
Buitelaar, Paul Ontology Search with the OntoSelect Ontology Library
Domain-Specific English-To-Spanish Translation of FrameNet
DFKI GmbH
Bungeroth, Jan The ATIS Sign Language Corpus
RWTH Aachen University
Bunt, Harry LIRICS Semantic Role Annotation: Design and Evaluation of a Set of Data Categories
Evaluating Dialogue Act Tagging with Naive and Expert Annotators
Towards Formal Interpretation of Semantic Annotation
Tilburg University
Burchardt, Aljoscha FATE: a FrameNet-Annotated Corpus for Textual Entailment
University of Saarland
Burger, Susanne Data Collection for the CHIL CLEAR 2007 Evaluation Campaign
A Comparative Cross-Domain Study of the Occurrence of Laughter in Meeting and Seminar Corpora
CMU
Busa, Roberto The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin
Università Cattolica del Sacro Cuore (Milan - Italy)
Buscaldi, Davide Geo-WordNet: Automatic Georeferencing of WordNet
NLE Lab, Universidad Politécnica de Valencia
Busemann, Stephan Identifying Foreign Person Names in Chinese Text
DFKI GmbH
Butters, Jonathan Using Similarity Metrics For Terminology Recognition
University of Sheffield
Buyko, Ekaterina Ontology-Based Interface Specifications for a NLP Pipeline Architecture
Semantic Annotations for Biology: a Corpus Development Initiative at the Jena University Language & Information Engineering (JULIE) Lab
Jena University Language & Information Engineering (JULIE) Lab. Jena
By, Tomas The Kalshnikov 691 Dependency Bank
Centro de Linguística da Universidade Nova de Lisboa
Byron, Donna K. SCARE: a Situated Corpus with Annotated Referring Expressions
The Ohio State University

 

C
Cabré, Teresa A Suite to Compile and Analyze an LSP Corpus
Pompeu Fabra University
Cabrio, Elena The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
Fondazione Bruno Kessler - FBK, Trento
Cahill, Lynne Using Similarity Measures to Extend the LinGO Lexicon
University of Brighton
Cailliau, Frederik CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content
SINEQUA
Calderone, Basilio Learning properties of Noun Phrases: from data to functions
Scuola Normale Superiore, Pisa
Callmeier, Ulrich Some Fine Points of Hybrid Natural Language Parsing
Acrolinx GmbH
Calzolari, Nicoletta Ontologizing Lexicon Access Functions based on an LMF-based Lexicon Taxonomy
Foundation of a Component-based Flexible Registry for Language Resources and Technology
KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Adapting International Standard for Asian Language Technologies
A lexicon for biology and bioinformatics: the BOOTStrep experience.
Evaluation of Natural Language Tools for Italian: EVALITA 2007
UFRA: a UIMA-based Approach to Federated Language Resource Architecture
ILC-CNR, Pisa
Campbell, Joseph Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition
MIT-LL
Campbell, Nick Tools & Resources for Visualising Conversational-Speech Interaction
National Institute of Information and Communications Technology & ATR
Campr, Pavel Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition
Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition
Department of Cybernetics, Faculty of Applied Sciences, University of West Bohemia in Pilsen
Capman, François A Multi-sensor Speech Database with Applications towards Robust Speech Processing in hostile Environments
Thales Communications, Signal Processing and Multimedia Dept, Colombes
Cappelli, Amedeo Evaluation of Natural Language Tools for Italian: EVALITA 2007
CELCT
Carbonell, Jaime Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Language Technologies Institute, Carnegie Mellon University
Cardie, Claire An eRulemaking Corpus: Identifying Substantive Issues in Public Comments
Annotating Topics of Opinions
Cornell University
Carl, Michael Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
Using Log-linear Models for Tuning Machine Translation Output
IAI Saarbrücken
Carmen, Marc Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study
Brigham Young University
Carpuat, Marine Evaluation of Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation
Hong Kong University of Science and Technology
Carrera, Jordi Complete and Consistent Annotation of WordNet using the Top Concept Ontology
Towards Spanish Verbs’ Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus
UOC
Carroll, James Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study
Brigham Young University
Carroll, John The BNC Parsed with RASP4UIMA
University of Sussex
Cartoni, Bruno Lexical Resources for Automatic Translation of Constructed Neologisms: the Case Study of Relational Adjectives
ISSCO/TIM/ETI - University of Geneva
Caseiro, Diamantino Building a Golden Collection of Parallel Multi-Language Word Alignment
L2F INESC-ID/IST, Lisboa
Caselli, Tommaso A Bilingual Corpus of Inter-linked Events
UFRA: a UIMA-based Approach to Federated Language Resource Architecture
ILC-CNR, Pisa
Castell, Nuria Causal Relation Extraction
UPC-TALP
Castellanos, A. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Castelli, Eric First Broadcast News Transcription System for Khmer Language
MICA
Castellón, Irene Towards Spanish Verbs’ Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus
UB
Castro, M. J. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universidad Politécnica de Valencia
Catizone, Roberta Information Extraction Tools and Methods for Understanding Dialogue in a Companion
University of Sheffield
Ceauşu, Alexandru DIAC+: a Professional Diacritics Recovering System
Unsupervised Lexical Acquisition for Part of Speech Tagging
RACAI’s Linguistic Web Services
RACAI, Romanian Academy, Bucharest
Ceberio, Klara Spelling Correction: from Two-Level Morphology to Open Source
University of the Basque Country
Češka, Pavel CzEng 0.7: Parallel Corpus with Community-Supplied Translations
Charles University, Prague
Chantree, Francis Cleaneval: a Competition for Cleaning Web Pages
Lexical Computing Ltd
Charoenporn, Thatsanee Adapting International Standard for Asian Language Technologies
TCL/NICT
Charonnat, Laure Automatic Phone Segmentation of Expressive Speech
IRISA / Universite Rennes 1 - Enssat
Chatzichrisafis, Nikos A Knowledge-Modeling Approach for Multilingual Regulus Lexica
Geneva University
Chauché, Jacques Building a Bilingual Representation of the Roget Thesaurus for French to English Machine Translation
University of Montpellier 2 and LIRMM-CNRS
Chaudiron, Stéphane The INFILE Project: a Crosslingual Filtering Systems Evaluation Campaign
Lille 3 - GERIICO
Chen, Chaomei Identifying Strategic Information from Scientific Articles through Sentence Classification
University of Drexel
Chen, Hsin-Hsi Event Detection and Summarization in Weblogs with Temporal Collocations
Department of Computer Science and Information Engineering, National Taiwan University
Chen, Kai-Yun Annotating “tense” in a Tense-less Language
University of Colorado
Chen, Yirong Corpus Exploitation from Wikipedia for Ontology Construction
Chinese Core Ontology Construction from a Bilingual Term Bank
Computing Department, The Hong Kong Polytechnic University
Chen, Yu Improving Statistical Machine Translation Efficiency by Triangulation
University of Saarland
Cheng, Xiwen Fine-grained Opinion Topic and Polarity Identification
DFKI GmbH
Chételat-Pelé, Emilie Sign Language Corpus Annotation: toward a new Methodology
LIMSI-CNRS
Chevelu, Jonathan Comparing Set-Covering Strategies for Optimal Corpus Design
IRISA / Universite Rennes 1 - Enssat
Chiarcos, Christian Ontology-Based XQuery’ing of XML-Encoded Language Resources on Multiple Annotation Layers
Ontology-Based Interface Specifications for a NLP Pipeline Architecture
The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Potsdam
Chiorean, Oana Andreea Evaluation of a Cross-lingual Romanian-English Multi-document Summariser
University of Wolverhampton
Chou, Ya-Min The Extended Architecture of Hantology for Japan Kanji
Ming Chuan University
Choudhury, Monojit Unsupervised Parts-of-Speech Induction for Bengali
A Common Parts-of-Speech Tagset Framework for Indian Languages
Microsoft Research Lab India
Choukri, Khalid Data Collection for the CHIL CLEAR 2007 Evaluation Campaign
A Guide for the Production of Reusable Language Resources
Latest Developments in ELRA’s Services
The INFILE Project: a Crosslingual Filtering Systems Evaluation Campaign
MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic
ELDA
Chrupala, Grzegorz Learning Morphology with Morfette
National Center for Language Technology, Dublin City University
Chung, Siaw-Fong Extracting Concrete Senses of Lexicon through Measurement of Conceptual Similarity in Ontologies
National Taiwan University
Chute, Christopher Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition
Mayo Clinic College of Medicine
Ciaramita, Massimiliano Semantically Annotated Snapshot of the English Wikipedia
Supersense Tagger for Italian
Yahoo! Research Barcelona
Cidral, Alexandre Evaluating Summaries Automatically - A system Proposal
Department of Informatics, University of Joinville
Cieri, Christopher The Linguistic Data Consortium Member Survey: Purpose, Execution and Results
Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition
15 Years of Language Resource Creation and Sharing: a Progress Report on LDC Activities
Speaker Recognition: Building the Mixer 4 and 5 Corpora
Linguistic Data Consortium
Ciravegna, Fabio Saxon: an Extensible Multimedia Annotator
A Comparative Evaluation of Term Recognition Algorithms
Using Similarity Metrics For Terminology Recognition
University of Sheffield
Civera, Jorge Bilingual Text Classification using the IBM 1 Translation Model
Departamento de Sistemas Informáticos y Computación - Universidad Politécnica de Valencia
Clark, Jonathan Toward Active Learning in Data Selection: Automatic Discovery of Language Features During Elicitation
Carnegie Mellon University
Claveau, Vincent Automatic Translation of Biomedical Terms by Supervised Machine Learning
IRISA / Universite Rennes 1
Cleuren, Leen Children’s Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement
Katholieke Universiteit Leuven
Climent, Salvador Complete and Consistent Annotation of WordNet using the Top Concept Ontology
Towards Spanish Verbs’ Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus
UOC
Coheur, Luisa Building a Golden Collection of Parallel Multi-Language Word Alignment
L2F INESC-ID/IST, Lisboa
Colas, Jose STC-TIMIT: Generation of a Single-channel Telephone Corpus
HCTLab, Universidad Autonoma de Madrid
Coll-Florit, Marta Towards Spanish Verbs’ Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus
UOC
Comas, Pere Question Answering on Speech Transcriptions: the QAST evaluation in CLEF
UPC-TALP
Concejero, Pedro Methodology for Evaluating the Usability of User Interfaces in Mobile Services
Telefónica Investigación y Desarrollo
Condon, Sherri Odds of Successful Transfer of Low-Level Concepts: a Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA’s TRANSTAC Program
Applying Automated Metrics to Speech Translation Dialogs
Performance Evaluation of Speech Translation Systems
MITRE Corporation
Condoravdi, Cleo The Encoding of lexical implications in VerbNet Predicates of change of locations
PARC
Copestake, Ann Language Resources and Chemical Informatics
Computer Laboratory, University of Cambridge
Coppens, Frederik A Coreference Corpus and Resolution System for Dutch
A Coreference Corpus and Resolution System for Dutch
A Coreference Corpus and Resolution System for Dutch
Language and Computing NV
Corazza, Anna Comparing Italian parsers on a common Treebank: the EVALITA experience
Università Federico II di Napoli
Corbett, Greville Lexicon Schemas and Related Data Models: when Standards Meet Users
University of Surrey
Corbett, Peter Language Resources and Chemical Informatics
Unilever Centre for Molecular Informatics, University of Cambridge
Corpas, Gloria Mutual Bilingual Terminology Extraction
Universidad de Malaga
Corvey, William Building a Corpus of Temporal-Causal Structure
University of Colorado
Costa, Francisco LX-Service: Web Services of Language Technology for Portuguese
University of Lisbon
Cotter, Philip Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora
National Centre for Text Mining, University of Manchester
Councill, Isaac ParsCit: an Open-source CRF Reference String Parsing Package
Pennsylvania State University
Cowan, Rosa Holy Moses! Leveraging Existing Tools and Resources for Entity Translation
CACI International Inc.
Cramer, Irene Exploring and Navigating: Tools for GermaNet
University of Dortmund
Crane, Gregory The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin
Tufts University, Boston
Crespo Miguel, Mario Domain-Specific English-To-Spanish Translation of FrameNet
University of Cadiz
Cristea, Dan How to Evaluate and Raise the Quality in a Collaborative Lexicographic Approach
Anaphora Resolution Exercise: an Overview
Al.I.Cuza University of Iasi
Cristoforetti, Luca WOZ Acoustic Data Collection for Interactive TV
Fondazione Bruno Kessler - FBK, Trento
Crouch, Keith A Corpus for Cross-Document Co-reference
MITRE Corporation
Crysmann, Berthold Some Fine Points of Hybrid Natural Language Parsing
DFKI GmbH
Csirik, János Hungarian Word-Sense Disambiguated Corpus
University of Szeged
Cucchiarini, Catia Recording Speech of Children, Non-Natives and Elderly People for HLT Applications: the JASMIN-CGN Corpus.
The Dutch-Flemish Comprehensive Approach to HLT Stimulation and Innovation: STEVIN, HLT Agency and beyond
Radboud University Nijmegen
Cucurullo, Sebastiana Semantic Press
ILC-CNR, Pisa
Cui, Gaoying Corpus Exploitation from Wikipedia for Ontology Construction
Chinese Core Ontology Construction from a Bilingual Term Bank
Computing Department, The Hong Kong Polytechnic University
Cunningham, Hamish A Framework for Identity Resolution and Merging for Multi-source Information Extraction
University of Sheffield
Cylwik, Natalia JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts
Institute of Linguistics, Adam Mickiewicz University, Poznań

 

D
D’Halleweyn, Elisabeth The Dutch-Flemish Comprehensive Approach to HLT Stimulation and Innovation: STEVIN, HLT Agency and beyond
Nederlandse Taalunie
D’hoore, Bart The AUTONOMATA Spoken Names Corpus
The AUTONOMATA Spoken Names Corpus
Nuance, Gent
Daelemans, Walter A Coreference Corpus and Resolution System for Dutch
Personae: a Corpus for Author and Personality Prediction from Text
University of Antwerp
Dahlqvist, Bengt Swedish-Turkish Parallel Treebank
Department of Linguistics and Philology, Uppsala University
Daille, Beatrice A Multi-Word Term Extraction Program for Arabic Language
Characterization of Scientific and Popular Science Discourse in French, Japanese and Russian
LINA - Université de Nantes
Dale, Robert Controlling Redundancy in Referring Expressions
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
Macquarie University
Dalianis, Hercules Experiments to Investigate the Connection between Case Distribution and Topical Relevance of Search Terms in an Information Retrieval Setting
DSV/KTH-Stockholm University
Damljanovic, Danica A Text-based Query Interface to OWL Ontologies
University of Sheffield
Darwish, Kareem Automatic Extraction of Textual Elements from News Web Pages
Cairo University
David, Sophie Classification Procedures for Software Evaluation
MoDyCo, CNRS & Université Paris 10
Davis, Brian Evaluating the Ontology underlying sMail - the Conceptual Framework for Semantic Email Communication
Linguistically Light Lexical Extensions for Ontologies
DERI/NUIG
Day, David A Corpus for Cross-Document Co-reference
MITRE Corporation
De Cao, Diego Towards a Vector Space Model for FrameNet-like Resources
University of Roma Tor Vergata
De Deyne, Simon The Construction and Evaluation of Word Space Models
University of Leuven
De Jong, Franciska Evaluation of Spoken Document Retrieval for Historic Speech Collections
University of Twente
De la Torre, Raúl Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories
Universidad Autonoma Madrid
De Luca, Ernesto William Integrating Metaphor Information into RDF/OWL EuroWordNet
A Comparative Study on Language Identification Methods
University of Magdeburg
De Matos, David Martins Using Lexical Acquisition to Enrich a Predicate Argument Reusable Database
L2F INESC-ID/IST, Lisboa
De Melo, Gerard Mapping Roget’s Thesaurus and WordNet to French
Max Planck Institute for Informatics
De Mori, Renato Semantic Frame Annotation on the French MEDIA corpus
LIA - University of Avignon
De Oliveira, Paulo C F Evaluating Summaries Automatically - A system Proposal
Department of Informatics, University of Joinville
De Vriend, Folkert Evaluating the Relationship between Linguistic and Geographic Distances using a 3D Visualization
Radboud University Nijmegen
Declerck, Thierry Foundation of a Component-based Flexible Registry for Language Resources and Technology
A Framework for Standardized Syntactic Annotation
DFKI GmbH
Degórski, Łukasz Definition Extraction Using a Sequential Combination of Baseline Grammars and Machine Learning Classifiers
Institute of Computer Science, Polish Academy of Sciences
Dekel, Nurit LILA: Cellular Telephone Speech Databases from Asia
NSC
Dekens, Tomas A Multi-sensor Speech Database with Applications towards Robust Speech Processing in hostile Environments
Vrije Universiteit Brussel, dept. ETRO-DSSP, Brussels
Deksne, Daiga Dictionary of Multiword Expressions for Translation into highly Inflected Languages
Tilde
Del Gratta, Riccardo A lexicon for biology and bioinformatics: the BOOTStrep experience.
Simple-Clips ongoing research: more information with less data by implementing inheritance
UFRA: a UIMA-based Approach to Federated Language Resource Architecture
ILC-CNR, Pisa
Delaere, Isabelle Learning-based Detection of Scientific Terms in Patient Information
LT3, University College Ghent
Deléglise, Paul Combined Systems for Automatic Phonetic Transcription of Proper Nouns
Université du Maine
Delhay, Arnaud Comparing Set-Covering Strategies for Optimal Corpus Design
IRISA / Universite Rennes 1 - Enssat
Dellert, Johannes Ontology-Based XQuery’ing of XML-Encoded Language Resources on Multiple Annotation Layers
Developing a TT-MCTAG for German with an RCG-based Parser
University of Tübingen
Delmonte, Rodolfo Enriching the Venice Italian Treebank with Dependency and Grammatical Relations
Università di Venezia
Delpech, Estelle Investigating the Structure of Procedural Texts for Answering How-to Questions
IRIT-CNRS
Demenko, Grazyna JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts
Institute of Linguistics, Adam Mickiewicz University, Poznań
Demetriou, George ANNALIST - ANNotation ALIgnment and Scoring Tool
Department of Computer Science, University of Sheffield
Den, Yasuharu A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation
Word-level Dependency-structure Annotation to Corpus of Spontaneous Japanese and its Application
Chiba University
Denda, Yuki Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
Ritsumeikan University
Deoskar, Tejaswini Induction of Treebank-Aligned Lexical Resources
Cornell University
Derouin, Marie-Jeanne Presentation of the New ISO-Standard for the Representation of Entries in Dictionaries: ISO 1951
Langenscheidt KG/Langenscheidt Fachverlag
Derwojedowa, Magdalena Corpus-based Semantic Relatedness for the Construction of Polish WordNet
Institute of the Polish Language, Warsaw University
Désilets, Alain Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors
National Research Council of Canada
Devillers, Laurence Coding Emotional Events in Audiovisual Corpora
LIMSI-CNRS
Devitt, Ann Sentiment Analysis and the Use of Extrinsic Datasets in Evaluation
Trinity College Dublin
Di Eugenio, Barbara I saw TREE trees in the park: How to Correct Real-Word Spelling Mistakes
From Extracting to Abstracting: Generating Quasi-abstractive Summaries
University of Illinois at Chicago
Di Felippo, Ariani The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database
CELiC/NILC-UNESP
Di Nunzio, Giorgio An Evaluation Resource for Geographic Information Retrieval
From Research to Application in Multilingual Information Access: the Contribution of Evaluation
University of Padua
Diab, Mona A Pilot Arabic Propbank
Columbia University
Dias-da-Silva, Bento Carlos The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database
CELiC/NILC-UNESP
Dickgießer, Sylvia memasysco: XML schema based metadata management system for speech corpora
Institut für Deutsche Sprache, Mannheim
Dickinson, Markus Detecting Errors in Semantic Annotation
A Simple Method for Tagset Comparision
Indiana University
Dinesh, Nikhil The Penn Discourse TreeBank 2.0.
University of Pennsylvania
Dingli, Alexiei Information Extraction Tools and Methods for Understanding Dialogue in a Companion
University of Sheffield
Đinh, Quang Thắng Word Segmentation of Vietnamese Texts: a Comparison of Approaches
Vietnam National University of Hanoi
Dinu, Anca On Classifying Coherent/Incoherent Romanian Short Texts
Authorship Identification of Romanian Texts with Controversial Paternity
University of Bucharest, Faculty of Foreign Languages and Literature
Dinu, Georgiana Learning Morphology with Morfette
Universitaet des Saarlandes
Dinu, Liviu Authorship Identification of Romanian Texts with Controversial Paternity
University of Bucharest
DiPersio, Denise The Linguistic Data Consortium Member Survey: Purpose, Execution and Results
Linguistic Data Consortium
Dipper, Stefanie Measures for Term and Sentence Relevances: an Evaluation for German
Annotation of Information Structure: an Evaluation across different Types of Texts
University of Bochum
Dirix, Peter Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
Centre for Computational Linguistics - KULeuven
Dividino, Renata Semiotic-based Ontology Evaluation Tool (S-OntoEval)
Fraunhofer Institute for Computer Graphics (IGD)
Divjak, Dagmar Designing and Evaluating a Russian Tagset
University of Sheffield
Domingo, Judith User-Centred Design of Error Correction Tools
Pompeu Fabra University
Doran, Christy Applying Automated Metrics to Speech Translation Dialogs
MITRE Corporation
Dorr, Bonnie The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
University of Maryland
Dovedan, Zdravko Rule-Based Chunker for Croatian
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences
Draxler, Christoph F0 of Adolescent Speakers - First Results for the German Ph@ttSessionz Database
BAS Bavarian Archive for Speech Signals
Dreuw, Philippe Benchmark Databases for Video-Based Automatic Sign Language Recognition
The ATIS Sign Language Corpus
RWTH Aachen University
Dridan, Rebecca Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German
University of Saarland
Driesen, Joris Recording Speech of Children, Non-Natives and Elderly People for HLT Applications: the JASMIN-CGN Corpus.
Katholieke Universiteit Leuven
Drissi, Youssef A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture
IBM Research
Duchateau, Jacques Children’s Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement
Katholieke Universiteit Leuven
Dukers, Alex Language-Sites: Accessing and Presenting Language Resources via Geographic Information Systems
Max Planck Institute for Psycholinguistics
Duncan, Susan An Exchange Format for Multimodal Annotations
University of Chicago
Durgar El-Kahlout, Ilknur BLEU+: a Tool for Fine-Grained BLEU Computation
Sabanci University
Duvert, Frédéric Semantic Frame Annotation on the French MEDIA corpus
LIA - University of Avignon
Dzikovska, Myroslava O. Evaluating Complement-Modifier Distinctions in a Semantically Annotated Corpus
University of Edinburgh

 

E
Eck, Matthias Communicating Unknown Words in Machine Translation
Carnegie Mellon University
Eckart, Kerstin A LAF/GrAF based Encoding Scheme for underspecified Representations of syntactic Annotations.
IMS, University of Stuttgart
Eckart, Richard Ontology-Based XQuery’ing of XML-Encoded Language Resources on Multiple Annotation Layers
TU Darmstadt
Ehmer, Oliver An Exchange Format for Multimodal Annotations
University of Freiburg
Eichler, Kathrin Unsupervised Relation Extraction From Web Documents
DFKI GmbH
Eidelman, Vladimir BART: A modular toolkit for coreference resolution
Columbia University
Eigner, Thomas Ontology Search with the OntoSelect Ontology Library
DFKI GmbH
Eisele, Andreas Improving Statistical Machine Translation Efficiency by Triangulation
University of Saarland
Eishold, Florian The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Tübingen
Elenius, Kjell Language Resources and Tools for Swedish: A Survey
Department of Speech, Music and Hearing, School of Computer Science and Communication, KTH
Elhadad, Michael Tagging a Hebrew Corpus: the Case of Participles
Ben Gurion University of the Negev
Ellbogen, Tania F0 of Adolescent Speakers - First Results for the German Ph@ttSessionz Database
IPS Institute of Phonetics and Speech Processing
Ellison, Noel Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study
Brigham Young University
Embarek, Mehdi Learning Patterns for Building Resources about Semantic Relations in the Medical Domain
CEA LIST
Emms, Martin Tree Distance and Some Other Variants of Evalb
Department of Computer Science, Trinity College, Dublin
Enguehard, Chantal Evaluation of Virtual Keyboards for West-African Languages
LINA - Université de Nantes
Erdenebat, Dashtseren Automatic Construction of a Japanese-Chinese Dictionary via English
Shizuoka University
Erjavec, Tomaz Designing and Evaluating a Russian Tagset
The JOS Morphosyntactically Tagged Corpus of Slovene
Jozef Stefan Institute
España, S. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universidad Politécnica de Valencia
Espeja, Sergio Automatic Acquisition for low frequency lexical items
COLDIC, a Lexicographic Platform for LMF compliant lexica
Pompeu Fabra University
Esquerra, Ignasi Corpus and Voices for Catalan Speech Synthesis
UPC-TALP
Estève, Yannick Manual vs Assisted Transcription of Prepared and Spontaneous Speech
Combined Systems for Automatic Phonetic Transcription of Proper Nouns
Université du Maine
Esteve-Elizalde, Cristina BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition
ATVS-UAM
Estrella, Paula Improving Contextual Quality Models for MT Evaluation Based on Evaluators’ Feedback
ISSCO/TIM/ETI - University of Geneva
Esuli, Andrea Annotating Expressions of Opinion and Emotion in the Italian Content Annotation Bank
ISTI - CNR
Evang, Kilian The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Tübingen
Evans, David A Japanese-English Technical Lexicon for Translation and Language Research
National Institute of Informatics
Evert, Stefan A Lightweight and Efficient Tool for Cleaning Web Pages
University of Osnabrück
Ezeiza, Nerea Spelling Correction: from Two-Level Morphology to Open Source
University of the Basque Country

 

F
Fabbri, Marco Integration of a Multilingual Keyword Extractor in a Document Management System
DrWolf
Fadaee, Hakimeh A Hybrid Morphology-Based POS Tagger for Persian
NLP Research Lab, Shahid Beheshti University
Fakotakis, Nikos Eksairesis: A Domain-Adaptable System for Ontology Building from Unstructured Text
Audio Database in Support of Potentiel Threat and Crisis Situation Management
The MoveOn Motorcycle Speech Corpus
A Real-World Emotional Speech Corpus for Modern Greek
Wire Communications Laboratory, Department of Electrical and Computer Engineering, University of Patras
Fallucchi, Francesca Yet another Platform for Extracting Knowledge from Corpora
DISP, University of Rome Tor Vergata
Faraj, Reem Annotating an Arabic Learner Corpus for Error
Montclair State University
Farber, Benjamin Improving NER in Arabic Using a Morphological Tagger
Fair Isaac Corporation
Farina, Cynthia An eRulemaking Corpus: Identifying Substantive Issues in Public Comments
Cornell University
Farkas, Richárd Hungarian Word-Sense Disambiguated Corpus
University of Szeged
Farwell, David Arabic WordNet: Semi-automatic Extensions using Bayesian Inference
Universitat Politécnica de Catalunya
Federmann, Christian Extracting and Querying Relations in Scientific Papers on Language Technology
DFKI GmbH
Fék, Márk Multimodal Spontaneous Expressive Speech Corpus for Hungarian
Budapest University of Technology and Economics
Feldman, Anna Annotating an Arabic Learner Corpus for Error
Designing and Evaluating a Russian Tagset
Montclair State University
Felger, Niko Adaptation of Relation Extraction Rules to New Domains
University of Cambridge
Fellbaum, Christiane KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
MASC: the Manually Annotated Sub-Corpus of American English
Berlin-Brandenburg Academy of Sciences
Fernandez, Gabriela Mutual Bilingual Terminology Extraction
Universidad de Sevilla
Fernández, Rubén Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases
GAPS, Signals, Systems and Radiocommunications Department, Universidad Politécnica de Madrid
Ferraro, Kathleen Semantic Vectors: a Scalable Open Source Package and Online Technology Management Application
University of Pittsburgh
Ferré, Gaëlle Creating and Exploiting Multimodal Annotated Corpora
Université de Nantes
Ferreres, Javi Arabic WordNet: Semi-automatic Extensions using Bayesian Inference
Universitat Politécnica de Catalunya
Ferret, Olivier Learning Patterns for Building Resources about Semantic Relations in the Medical Domain
CEA LIST
Ferro, Nicola An Evaluation Resource for Geographic Information Retrieval
From Research to Application in Multilingual Information Access: the Contribution of Evaluation
University of Padua
Ferrucci, David A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture
IBM Research
Fersoe, Hanne LC-STAR II: Starring more Lexica
CST
Fišer, Darja Harvesting Multi-Word Expressions from Parallel Corpora
University of Ljubljana, Faculty of Arts
Fierrez, Julian BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition
ATVS-UAM
Figueira, Luís Methodologies for Designing and Recording Speech Databases for Corpus Based Synthesis
L2F INESC-ID/IST, Lisboa
Fillmore, Charles MASC: the Manually Annotated Sub-Corpus of American English
ICSI
Finthammer, Marc Exploring and Navigating: Tools for GermaNet
University of Dortmund
Fishel, Mark Experiments on Processing Overlapping Parallel Corpora
University of Tartu
Fitzgerald, Erin Linguistic Resources for Reconstructing Spontaneous Speech Text
Johns Hopkins University, Center for Language and Speech Processing
Fitzpatrick, Eileen Annotating an Arabic Learner Corpus for Error
Montclair State University
Fleisch, Axel Experimental Fast-Tracking of Morphological Analysers for Nguni Languages
University of Helsinki & University of South Africa
Flickinger, Dan Some Fine Points of Hybrid Natural Language Parsing
CSLI Stanford & DFKI GmbH
Flynn, Mike Task-Based Evaluation of Meeting Browsers: from Task Elicitation to User Behavior Analysis
IDIAP Research Institute
Font Llitjós, Ariadna Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Vivísimo, Inc.
Forăscu, Corina How to Evaluate and Raise the Quality in a Collaborative Lexicographic Approach
GMT to +2 or how can TimeML be used in Romanian
Al.I.Cuza University of Iasi
Forbes-Riley, Kate Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems
University of Pittsburgh
Forsbom, Eva Language Resources and Tools for Swedish: A Survey
Department of Linguistics and Philology, Uppsala University
Fosler-Lussier, Eric SCARE: a Situated Corpus with Annotated Referring Expressions
The Ohio State University
Fossati, Davide I saw TREE trees in the park: How to Correct Real-Word Spelling Mistakes
University of Illinois at Chicago
Foster, Jennifer Parser Evaluation and the BNC: Evaluating 4 constituency parsers with 3 metrics
Dublin City University
Fragkou, Pavlina BOEMIE Ontology-Based Text Annotation Tool
N.C.S.R. Demokritos
Francom, Jerid Parallel Multi-Theory Annotations of Syntactic Structure
The University of Arizona
Frederking, Robert Toward Active Learning in Data Selection: Automatic Discovery of Language Features During Elicitation
Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls
Carnegie Mellon University
Freitag, Dayne Improving NER in Arabic Using a Morphological Tagger
Fair Isaac Corporation
Freitas, Tiago CORP-ORAL: Spontaneous Speech Corpus for European Portuguese
Spock - a Spoken Corpus Client
ILTEC
Friburger, Nathalie Automatic Rich Annotation of Large Corpus of Conversational transcribed speech: the Chunking Task of the EPAC Project
Université François Rabelais, Tours
Friedman, Lauren Management of Large Annotation Projects Involving Multiple Human Judges: a Case Study of GALE Machine Translation Post-editing
New Resources for Document Classification, Analysis and Translation Technologies
Linguistic Data Consortium
Frunza, Oana A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization
University of Ottawa
Fujii, Atsushi Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop
Producing an Encyclopedic Dictionary using Patent Documents
University of Tsukuba
Fujimoto, Masakiyo Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
NTT Corporation
Fürstenau, Hagen Enriching Frame Semantic Resources with Dependency Graphs
University of Saarland
Furui, Sadaoki Thai Broadcast News Corpus Construction and Evaluation
Tokyo Institute of Technology

 

G
Gabay, David Tagging a Hebrew Corpus: the Case of Participles
Ben Gurion University of the Negev
Gaizasukas, Robert Combining Terminology Resources and Statistical Methods for Entity Recognition: an Evaluation
ANNALIST - ANNotation ALIgnment and Scoring Tool
University of Sheffield
Galibert, Olivier An Evaluation of Spoken and Textual Interaction in the RITEL Interactive Question Answering System
LIMSI-CNRS
Gallego, Silvia Corpus and Voices for Catalan Speech Synthesis
UPC-TALP
Ganchev, Todor Audio Database in Support of Potentiel Threat and Crisis Situation Management
The MoveOn Motorcycle Speech Corpus
A Real-World Emotional Speech Corpus for Modern Greek
University of Patras
Gandcher, Franck A Guide for the Production of Reusable Language Resources
ELDA
Gangemi, Aldo LMM: an OWL-DL MetaModel to Represent Heterogeneous Lexical Knowledge
Institute for Cognitive Sciences and Technology -National Research Council
Ganguly, Niloy Unsupervised Parts-of-Speech Induction for Bengali
Indian Institute of Technology Kharagpur
Gardent, Claire A Test Suite for Inference Involving Adjectives
LORIA, University of Nancy
Garnier-Rizet, Martine CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content
VECSYS
Garrido, Javier STC-TIMIT: Generation of a Single-channel Telephone Corpus
HCTLab, Universidad Autonoma de Madrid
Garrote, Marta Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories
Universidad Autonoma Madrid
Gasch, Joachim memasysco: XML schema based metadata management system for speech corpora
Institut für Deutsche Sprache, Mannheim
Gauvain, Jean-Luc CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content
LIMSI-CNRS
Geeraerts, Dirk The Construction and Evaluation of Word Space Models
Modelling Word Similarity: an Evaluation of Automatic Synonymy Extraction Algorithms.
University of Leuven
Geertzen, Jeroen Evaluating Dialogue Act Tagging with Naive and Expert Annotators
Tilburg University
Geoffrois, Edouard An Economic View on Human Language Technology Evaluation
DGA
Georgescul, Maria Building Mobile Spoken Dialogue Applications Using Regulus
University of Geneva
Georgila, Kallirroi A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems
University of Edinburgh
Gerassimenko, Olga From Human Communication to Intelligent User Interfaces: Corpora of Spoken Estonian
University of Tartu
Gey, Fredric A Japanese-English Technical Lexicon for Translation and Language Research
An Evaluation Resource for Geographic Information Retrieval
University of California, Berkeley
Ghesquière, Pol Children’s Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement
Katholieke Universiteit Leuven
Gibbon, Dafydd An Automatic Close Copy Speech Synthesis Tool for Large-Scale Speech Corpus Evaluation
University of Bielefeld
Gibson, Bryan The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
University of Michigan
Giesbers, Charlotte Evaluating the Relationship between Linguistic and Geographic Distances using a 3D Visualization
Radboud University Nijmegen
Giles, C Lee ParsCit: an Open-source CRF Reference String Parsing Package
Pennsylvania State University
Gilg, Thomas ALC: Alcohol Language Corpus
Institute for Legal Medicine
Gillam, Lee Automatic Document Quality Control
Lexical Ontology Extraction using Terminology Analysis: Automating Video Annotation
University of Surrey
Gillies, Breanna LILA: Cellular Telephone Speech Databases from Asia
Appen
Giménez, Jesús Towards Heterogeneous Automatic MT Error Analysis
UPC-TALP
Giouli, Voula Building a Greek corpus for Textual Entailment
ILSP - Athens
Giovannetti, Emiliano Ontology Learning and Semantic Annotation: a Necessary Symbiosis
ILC-CNR, Pisa
Girardi, Christian The TextPro Tool Suite
Fondazione Bruno Kessler - FBK, Trento
Givon, Sharon Named Entity Recognition for Digitised Historical Texts
University of Edinburgh
Gledhill, Christopher A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions
LILPA, Université Marc Bloch Strasbourg
Gleim, Rüdiger A Unified Database of Dependency Treebanks: Integrating, Quantifying & Evaluating Dependency Data
Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
University of Bielefeld
Glenn, Meghan Quick Rich Transcriptions of Arabic Broadcast News Speech Data
Linguistic Data Consortium
Gliozzo, Alfio Massimiliano Supersense Tagger for Italian
LMM: an OWL-DL MetaModel to Represent Heterogeneous Lexical Knowledge
Laboratory for Applied Ontology, ISTC-CNR
Gnjatovic, Milan On the Role of the NIMITEK Corpus in Developing an Emotion Adaptive Spoken Dialogue System
Otto-von-Guericke-University Magdeburg, Department of Knowledge Processing and Language Engineering
Gödde, Florian Corpus Analysis of Spoken Smart-Home Interactions with Older Users
Deutsche Telekom Labs, Berlin University of Technology
Godinho, Joaquim Methodologies for Designing and Recording Speech Databases for Corpus Based Synthesis
INOV
Goebel, Randy Targeting Chinese Nominal Compounds in Corpora
AICML, University of Alberta
Goecke, Daniela Influence of Text Type and Text Length on Anaphoric Annotation
University of Bielefeld
Goeuriot, Lorraine Characterization of Scientific and Popular Science Discourse in French, Japanese and Russian
LINA - Université de Nantes
Goldberg, Yoav Word-Based or Morpheme-Based? Annotation Strategies for Modern Hebrew Clitics
Tagging a Hebrew Corpus: the Case of Participles
Ben Gurion University of the Negev
Goldstein, J.D. Statistical Evaluation of Information Distillation Systems
BAE Systems
Goldstein-Stewart, Jade Creating and Using a Correlated Corpus to Glean Communicative Commonalities
US Dept. of Defense
Gómez Gallo, Carlos Production in a Multimodal Corpus: how Speakers Communicate Complex Actions
University of Rochester
Gómez, J. A. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universidad Politécnica de Valencia
Gómez-Pérez, Asunción Towards a Glossary of Activities in the Ontology Engineering Field
Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence
Ontology Engineering Group (Universidad Politécnica de Madrid)
González-Ledesma, Ana Pragmatic Annotation of Discourse Markers in a Multilingual Parallel Corpus (Arabic- Spanish-English)
Universidad Autónoma de Madrid
Gonzalez-Rodriguez, Joaquin BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition
ATVS-UAM
Gonzalo, Julio From Research to Application in Multilingual Information Access: the Contribution of Evaluation
Universidad Nacional de Educación a Distancia, Madrid
Goodwin, Kerri Creating and Using a Correlated Corpus to Glean Communicative Commonalities
Loyola College in Maryland
Gorbe, J. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universidad Politécnica de Valencia
Gordo, A. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Gorjanc, Vojko Slovene Terminology Web Portal and the TBX-Compatible Simplified DTD/schema
Faculty of Arts, University of Ljubljana
Górski, Rafał L. Towards the National Corpus of Polish
Institute of Polish Language at the Polish Academy of Sciences
Gottwald, Sebastian Tapping Huge Temporally Indexed Textual Resources with WCTAnalyze
University of Leipzig
Götze, Michael Annotation of Information Structure: an Evaluation across different Types of Texts
University of Potsdam
Grabar, Natalia Characterization of Scientific and Popular Science Discourse in French, Japanese and Russian
INSERM - Université René Descartes
Graca, Joao Building a Golden Collection of Parallel Multi-Language Word Alignment
L2F INESC-ID/IST, Lisboa
Graff, David Speaker Recognition: Building the Mixer 4 and 5 Corpora
Linguistic Data Consortium
Graham, C. Ray Elicited Imitation as an Oral Proficiency Measure with ASR Scoring
BYU Linguistics
Grau, Bernat User-Centred Design of Error Correction Tools
Pompeu Fabra University
Gravier, Guillaume On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems
Morphosyntactic Resources for Automatic Speech Recognition
IRISA / Universite Rennes 1
Greenwood, Mark Saxon: an Extensible Multimedia Annotator
University of Sheffield
Grefenstette, Gregory Semi-automatic Building Method for a Multidimensional Affect Dictionary for a New Language
A Conceptual Approach to Web Image Retrieval
CEA LIST
Grimes, Stephen Lexicon Schemas and Related Data Models: when Standards Meet Users
Indiana University
Griol, David Acquisition and Evaluation of a Dialog Corpus through WOz and Dialog Simulation Techniques
DSIC - UPV
Grishman, Ralph Is this NE tagger getting old?
New York University
Grocholewski, Stefan JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts
Institute of Computing Science, Poznan University of Technology
Grothe, Lena A Comparative Study on Language Identification Methods
University of Magdeburg
Grouin, Cyril Certification and Cleaning up of a Text Corpus: Towards an Evaluation of the “Grammatical” Quality of a Corpus
LIMSI-CNRS
Grover, Claire Named Entity Recognition for Digitised Historical Texts
Learning the Species of Biomedical Named Entities from Annotated Corpora
University of Edinburgh
Gubrynowicz, Ryszard Design and Data Collection for Spoken Polish Dialogs Database
Polish-Japaneese Institute of Information Technology, Warsaw,Poland
Guerini, Marco Resources for Persuasion
Valentino: A Tool for Valence Shifting of Natural Language Texts
Fondazione Bruno Kessler - FBK, Trento
Guillemin-Lanne, Sylvie CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content
TEMIS
Guirao, José M. Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories
Universidad de Granada
Guo, Yikun Combining Terminology Resources and Statistical Methods for Entity Recognition: an Evaluation
University of Sheffield
Gupta, Piklu Enriching GermaNet with verb-noun relations - a case study of lexical acquisition
University of Tübingen
Gurevych, Iryna Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary
Technische Universität Darmstadt
Gurrutxaga, Antton Analysis and Performance of Morphological Query Expansion and Language-Filtering Words on Basque Web Searching
WNTERM: Enriching the MCR with a Terminological Dictionary
Elhuyar Fundazioa, R&D
Guthrie, David An Unsupervised Probabilistic Approach for the Detection of Outliers in Corpora
University of Sheffield
Guthrie, Louise Professor or Screaming Beast? Detecting Anomalous Words in Chinese
Authorship Attribution of E-Mail: Comparing Classifiers over a New Corpus for Evaluation
Using a Probabilistic Model of Context to Detect Word Obfuscation
Unsupervised Learning-based Anomalous Arabic Text Detection
An Unsupervised Probabilistic Approach for the Detection of Outliers in Corpora
University of Sheffield

 

H
Ha, Le An Mutual Bilingual Terminology Extraction
University of Wolverhampton
Habash, Nizar Improving NER in Arabic Using a Morphological Tagger
Identification of Naturally Occurring Numerical Expressions in Arabic
Columbia University
Habert, Benoit Annotation and analysis of overlapping speech in political interviews
ICAR
Haddow, Barry Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks
University of Edinburgh
Haertel, Robbie Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study
Brigham Young University
Hahn, Stefan A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding
RWTH Aachen University
Hahn, Udo Approximating Learning Curves for Active-Learning-Driven Annotation
Semantic Annotations for Biology: a Corpus Development Initiative at the Jena University Language & Information Engineering (JULIE) Lab
Jena University, JULIE Lab
Hain, Horst-Udo Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework
Siemens AG
Hajič, Jan Validating the Quality of Full Morphological Annotation
Charles University, Prague
Hajicová, Eva From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank
Charles University, Prague
Halácsy, Péter Parallel Creation of Gigaword Corpora for Medium Density Languages - an Interim Report
BME MOKK
Halimi, Sonia Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
Université de Genève/ETI/TIM
Hall, Keith Inter-sentential Coreferences in Semantic Networks: An Evaluation of Manual Annotation
Johns Hopkins University, Center for Language and Speech Processing
Hallett, Catalina Automatic Rewriting of Patient Record Narratives
The Open University
Halpern, Jack Exploiting Lexical Resources for Disambiguating CJK and Arabic Orthographic Variants
The CJK Dictionary Institute, Inc.
Hammarström, Harald Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic)
Chalmers University, Gothenburg
Hamon, Olivier An Experimental Methodology for an End-to-End Evaluation in Speech-to-Speech Translation
PASSAGE: from French Parser Evaluation to Large Sized Treebank
ELDA
Han, Lei A Research on Automatic Chinese Catchword Extraction
Wuhan University
Handschuh, Siegfried Evaluating the Ontology underlying sMail - the Conceptual Framework for Semantic Email Communication
Linguistically Light Lexical Extensions for Ontologies
DERI/NUIG
Hänig, Christian UnsuParse: unsupervised Parsing with unsupervised Part of Speech Tagging
University of Leipzig
Hao, Yanfen Acquiring Naturalistic Concept Descriptions from the Web
University College Dublin
Hara, Sunao In-car Speech Data Collection along with Various Multimodal Signals
Nagoya University
Harabagiu, Sanda A Linguistic Resource for Discovering Event Structures and Resolving Event Coreference
The University of Texas at Dallas
Harada, Takashi Creation of Learner Corpus and Its Application to Speech Recognition
Doshisha University
Hardcastle, David Automatic Rewriting of Patient Record Narratives
Can we Evaluate the Quality of Generated Text?
The Open University
Harris, Dave SpatialML: Annotation Scheme, Corpora, and Tools
MITRE Corporation
Hartholt, Arno A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture
USC’s Institute for Creative Technologies
Hartley, Anthony Generalising Lexical Translation Strategies for MT Using Comparable Corpora
Corpus-Based Tools for Computer-Assisted Acquisition of Reading Abilities in Cognate Languages
Sensitivity of Automated MT Evaluation Metrics on Higher Quality MT Output: BLEU vs Task-Based Evaluation Methods
University of Leeds
Hartung, Matthias Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration
Department of Computational Linguistics, Heidelberg University
Hasan, Saša A Multi-Genre SMT System for Arabic to French
Automatic Evaluation Measures for Statistical Machine Translation System Optimization
RWTH Aachen University
Hashimoto, Chikara A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
Yamagata University
Hasler, Laura The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
Centering Theory for Evaluation of Coherence in Computer-Aided Summaries
University of Wolverhampton
Hatvani, Csaba Hungarian Word-Sense Disambiguated Corpus
University of Szeged
Hauptmann, Alexander Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments
Carnegie Mellon University
Hayashi, Yoshihiko Ontologizing Lexicon Access Functions based on an LMF-based Lexicon Taxonomy
Osaka University & NICT
Heeren, Willemijn Evaluation of Spoken Document Retrieval for Historic Speech Collections
University of Twente
Heid, Ulrich Tools for Collocation Extraction: Preferences for Active vs. Passive
A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case
A LAF/GrAF based Encoding Scheme for underspecified Representations of syntactic Annotations.
Head or Non-head? Semi-automatic Procedures for Extracting and Classifying Subcategorisation Properties of Compounds.
IMS, University of Stuttgart
Heinrich, Christian ALC: Alcohol Language Corpus
BAS Bavarian Archive for Speech Signals
Hemsen, Holmer Unsupervised Relation Extraction From Web Documents
DFKI GmbH
Henderer, Joe What would you Ask a conversational Agent? Observations of Human-Agent Dialogues in a Museum Setting
USC Institute for Creative Technologies
Hendrickx, Iris A Coreference Corpus and Resolution System for Dutch
University of Antwerp
Hennoste, Tiit From Human Communication to Intelligent User Interfaces: Corpora of Spoken Estonian
University of Helsinki
Henriksen, Lina Merging a Syntactic Resource with a WordNet: a Feasibility Study of a Merge between STO and DanNet
University of Copenhagen, Centre for Language Technology (CST)
Hepple, Mark Cross-Domain Dialogue Act Tagging
Combining Terminology Resources and Statistical Methods for Entity Recognition: an Evaluation
University of Sheffield
Hering, Horst The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech
EUROCONTROL Experimental Centre
Hermet, Matthieu Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors
University of Ottawa
Hernaez, Inma Text Independent Speaker Identification in Multilingual Environments
University of the Basque Country
Hernáez, Inmaculada Subjective Evaluation of an Emotional Speech Database for Basque
University of the Basque Country
Hernandez, Gregorio Spelling Correction: from Two-Level Morphology to Open Source
Eleka S.L.
Hernández, Luis A. Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases
GAPS, Signals, Systems and Radiocommunications Department, Universidad Politécnica de Madrid
Hernandez-Lopez, Daniel BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition
ATVS-UAM
Hess, Michael Dependency-Based Relation Mining for Biomedical Literature
University of Zurich
Heyer, Gerhard Tapping Huge Temporally Indexed Textual Resources with WCTAnalyze
ASV Toolbox: a Modular Collection of Language Exploration Tools
University of Leipzig
Heylen, Kris The Construction and Evaluation of Word Space Models
Modelling Word Similarity: an Evaluation of Automatic Synonymy Extraction Algorithms.
University of Leuven
Hickl, Andrew Unsupervised Resource Creation for Textual Inference Applications
Scaling Answer Type Detection to Large Hierarchies
Language Computer Corporation
Hillard, Dustin The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998
University of Washington
Hinrichs, Erhard Foundation of a Component-based Flexible Registry for Language Resources and Technology
In Contrast - A Complex Discourse Connective
University of Tuebingen
Hitzeman, Janet SpatialML: Annotation Scheme, Corpora, and Tools
A Corpus for Cross-Document Co-reference
MITRE Corporation
Hobbs, Reginald MTriage: Web-enabled Software for the Creation, Machine Translation, and Annotation of Smart Documents
Army Research Laboratory
Hockey, Beth Ann Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
UCSC UARC
Hofbauer, Konrad The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech
Graz University of Technology
Hoffmann, Holger The PIT Corpus of German Multi-Party Dialogues
University of Ulm
Hofmann, Hansjörg Emotion Recognition from Speech: Stress Experiment
Institute of Information Technology
Höge, Harald Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework
Siemens AG
Hollink, Laura A Common Multimedia Annotation Framework for Cross Linking Cultural Heritage Digital Collections
Free University, Amsterdam
Holz, Florian ASV Toolbox: a Modular Collection of Language Exploration Tools
University of Leipzig
Hong, Jia-Fei The Extended Architecture of Hantology for Japan Kanji
Quality Assurance of Automatic Annotation of Very Large Corpora: a Study based on heterogeneous Tagging System
National Taiwan University
Horiuchi, Hiroaki A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions
Shizuoka University
Hoste, Veronique Learning-based Detection of Scientific Terms in Patient Information
A Coreference Corpus and Resolution System for Dutch
LT3, University College Ghent
Hou, Yuexian Exploiting the Role of Position Feature in Chinese Relation Extraction
School of Computer Science and Technology, Tianjin Unversity
Hovy, Eduard A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture
USC’s Information Sciences Institute
Hoyt, Jeffrey An Exchange Format for Multimodal Annotations
MITRE Corporation
Hrúz, Marek Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition
Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition
Department of Cybernetics, Faculty of Applied Sciences, University of West Bohemia in Pilsen
Hsieh, Shu-kai KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Adapting International Standard for Asian Language Technologies
Extracting Concrete Senses of Lexicon through Measurement of Conceptual Similarity in Ontologies
National Taiwan University
Huang, Chu-Ren KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Adapting International Standard for Asian Language Technologies
The Extended Architecture of Hantology for Japan Kanji
Extracting Concrete Senses of Lexicon through Measurement of Conceptual Similarity in Ontologies
Quality Assurance of Automatic Annotation of Very Large Corpora: a Study based on heterogeneous Tagging System
Academia Sinica
Huckstorf, Axel The European Thesaurus on International Relations and Area Studies - a Multilingual Resource for Indexing, Retrieval, and Translation
Stiftung Wissenschaft und Politik (SWP) - German Institute for International and Security Affairs
Huet, Stéphane Morphosyntactic Resources for Automatic Speech Recognition
IRISA / Universite Rennes 1
Huijbregts, Marijn Evaluation of Spoken Document Retrieval for Historic Speech Collections
University of Twente
Hulden, Mans Parallel Multi-Theory Annotations of Syntactic Structure
The University of Arizona
Hunt, Steve From Field Notes towards a Knowledge Base
Tilburg University
Hunter, D. Statistical Evaluation of Information Distillation Systems
BAE Systems
Hurtado, Lluís F. Acquisition and Evaluation of a Dialog Corpus through WOz and Dialog Simulation Techniques
DSIC - UPV
Husain, Samar Developing Verb Frames for Hindi
Language Technologies Research Centre, IIIT, Hyderabad
Husarciuc, Maria Romanian Semantic Role Resource
Faculty of Letters
Huynh, Cong-phap SECTra_w.1: an Online Collaborative System for Evaluating, Post-editing and Presenting MT Translation Corpora
GETALP, LIG, INPG

 

I
Ibekwe-SanJuan, Fidelia Identifying Strategic Information from Scientific Articles through Sentence Classification
University of Lyon 3
Ibrahim, Hossam Automatic Extraction of Textual Elements from News Web Pages
Cairo University
Ide, Nancy A Bilingual Corpus of Inter-linked Events
MASC: the Manually Annotated Sub-Corpus of American English
Department of Computer Science, Vassar College, Poughkeepsie, New York
Ienco, Dino Automatic extraction of subcategorization frames for Italian
Università di Torino
Iftene, Adrian Named Entity Relation Mining using Wikipedia
Al.I.Cuza University of Iasi
Iida, Hitoshi Automatic Emotional Degree Labeling for Speakers’ Anger Utterance during Natural Japanese Dialog
School of Media Science, Tokyo University of Technology
Inácio, Susana What’s in a Colour? Studying and Contrasting Colours with COMPARA
FCCN
Inkpen, Diana Combining Multiple Models for Speech Information Retrieval
Using the Complexity of the Distribution of Lexical Elements as a Feature in Authorship Attribution
University of Ottawa
Ion, Radu Unsupervised Lexical Acquisition for Part of Speech Tagging
RACAI’s Linguistic Web Services
RACAI, Romanian Academy, Bucharest
Ircing, Pavel Dialogue, Speech and Images: the Companions Project Data Set
University of West Bohemia
Iria, José Saxon: an Extensible Multimedia Annotator
A Comparative Evaluation of Term Recognition Algorithms
An Approach to Modeling Heterogeneous Resources for Information Extraction
University of Sheffield
Irimia, Elena Unsupervised Lexical Acquisition for Part of Speech Tagging
RACAI, Romanian Academy, Bucharest
Isahara, Hitoshi Selection of Japanese-English Equivalents by Integrating High-quality Corpora and Huge Amounts of Web Data
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Extraction of Informative Expressions from Domain-specific Documents
Boot-Strapping a WordNet Using Multiple Existing WordNets
Construction of a Metadata Database for Efficient Development and Use of Language Resources
Extraction of Attribute Concepts from Japanese Adjectives
Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
A Dependency Parser for Thai
Development of the Japanese WordNet
Application of Resource-based Machine Translation to Real Business Scenes
National Institute of Information and Communications Technology
Ishikawa, Shogo A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions
Shizuoka University
Ishizaki, Shun A Contextual Dynamic Network Model for WSD Using Associative Concept Dictionary
Keio University
Ismael, Safa New Resources for Document Classification, Analysis and Translation Technologies
Linguistic Data Consortium
Itagaki, Masaki Post-MT Term Swapper: Supplementing a Statistical Machine Translation System with a User Dictionary
Microsoft
Itahashi, Shuichi The 2008 Oriental COCOSDA Book Project: in Commemoration of the First Decade of Sustained Activities in Asia
National Institute of Informatics
Itai, Alon Using Movie Subtitles for Creating a Large-Scale Bilingual Corpora
Computer Science Dept. Technion
Itamar, Einav Using Movie Subtitles for Creating a Large-Scale Bilingual Corpora
Computer Science Dept. Technion
Itoh Ozaku, Hiromi Relationships between Nursing Converstaions and Activities
ATR Knowledge Science Labs.
Itoh, Yoshiaki Test Collections for Spoken Document Retrieval from Lecture Audio Data
Iwate Prefectural University
Itou, Katunobu Test Collections for Spoken Document Retrieval from Lecture Audio Data
In-car Speech Data Collection along with Various Multimodal Signals
Hosei University
Ittycheriah, Midhun What would you Ask a conversational Agent? Observations of Human-Agent Dialogues in a Museum Setting
USC Information Sciences Institute
Ivanova, Kremena Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case
IMS, University of Stuttgart
Ivanova, Steliana POS Tagging for German: how important is the Right Context?
Umbria Inc.
Iwano, Koji Thai Broadcast News Corpus Construction and Evaluation
Tokyo Institute of Technology

 

J
Jabbari, Sanaz Using a Probabilistic Model of Context to Detect Word Obfuscation
University of Sheffield
Jaeger, T. Florian Production in a Multimodal Corpus: how Speakers Communicate Complex Actions
University of Rochester
Janíček, Miroslav CzEng 0.7: Parallel Corpus with Community-Supplied Translations
Charles University, Prague
Janssen, Maarten Spock - a Spoken Corpus Client
ILTEC
Jelinek, Frederick Linguistic Resources for Reconstructing Spontaneous Speech Text
Johns Hopkins University, Center for Language and Speech Processing
Jern, Alan BART: A modular toolkit for coreference resolution
University of California Los Angeles
Jha, Girish Nath A Common Parts-of-Speech Tagset Framework for Indian Languages
Jawaharlal Nehru University
Ji, Donghong A Research on Automatic Chinese Catchword Extraction
Wuhan University
Joan, Anna Turning a Term Extractor into a new Domain: first Experiences
Applied Linguistic Institute
Jochim, Charles A Simple Method for Tagset Comparision
Indiana University
Johannessen, Janne Bondi Evaluation of Linguistics-Based Translation
Glossa: a Multilingual, Multimodal, Configurable User Interface
University of Oslo
Johansson, Richard Comparing Dependency and Constituent Syntax for Frame-semantic Analysis
Lund University
Johnson, Aaron Elicited Imitation as an Oral Proficiency Measure with ASR Scoring
BYU Linguistics
Jon, Chamberlain ANAWIKI: Creating Anaphorically Annotated Resources through Web Cooperation
University of Essex
Jones, Rhys James Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
Bangor University
Jongejan, Bart Experiments to Investigate the Connection between Case Distribution and Topical Relevance of Search Terms in an Information Retrieval Setting
CST, Copenhagen
Jongtaveesataporn, Markpong Thai Broadcast News Corpus Construction and Evaluation
Tokyo Institute of Technology
Jönsson, Arne Using Random Indexing to improve Singular Value Decomposition for Latent Semantic Analysis
Department of Computer and Information Science
Joseph, Mark The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
University of Michigan
Joshi, Aravind The Penn Discourse TreeBank 2.0.
University of Pennsylvania
Joubert, Alain Evolutionary Basic Notions for a Thematic Representation of General Knowledge
LIRMM
Jouis, Christophe Representation of Atypical Entities in Ontologies
LIP6 - Université Pierre et Marie Curie
Juan-Císcar, Alfons Bilingual Text Classification using the IBM 1 Translation Model
Departamento de Sistemas Informáticos y Computación - Universidad Politécnica de Valencia
Judge, John Linguistically Light Lexical Extensions for Ontologies
IBM LanguageWare

 

K
Kaalep, Heiki-Jaan Experiments on Processing Overlapping Parallel Corpora
University of Tartu
Kacic, Zdravko Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework
University of Maribor
Kageura, Kyo Constructing a Corpus that Indicates Patterns of Modification between Draft and Final Translations by Human Translators
Graduate School of Education, University of Tokyo
Kaggal, Vinod System Evaluation on a Named Entity Corpus from Clinical Notes
Mayo Clinic College of Medicine
Kaisser, Michael Creating a Research Collection of Question Answer Sentence Pairs with Amazon’s Mechanical Turk
University of Edinburgh / Powerset
Kaji, Hiroyuki Automatic Construction of a Japanese-Chinese Dictionary via English
Shizuoka University
Kaji, Reiko Constructing a Database of Non-Japanese Pronunciations of Different Japanese Romanizations
Tokyo University of Foreign Studies
Kaljurand, Kaarel Dependency-Based Relation Mining for Biomedical Literature
University of Zurich
Kallmeyer, Laura Developing a TT-MCTAG for German with an RCG-based Parser
University of Tübingen
Kan, Min-Yen ParsCit: an Open-source CRF Reference String Parsing Package
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
National University of Singapore
Kando, Noriko A Japanese-English Technical Lexicon for Translation and Language Research
National Institute of Informatics
Kanzaki, Kyoko KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Boot-Strapping a WordNet Using Multiple Existing WordNets
Extraction of Attribute Concepts from Japanese Adjectives
Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
Development of the Japanese WordNet
National Institute of Information and Communications Technology
Kaplan, Dain Adapting International Standard for Asian Language Technologies
Tokyo Institute of Technology
Karaiskos, Vasilis A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems
University of Edinburgh
Karkaletsis, Vangelis BOEMIE Ontology-Based Text Annotation Tool
N.C.S.R. Demokritos
Karlgren, Jussi Experiments to Investigate the Connection between Case Distribution and Topical Relevance of Search Terms in an Information Retrieval Setting
SICS, Stockholm
Karra, Vassia Condensing Sentences for Subtitle Generation
University of Athens
Kasami, Tomohiko A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions
Shizuoka University
Kassner, Laura Acquiring a Taxonomy from the German Wikipedia
University of Tuebingen
Kasterpalu, Riina From Human Communication to Intelligent User Interfaces: Corpora of Spoken Estonian
University of Tartu
Katrin, Tomanek Approximating Learning Curves for Active-Learning-Driven Annotation
FSU Jena
Kawahara, Daisuke A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
A Method for Automatically Constructing Case Frames for English
National Institute of Information and Communications Technology
Kawahara, Tatsuya Test Collections for Spoken Document Retrieval from Lecture Audio Data
Kyoto University of Technology
Kawtrakul, Asanee Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance
Department of Computer Engineering, Kasetsart University, Bangkok
Kawtrakul, Asanee Building an Annotated Corpus for Text Summarization and Question Answering
Kasetsart University
Kay, Martin Improving Statistical Machine Translation Efficiency by Triangulation
University of Saarland
Kellermann, Walter WOZ Acoustic Data Collection for Interactive TV
University of Erlangen-Nuremberg
Kemper, Brian Connecting Text Mining and Pathways using the PathText Resource
University of Tokyo
Kemps-Snijders, Marc Ensuring Semantic Interoperability on Lexical Resources
Exploring and Enriching a Language Resource Archive via the Web
ISOcat: Corralling Data Categories in the Wild
Max Planck Institute for Psycholinguistics
Kennington, Casey Elicited Imitation as an Oral Proficiency Measure with ASR Scoring
BYU Linguistics
Kermanidis, Katia Lida Eksairesis: A Domain-Adaptable System for Ontology Building from Unstructured Text
Department of Informatics, Ionian University
Keyser, Paul A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture
IBM Research
Kiefer, Bernd Some Fine Points of Hybrid Natural Language Parsing
DFKI GmbH
Kikuchi, Norihiro Connecting Text Mining and Pathways using the PathText Resource
Mitsui Knowledge Industry Co., Ltd.
Kilgarriff, Adam Cleaneval: a Competition for Cleaning Web Pages
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case
Lexical Computing Ltd
Kim, Dong-Il Annotation Guidelines for Chinese-Korean Word Alignment
YUST
Kim, Jin-Dong Challenges in Pronoun Resolution System for Biomedical Text
The University of Tokyo
King, Maghi Improving Contextual Quality Models for MT Evaluation Based on Evaluators’ Feedback
ISSCO/TIM/ETI - University of Geneva
Kipp, Michael Spatiotemporal Coding in ANVIL
An Exchange Format for Multimodal Annotations
DFKI GmbH
Kiriyama, Shinya A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions
Shizuoka University
Kitamura, Keisuke Creation of Learner Corpus and Its Application to Speech Recognition
Doshisha University
Kitano, Hiroaki Connecting Text Mining and Pathways using the PathText Resource
Okinawa Institute of Science and Technology
Kitaoka, Norihide Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
In-car Speech Data Collection along with Various Multimodal Signals
Nagoya University
Kitazawa, Shigeyoshi A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions
Shizuoka University
Kiyota, Yoji Automated Subject Induction from Query Keywords through Wikipedia Categories and Subject Headings
University of Tokyo
Klakow, Dietrich Cost-Sensitive Learning in Answer Extraction
University of Saarland
Klassmann, Alex Exploring and Enriching a Language Resource Archive via the Web
Max Planck Institute for Psycholinguistics
Klavans, Judith Relation between Agreement Measures on Human Labeling and Machine Learning Performance: Results from an Art History Domain
University of Maryland
Kleiner, Stefan German Today: a really extensive Corpus of Spoken Standard German
Institut für Deutsche Sprache, Mannheim
Klessa, Katarzyna JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts
Institute of Linguistics, Adam Mickiewicz University, Poznań
Klingenstein, Sara Building a Corpus of Temporal-Causal Structure
University of Colorado
Kloosterman, Geert A Coreference Corpus and Resolution System for Dutch
University of Groningen
Kluck, Michael The European Thesaurus on International Relations and Area Studies - a Multilingual Resource for Indexing, Retrieval, and Translation
Stiftung Wissenschaft und Politik (SWP) - German Institute for International and Security Affairs
Knight, Dawn Introducing DRS (The Digital Replay System): a Tool for the Future of Corpus Linguistic Research and Analysis
University of Nottingham
Knöbl, Ralf German Today: a really extensive Corpus of Spoken Standard German
Institut für Deutsche Sprache, Mannheim
Knopp, Johannes Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration
Department of Computational Linguistics, Heidelberg University
Koehler, Florian A Question Answering System for German. Experiments with Morphological Linguistic Resources
IMS, University of Stuttgart
Koeva, Svetla Chooser: a Multi-Task Annotation Tool
Bulgarian Academy of Sciences
Kogure, Kiyoshi Relationships between Nursing Converstaions and Activities
ATR Knowledge Science Labs.
Kogure, Satoru Developing Corpus of Japanese Classroom Lecture Speech Contents
Faculty of Informatics, Shizuoka University
Köhler, Joachim The MoveOn Motorcycle Speech Corpus
Fraunhofer IAIS
Koit, Mare From Human Communication to Intelligent User Interfaces: Corpora of Spoken Estonian
University of Tartu
Kokkinakis, Dimitrios MeSH©: from a Controlled Vocabulary to a Processable Resource
A Semantically Annotated Swedish Medical Corpus
University of Gothenburg
Kolář, Jáchym Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations
University of West Bohemia
Kondoh, Yohsuke Automatic Assessment of Japanese Text Readability Based on a Textbook Corpus
Nagoya University
Konings, Nanneke The AUTONOMATA Spoken Names Corpus
CLST, Radboud University, Nijmegen, the Netherlands
Kopotev, Mikhail Designing and Evaluating a Russian Tagset
University of Helsinki
Kordoni, Valia Robust Parsing with a Large HPSG Grammar
Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German
University of Saarland & DFKI GmbH
Korhonen, Anna LexSchem: a Large Subcategorization Lexicon for French Verbs
University of Cambridge
Kornai, András Parallel Creation of Gigaword Corpora for Medium Density Languages - an Interim Report
BME MOKK
Koskenniemi, Kimmo CLARIN: Common Language Resources and Technology Infrastructure
University of Helsinki
Kosseim, Leila Answering List Questions using Co-occurrence and Clustering
Concordia University
Kostoulas, Theodoros The MoveOn Motorcycle Speech Corpus
A Real-World Emotional Speech Corpus for Modern Greek
University of Patras
Kotnik, Bojan Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework
University of Maribor
Kountz, Manuel A LAF/GrAF based Encoding Scheme for underspecified Representations of syntactic Annotations.
Graduate Programme GRK 609, Uni Stuttgart
Kouylekov, Milen The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
Fondazione Bruno Kessler - FBK, Trento
Kozawa, Shunsuke Automatic Acquisition of Usage Information for Language Resources
Construction of a Metadata Database for Efficient Development and Use of Language Resources
Graduate School of Information Science, Nagoya University
Krahmer, Emiel Controlling Redundancy in Referring Expressions
University of Tilburg
Krauwer, Steven CLARIN: Common Language Resources and Technology Infrastructure
MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic
University of Utrecht
Krek, Simon Slovene Terminology Web Portal and the TBX-Compatible Simplified DTD/schema
The JOS Morphosyntactically Tagged Corpus of Slovene
Jozef Stefan Institute
Krestel, Ralf Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles
L3S Research Center
Kronenthal, Melissa A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems
University of Edinburgh
Krstev, Cvetana The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
Faculty of Philology, Belgrade
Kruschwitz, Udo ANAWIKI: Creating Anaphorically Annotated Resources through Web Cooperation
University of Essex
Kübler, Sandra POS Tagging for German: how important is the Right Context?
How to Compare Treebanks
Indiana University
Kubo, Junko Temporal Aspects of Terminology for Automatic Term Recognition: Case Study on Women’s Studies Terms
Graduate School of Library, Information and Media Studies, University of Tsukuba
Kuboya, Shunta The Japanese FrameNet Software Tools
Keio University
Kuhn, Jonas Identification of Comparable Argument-Head Relations in Parallel Corpora
University of Potsdam
Kulick, Seth Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation
Enhancing the Arabic Treebank: a Collaborative Effort toward New Annotation Guidelines
Linguistic Data Consortium
Kunst, Jan Pieter Evaluating the Relationship between Linguistic and Geographic Distances using a 3D Visualization
Meertens Institute
Kurella, Svitlana Corpus-Based Tools for Computer-Assisted Acquisition of Reading Abilities in Cognate Languages
University of Leeds
Kurohashi, Sadao A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
Kyoto University
Kuroiwa, Shingo Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
Chiba University
Kusakawa, Takashi In-car Speech Data Collection along with Various Multimodal Signals
Nagoya University

 

L
Lê, Hồng Phương Word Segmentation of Vietnamese Texts: a Comparison of Approaches
LORIA, University of Nancy
Lafourcade, Mathieu Evolutionary Basic Notions for a Thematic Representation of General Knowledge
LIRMM
Lamel, Lori Question Answering on Speech Transcriptions: the QAST evaluation in CLEF
CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content
LIMSI-CNRS
Lammie Glenn, Meghan Management of Large Annotation Projects Involving Multiple Human Judges: a Case Study of GALE Machine Translation Post-editing
Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition
Linguistic Data Consortium
Lampmann, Malte Emotion Recognition from Speech: Stress Experiment
Institute of Information Technology
Lanchantin, Pierre Automatic Phoneme Segmentation with Relaxed Textual Constraints
IRCAM
Lange, Marek JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts
Laboratory of Speech and Language Technology , Adam Mickiewicz University Foundation, Poznan
Langlais, Philippe MISTRAL: a Statistical Machine Translation Decoder for Speech Recognition Lattices
Université de Montréal
Langlois, David Phrase-Based Machine Translation based on Simulated Annealing
LORIA, University of Nancy
Laoudi, Jamal MTriage: Web-enabled Software for the Creation, Machine Translation, and Annotation of Smart Documents
Exploitation of an Arabic Language Resource for Machine Translation Evaluation: using Buckwalter-based Lookup Tool to Augment CMU Alignment Algorithm
Advanced Resource Technology, Inc.
Laparra, Egoitz Complete and Consistent Annotation of WordNet using the Top Concept Ontology
University of the Basque Country
Lapshinova-Koltunski, Ekaterina Head or Non-head? Semi-automatic Procedures for Extracting and Classifying Subcategorisation Properties of Compounds.
IMS, University of Stuttgart
Lashevskaja, Olga N. Semantic Annotation Layer in Russian National Corpus: Lexical Classes of Nouns and Adjectives
VINITI RAN, Moscow
Laskowski, Kornel A Comparative Cross-Domain Study of the Occurrence of Laughter in Meeting and Seminar Corpora
CMU
Lau, Monica In Contrast - A Complex Discourse Connective
University of Tuebingen
Laublet, Philippe Automatic Identification of Temporal Information in Tourism Web Pages
Lalic - Université Paris Sorbonne
Lauc, Tomislava Generating a Morphological Lexicon of Organization Entity Names
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences
Laurent, Antoine Combined Systems for Automatic Phonetic Transcription of Proper Nouns
Université du Maine
Lavecchia, Caroline Phrase-Based Machine Translation based on Simulated Annealing
LORIA, University of Nancy
Lavelli, Alberto Comparing Italian parsers on a common Treebank: the EVALITA experience
Fondazione Bruno Kessler - FBK, Trento
Lavie, Alon Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Language Technologies Institute, Carnegie Mellon University
Łaziński, Marek Towards the National Corpus of Polish
Polish Scientific Publishers PWN and Warsaw University
Le Meur, André Presentation of the New ISO-Standard for the Representation of Entries in Dictionaries: ISO 1951
Université du Québec en Outaouais
Lecorvé, Gwénolé On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems
IRISA / Universite Rennes 1
Lee, Alan A Study of Parentheticals in Discourse Corpora - Implications for NLG Systems
The Penn Discourse TreeBank 2.0.
University of Pennsylvania
Lee, Chong Min Detecting Errors in Semantic Annotation
Georgetown University
Lee, Dongwon The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
Pennsylvania State University
Lee, Haejoong Management of Large Annotation Projects Involving Multiple Human Judges: a Case Study of GALE Machine Translation Post-editing
Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium
Linguistic Data Consortium
Lee, Jong-Hyeok Annotation Guidelines for Chinese-Korean Word Alignment
POSTECH
Lee, Lung-Hao Quality Assurance of Automatic Annotation of Very Large Corpora: a Study based on heterogeneous Tagging System
Institute of Linguistics, Academia Sinica
Lefever, Els Learning-based Detection of Scientific Terms in Patient Information
LT3, University College Ghent
Lefevre, Fabrice Semantic Frame Annotation on the French MEDIA corpus
LIA - University of Avignon
Lehmberg, Timm The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Hamburg
Lehnen, Patrick A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding
RWTH Aachen University
Leidner, Jochen L. Cost-Sensitive Learning in Answer Extraction
Research & Development, Thomson Legal & Regulatory
Lemnitzer, Lothar Enriching GermaNet with verb-noun relations - a case study of lexical acquisition
Extraction and Evaluation of Keywords from Learning Objects: a Multilingual Approach
University of Tübingen
Lemon, Oliver Automatic Learning and Evaluation of User-Centered Objective Functions for Dialogue System Optimisation
Edinburgh University
Lenci, Alessandro Computational Models for Event Type Classification in Context
Unsupervised Acquisition of Verb Subcategorization Frames from Shallow-Parsed Corpora
University of Pisa, Department of Linguistics
Lendvai, Piroska From Field Notes towards a Knowledge Base
Tilburg University
Leseva, Svetlozara Chooser: a Multi-Task Annotation Tool
Bulgarian Academy of Sciences
Leshtanska, Magdalena The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Tübingen
Lesmo, Leonardo Comparing Italian parsers on a common Treebank: the EVALITA experience
Università di Torino
Leturia, Igor Analysis and Performance of Morphological Query Expansion and Language-Filtering Words on Basque Web Searching
Elhuyar Fundazioa, R&D
Levas, Anthony A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture
IBM Research
Levin, Lori Toward Active Learning in Data Selection: Automatic Discovery of Language Features During Elicitation
Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Carnegie Mellon University
Lewandowska-Tomaszyk, Barbara Towards the National Corpus of Polish
University of Łódź
Li, Hong Adaptation of Relation Extraction Rules to New Domains
DFKI GmbH
Li, Jin-Ji Annotation Guidelines for Chinese-Korean Word Alignment
POSTECH
Li, Wenjie Opinion Annotation in On-line Chinese Product Reviews
Exploiting the Role of Position Feature in Chinese Relation Extraction
Corpus Exploitation from Wikipedia for Ontology Construction
Chinese Core Ontology Construction from a Bilingual Term Bank
The Hong Kong Polytechnic University
Li, Yaoyong Evaluating Evaluation Metrics for Ontology-Based Applications: Infinite Reflection
University of Sheffield
Liberman, Mark 15 Years of Language Resource Creation and Sharing: a Progress Report on LDC Activities
Linguistic Data Consortium
Lichte, Timm Developing a TT-MCTAG for German with an RCG-based Parser
University of Tübingen
Lima, Vera Keywords, k-NN and Neural Networks: a Support for Hierarchical Categorization of Texts in Brazilian Portuguese
PUCRS
Lin, Wei-Hao Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments
Carnegie Mellon University
Linares, Georges Local Methods for On-Demand Out-of-Vocabulary Word Retrieval
University of Avignon
Lippincott, Tom Relation between Agreement Measures on Human Labeling and Machine Learning Performance: Results from an Art History Domain
Columbia University
Litman, Diane Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems
University of Pittsburgh
Liu, Chen Borrowing Language Resources for Development of Automatic Speech Recognition for Low- and Middle-Density Languages
Motorola Labs
Liu, Ting Cross-Domain Dialogue Act Tagging
University at Albany, SUNY
Liu, Wei Professor or Screaming Beast? Detecting Anomalous Words in Chinese
University of Sheffield
Ljubesic, Nikola Generating a Morphological Lexicon of Organization Entity Names
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences
Llorens, D. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Loehr, Dan An Exchange Format for Multimodal Annotations
MITRE Corporation
Logie, Robert A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems
University of Edinburgh
Lombardo, Vincenzo Comparing Italian parsers on a common Treebank: the EVALITA experience
Evaluation of Natural Language Tools for Italian: EVALITA 2007
Università di Torino
Lönneker-Rodman, Birte Integrating Metaphor Information into RDF/OWL EuroWordNet
International Computer Science Institute
Lonsdale, Deryle Elicited Imitation as an Oral Proficiency Measure with ASR Scoring
Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study
BYU Linguistics
Loos, Berenike A Semantic Memory for Incremental Ontology Population
German National Library
López, Eduardo Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases
GAPS, Signals, Systems and Radiocommunications Department, Universidad Politécnica de Madrid
Lorente, Mercè Turning a Term Extractor into a new Domain: first Experiences
Applied Linguistic Institute
Lounela, Mikko Process Model for Composing High-quality Text Corpora
Research Institute for the Languages of Finland
Lowe, John Creating a Research Collection of Question Answer Sentence Pairs with Amazon’s Mechanical Turk
Powerset
Lu, Qin Exploiting the Role of Position Feature in Chinese Relation Extraction
Corpus Exploitation from Wikipedia for Ontology Construction
Chinese Core Ontology Construction from a Bilingual Term Bank
Chinese Term Extraction Based on Delimiters
Department of Computing, The Hong Kong Polytechnic University
Luengo, Iker Subjective Evaluation of an Emotional Speech Database for Basque
Text Independent Speaker Identification in Multilingual Environments
University of the Basque Country
Luengo, Juan Carlos Methodology for Evaluating the Usability of User Interfaces in Mobile Services
Telefónica España
Luján Mares, Míriam Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation.
Instituto Tecnológico de Informática, Universidad Politécnica de Valencia
Luyckx, Kim Personae: a Corpus for Author and Personality Prediction from Text
CNTS Language Technology Group, University of Antwerp
Luzzati, Daniel Manual vs Assisted Transcription of Prepared and Spontaneous Speech
Université du Maine

 

M
Màrquez, Lluis Towards Heterogeneous Automatic MT Error Analysis
UPC-TALP
Ma, Qing Selection of Japanese-English Equivalents by Integrating High-quality Corpora and Huge Amounts of Web Data
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
Ryukoku University
Ma, Xiaoyi Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
Linguistic Data Consortium
Maamouri, Mohamed Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation
Enhancing the Arabic Treebank: a Collaborative Effort toward New Annotation Guidelines
A Pilot Arabic Propbank
Linguistic Data Consortium & University of Penn
Macken, Lieve Sentence Alignment in DPC: Maximizing Precision, Minimizing Human Effort
Gent Hogeschool
Madany, Abdel-Rahim Automatic Extraction of Textual Elements from News Web Pages
Cairo University
Madsen, Bodil Nistrup A Taxonomy of Lexical Metadata Categories
Copenhagen Business School
Maeda, Kazuaki Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction
Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
Linguistic Data Consortium
Maegaard, Bente MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic
University of Copenhagen, Centre for Language Technology (CST)
Magnini, Bernardo The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
Evaluation of Natural Language Tools for Italian: EVALITA 2007
Fondazione Bruno Kessler - FBK, Trento
Magnusson, Magnus An Exchange Format for Multimodal Annotations
Human Behavior Laboratory, Reykjavik
Maier, Wolfgang Developing a TT-MCTAG for German with an RCG-based Parser
How to Compare Treebanks
University of Tübingen
Maks, Isa Adjectives in the Dutch Semantic Lexical Database CORNETTO
Integrating Lexical Units, Synsets and Ontology in the Cornetto Database
Standardising Bilingual Lexical Resources According to the Lexicon Markup Framework
Faculteit der Letteren, Vrije Universiteit Amsterdam
Malaisé, Véronique A Common Multimedia Annotation Framework for Cross Linking Cultural Heritage Digital Collections
Free University, Amsterdam
Maleki, Jalal Converting Romanized Persian to the Arabic Writing Systems
Linkoping University, Sweden
Mamede, Nuno J. Using Lexical Acquisition to Enrich a Predicate Argument Reusable Database
L2F INESC-ID/IST, Lisboa
Mandl, Thomas An Evaluation Resource for Geographic Information Retrieval
University of Hildesheim
Mani, Inderjeet SpatialML: Annotation Scheme, Corpora, and Tools
MITRE Corporation
Manning, Christopher Lexicon Schemas and Related Data Models: when Standards Meet Users
Stanford University
Mansouri, Aous A Pilot Arabic Propbank
University of Colorado, Boulder
Manzano-Macho, David Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence
Facultad de Informatica, Departamento de Inteligencia Artificial, Universidad Politécnica de Madrid, Spain
Mapelli, Valérie A Guide for the Production of Reusable Language Resources
Latest Developments in ELRA’s Services
ELDA
Maragoudakis, Manolis Eksairesis: A Domain-Adaptable System for Ontology Building from Unstructured Text
Department of Information and Communication Systems Engineering, University of the Aegean
Marasek, Krzysztof Design and Data Collection for Spoken Polish Dialogs Database
Polish-Japaneese Institute of Information Technology, Warsaw,Poland
Marchetti, Andrea KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
CNR-IIT
Marchi, Simone Ontology Learning and Semantic Annotation: a Necessary Symbiosis
ILC-CNR, Pisa
Marcińczuk, Michał Definition Extraction Using a Sequential Combination of Baseline Grammars and Machine Learning Classifiers
Wrocław University of Technology
Marek, Torsten Extracting and Querying Relations in Scientific Papers on Language Technology
DFKI GmbH
Marimon, Montserrat Automatic Acquisition for low frequency lexical items
COLDIC, a Lexicographic Platform for LMF compliant lexica
Pompeu Fabra University
Marinelli, Rita Encoding Terms from a Scientific Domain in a Terminological Database: Methodology and Criteria
ILC-CNR, Pisa
Markantonatou, Stella Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
ILSP - Athens
Marocco, Paolo Towards a Vector Space Model for FrameNet-like Resources
University of Roma Tor Vergata
Marquardt, Lutz WOZ Acoustic Data Collection for Interactive TV
University of Erlangen-Nuremberg
Martens, Jean-Pierre The AUTONOMATA Spoken Names Corpus
ELIS, Gent University, Belgium
Martí, M.Antònia AnCora-Verb: A Lexical Resource for the Semantic Annotation of Corpora
AnCora: Multilevel Annotated Corpora for Catalan and Spanish
Arabic WordNet: Semi-automatic Extensions using Bayesian Inference
CLiC-University of Barcelona
Martin, James H. Building a Corpus of Temporal-Causal Structure
Annotating Students’ Understanding of Science Concepts
University of Colorado
Martin, Jean-Claude Coding Emotional Events in Audiovisual Corpora
LIMSI-CNRS
Martínez, Paloma An Empirical Approach to a Preliminary Successful Identification and Resolution of Temporal Expressions in Spanish News Corpora
Universidad Carlos III de Madrid
Martínez-Hinarejos, Carlos D. Evaluation of Different Segmentation Techniques for Dialogue Turns
Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation.
DSIC - UPV
Martins, Pedro LX-Service: Web Services of Language Technology for Portuguese
University of Lisbon
Martins, Rui The LECTRA Corpus - Classroom Lecture Transcriptions in European Portuguese
L2F INESC-ID/IST, Lisboa
Marzal, A. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Marzelou, Evi Building a Greek corpus for Textual Entailment
ILSP - Athens
Masanz, James System Evaluation on a Named Entity Corpus from Clinical Notes
Mayo Clinic College of Medicine
Massó, Guillem User-Centred Design of Error Correction Tools
Pompeu Fabra University
Masuda, Hidetaka Automated Subject Induction from Query Keywords through Wikipedia Categories and Subject Headings
Tokyo Denki University
Mata, Ana Isabel The LECTRA Corpus - Classroom Lecture Transcriptions in European Portuguese
CLUL
Matoušek, Jindřich Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis
University of West Bohemia
Matsubara, Shigeki Automatic Acquisition of Usage Information for Language Resources
Construction of a Metadata Database for Efficient Development and Use of Language Resources
Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus
Information Technology Center, Nagoya University
Matsuoka, Yukiko Connecting Text Mining and Pathways using the PathText Resource
The Systems Biology Institute
Matsuyoshi, Suguru Automatic Assessment of Japanese Text Readability Based on a Textbook Corpus
Nagoya University
Maurel, Denis Prolexbase: a Multilingual Relational Lexical Database of Proper Names
Université François Rabelais, Tours
Mauser, Arne Automatic Evaluation Measures for Statistical Machine Translation System Optimization
RWTH Aachen University
Max, Aurélien An Evaluation of Spoken and Textual Interaction in the RITEL Interactive Question Answering System
LIMSI-CNRS
Maxwell, Michael Lexicon Schemas and Related Data Models: when Standards Meet Users
University of Maryland
Maynard, Diana Benchmarking Textual Annotation Tools for the Semantic Web
Evaluating Evaluation Metrics for Ontology-Based Applications: Infinite Reflection
University of Sheffield
Mayo, Neil A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems
University of Edinburgh
Mazo, Hélène Latest Developments in ELRA’s Services
ELDA
Mazzei, Alessandro Comparing Italian parsers on a common Treebank: the EVALITA experience
Evaluation of Natural Language Tools for Italian: EVALITA 2007
Universit di Torino
McCarthy, Diana Lexical Substitution as a Framework for Multiword Evaluation
University of Sussex
McClanahan, Peter Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study
Brigham Young University
McConville, Mark Evaluating Complement-Modifier Distinctions in a Semantically Annotated Corpus
University of Edinburgh
McGhee, Jeremiah Elicited Imitation as an Oral Proficiency Measure with ASR Scoring
BYU Linguistics
McGillivray, Barbara Unsupervised Acquisition of Verb Subcategorization Frames from Shallow-Parsed Corpora
ILC-CNR, Pisa
McNaught, John Clustering Related Terms with Definitions
Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora
The University of Manchester
Medero, Julie Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium
Linguistic Data Consortium
Medero, Shawn Management of Large Annotation Projects Involving Multiple Human Judges: a Case Study of GALE Machine Translation Post-editing
Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium
Linguistic Data Consortium
Megerdoomian, Karine Low-Density Language Bootstrapping: the Case of Tajiki Persian
MITRE Corporation
Megyesi, Beáta Swedish-Turkish Parallel Treebank
Language Resources and Tools for Swedish: A Survey
Department of Linguistics and Philology, Uppsala University
Mehler, Alexander A Unified Database of Dependency Treebanks: Integrating, Quantifying & Evaluating Dependency Data
Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
University of Bielefeld
Meignier, Sylvain Combined Systems for Automatic Phonetic Transcription of Proper Nouns
Université du Maine
Meister, Einar Strengthening the Estonian Language Technology
Tallinn University of Technology
Melero, Maite Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
Rapid Deployment of a New METIS Language Pair: Catalan-English
GliCom, Fundació Barcelona Media, UPF
Melnar, Lynette LILA: Cellular Telephone Speech Databases from Asia
Borrowing Language Resources for Development of Automatic Speech Recognition for Low- and Middle-Density Languages
Motorola
Mencke, Myriam Evaluating the Ontology underlying sMail - the Conceptual Framework for Semantic Email Communication
DERI/NUIG
Mendes, Carlos Methodologies for Designing and Recording Speech Databases for Corpus Based Synthesis
L2F INESC-ID/IST, Lisboa
Merlin, Téva Combined Systems for Automatic Phonetic Transcription of Proper Nouns
Université du Maine
Messiant, Cédric LexSchem: a Large Subcategorization Lexicon for French Verbs
Do we Still Need Gold Standards for Evaluation?
LIPN-CNRS
Meurs, Marie-Jean Semantic Frame Annotation on the French MEDIA corpus
LIA - University of Avignon
Micher, Jeffrey Exploitation of an Arabic Language Resource for Machine Translation Evaluation: using Buckwalter-based Lookup Tool to Augment CMU Alignment Algorithm
ARL
Mieskes, Margot Parameters for Topic Boundary Detection in Multi-Party Dialogues
A Three-stage Disfluency Classifier for Multi Party Dialogues
Knowledge Sources for Bridging Resolution in Multi-Party Dialog
European Media Laboratory GmbH, Heidelberg
Mihalcea, Rada Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources
University of North Texas
Miháltz, Márton Knowledge-based Coreference Resolution for Hungarian
MorphoLogic
Mille, Simon Making Text Resources Accessible to the Reader: the Case of Patent Claims
Pompeu Fabra University
Miller, Keith J. A Ground Truth Dataset for Matching Culturally Diverse Romanized Person Names
Adjudicator Agreement and System Rankings for Person Name Search
An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
MITRE Corporation
Miltsakaki, Eleni The Penn Discourse TreeBank 2.0.
University of Pennsylvania
Minel, Jean-Luc Automatic Identification of Temporal Information in Tourism Web Pages
MoDyCo, CNRS
Mineur, Anne-Marie A Coreference Corpus and Resolution System for Dutch
University of Groningen
Minker, Wolfgang The PIT Corpus of German Multi-Party Dialogues
University of Ulm
Mírovský, Jiří Does Netgraph Fit Prague Dependency Treebank?
Charles University, Prague
Mitkov, Ruslan Mutual Bilingual Terminology Extraction
Anaphora Resolution Exercise: an Overview
Smarty - Extendable Framework for Bilingual and Multilingual Comprehension Assistants
University of Wolverhampton
Mival, Oli Dialogue, Speech and Images: the Companions Project Data Set
Napier University
Miyajima, Chiyomi Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
In-car Speech Data Collection along with Various Multimodal Signals
Nagoya University
Miyao, Yusuke GENIA-GR: a Grammatical Relation Corpus for Parser Evaluation in the Biomedical Domain
University of Tokyo
Mladová, Lucie From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank
Charles University, Prague
Mochales Palau, Raquel Language Resources for Studying Argument
Katholieke Universiteit Leuven
Mochizuki, Hajime Constructing a Database of Non-Japanese Pronunciations of Different Japanese Romanizations
Tokyo University of Foreign Studies
Moens, Marie-Francine Language Resources for Studying Argument
Katholieke Universiteit Leuven
Mögele, Hannes Talking and Looking: the SmartWeb Multimodal Interaction Corpus
BAS Bavarian Archive for Speech Signals
Mohanty, Rajat Lexical Resources for Semantics Extraction
Indian Institute of Technology Bombay
Mohler, Michael Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages
University of North Texas
Mokbel, Chafic MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic
University of Balamand
Mokrane, Abdenour Automatic Rich Annotation of Large Corpus of Conversational transcribed speech: the Chunking Task of the EPAC Project
Université François Rabelais, Tours
Moldovan, Dan Causal Relation Extraction
Lymba Corporation
Möller, Sebastian Corpus Analysis of Spoken Smart-Home Interactions with Older Users
Deutsche Telekom Labs, Berlin University of Technology
Monachesi, Paola From D-Coi to SoNaR: a reference corpus for Dutch
Extraction and Evaluation of Keywords from Learning Objects: a Multilingual Approach
Creating Glossaries Using Pattern-Based and Machine Learning Techniques
University of Utrecht
Monachini, Monica Named Entity WordNet
Ontologizing Lexicon Access Functions based on an LMF-based Lexicon Taxonomy
KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Adapting International Standard for Asian Language Technologies
A lexicon for biology and bioinformatics: the BOOTStrep experience.
UFRA: a UIMA-based Approach to Federated Language Resource Architecture
ILC-CNR, Pisa
Moniz, Helena The LECTRA Corpus - Classroom Lecture Transcriptions in European Portuguese
L2F INESC-ID/CLUL, Lisboa
Monson, Christian Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Language Technologies Institute, Carnegie Mellon University
Monte, Enric Using Reordering in Statistical Machine Translation based on Alignment Block Classification
UPC-TALP
Montemagni, Simonetta Ontology Learning and Semantic Annotation: a Necessary Symbiosis
Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora
Unsupervised Acquisition of Verb Subcategorization Frames from Shallow-Parsed Corpora
ILC-CNR, Pisa
Moore, Johanna A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems
University of Edinburgh
Moraes, Silvia Keywords, k-NN and Neural Networks: a Support for Hierarchical Categorization of Texts in Brazilian Portuguese
PUCRS
Morales, Nicolas STC-TIMIT: Generation of a Single-channel Telephone Corpus
HCTLab, Universidad Autonoma de Madrid
Moran, Steve Lexicon Schemas and Related Data Models: when Standards Meet Users
University of Washington
Morante, Roser Semantic Role Labeling Tools Trained on the Cast3LB-CoNNL-SemRol Corpus
Tilburg University
Moreau, Nicolas Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework
Data Collection for the CHIL CLEAR 2007 Evaluation Campaign
ELDA
Moreno Sandoval, Antonio Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories
Universidad Autonoma Madrid
Moreno, Asuncion LILA: Cellular Telephone Speech Databases from Asia
LC-STAR II: Starring more Lexica
Corpus and Voices for Catalan Speech Synthesis
UPC-TALP
Morris, Andrew C. Automatic Phoneme Segmentation with Relaxed Textual Constraints
SPINVOX
Morrissey, Sara The ATIS Sign Language Corpus
Dublin City University
Mörth, Karlheinz Words in Contexts: Digital Editions of Literary Journals in the “AAC - Austrian Academy Corpus”
Austrian Academy of Sciences
Moschitti, Alessandro BART: A modular toolkit for coreference resolution
University of Trento
Mossel, Eelco Language Resources for Semantic Document Annotation and Crosslingual Retrieval
University of Hamburg
Mostefa, Djamel Question Answering on Speech Transcriptions: the QAST evaluation in CLEF
Data Collection for the CHIL CLEAR 2007 Evaluation Campaign
An Experimental Methodology for an End-to-End Evaluation in Speech-to-Speech Translation
New Telephone Speech Databases for French: a Children Database and an optimized Adult Corpus
The INFILE Project: a Crosslingual Filtering Systems Evaluation Campaign
PASSAGE: from French Parser Evaluation to Large Sized Treebank
Quick Rich Transcriptions of Arabic Broadcast News Speech Data
ELDA
Moszczyński, Radosław Enhancing an English-Polish Electronic Dictionary for Multiword Expression Research
University of Warsaw
Mota, Cristina Is this NE tagger getting old?
L2F INESC-ID/IST, Lisboa & New York University
Mporas, Iosif A Real-World Emotional Speech Corpus for Modern Greek
University of Patras
Mueller, Mark-Christoph Knowledge Sources for Bridging Resolution in Multi-Party Dialog
TU Darmstadt
Muhr, Rudolf The Pronouncing Dictionary of Austrian German (AGPD) and the Austrian Phonetic Database (ADABA): Report on a large Phonetic Resources Database of the three Major Varieties of German
Graz University
Mukherjee, Animesh Unsupervised Parts-of-Speech Induction for Bengali
Indian Institute of Technology Kharagpur
Müller, Christof Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary
Technische Universität Darmstadt
Muller, Philippe Evaluation Metrics for Automatic Temporal Annotation of Texts
Toulouse University
Muñoz, Rafael Named Entity WordNet
University of Alicante
Murata, Masaki Selection of Japanese-English Equivalents by Integrating High-quality Corpora and Huge Amounts of Web Data
National Institute of Information and Communications Technology
Murray-Rust, Peter Language Resources and Chemical Informatics
Unilever Centre for Molecular Informatics, University of Cambridge
Murthi, Vijay Similar Term Discovery using Web Search
Yahoo, Inc.

 

N
Nakagawa, Hiroshi Automated Subject Induction from Query Keywords through Wikipedia Categories and Subject Headings
University of Tokyo
Nakagawa, Seiichi Developing Corpus of Japanese Classroom Lecture Speech Contents
Department of Information and Computer Sciences, Toyohashi University of Technology
Nakamura, Junpei A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation
Tokyo University of Agriculture and Technology
Nakamura, Satoshi Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
National Institute of Information and Communications Technology & ATR
Nakao, Koichi Selection of Japanese-English Equivalents by Integrating High-quality Corpora and Huge Amounts of Web Data
Ryukoku University
Nakao, Yukie Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
Nantes University
Nakayama, Masato Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
Ritsumeikan University
Nallasamy, Udhyakumar NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls
Language Technologies Institute, Carnegie Mellon University
Nanjo, Hiroaki Test Collections for Spoken Document Retrieval from Lecture Audio Data
Ryukoku University
Narawa, Chiharu Ontologizing Lexicon Access Functions based on an LMF-based Lexicon Taxonomy
Kyoto University
Naroua, Harouna Evaluation of Virtual Keyboards for West-African Languages
Université Abdou Moumouni
Nastase, Vivi Acquiring a Taxonomy from the German Wikipedia
European Media Laboratory GmbH, Heidelberg
Nath, Joydeep Unsupervised Parts-of-Speech Induction for Bengali
Indian Institute of Technology Kharagpur
Nathan, David Building a Federation of Language Resource Repositories: the DAM-LR Project and its Continuation within CLARIN.
SOAS University of London
Navarretta, Costanza Annotating Abstract Pronominal Anaphora in the DAD Project
University of Copenhagen, Centre for Language Technology (CST)
Navas, Eva Subjective Evaluation of an Emotional Speech Database for Basque
Text Independent Speaker Identification in Multilingual Environments
University of the Basque Country
Nayak, Amiya Using the Complexity of the Distribution of Lexical Elements as a Feature in Authorship Attribution
University of Ottawa
Nazar, Rogelio A Suite to Compile and Analyze an LSP Corpus
Pompeu Fabra University
Neely, Abby Speaker Recognition: Building the Mixer 4 and 5 Corpora
Linguistic Data Consortium
Neff, Mary Navigating through Dense Annotation Spaces
IBM Research
Negri, Matteo Development and Alignment of a Domain-Specific Ontology for Question Answering
The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
Fondazione Bruno Kessler - FBK, Trento
Neidle, Carol Benchmark Databases for Video-Based Automatic Sign Language Recognition
Boston University
Nelson, Peter C. From Extracting to Abstracting: Generating Quasi-abstractive Summaries
University of Illinois at Chicago
Németh, Géza Multimodal Spontaneous Expressive Speech Corpus for Hungarian
Budapest University of Technology and Economics
Németh, Péter Parallel Creation of Gigaword Corpora for Medium Density Languages - an Interim Report
Kitchen Budapest
Nemoto, Rena Speech Errors on Frequently Observed Homophones in French: Perceptual Evaluation vs Automatic Classification
LIMSI-CNRS
Neri, Federico KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Synthema
Nerima, Luka Generating Bilingual Dictionaries by Transitivity
University of Geneva
Netzer, Yael Tagging a Hebrew Corpus: the Case of Participles
Ben Gurion University of the Negev
Neumann, Günter Unsupervised Relation Extraction From Web Documents
The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
DFKI GmbH
Neumann, Heiko The PIT Corpus of German Multi-Party Dialogues
University of Ulm
Newbold, Neil Automatic Document Quality Control
Lexical Ontology Extraction using Terminology Analysis: Automating Video Annotation
University of Surrey
Ney, Hermann Benchmark Databases for Video-Based Automatic Sign Language Recognition
A Multi-Genre SMT System for Arabic to French
The ATIS Sign Language Corpus
A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding
Automatic Evaluation Measures for Statistical Machine Translation System Optimization
RWTH Aachen University
Nguyễn, Thi Minh Huyền Word Segmentation of Vietnamese Texts: a Comparison of Approaches
Vietnam National University of Hanoi
Nguyễn, Cẩm Tú Word Segmentation of Vietnamese Texts: a Comparison of Approaches
Vietnam National University of Hanoi
Nguyen, Ngan Challenges in Pronoun Resolution System for Biomedical Text
The University of Tokyo
Nicholson, Jeremy Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German
University of Melbourne
Nielsen, Rodney D. Annotating Students’ Understanding of Science Concepts
University of Colorado, Boulder
Nilsson, Jens MaltEval: an Evaluation and Visualization Tool for Dependency Parsing
Växjö University
Nioche, Julien The BNC Parsed with RASP4UIMA
Digital Pebble Ltd
Nishino, Takanori In-car Speech Data Collection along with Various Multimodal Signals
Nagoya University
Nishiura, Takanobu Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
Ritsumeikan University
Nishizaki, Hiromitsu Test Collections for Spoken Document Retrieval from Lecture Audio Data
Developing Corpus of Japanese Classroom Lecture Speech Contents
University of Yamanashi
Nissim, Malvina The Italian Particle “ne”: Corpus Construction and Analysis
University of Bologna
Nivre, Joakim MaltEval: an Evaluation and Visualization Tool for Dependency Parsing
Växjö University & Uppsala University
Nivre, Joakim Swedish-Turkish Parallel Treebank
Department of Linguistics and Philology, Uppsala University
Noikongka, Daoyos Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance
Department of Computer Engineering, Kasetsart University, Bangkok
Nøklestad, Anders Glossa: a Multilingual, Multimodal, Configurable User Interface
University of Oslo, Norway
Nordgård, Torbjørn Evaluation of Linguistics-Based Translation
Lingit, Trondheim, Norway
Novák, Václav Inter-sentential Coreferences in Semantic Networks: An Evaluation of Manual Annotation
Charles University, Prague
Ntalampiras, Stavros Audio Database in Support of Potentiel Threat and Crisis Situation Management
University of Patras
Nugues, Pierre Comparing Dependency and Constituent Syntax for Frame-semantic Analysis
Lund University
Nunes, Ana Methodologies for Designing and Recording Speech Databases for Corpus Based Synthesis
INOV
Nunes, Filipe LX-Service: Web Services of Language Technology for Portuguese
University of Lisbon
Nürnberger, Andreas A Comparative Study on Language Identification Methods
University of Magdeburg
Nygaard, Lars Evaluation of Linguistics-Based Translation
Glossa: a Multilingual, Multimodal, Configurable User Interface
University of Oslo, Norway

 

O
Obrębski, Tomasz Verb-Noun Collocation SyntLex Dictionary: Corpus-Based Approach
Adam Mickiewicz University in Poznań
Obradović, Ivan The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
Faculty of Mining and geology, Belgrade
Oda, Kanae Connecting Text Mining and Pathways using the PathText Resource
University of Tokyo
Odriozola, Igor Subjective Evaluation of an Emotional Speech Database for Basque
Text Independent Speaker Identification in Multilingual Environments
University of the Basque Country
Oepen, Stephan Some Fine Points of Hybrid Natural Language Parsing
Universitetet i Oslo & CSLI Stanford
Oflazer, Kemal BLEU+: a Tool for Fine-Grained BLEU Computation
Sabanci University
Oger, Stanislas Local Methods for On-Demand Out-of-Vocabulary Word Retrieval
University of Avignon
Ogiso, Toshinobu A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation
The National Institute for Japanese Language
Ogórkiewicz, Jerzy JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts
Laboratory of Speech and Language Technology , Adam Mickiewicz University Foundation, Poznan
Ogren, Philip System Evaluation on a Named Entity Corpus from Clinical Notes
Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition
University of Colorado
Ogura, Hideki A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation
The National Institute for Japanese Language
Ohara, Kyoko The Japanese FrameNet Software Tools
Lexicon, Grammar, and Multilinguality in the Japanese FrameNet
Keio University
Ohno, Sumio Automatic Emotional Degree Labeling for Speakers’ Anger Utterance during Natural Japanese Dialog
School of Computer Science, Tokyo University of Technology
Ohta, Kengo Developing Corpus of Japanese Classroom Lecture Speech Contents
Department of Information and Computer Sciences, Toyohashi University of Technology
Okamoto, Jun A Contextual Dynamic Network Model for WSD Using Associative Concept Dictionary
Keio University
Okazaki, Naoaki Building Bilingual Lexicons using Lexical Translation Probabilities via Pivot Languages
Connecting Text Mining and Pathways using the PathText Resource
University of Tokyo
Oliveira, Luís Methodologies for Designing and Recording Speech Databases for Corpus Based Synthesis
L2F INESC-ID/IST, Lisboa
Oliver, Antoni Complete and Consistent Annotation of WordNet using the Top Concept Ontology
UOC
Olsen, Sussi Merging a Syntactic Resource with a WordNet: a Feasibility Study of a Merge between STO and DanNet
Annotating Abstract Pronominal Anaphora in the DAD Project
University of Copenhagen, Centre for Language Technology (CST)
Omologo, Maurizio WOZ Acoustic Data Collection for Interactive TV
Fondazione Bruno Kessler - FBK, Trento
Ono, Takahiro Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus
Graduate School of Information Science, Nagoya University
Oostdijk, Nelleke From D-Coi to SoNaR: a reference corpus for Dutch
Radboud University Nijmegen
Orasan, Constantin Evaluation of a Cross-lingual Romanian-English Multi-document Summariser
Development and Alignment of a Domain-Specific Ontology for Question Answering
The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
Anaphora Resolution Exercise: an Overview
University of Wolverhampton
Ordelman, Roeland From D-Coi to SoNaR: a reference corpus for Dutch
Evaluation of Spoken Document Retrieval for Historic Speech Collections
University of Twente
Ormándi, Róbert Hungarian Word-Sense Disambiguated Corpus
University of Szeged
Ortega-Garcia, Javier BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition
ATVS-UAM
Osenova, Petya Language Resources for Semantic Document Annotation and Crosslingual Retrieval
Bulgarian Academy of Sciences
Oshika, Beatrice Applying Automated Metrics to Speech Translation Dialogs
MITRE Corporation
Otani, Naofumi A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions
Shizuoka University
Otterbacher, Jahna Modeling Document Dynamics: an Evolutionary Approach
University of Cyprus
Ou, Shiyan Development and Alignment of a Domain-Specific Ontology for Question Answering
University of Wolverhampton
Overbeeke, Chwhynny Towards Formal Interpretation of Semantic Annotation
Tilburg University
Ozaki, Akira In-car Speech Data Collection along with Various Multimodal Signals
Nagoya University
Ozdowska, Sylwia Cross-Corpus Evaluation of Word Alignment
NCLT, Dublin City University

 

P
Pala, Karel Czech MWE Database
Masaryk University Brno
Pala, Kiran Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language
Language Technologies Research Centre, IIIT, Hyderabad
Palazón, V. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Palm, Günther Emotion Recognition from Speech: Stress Experiment
The PIT Corpus of German Multi-Party Dialogues
Institute of Neural Information Processing
Palmer, Martha Annotating Students’ Understanding of Science Concepts
A Pilot Arabic Propbank
University of Colorado, Boulder
Panckhurst, Rachel Classification Procedures for Software Evaluation
Praxiling UMR 5267 CNRS, Université Paul-Valéry Montpellier 3,
Panunzi, Alessandro Integration of a Multilingual Keyword Extractor in a Document Management System
University of Florence
Papagianopoulou, Aggeliki Condensing Sentences for Subtitle Generation
University of Athens
Pareja-Lora, Antonio Ontology-Based Interface Specifications for a NLP Pipeline Architecture
OEG (UPM) / DSIC (UCM), Madrid
Parker, Robert Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium
Linguistic Data Consortium
Parmentier, Yannick Developing a TT-MCTAG for German with an RCG-based Parser
University of Tübingen
Paroubek, Patrick EASY, Evaluation of Parsers of French: what are the Results?
Annotation and analysis of overlapping speech in political interviews
PASSAGE: from French Parser Evaluation to Large Sized Treebank
LIMSI-CNRS
Parvaz, Dan Applying Automated Metrics to Speech Translation Dialogs
Low-Density Language Bootstrapping: the Case of Tajiki Persian
Performance Evaluation of Speech Translation Systems
MITRE Corporation
Pasca, Marius Low-Complexity Heuristics for Deriving Fine-Grained Classes of Named Entities from Web Textual Data
Google Inc.
Passarotti, Marco The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin
Università Cattolica del Sacro Cuore, Milan
Passonneau, Rebecca MASC: the Manually Annotated Sub-Corpus of American English
Relation between Agreement Measures on Human Labeling and Machine Learning Performance: Results from an Art History Domain
Columbia University
Patry, Alexandre MISTRAL: a Statistical Machine Translation Decoder for Speech Recognition Lattices
Université de Montréal
Patsis, Yorgos A Multi-sensor Speech Database with Applications towards Robust Speech Processing in hostile Environments
Vrije Universiteit Brussel, dept. ETRO-DSSP, Brussels
Paulo Pardal, Joana Building a Golden Collection of Parallel Multi-Language Word Alignment
L2F INESC-ID/IST, Lisboa
Paulo, Sérgio Methodologies for Designing and Recording Speech Databases for Corpus Based Synthesis
L2F INESC-ID/IST, Lisboa
Paulsson, Niklas LILA: Cellular Telephone Speech Databases from Asia
Quick Rich Transcriptions of Arabic Broadcast News Speech Data
ELDA
Paulussen, Hans Sentence Alignment in DPC: Maximizing Precision, Minimizing Human Effort
Katholieke Universiteit Leuven
Pavaputanont Na Mahasarakham, Puwarat Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance
Department of Computer Engineering, Kasetsart University, Bangkok
Pazienza, Maria Teresa JMWNL: an Extensible Multilingual Library for Accessing Wordnets in Different Languages
A Web Browser Extension for Growing-up Ontological Knowledge from Traditional Web Content
A Bottom-up Comparative Study of EuroWordNet and WordNet 3.0 Lexical and Semantic Relations
Clustering of Terms from Translation Dictionaries and Synonyms Lists to Automatically Build more Structured Linguistic Resources
University of Rome Tor Vergata
Pechsiri, Chaveevan Building an Annotated Corpus for Text Summarization and Question Answering
Dhurakijpundit University
Pecina, Pavel Validating the Quality of Full Morphological Annotation
Charles University, Prague
Pedersen, Bolette Sandford Merging a Syntactic Resource with a WordNet: a Feasibility Study of a Merge between STO and DanNet
University of Copenhagen, Centre for Language Technology (CST)
Peirsman, Yves The Construction and Evaluation of Word Space Models
Modelling Word Similarity: an Evaluation of Automatic Synonymy Extraction Algorithms.
University of Leuven
Pekar, Viktor Development and Alignment of a Domain-Specific Ontology for Question Answering
University of Wolverhampton
Pellegrini, Thomas Developments of “Lëtzebuergesch” Resources for Automatic Speech Processing and Linguistic Studies
LIMSI-CNRS
Pennacchiotti, Marco FATE: a FrameNet-Annotated Corpus for Textual Entailment
Towards a Vector Space Model for FrameNet-like Resources
A Web Browser Extension for Growing-up Ontological Knowledge from Traditional Web Content
University of Saarland
Perboni, Sara The Italian Particle “ne”: Corpus Construction and Analysis
University of Bologna
Perera, Nadine CLIoS: Cross-lingual Induction of Speech Recognition Grammars
University of Saarland
Pérez, Javier Corpus and Voices for Catalan Speech Synthesis
UPC-TALP
Peris, G. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Petasis, Georgios BOEMIE Ontology-Based Text Annotation Tool
N.C.S.R. Demokritos
Peters, Carol From Research to Application in Multilingual Information Access: the Contribution of Evaluation
Istituto di Scienza e Tecnologie dell’Informazione
Peters, Wim Evaluating Evaluation Metrics for Ontology-Based Applications: Infinite Reflection
University of Sheffield
Peterson, Erik Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Accenture Technology Labs
Peterson, Kay Translation Adequacy and Preference Evaluation Tool (TAP-ET)
Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction
National Institute of Standards and Technology
Petrik, Stefan The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech
Graz University of Technology
Pettersson, Eva Swedish-Turkish Parallel Treebank
Department of Linguistics and Philology, Uppsala University
Petukhova, Volha LIRICS Semantic Role Annotation: Design and Evaluation of a Set of Data Categories
Evaluating Dialogue Act Tagging with Naive and Expert Annotators
Tilburg University
Petzell, Malin Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic)
Gothenburg University
Pfeil, Martin Emotion Recognition from Speech: Stress Experiment
Institute of Information Technology
Phillips, Jon Applying Automated Metrics to Speech Translation Dialogs
Performance Evaluation of Speech Translation Systems
MITRE Corporation
Pianta, Emanuele Frame Information Transfer from English to Italian
L-ISA: Learning Domain Specific Isa-Relations from the Web
The TextPro Tool Suite
Fondazione Bruno Kessler - FBK, Trento
Piao, Scott Clustering Related Terms with Definitions
The University of Manchester
Piasecki, Maciej Corpus-based Semantic Relatedness for the Construction of Polish WordNet
Institute of Applied Informatics, Wrocław University of Technology
Picca, Davide Supersense Tagger for Italian
LMM: an OWL-DL MetaModel to Represent Heterogeneous Lexical Knowledge
University of Lausanne
Picchi, Eugenio Semantic Press
ILC-CNR, Pisa
Pinho, Roberto Identifying Strategic Information from Scientific Articles through Sentence Classification
University of São Paulo
Pinkal, Manfred CLIoS: Cross-lingual Induction of Speech Recognition Grammars
University of Saarland
Pinto, Hugo Information Extraction Tools and Methods for Understanding Dialogue in a Companion
University of Sheffield
Piperidis, Stelios Condensing Sentences for Subtitle Generation
Foundation of a Component-based Flexible Registry for Language Resources and Technology
Building a Greek corpus for Textual Entailment
ILSP - Athens
Pirrelli, Vito Unsupervised Acquisition of Verb Subcategorization Frames from Shallow-Parsed Corpora
ILC-CNR, Pisa
Pitel, Guillaume Semi-automatic Building Method for a Multidimensional Affect Dictionary for a New Language
CEA LIST
Pitz, Michael CLIoS: Cross-lingual Induction of Speech Recognition Grammars
Department of Human Machine Interaction, BMW Group Research and Technology, Munich
Plank, Barbara Subdomain Sensitive Statistical Parsing using Raw Corpora
University of Groningen
Pociello, Eli Analysis and Performance of Morphological Query Expansion and Language-Filtering Words on Basque Web Searching
WNTERM: Enriching the MCR with a Terminological Dictionary
Elhuyar Fundazioa, R&D
Podile, Kholisa Experimental Fast-Tracking of Morphological Analysers for Nguni Languages
University of South Africa
Poesio, Massimo BART: A modular toolkit for coreference resolution
Anaphoric Annotation in the ARRAU Corpus
ANAWIKI: Creating Anaphorically Annotated Resources through Web Cooperation
A Corpus for Cross-Document Co-reference
University of Essex
Poibeau, Thierry LexSchem: a Large Subcategorization Lexicon for French Verbs
Do we Still Need Gold Standards for Evaluation?
LIPN-CNRS
Polk, John An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
MITRE Corporation
Pollák, Petr Phone Segmentation Tool with Integrated Pronunciation Lexicon and Czech Phonetically Labelled Reference Database.
Czech Technical University in Prague
Pomikálek, Jan Detecting Co-Derivative Documents in Large Text Collections
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case
Faculty of Informatics, Masaryk University
Ponzetto, Simone BART: A modular toolkit for coreference resolution
European Media Laboratory GmbH, Heidelberg
Popescu, Adrian A Conceptual Approach to Web Image Retrieval
CEA LIST
Popescu, Marius Authorship Identification of Romanian Texts with Controversial Paternity
University of Bucharest
Popescu-Belis, Andrei Improving Contextual Quality Models for MT Evaluation Based on Evaluators’ Feedback
Task-Based Evaluation of Meeting Browsers: from Task Elicitation to User Behavior Analysis
IDIAP Research Institute
Poprat, Michael Semantic Annotations for Biology: a Corpus Development Initiative at the Jena University Language & Information Engineering (JULIE) Lab
Jena University, JULIE Lab
Portillo, Guillermo Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases
Respiratory Department. Unidad de Trastornos Respiratorios del Sueño. Hospital Clínico Universitario Málaga
Potamitis, Ilyas Audio Database in Support of Potentiel Threat and Crisis Situation Management
Technological Educational Institute of Crete
Potrich, Alessandra L-ISA: Learning Domain Specific Isa-Relations from the Web
Fondazione Bruno Kessler - FBK, Trento
Povlsen, Claus Merging a Syntactic Resource with a WordNet: a Feasibility Study of a Merge between STO and DanNet
University of Copenhagen, Centre for Language Technology (CST)
Power, Richard Deriving Rhetorical Complexity Data from the RST-DT Corpus
The Open University
Powley, Brett The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
Macquarie University
Prasad, Rashmi The Penn Discourse TreeBank 2.0.
University of Pennsylvania
Prat, F. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Pretorius, Laurette Experimental Fast-Tracking of Morphological Analysers for Nguni Languages
University of South Africa
Prévot, Laurent Extracting Concrete Senses of Lexicon through Measurement of Conceptual Similarity in Ontologies
Université de Toulouse
Priestley, Joel Glossa: a Multilingual, Multimodal, Configurable User Interface
University of Oslo, Norway
Prince, Cambell Lexicon Schemas and Related Data Models: when Standards Meet Users
SIL
Prince, Violaine Building a Bilingual Representation of the Roget Thesaurus for French to English Machine Translation
University of Montpellier 2 and LIRMM-CNRS
Probst, Katharina Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages
Language Technologies Institute, Carnegie Mellon University
Prokopidis, Prokopis Condensing Sentences for Subtitle Generation
ILSP - Athens
Przepiórkowski, Adam Towards the National Corpus of Polish
♠ Demo: An Open Source Tool for Partial Parsing and Morphosyntactic Disambiguation
Definition Extraction Using a Sequential Combination of Baseline Grammars and Machine Learning Classifiers
Institute of Computer Science at the Polish Academy of Sciences and Warsaw University
Przybocki, Mark Translation Adequacy and Preference Evaluation Tool (TAP-ET)
Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction
National Institute of Standards and Technology
Puche, Javier Tagging Spanish Texts: the Problem of Problem of “SE”
Universidad Politécnica de Madrid
Purandare, Amruta Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems
University of Pittsburgh
Purpura, Stephen The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998
Cornell University
Puscasu, Georgiana Annotation of WordNet Verbs with TimeML Event Classes
University of Wolverhampton
Pustylnikov, Olga A Unified Database of Dependency Treebanks: Integrating, Quantifying & Evaluating Dependency Data
University of Bielefeld

 

Q
Qu, Wei-guang Quality Assurance of Automatic Annotation of Very Large Corpora: a Study based on heterogeneous Tagging System
Institute of Computational Linguistics, Peking University
Qu, Weiruo Targeting Chinese Nominal Compounds in Corpora
CIS, University of Munich (LMU)
Quaresma, Paulo A Framework for Multilingual Ontology Mapping
University of Evora
Quasthoff, Uwe UnsuParse: unsupervised Parsing with unsupervised Part of Speech Tagging
ASV Toolbox: a Modular Collection of Language Exploration Tools
University of Leipzig
Quimby, Rob SpatialML: Annotation Scheme, Corpora, and Tools
MITRE Corporation
Quixal, Martí User-Centred Design of Error Correction Tools
Pompeu Fabra University
Quochi, Valeria A lexicon for biology and bioinformatics: the BOOTStrep experience.
Learning properties of Noun Phrases: from data to functions
ILC-CNR, Pisa

 

R
R. Costa-jussà, Marta Using Reordering in Statistical Machine Translation based on Alignment Block Classification
UPC-TALP
R. Fonollosa, José A. Using Reordering in Statistical Machine Translation based on Alignment Block Classification
UPC-TALP
Rääbis, Andriela From Human Communication to Intelligent User Interfaces: Corpora of Spoken Estonian
University of Tartu
Radev, Dragomir Modeling Document Dynamics: an Evolutionary Approach
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
University of Michigan
Raffaelli, Remo KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Synthema
Ragheb, Ahmed A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields
RDI
Rajbhandari, Sachit Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance
Food and Agriculture Organization of the United Nations, Rome
Rajendran, S. A Common Parts-of-Speech Tagset Framework for Indian Languages
Tamil University
Rambow, Owen Using Semantically Annotated Corpora to Build Collocation Resources
Improving NER in Arabic Using a Morphological Tagger
Columbia University
Ramos, Daniel BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition
ATVS-UAM
Ramos, José Ángel Tagging Spanish Texts: the Problem of Problem of “SE”
Universidad Politécnica de Madrid
Ramos-Garijo, R. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
ES
Răschip, Marius How to Evaluate and Raise the Quality in a Collaborative Lexicographic Approach
Al.I.Cuza University of Iasi
Rashwan, Mohsen A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields
RDI
Rawding, Matt An eRulemaking Corpus: Identifying Substantive Issues in Public Comments
Cornell University
Raymond, Christian Active Annotation in the LUNA Italian Corpus of Spontaneous Dialogues
A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding
LIA - University of Avignon
Rayner, Manny Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
Building Mobile Spoken Dialogue Applications Using Regulus
Université de Genève/ETI/TIM
Razmara, Majid Answering List Questions using Co-occurrence and Clustering
Concordia University
Recasens, Marta AnCora: Multilevel Annotated Corpora for Catalan and Spanish
University of Barcelona
Reed, Chris Language Resources for Studying Argument
University of Dundee
Reed, Marian The Linguistic Data Consortium Member Survey: Purpose, Execution and Results
Linguistic Data Consortium
Refice, Mario Integrating Audio and Visual Information for Modelling Communicative Behaviours Perceived as Different
Department of Electrical Engineering and Electronics, Polytechnics of Bari
Rehbein, Ines How to Compare Treebanks
Dublin City University
Rehm, Georg Ontology-Based XQuery’ing of XML-Encoded Language Resources on Multiple Annotation Layers
Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Tübingen
Ren, Han A Research on Automatic Chinese Catchword Extraction
Wuhan University
Reynaert, Martin From D-Coi to SoNaR: a reference corpus for Dutch
All, and only, the Errors: more Complete and Consistent Spelling and OCR-Error Correction Evaluation
Tilburg University
Rhinow, Steffen Emotion Recognition from Speech: Stress Experiment
Institute of Information Technology
Riccardi, Giuseppe Active Annotation in the LUNA Italian Corpus of Spontaneous Dialogues
Department of Information Engineering and Computer Science, University of Trento
Richer, Justin SpatialML: Annotation Scheme, Corpora, and Tools
MITRE Corporation
Richter, Matthias Tapping Huge Temporally Indexed Textual Resources with WCTAnalyze
University of Leipzig
Rieser, Verena Automatic Learning and Evaluation of User-Centered Objective Functions for Dialogue System Optimisation
Edinburgh University
Rigau, German KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Complete and Consistent Annotation of WordNet using the Top Concept Ontology
WNTERM: Enriching the MCR with a Terminological Dictionary
University of the Basque Country
Rilliard, Albert Multimodal Spontaneous Expressive Speech Corpus for Hungarian
LIMSI-CNRS
Rinaldi, Fabio Dependency-Based Relation Mining for Biomedical Literature
University of Zurich
Ringersma, Jacquelijn Ensuring Semantic Interoperability on Lexical Resources
Language-Sites: Accessing and Presenting Language Resources via Geographic Information Systems
Max Planck Institute for Psycholinguistics
Ringger, Eric Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study
Brigham Young University
Ringlstetter, Christoph Targeting Chinese Nominal Compounds in Corpora
AICML, University of Alberta
Ritz, Julia Annotation of Information Structure: an Evaluation across different Types of Texts
University of Potsdam
Rizov, Borislav Chooser: a Multi-Task Annotation Tool
Hydra: a Modal Logic Tool for Wordnet Development, Validation and Exploration
Bulgarian Academy of Sciences
Robaldo, Livio The Penn Discourse TreeBank 2.0.
University of Torino
Robba, Isabelle EASY, Evaluation of Parsers of French: what are the Results?
LIMSI-CNRS
Roberts, Angus Combining Terminology Resources and Statistical Methods for Entity Recognition: an Evaluation
ANNALIST - ANNotation ALIgnment and Scoring Tool
University of Sheffield
Roberts, Kirk Scaling Answer Type Detection to Large Hierarchies
Language Computer Corporation
Robinson, Susan What would you Ask a conversational Agent? Observations of Human-Agent Dialogues in a Museum Setting
A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture
USC Institute for Creative Technologies
Rodet, Xavier Automatic Phoneme Segmentation with Relaxed Textual Constraints
IrcamCorpusTools: an Extensible Platform for Spoken Corpora Exploitation
IRCAM
Rodet, Xavier IrcamCorpusTools: an Extensible Platform for Spoken Corpora Exploitation
IRCAM
Rodríguez, Juan José Methodology for Evaluating the Usability of User Interfaces in Mobile Services
Telefónica Investigación y Desarrollo
Rodriguez, Kepa Joseba Active Annotation in the LUNA Italian Corpus of Spontaneous Dialogues
Piedmont Consortium for Information Systems
Rodríquez, Horacio Arabic WordNet: Semi-automatic Extensions using Bayesian Inference
Universitat Politécnica de Catalunya
Roesner, Dietmar On the Role of the NIMITEK Corpus in Developing an Emotion Adaptive Spoken Dialogue System
Otto-von-Guericke-University Magdeburg, Department of Knowledge Processing and Language Engineering
Rojc, Matej Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework
University of Maribor
Romanelli, Massimo Semiotic-based Ontology Evaluation Tool (S-OntoEval)
DFKI GmbH
Romary, Laurent Foundation of a Component-based Flexible Registry for Language Resources and Technology
Max Planck Institute for Psycholinguistics
Romportl, Jan Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis
University of West Bohemia
Rooth, Mats Induction of Treebank-Aligned Lexical Resources
Cornell University
Rose, Travis An Exchange Format for Multimodal Annotations
Virginia Tech
Rosell, Magnus Revealing Relations between Open and Closed Answers in Questionnaires through Text Clustering Evaluation
KTH CSC
Rosner, Michael ODL: an Object Description Language for Lexical Information
University of Malta
Rosset, Sophie Question Answering on Speech Transcriptions: the QAST evaluation in CLEF
An Evaluation of Spoken and Textual Interaction in the RITEL Interactive Question Answering System
LIMSI-CNRS
Rossignol, Mathias Word Segmentation of Vietnamese Texts: a Comparison of Approaches
MICA, Hanoi University of Technology
Rosso, Paolo Geo-WordNet: Automatic Georeferencing of WordNet
NLE Lab, Universidad Politécnica de Valencia
Roth, Michael Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information
University of Saarland
Roth, Ryan Identification of Naturally Occurring Numerical Expressions in Arabic
Columbia University
Rousselot, François A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions
INSA Strasbourg
Roventini, Adriana Mapping Events and Abstract Entities from PAROLE-SIMPLE-CLIPS to ItalWordNet
ILC-CNR, Pisa
Rowe, Glenn Language Resources for Studying Argument
University of Dundee
Ruangrajitpakorn, Taneth OpenCCG Workbench and Visualization Tool
National Electronics and Computer Technology Center
Rubenstein, Alan An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
MITRE Corporation
Ruimy, Nilda Mapping Events and Abstract Entities from PAROLE-SIMPLE-CLIPS to ItalWordNet
Simple-Clips ongoing research: more information with less data by implementing inheritance
More Semantic Links in the SIMPLE-CLIPS Database
ILC-CNR, Pisa
Rupp, C.J. Language Resources and Chemical Informatics
Computer Laboratory, University of Cambridge
Ruppenhofer, Josef Finding the Sources and Targets of Subjective Expressions
University of Pittsburgh
Russ, Thomas A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture
USC’s Information Sciences Institute
Russel, Albert Exploring and Enriching a Language Resource Archive via the Web
Max Planck Institute for Psycholinguistics
Rychlý, Pavel Detecting Co-Derivative Documents in Large Text Collections
Faculty of Informatics, Masaryk University

 

S
Sabin, Roberta Creating and Using a Correlated Corpus to Glean Communicative Commonalities
Loyola College in Maryland
Sadamitsu, Kugatsu Sentiment Analysis Based on Probabilistic Models Using Inter-Sentence Information
University of Tsukuba
Saenz Perez, Fernando Conceptual Modeling of Ontology-based Linguistic Resources with a Focus on Semantic Relations
Universidad Complutense de Madrid
Sætre, Rune Connecting Text Mining and Pathways using the PathText Resource
University of Tokyo
Sagae, Kenji GENIA-GR: a Grammatical Relation Corpus for Parser Evaluation in the Biomedical Domain
University of Tokyo
Sagara, Kaoru Relationships between Nursing Converstaions and Activities
Seinan Jo Gakuen University
Saggion, Horacio A Framework for Identity Resolution and Merging for Multi-source Information Extraction
University of Sheffield
Saint-Dizier, Patrick Investigating the Structure of Procedural Texts for Answering How-to Questions
IRIT-CNRS
Sainz, Iñaki Subjective Evaluation of an Emotional Speech Database for Basque
Text Independent Speaker Identification in Multilingual Environments
University of the Basque Country
Saito, Hiroaki The Japanese FrameNet Software Tools
Keio University
Sakai, Satoshi Automated Subject Induction from Query Keywords through Wikipedia Categories and Subject Headings
Tokyo Denki University
Sam, Sethserey First Broadcast News Transcription System for Khmer Language
Laboratoire d’Informatique de Grenoble (LIG)
Samuel, Kenneth An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
MITRE Corporation
Samy, Doaa An Empirical Approach to a Preliminary Successful Identification and Resolution of Temporal Expressions in Spanish News Corpora
Pragmatic Annotation of Discourse Markers in a Multilingual Parallel Corpus (Arabic- Spanish-English)
Universidad Autónoma de Madrid & Cairo University
Sánchez, Joan Andreu Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars
Instituto Tecnológico de Informática, Universidad Politécnica de Valencia
Sanchez, Jon Subjective Evaluation of an Emotional Speech Database for Basque
Text Independent Speaker Identification in Multilingual Environments
University of the Basque Country
Sánchez, Sebastián Methodology for Evaluating the Usability of User Interfaces in Mobile Services
Universidad de Alcalá
Sanchis, Emilio Acquisition and Evaluation of a Dialog Corpus through WOz and Dialog Simulation Techniques
DSIC - UPV
Sanchis, Germán Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars
Instituto Tecnológico de Informática, Universidad Politécnica de Valencia
Sanders, Eric The IFADV Corpus: a Free Dialog Video Corpus
LILA: Cellular Telephone Speech Databases from Asia
Recording Speech of Children, Non-Natives and Elderly People for HLT Applications: the JASMIN-CGN Corpus.
SPEX/CLST Radboud University Nijmegen
Sanders, Greg Performance Evaluation of Speech Translation Systems
National Institute of Standards and Technology
Sanders, Gregory Odds of Successful Transfer of Low-Level Concepts: a Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA’s TRANSTAC Program
Applying Automated Metrics to Speech Translation Dialogs
National Institute of Standards and Technology
Sanderson, Mark An Evaluation Resource for Geographic Information Retrieval
From Research to Application in Multilingual Information Access: the Contribution of Evaluation
University of Sheffield
Sankaran, Baskaran A Common Parts-of-Speech Tagset Framework for Indian Languages
Microsoft Research Lab India
Santaholma, Marianne A Knowledge-Modeling Approach for Multilingual Regulus Lexica
Geneva University
Santini, Marina Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
DSV/KTH-Stockholm University
Santos, Diana What’s in a Colour? Studying and Contrasting Colours with COMPARA
Portuguese-English Word Alignment: some Experiments
An Evaluation Resource for Geographic Information Retrieval
SINTEF ICT
Santos, Fabíola CORP-ORAL: Spontaneous Speech Corpus for European Portuguese
ILTEC
Saratxaga, Ibon Subjective Evaluation of an Emotional Speech Database for Basque
Text Independent Speaker Identification in Multilingual Environments
University of the Basque Country
Saravanan, K. A Common Parts-of-Speech Tagset Framework for Indian Languages
Microsoft Research Lab India
Sasaki, Minoru Division of Example Sentences Based on the Meaning of a Target Word Using Semi-Supervised Clustering
Ping-pong Document Clustering using NMF and Linkage-Based Refinement
Spectral Clustering for a Large Data Set by Reducing the Similarity Matrix Size
Ibaraki University
Sassolini, Eva Semantic Press
ILC-CNR, Pisa
Satayamas, Vee Building an Annotated Corpus for Text Summarization and Question Answering
Kasetsart University
Sato, Hiroaki New Functions of FrameSQL for Multilingual FrameNets
Senshu University
Sato, Satoshi Automatic Assessment of Japanese Text Readability Based on a Textbook Corpus
Nagoya University
Satta, Giorgio Comparing Italian parsers on a common Treebank: the EVALITA experience
Università di Padova
Savino, Michelina Integrating Audio and Visual Information for Modelling Communicative Behaviours Perceived as Different
Department of Psychology, University of Bari
Savkov, Aleksandar The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Tübingen
Savova, Guergana System Evaluation on a Named Entity Corpus from Clinical Notes
Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition
Mayo Clinic College of Medicine
Scerri, Simon Evaluating the Ontology underlying sMail - the Conceptual Framework for Semantic Email Communication
DERI/NUIG
Schäfer, Ulrich Extracting and Querying Relations in Scientific Papers on Language Technology
DFKI GmbH
Scheible, Silke Annotating Superlatives
University of Edinburgh
Scherer, Stefan A Flexible Wizard of Oz Environment for Rapid Prototyping
Emotion Recognition from Speech: Stress Experiment
The PIT Corpus of German Multi-Party Dialogues
Institute of Neural Information Processing
Scheuermann, Gerik Tapping Huge Temporally Indexed Textual Resources with WCTAnalyze
University of Leipzig
Schiel, Florian ALC: Alcohol Language Corpus
Talking and Looking: the SmartWeb Multimodal Interaction Corpus
F0 of Adolescent Speakers - First Results for the German Ph@ttSessionz Database
BAS Bavarian Archive for Speech Signals
Schlenoff, Craig Odds of Successful Transfer of Low-Level Concepts: a Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA’s TRANSTAC Program
Applying Automated Metrics to Speech Translation Dialogs
Performance Evaluation of Speech Translation Systems
National Institute of Standards and Technology
Schluter, Natalie Treebank-Based Acquisition of LFG Parsing Resources for French
Dublin City University
Schmidt, Paul Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
IAI Saarbrücken
Schmidt, Thomas An Exchange Format for Multimodal Annotations
University of Hamburg
Schneider, Gerold Dependency-Based Relation Mining for Biomedical Literature
University of Zurich
Schonefeld, Oliver The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Tübingen
Schossland, Sidney Evaluating Summaries Automatically - A system Proposal
Department of Foreign Trade, University of Joinville
Schrader, Bettina Identification of Comparable Argument-Head Relations in Parallel Corpora
University of Potsdam
Schroeder, Elizabeth An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
MITRE Corporation
Schuler, Karin System Evaluation on a Named Entity Corpus from Clinical Notes
Mayo Clinic College of Medicine
Schulte im Walde, Sabine Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case
IMS, University of Stuttgart
Schultz, Tanja NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls
Language Technologies Institute, Carnegie Mellon University
Schütze, Hinrich An Inverted Index for Storing and Retrieving Grammatical Dependencies
A Question Answering System for German. Experiments with Morphological Linguistic Resources
IMS, University of Stuttgart
Schuurman, Ineke Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
From D-Coi to SoNaR: a reference corpus for Dutch
Spatiotemporal Annotation Using MiniSTEx: how to deal with Alternative, Foreign, Vague and/or Obsolete Names?
Katholieke Universiteit Leuven
Schwarten, Lasse A Semantic Memory for Incremental Ontology Population
University of Bremen
Schwartz, Reva Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition
USSS
Schwenker, Friedhelm Emotion Recognition from Speech: Stress Experiment
Institute of Neural Information Processing
Scivetti, Laura Integrating Audio and Visual Information for Modelling Communicative Behaviours Perceived as Different
Department of Psychology, University of Bari
Sclaroff, Stan Benchmark Databases for Video-Based Automatic Sign Language Recognition
Boston University
Scott, Donia Can we Evaluate the Quality of Generated Text?
The Open University
Sebastian, Shaji Similar Term Discovery using Web Search
Yahoo, Inc.
Sebastiani, Fabrizio Annotating Expressions of Opinion and Emotion in the Italian Content Annotation Bank
ISTI - CNR
Sébillot, Pascale On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems
Morphosyntactic Resources for Automatic Speech Recognition
IRISA / Universite Rennes 1
Segarra, Encarna Acquisition and Evaluation of a Dialog Corpus through WOz and Dialog Simulation Techniques
DSIC - UPV
Segers, Roxane Adjectives in the Dutch Semantic Lexical Database CORNETTO
Integrating Lexical Units, Synsets and Ontology in the Cornetto Database
Vrije Universiteit Amsterdam
Sekine, Satoshi Extended Named Entity Ontology with Attribute Information
Sentiment Analysis Based on Probabilistic Models Using Inter-Sentence Information
New York University
Sellberg, Linus Using Random Indexing to improve Singular Value Decomposition for Latent Semantic Analysis
Department of Computer and Information Science
Seng, Sopheap First Broadcast News Transcription System for Khmer Language
Laboratoire d’Informatique de Grenoble (LIG)
Seppi, Kevin Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study
Brigham Young University
Shamsfard, Mehrnoush Towards Semi Automatic Construction of a Lexical Ontology for Persian
A Hybrid Morphology-Based POS Tagger for Persian
NLP Research Lab,Shahid Beheshti university
Sharma, Dipti Misra Developing Verb Frames for Hindi
Language Technologies Research Centre, IIIT, Hyderabad
Sharoff, Serge Cleaneval: a Competition for Cleaning Web Pages
Generalising Lexical Translation Strategies for MT Using Comparable Corpora
Corpus-Based Tools for Computer-Assisted Acquisition of Reading Abilities in Cognate Languages
Designing and Evaluating a Russian Tagset
University of Leeds
Shemanaeva, Olga Yu. Semantic Annotation Layer in Russian National Corpus: Lexical Classes of Nouns and Adjectives
RSUH, Moscow
Shen, Wade Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition
MIT-LL
Shinnou, Hiroyuki Division of Example Sentences Based on the Meaning of a Target Word Using Semi-Supervised Clustering
Ping-pong Document Clustering using NMF and Linkage-Based Refinement
Spectral Clustering for a Large Data Set by Reducing the Similarity Matrix Size
Ibaraki University
Shinzato, Keiji A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
Kyoto University
Shirai, Kiyoaki Adapting International Standard for Asian Language Technologies
JAIST
Shockley, Darla Magdalena SCARE: a Situated Corpus with Annotated Referring Expressions
The Ohio State University
Siddharthan, Advaith Language Resources and Chemical Informatics
Computer Laboratory, University of Cambridge
Silberer, Carina Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration
Department of Computational Linguistics, Heidelberg University
Silliman, Scott Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems
University of Pittsburgh
Silva, João LX-Service: Web Services of Language Technology for Portuguese
University of Lisbon
Silva, Maria do Rosário What’s in a Colour? Studying and Contrasting Colours with COMPARA
FCCN
Silveira, Sara LX-Service: Web Services of Language Technology for Portuguese
University of Lisbon
Sima’an, Khalil Subdomain Sensitive Statistical Parsing using Raw Corpora
University of Amsterdam (UvA)
Simi, Maria Comparing Italian parsers on a common Treebank: the EVALITA experience
Università di Pisa
Simões, Alberto Portuguese-English Word Alignment: some Experiments
Universidade do Minho
Simov, Kiril Language Resources for Semantic Document Annotation and Crosslingual Retrieval
Bulgarian Academy of Sciences
Singh, Anil Kumar Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language
Language Technologies Research Centre, IIIT, Hyderabad
Singh, Suchinder OpenCCG Workbench and Visualization Tool
National Electronics and Computer Technology Center
Sini, Margherita Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance
Food and Agriculture Organization of the United Nations, Rome
Sitbon, Laurianne Evaluating Robustness Of A QA System Through A Corpus Of Real-Life Questions
Evaluation of Lexical Resources and Semantic Networks on a Corpus of Mental Associations
LIA - University of Avignon
Sjöbergh, Jonas A Multi-Lingual Dictionary of Dirty Words
What is poorly Said is a Little Funny
Hokkaido University
Skadina, Inguna Dictionary of Multiword Expressions for Translation into highly Inflected Languages
Tilde
Skadins, Raivis Dictionary of Multiword Expressions for Translation into highly Inflected Languages
Tilde
Skarnitzl, Radek Phone Segmentation Tool with Integrated Pronunciation Lexicon and Czech Phonetically Labelled Reference Database.
Charles University, Prague
Śledziński, Daniel JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts
Institute of Linguistics, Adam Mickiewicz University, Poznań
Sloetjes, Han Annotation by Category: ELAN and ISO DCR
An Exchange Format for Multimodal Annotations
Max Planck Institute for Psycholinguistics
Smaïli, Kamel Phrase-Based Machine Translation based on Simulated Annealing
LORIA, University of Nancy
Šmerk, Pavel Czech MWE Database
Masaryk University Brno
Smith, Jason BART: A modular toolkit for coreference resolution
Johns Hopkins University, Center for Language and Speech Processing
Smrž, Otakar Building the Valency Lexicon of Arabic Verbs
Charles University, Prague
Smrz, Pavel KnoFusius: a New Knowledge Fusion System for Interpretation of Gene Expression Data
Brno University of Technology
Sobha, L. A Common Parts-of-Speech Tagset Framework for Indian Languages
AU-KBC Research Centre
Soehn, Jan-Philipp A Multilingual Database of Polarity Items
University of Tuebingen
Sofianopoulos, Sokratis Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
ILSP - Athens
Sogrin, Mikhail Linguistically Light Lexical Extensions for Ontologies
IBM LanguageWare
Somasundaran, Swapna Finding the Sources and Targets of Subjective Expressions
University of Pittsburgh
Sone, Takaaki The Japanese FrameNet Software Tools
Keio University
Song, Zhiyi Entity Translation and Alignment in the ACE-07 ET Task
Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction
Linguistic Data Consortium & University of Pennsylvania
Sonntag, Daniel Semiotic-based Ontology Evaluation Tool (S-OntoEval)
DFKI GmbH
Soria, Claudia Ontologizing Lexicon Access Functions based on an LMF-based Lexicon Taxonomy
Adapting International Standard for Asian Language Technologies
UFRA: a UIMA-based Approach to Federated Language Resource Architecture
ILC-CNR, Pisa
Sornlertlamvanich, Virach Adapting International Standard for Asian Language Technologies
A Dependency Parser for Thai
TCL/NICT
Soroa, Aitor Spelling Correction: from Two-Level Morphology to Open Source
Using the Multilingual Central Repository for Graph-Based Word Sense Disambiguation
University of the Basque Country
Speelman, Dirk Modelling Word Similarity: an Evaluation of Automatic Synonymy Extraction Algorithms.
University of Leuven
Speranza, Manuela Evaluation of Natural Language Tools for Italian: EVALITA 2007
Fondazione Bruno Kessler - FBK, Trento
Spohr, Dennis A General Methodology for Mapping EuroWordNets to the Suggested Upper Merged Ontology
Institut fuer Linguistik/Romanistik, Universitaet Stuttgart
Spousta, Miroslav Validating the Quality of Full Morphological Annotation
Charles University, Prague
Spoustová, Drahomíra johanka Validating the Quality of Full Morphological Annotation
Charles University, Prague
Spracklin, Leanne Using the Complexity of the Distribution of Lexical Elements as a Feature in Authorship Attribution
University of Ottawa
Spreyer, Kathrin Identification of Comparable Argument-Head Relations in Parallel Corpora
University of Potsdam
Sprugnoli, Rachele Evaluation of Natural Language Tools for Italian: EVALITA 2007
CELCT
Spurk, Christian Development and Alignment of a Domain-Specific Ontology for Question Answering
DFKI GmbH
Spyns, Peter The Dutch-Flemish Comprehensive Approach to HLT Stimulation and Innovation: STEVIN, HLT Agency and beyond
Nederlandse Taalunie
Spyropoulos, Constantine BOEMIE Ontology-Based Text Annotation Tool
N.C.S.R. Demokritos
Stanković, Ranka The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
Faculty of Mining and geology, Belgrade
Stark, Matthias The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Tübingen
Starlander, Marianne Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
Université de Genève/ETI/TIM
Ştefanescu, Dan A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions
RACAI’s Linguistic Web Services
RACAI, Romanian Academy, Bucharest
Stein, Daniel The ATIS Sign Language Corpus
RWTH Aachen University
Stellato, Armando JMWNL: an Extensible Multilingual Library for Accessing Wordnets in Different Languages
A Web Browser Extension for Growing-up Ontological Knowledge from Traditional Web Content
A Bottom-up Comparative Study of EuroWordNet and WordNet 3.0 Lexical and Semantic Relations
Clustering of Terms from Translation Dictionaries and Synonyms Lists to Automatically Build more Structured Linguistic Resources
University of Rome Tor Vergata
Steves, Michelle Performance Evaluation of Speech Translation Systems
National Institute of Standards and Technology
Stiefelhagen, Rainer Data Collection for the CHIL CLEAR 2007 Evaluation Campaign
UKA-ISL
Stock, Oliviero Resources for Persuasion
Valentino: A Tool for Valence Shifting of Natural Language Texts
Fondazione Bruno Kessler - FBK, Trento
Stoia, Laura SCARE: a Situated Corpus with Annotated Referring Expressions
The Ohio State University
Stoyanov, Veselin Annotating Topics of Opinions
Cornell University
Strand, Ole Morten RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus
Norwegian Defence Research Establishment
Strandson, Krista From Human Communication to Intelligent User Interfaces: Corpora of Spoken Estonian
University of Tartu
Strapparava, Carlo Resources for Persuasion
Valentino: A Tool for Valence Shifting of Natural Language Texts
Fondazione Bruno Kessler - FBK, Trento
Strassel, Stephanie Entity Translation and Alignment in the ACE-07 ET Task
Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction
Management of Large Annotation Projects Involving Multiple Human Judges: a Case Study of GALE Machine Translation Post-editing
New Resources for Document Classification, Analysis and Translation Technologies
Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition
Quick Rich Transcriptions of Arabic Broadcast News Speech Data
Linguistic Data Consortium & University of Pennsylvania
Strauß, Petra-Maria A Flexible Wizard of Oz Environment for Rapid Prototyping
The PIT Corpus of German Multi-Party Dialogues
Institute of Information Technology
Strömqvist, Sven Building a Federation of Language Resource Repositories: the DAM-LR Project and its Continuation within CLARIN.
Lund University
Strube, Michael Acquiring a Taxonomy from the German Wikipedia
Parameters for Topic Boundary Detection in Multi-Party Dialogues
A Three-stage Disfluency Classifier for Multi Party Dialogues
Knowledge Sources for Bridging Resolution in Multi-Party Dialog
European Media Laboratory GmbH, Heidelberg
Stubbe, Andrea Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
Conject AG
Stührenberg, Maik Influence of Text Type and Text Length on Anaphoric Annotation
University of Bielefeld
Suárez-Figueroa, Mari Carmen Towards a Glossary of Activities in the Ontology Engineering Field
Ontology Engineering Group (Universidad Politécnica de Madrid)
Subbarao, K.V. A Common Parts-of-Speech Tagset Framework for Indian Languages
Jawaharlal Nehru University
Sugimoto, Shigeo Temporal Aspects of Terminology for Automatic Term Recognition: Case Study on Women’s Studies Terms
Graduate School of Library, Information and Media Studies, University of Tsukuba
Suktarachan, Mukda Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance
Department of Computer Engineering, Kasetsart University, Bangkok
Sukvari, Thana Building an Annotated Corpus for Text Summarization and Question Answering
Sriprathum University
Sumida, Asuka Boosting Precision and Recall of Hyponymy Relation Acquisition from Hierarchical Layouts in Wikipedia
Japan Advanced Institute of Science and Technology
Sun, Haotian ANNALIST - ANNotation ALIgnment and Scoring Tool
Department of Computer Science, University of Sheffield
Supnithi, Thepchai OpenCCG Workbench and Visualization Tool
National Electronics and Computer Technology Center
Surana, Harshit Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language
Language Technologies Research Centre, IIIT, Hyderabad
Švec, Jan Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations
University of West Bohemia
Svendsen, Torbjørn RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus
Norwegian University of Science and Technology
Svoboda, Lukáš Czech MWE Database
Masaryk University Brno
Swift, Mary Production in a Multimodal Corpus: how Speakers Communicate Complex Actions
University of Rochester
Symonenko, Svetlana Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
Nitol, LLC
Szabó, János Multimodal Spontaneous Expressive Speech Corpus for Hungarian
Budapest University of Technology and Economics
Szarvas, György Hungarian Word-Sense Disambiguated Corpus
University of Szeged
Szauter, Dóra Hungarian Word-Sense Disambiguated Corpus
University of Szeged
Szpakowicz, Stan Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors
Corpus-based Semantic Relatedness for the Construction of Polish WordNet
University of Ottawa

 

T
T. Toledano, Doroteo STC-TIMIT: Generation of a Single-channel Telephone Corpus
BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition
Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories
ATVS Lab, Universidad Autonoma de Madrid
Tablan, Valentin A Text-based Query Interface to OWL Ontologies
University of Sheffield
Tadic, Marko Rule-Based Chunker for Croatian
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences
Tagami, Hayato The Japanese FrameNet Software Tools
Keio University
Takebayashi, Yoichi A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions
Shizuoka University
Takeda, Kazuya Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
In-car Speech Data Collection along with Various Multimodal Signals
Nagoya University
Takenobu, Tokunaga Adapting International Standard for Asian Language Technologies
Tokyo Institute of Technology
Takiguchi, Tetsuya Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
Kobe University
Tamamura, Shin’ichi Automatic Construction of a Japanese-Chinese Dictionary via English
Shizuoka University
Tamarit, Vicent Evaluation of Different Segmentation Techniques for Dialogue Turns
DSIC - UPV
Tamburini, Fabio Evaluation of Natural Language Tools for Italian: EVALITA 2007
DSLO - University of Bologna
Tamura, Noriyuki Automated Subject Induction from Query Keywords through Wikipedia Categories and Subject Headings
Tokyo Denki University
Tamura, Satoshi Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
Gifu University
Tan, Yee Fan The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
National University of Singapore
Tannier, Xavier Evaluation Metrics for Automatic Temporal Annotation of Texts
LIMSI-CNRS
Tantug, A. Cuneyd BLEU+: a Tool for Fine-Grained BLEU Computation
Istanbul Technical University
Tapias, Daniel Methodology for Evaluating the Usability of User Interfaces in Mobile Services
Telefónica España
Tateisi, Yuka GENIA-GR: a Grammatical Relation Corpus for Parser Evaluation in the Biomedical Domain
Kogakuin University
Taulé, Mariona AnCora-Verb: A Lexical Resource for the Semantic Annotation of Corpora
AnCora: Multilevel Annotated Corpora for Catalan and Spanish
CLiC-University of Barcelona
Tavernier, Jean Holy Moses! Leveraging Existing Tools and Resources for Entity Translation
CACI International Inc.
Tavosanis, Mirko Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
Universita di Pisa
Tejedor, Javier STC-TIMIT: Generation of a Single-channel Telephone Corpus
HCTLab, Universidad Autonoma de Madrid
Ten Bosch, Louis Evaluating the Relationship between Linguistic and Geographic Distances using a 3D Visualization
Radboud University Nijmegen
Teng, Chun-Yuan Event Detection and Summarization in Weblogs with Temporal Collocations
Department of Computer Science and Information Engineering, National Taiwan University
Tennent, Paul Introducing DRS (The Digital Replay System): a Tool for the Future of Corpus Linguistic Research and Analysis
University of Nottingham
Terada, Akira Extraction of Informative Expressions from Domain-specific Documents
Application of Resource-based Machine Translation to Real Business Scenes
Japan Airlines
Tescon, Maurizio KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
CNR-IIT
Teufel, Simone Language Resources and Chemical Informatics
Computer Laboratory, University of Cambridge
Thamvijit, Dussadee Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance
Department of Computer Engineering, Kasetsart University, Bangkok
Thanopoulos, Aristomenis Eksairesis: A Domain-Adaptable System for Ontology Building from Unstructured Text
Wire Communications Laboratory, Department of Electrical and Computer Engineering, University of Patras
Theodorakos, Aris BOEMIE Ontology-Based Text Annotation Tool
N.C.S.R. Demokritos
Theune, Mariet Controlling Redundancy in Referring Expressions
University of Twente
Thompson, Paul Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora
National Centre for Text Mining, University of Manchester
Thomsen, Hanne Erdman A Taxonomy of Lexical Metadata Categories
Copenhagen Business School
Thornell, Christina Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic)
Gothenburg University
Tiberi, Melissa Encoding Terms from a Scientific Domain in a Terminological Database: Methodology and Criteria
Biblioteca Nazionale Centrale di Firenze
Tiberius, Carole Standardising Bilingual Lexical Resources According to the Lexicon Markup Framework
TST-centrale, Institute for Dutch Lexicology (INL), Leiden
Tiedemann, Jörg Synchronizing Translated Movie Subtitles
University of Groningen
Tihelka, Daniel Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis
University of West Bohemia
Timimi, Ismail The INFILE Project: a Crosslingual Filtering Systems Evaluation Campaign
Lille 3 - GERIICO
Tobin, Richard Named Entity Recognition for Digitised Historical Texts
University of Edinburgh
Todiraşcu, Amalia A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions
LILPA, Université Marc Bloch Strasbourg
Tohyama, Hitomi Automatic Acquisition of Usage Information for Language Resources
Construction of a Metadata Database for Efficient Development and Use of Language Resources
Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus
Information Technology Center, Nagoya University
Toledano, Doroteo t. Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases
ATVS Biometric Recognition Group Universidad Autónoma de Madrid, Spain
Tomanek, Katrin Semantic Annotations for Biology: a Corpus Development Initiative at the Jena University Language & Information Engineering (JULIE) Lab
Jena University, JULIE Lab
Tomas, David The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
University of Alicante
Tomuro, Noriko Extraction of Attribute Concepts from Japanese Adjectives
DePaul University
Tonelli, Sara Enriching the Venice Italian Treebank with Dependency and Grammatical Relations
Frame Information Transfer from English to Italian
Fondazione Bruno Kessler - FBK, Trento
Toney, Dave An Evaluation of Spoken and Textual Interaction in the RITEL Interactive Question Answering System
LIMSI-CNRS
Tongchim, Shisanu A Dependency Parser for Thai
Thai Computational Linguistics Laboratory
Toral, Antonio Named Entity WordNet
Simple-Clips ongoing research: more information with less data by implementing inheritance
Evaluation of Natural Language Tools for Italian: EVALITA 2007
More Semantic Links in the SIMPLE-CLIPS Database
ILC-CNR, Pisa
Torisawa, Kentaro Boosting Precision and Recall of Hyponymy Relation Acquisition from Hierarchical Layouts in Wikipedia
Japan Advanced Institute of Science and Technology
Torrens, Edson Wilson Evaluating Summaries Automatically - A system Proposal
Department of Informatics, University of Joinville
Touset, Pascal Controlling Redundancy in Referring Expressions
University of Twente
Trabucco, Andrea Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora
ILC-CNR, Pisa
Trancoso, Isabel The LECTRA Corpus - Classroom Lecture Transcriptions in European Portuguese
L2F INESC-ID/IST, Lisboa
Trandabat, Diana Romanian Semantic Role Resource
Faculty of Computer Science
Traue, Harald The PIT Corpus of German Multi-Party Dialogues
University of Ulm
Traum, David What would you Ask a conversational Agent? Observations of Human-Agent Dialogues in a Museum Setting
A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture
USC Institute for Creative Technologies
Trawinski, Beata A Multilingual Database of Polarity Items
University of Tuebingen
Trilsbeek, Paul A Grid of Regional Language Archives
Language-Sites: Accessing and Presenting Language Resources via Geographic Information Systems
Max Planck Institute for Psycholinguistics
Trippel, Thorsten Lexicon Schemas and Related Data Models: when Standards Meet Users
University of Bielefeld
Trojahn, Cássia A Framework for Multilingual Ontology Mapping
University of Evora
Trojanová, Jana Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition
Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition
Department of Cybernetics, Faculty of Applied Sciences, University of West Bohemia in Pilsen
Tron, Viktor On the Durational Reduction of Repeated Mentions: Recency and Speaker Effects
Viktortron.org
Tropf, Herbert LILA: Cellular Telephone Speech Databases from Asia
Siemens AG
Troussov, Alexander Linguistically Light Lexical Extensions for Ontologies
IBM LanguageWare
Trushkina, Julia Sentence Alignment in DPC: Maximizing Precision, Minimizing Human Effort
Katholieke Universiteit Leuven
Tsarfaty, Reut Word-Based or Morpheme-Based? Annotation Strategies for Modern Hebrew Clitics
Institute for Logic Language and Computation, University of Amsterdam
Tseng, Chiu-yu The 2008 Oriental COCOSDA Book Project: in Commemoration of the First Decade of Sustained Activities in Asia
Academia Sinica
Tsourakis, Nikos Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System
Building Mobile Spoken Dialogue Applications Using Regulus
Université de Genève/ETI/TIM
Tsuchiya, Masatoshi Developing Corpus of Japanese Classroom Lecture Speech Contents
Information and Media Center, Toyohashi University of Technology
Tsuge, Satoru Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
University of Tokushima
Tsuji, Keita Temporal Aspects of Terminology for Automatic Term Recognition: Case Study on Women’s Studies Terms
Graduate School of Library, Information and Media Studies, University of Tsukuba
Tsujii, Jun’ichi Building Bilingual Lexicons using Lexical Translation Probabilities via Pivot Languages
Connecting Text Mining and Pathways using the PathText Resource
GENIA-GR: a Grammatical Relation Corpus for Parser Evaluation in the Biomedical Domain
Challenges in Pronoun Resolution System for Biomedical Text
University of Tokyo & National Centre for Text Mining, University of Manchester
Tsunakawa, Takashi Building Bilingual Lexicons using Lexical Translation Probabilities via Pivot Languages
University of Tokyo
Tsuruoka, Yoshimasa Connecting Text Mining and Pathways using the PathText Resource
National Centre for Text Mining, University of Manchester
Tudorache, Alexandra JMWNL: an Extensible Multilingual Library for Accessing Wordnets in Different Languages
A Bottom-up Comparative Study of EuroWordNet and WordNet 3.0 Lexical and Semantic Relations
Academy of Economic Studies Bucharest
Tufiş, Dan A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions
DIAC+: a Professional Diacritics Recovering System
Unsupervised Lexical Acquisition for Part of Speech Tagging
RACAI’s Linguistic Web Services
RACAI, Romanian Academy, Bucharest
Turmo, Jordi Question Answering on Speech Transcriptions: the QAST evaluation in CLEF
UPC-TALP

 

U
Uchimoto, Kiyotaka Automatic Acquisition of Usage Information for Language Resources
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
Boot-Strapping a WordNet Using Multiple Existing WordNets
Construction of a Metadata Database for Efficient Development and Use of Language Resources
Development of the Japanese WordNet
Word-level Dependency-structure Annotation to Corpus of Spontaneous Japanese and its Application
A Method for Automatically Constructing Case Frames for English
National Institute of Information and Communications Technology
Uchiyama, Kiyoko A Contextual Dynamic Network Model for WSD Using Associative Concept Dictionary
Keio University
Urciuoli, Ilaria Annotating Expressions of Opinion and Emotion in the Italian Content Annotation Bank
CELCT
Uryupina, Olga Error Analysis for Learning-based Coreference Resolution
Institute of Linguistics, Russian Academy of Science
Uszkoreit, Hans Adaptation of Relation Extraction Rules to New Domains
Extracting and Querying Relations in Scientific Papers on Language Technology
DFKI GmbH
Utiyama, Masao Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop
Development of the Japanese WordNet
Application of Resource-based Machine Translation to Real Business Scenes
National Institute of Information and Communications Technology
Utsuro, Takehito Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop
University of Tsukuba

 

V
Vũ, Xuẩn Lương Word Segmentation of Vietnamese Texts: a Comparison of Approaches
VietLex, Hanoi
Valentín, Oriol Rapid Deployment of a New METIS Language Pair: Catalan-English
User-Centred Design of Error Correction Tools
Grup de Lingüística Computacional (Barcelona Media-UPF), Barcelona
Vallee, Arnaud New Telephone Speech Databases for French: a Children Database and an optimized Adult Corpus
Telisma
Van den Heuvel, Henk The IFADV Corpus: a Free Dialog Video Corpus
LC-STAR II: Starring more Lexica
The AUTONOMATA Spoken Names Corpus
SPEX/CLST Radboud University Nijmegen
Van der Vliet, Hennie Adjectives in the Dutch Semantic Lexical Database CORNETTO
Faculteit der Letteren, Vrije Universiteit Amsterdam
Van der Werff, Laurens Evaluation of Spoken Document Retrieval for Historic Speech Collections
University of Twente
Van Genabith, Josef Learning Morphology with Morfette
Treebank-Based Acquisition of LFG Parsing Resources for French
Parser Evaluation and the BNC: Evaluating 4 constituency parsers with 3 metrics
National Center for Language Technology, Dublin City University
Van hamme, Hugo Children’s Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement
Recording Speech of Children, Non-Natives and Elderly People for HLT Applications: the JASMIN-CGN Corpus.
Katholieke Universiteit Leuven
Van Hout, Roeland Evaluating the Relationship between Linguistic and Geographic Distances using a 3D Visualization
Radboud University Nijmegen
Van Noord, Gertjan From D-Coi to SoNaR: a reference corpus for Dutch
University of Groningen
Van Son, Rob The IFADV Corpus: a Free Dialog Video Corpus
ACLC/IFA, University of Amsterdam
Van Uytvanck, Dieter Language-Sites: Accessing and Presenting Language Resources via Geographic Information Systems
Max Planck Institute for Psycholinguistics
Van Valkenhoef, Tobias A Grid of Regional Language Archives
Max Planck Institute for Psycholinguistics
Van Veenendaal, Remco Building a Federation of Language Resource Repositories: the DAM-LR Project and its Continuation within CLARIN.
Standardising Bilingual Lexical Resources According to the Lexicon Markup Framework
Dutch Institute for Lexicographyy
Van Zijl, Lynette The ATIS Sign Language Corpus
Stellenbosch University
Vandeghinste, Vincent Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
From D-Coi to SoNaR: a reference corpus for Dutch
Centre for Computational Linguistics - KULeuven
VanderVliet, Hennie Integrating Lexical Units, Synsets and Ontology in the Cornetto Database
Vrije Universiteit Amsterdam
VanGent, Joop KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Irion technologies
Vanni, Michelle Holy Moses! Leveraging Existing Tools and Resources for Entity Translation
US Army Research Laboratory
Vanni, Stephan CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content
VECSYS
Vanopstal, Klaar Learning-based Detection of Scientific Terms in Patient Information
LT3, University College Ghent
Vaquero Sanchez, Antonio Conceptual Modeling of Ontology-based Linguistic Resources with a Focus on Semantic Relations
Universidad Complutense de Madrid
Váradi, Tamás CLARIN: Common Language Resources and Technology Infrastructure
Linguistics Institute, Hungarian Academy of Sciences
Varasai, Patcharee Building an Annotated Corpus for Text Summarization and Question Answering
Kasetsart University
Varga, Dániel Parallel Creation of Gigaword Corpora for Medium Density Languages - an Interim Report
BME MOKK
Vasilescu, Ioana Speech Errors on Frequently Observed Homophones in French: Perceptual Evaluation vs Automatic Classification
LIMSI-CNRS
Vassiliou, Marina Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
ILSP - Athens
Vaz, Paula Cristina Using Lexical Acquisition to Enrich a Predicate Argument Reusable Database
L2F INESC-ID/IST, Lisboa
Veale, Tony Acquiring Naturalistic Concept Descriptions from the Web
University College Dublin
Veaux, Christophe Automatic Phoneme Segmentation with Relaxed Textual Constraints
IrcamCorpusTools: an Extensible Platform for Spoken Corpora Exploitation
IRCAM
Vecchi, Eva An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
MITRE Corporation
Velupillai, Sumithra Revealing Relations between Open and Closed Answers in Questionnaires through Text Clustering Evaluation
DSV/KTH-Stockholm University
Venturi, Giulia Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora
ILC-CNR, Pisa
Verhelst, Werner A Multi-sensor Speech Database with Applications towards Robust Speech Processing in hostile Environments
Vrije Universiteit Brussel, dept. ETRO-DSSP, Brussels
Versley, Yannick BART: A modular toolkit for coreference resolution
How to Compare Treebanks
University of Tuebingen
Vetulani, Grazyna Verb-Noun Collocation SyntLex Dictionary: Corpus-Based Approach
Adam Mickiewicz University in Poznań
Vetulani, Zygmunt Verb-Noun Collocation SyntLex Dictionary: Corpus-Based Approach
Adam Mickiewicz University in Poznań
Viana, M. Céu The LECTRA Corpus - Classroom Lecture Transcriptions in European Portuguese
CLUL
Vicedo, Jose Luis The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
University of Alicante
Vicente-Díez, María Teresa An Empirical Approach to a Preliminary Successful Identification and Resolution of Temporal Expressions in Spanish News Corpora
Universidad Carlos III de Madrid
Vidal, Gaëlle Automatic Phone Segmentation of Expressive Speech
IRISA / Universite Rennes 1 - Enssat
Vidulin, Vedrana Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
Jozef Stefan Institute
Vieira, Renata A Framework for Multilingual Ontology Mapping
Pontifícia Universidade Católica do Rio Grande do Sul
Viethen, Jette Controlling Redundancy in Referring Expressions
Macquarie University
Vilar, J. M. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universitat Jaume I, Castellón
Villata, Serena Automatic extraction of subcategorization frames for Italian
Università di Torino
Villegas, Marta COLDIC, a Lexicographic Platform for LMF compliant lexica
Pompeu Fabra University
Villemonte de la Clergerie, Eric PASSAGE: from French Parser Evaluation to Large Sized Treebank
INRIA-Rocquencourt
Vilnat, Anne EASY, Evaluation of Parsers of French: what are the Results?
PASSAGE: from French Parser Evaluation to Large Sized Treebank
LIMSI-CNRS
Vilo, Jaak Strengthening the Estonian Language Technology
Tartu University
Vincze, Veronika Hungarian Word-Sense Disambiguated Corpus
University of Szeged
Vintar, Špela Harvesting Multi-Word Expressions from Parallel Corpora
University of Ljubljana, Faculty of Arts
Vitas, Duško The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines
Faculty of Mathematics, Belgrade
Vivaldi, Jorge Turning a Term Extractor into a new Domain: first Experiences
A Suite to Compile and Analyze an LSP Corpus
Applied Linguistic Institute
Vogel, Stephan Communicating Unknown Words in Machine Translation
Carnegie Mellon University
Volín, Jan Phone Segmentation Tool with Integrated Pronunciation Lexicon and Czech Phonetically Labelled Reference Database.
Charles University, Prague
Volpe Nunes, Maria das Graças The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database
NILC/USP
Voss, Clare MTriage: Web-enabled Software for the Creation, Machine Translation, and Annotation of Smart Documents
Exploitation of an Arabic Language Resource for Machine Translation Evaluation: using Buckwalter-based Lookup Tool to Augment CMU Alignment Algorithm
Army Research Laboratory
Vossen, Piek Adjectives in the Dutch Semantic Lexical Database CORNETTO
Integrating Lexical Units, Synsets and Ontology in the Cornetto Database
KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures
Faculteit der Letteren, Vrije Universiteit Amsterdam
Vrusias, Bogdan Lexical Ontology Extraction using Terminology Analysis: Automating Video Annotation
University of Surrey
Vuckovic, Kristina Rule-Based Chunker for Croatian
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences

 

W
Waast-Richard, Claire CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content
EDF R&D
Wagner, Agnieszka JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts
Institute of Linguistics, Adam Mickiewicz University, Poznań
Waibel, Alex Communicating Unknown Words in Machine Translation
Carnegie Mellon University
Waldron, Benjamin Language Resources and Chemical Informatics
Computer Laboratory, University of Cambridge
Walker, Kevin Speaker Recognition: Building the Mixer 4 and 5 Corpora
Linguistic Data Consortium
Walter, Stephan Linguistic Description and Automatic Extraction of Definitions from German Court Decisions
University of Saarland
Wang, Xinglong Learning the Species of Biomedical Named Entities from Annotated Corpora
University of Edinburgh
Wang, Zhulong Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
Fujitsu R&D Center LTD
Wanner, Leo Using Semantically Annotated Corpora to Build Collocation Resources
Making Text Resources Accessible to the Reader: the Case of Patent Claims
ICREA and Pompeu Fabra University
Ward, Wayne Annotating Students’ Understanding of Science Concepts
University of Colorado, Boulder
Watson, Matt A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems
University of Sunderland
Way, Andy The ATIS Sign Language Corpus
Dublin City University
Webb, Nick Cross-Domain Dialogue Act Tagging
University at Albany, SUNY
Webber, Bonnie The Penn Discourse TreeBank 2.0.
University of Edinburgh
Weber, Corinna The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering
DFKI GmbH
Wehrli, Eric Generating Bilingual Dictionaries by Transitivity
University of Geneva
Wei, Furu Exploiting the Role of Position Feature in Chinese Relation Extraction
Department of Computing, The Hong Kong Polytechnic University
Weidenbacher, Ulrich The PIT Corpus of German Multi-Party Dialogues
University of Ulm
Weikum, Gerhard Mapping Roget’s Thesaurus and WordNet to French
Max Planck Institute for Informatics
Weiser, Stéphanie Automatic Identification of Temporal Information in Tourism Web Pages
MoDyCo, CNRS
Weiss, Brian Performance Evaluation of Speech Translation Systems
National Institute of Standards and Technology
Weller, Marion Tools for Collocation Extraction: Preferences for Active vs. Passive
A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions
IMS, University of Stuttgart
Wellner, Ben SpatialML: Annotation Scheme, Corpora, and Tools
MITRE Corporation
Wellner, Pierre Task-Based Evaluation of Meeting Browsers: from Task Elicitation to User Behavior Analysis
IDIAP Research Institute
Wentland, Wolodja Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration
Department of Computational Linguistics, Heidelberg University
Wermter, Joachim Semantic Annotations for Biology: a Corpus Development Initiative at the Jena University Language & Information Engineering (JULIE) Lab
Jena University, JULIE Lab
Wesseling, Wieneke The IFADV Corpus: a Free Dialog Video Corpus
ACLC/IFA, University of Amsterdam
Westerhout, Eline Creating Glossaries Using Pattern-Based and Machine Learning Techniques
University of Utrecht
Westerlund, Torbjörn Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic)
Uppsala University
Whistlecroft, Lisa Classification Procedures for Software Evaluation
PALATINE, The Higher Education Academy/Lancaster University
White, J.V. Statistical Evaluation of Information Distillation Systems
BAE Systems
White, Michael Projecting Propbank Roles onto the CCGbank
The Ohio State University
Wick, Michael A Corpus for Cross-Document Co-reference
University of Massachusetts
Widdows, Dominic Semantic Vectors: a Scalable Open Source Package and Online Technology Management Application
Google Inc.
Wiebe, Janyce A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources
Finding the Sources and Targets of Subjective Expressions
University of Pittsburgh
Wiegand, Michael Cost-Sensitive Learning in Answer Extraction
University of Saarland
Wilkerson, John The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998
University of Washington
Wilks, Yorick Cross-Domain Dialogue Act Tagging
Dialogue, Speech and Images: the Companions Project Data Set
Information Extraction Tools and Methods for Understanding Dialogue in a Companion
An Unsupervised Probabilistic Approach for the Detection of Outliers in Corpora
University of Sheffield
Williams, Briony Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
Bangor University
Williams, Sandra Deriving Rhetorical Complexity Data from the RST-DT Corpus
The Open University
Wilson, Theresa Annotating Subjective Content in Meetings
University of Edinburgh
Winder, Ransom Creating and Using a Correlated Corpus to Glean Communicative Commonalities
MITRE Corporation
Windhouwer, Menzo Ensuring Semantic Interoperability on Lexical Resources
ISOcat: Corralling Data Categories in the Wild
Max Planck Institute for Psycholinguistics
Winkler, Thomas The MoveOn Motorcycle Speech Corpus
Fraunhofer IAIS
Witt, Andreas Influence of Text Type and Text Length on Anaphoric Annotation
The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources
University of Tübingen
Witte, René Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles
University of Karlsruhe
Wittenburg, Peter Exploring and Enriching a Language Resource Archive via the Web
Annotation by Category: ELAN and ISO DCR
ISOcat: Corralling Data Categories in the Wild
CLARIN: Common Language Resources and Technology Infrastructure
Foundation of a Component-based Flexible Registry for Language Resources and Technology
A Grid of Regional Language Archives
Max Planck Institute for Psycholinguistics
Woelfel, Matthias A Comparative Cross-Domain Study of the Occurrence of Laughter in Meeting and Seminar Corpora
UKA
Wolf, Chris Adjudicator Agreement and System Rankings for Person Name Search
An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
MITRE Corporation
Wolters, Maria Corpus Analysis of Spoken Smart-Home Interactions with Older Users
A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users’ Interactions with Spoken Dialogue Systems
CSTR, University of Edinburgh
Womser-Hacker, Christa An Evaluation Resource for Geographic Information Retrieval
University of Hildesheim
Wong, Kam-Fai Opinion Annotation in On-line Chinese Product Reviews
Chinese University of Hong Kong
Wright, Sue Ellen ISOcat: Corralling Data Categories in the Wild
Kent State University
Wu, Dekai Evaluation of Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation
Hong Kong University of Science and Technology
Wunsch, Holger Enriching GermaNet with verb-noun relations - a case study of lexical acquisition
University of Tübingen
Wutiwiwatchai, Chai Thai Broadcast News Corpus Construction and Evaluation
National Electronics and Computer Technology Center
Wynne, Martin CLARIN: Common Language Resources and Technology Infrastructure
Oxford Text archive

 

X
Xia, Lei An Approach to Modeling Heterogeneous Resources for Information Extraction
University of Sheffield
Xia, Yunqing Opinion Annotation in On-line Chinese Product Reviews
Tsinghua University
Xie, Zhuli From Extracting to Abstracting: Generating Quasi-abstractive Summaries
Applications and Software Research Center, Motorola
Xu, Feiyu Adaptation of Relation Extraction Rules to New Domains
Fine-grained Opinion Topic and Polarity Identification
DFKI GmbH
Xu, Mingwei Extracting Concrete Senses of Lexicon through Measurement of Conceptual Similarity in Ontologies
Academia Sinica
Xu, Ruifeng Opinion Annotation in On-line Chinese Product Reviews
The Hong Kong Polytechnic University
Xue, Nianwen Annotating “tense” in a Tense-less Language
University of Colorado

 

Y
Yamada, Takeshi Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
University of Tsukuba
Yamamoto, Eiko Extraction of Informative Expressions from Domain-specific Documents
Application of Resource-based Machine Translation to Real Business Scenes
Kobe University
Yamamoto, Kazumasa Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -
Toyohashi University of Technology
Yamamoto, Mikio Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop
Sentiment Analysis Based on Probabilistic Models Using Inter-Sentence Information
University of Tsukuba
Yamamoto, Seiichi Creation of Learner Corpus and Its Application to Speech Recognition
Doshisha University
Yamashita, Yoichi Test Collections for Spoken Document Retrieval from Lecture Audio Data
Ritsumeikan University
Yamazaki, Hiroki Creation of Learner Corpus and Its Application to Speech Recognition
Doshisha University
Yang, Xiaofeng BART: A modular toolkit for coreference resolution
Inst. for Infocomm Research
Yang, Yuhang Chinese Term Extraction Based on Delimiters
School of Computer Science and Technology, Harbin Institute of Technology
Yankova, Milena A Framework for Identity Resolution and Merging for Multi-source Information Extraction
University of Sheffield
Yannoutsou, Olga Evaluation of a Machine Translation System for Low Resource Languages: METIS-II
ILSP - Athens
Yano, Tae Relation between Agreement Measures on Human Labeling and Machine Learning Performance: Results from an Art History Domain
Carnegie Mellon University
Yaseen, Mustafa MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic
Amman University
Yasuda, Norihito Test Collections for Spoken Document Retrieval from Lecture Audio Data
NTT Corporation
YingJu, Xia Adapting International Standard for Asian Language Technologies
Fujitsu R&D Center LTD
Yongyuth, Panita Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance
Department of Computer Engineering, Kasetsart University, Bangkok
Yoshinaga, Naoki Boosting Precision and Recall of Hyponymy Relation Acquisition from Hierarchical Layouts in Wikipedia
University of Tokyo
Yu, Shiwen Quality Assurance of Automatic Annotation of Very Large Corpora: a Study based on heterogeneous Tagging System
Institute of Computational Linguistics, Peking University

 

Z
žabokrtský, Zdeněk CzEng 0.7: Parallel Corpus with Community-Supplied Translations
Charles University, Prague
Zaenen, Annie The Encoding of lexical implications in VerbNet Predicates of change of locations
PARC
Zaghouani, Wajdi A Pilot Arabic Propbank
Linguistic Data Consortium
Zamora, F. The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters
Universidad Politécnica de Valencia
Zanoli, Roberto The TextPro Tool Suite
Fondazione Bruno Kessler - FBK, Trento
Zanzotto, Fabio Massimo Yet another Platform for Extracting Knowledge from Corpora
DISP, University of Rome Tor Vergata
Zaragoza, Hugo Semantically Annotated Snapshot of the English Wikipedia
Yahoo! Research Barcelona
Zarcone, Alessandra Computational Models for Event Type Classification in Context
Scuola Normale Superiore, Pisa
Zelezny, Milos Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition
Department of Cybernetics, Faculty of Applied Sciences, University of West Bohemia in Pilsen
Zeman, Daniel Reusable Tagset Conversion Using Tagset Drivers
Univerzita Karlova
Zesch, Torsten Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary
Technische Universität Darmstadt
Zhang, Peng Exploiting the Role of Position Feature in Chinese Relation Extraction
School of Computer Science and Technology, Tianjin Unversity
Zhang, Yajing Identifying Foreign Person Names in Chinese Text
Extracting and Querying Relations in Scientific Papers on Language Technology
DFKI GmbH
Zhang, Yi Robust Parsing with a Large HPSG Grammar
Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German
University of Saarland & DFKI GmbH
Zhang, Yujie Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
National Institute of Information and Communications Technology
Zhang, Ziqi A Comparative Evaluation of Term Recognition Algorithms
University of Sheffield
Zhao, Tiejun Chinese Term Extraction Based on Delimiters
School of Computer Science and Technology, Harbin Institute of Technology
Zhong, Hua Annotating “tense” in a Tense-less Language
University of Colorado
Zidraşco, Tatiana Estimating Word Phonosemantics
Technical University of Moldova
Ziegenhain, Ute LC-STAR II: Starring more Lexica
Siemens AG
Zikánová, Sárka From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank
Charles University, Prague
Zini, Manuel Integration of a Multilingual Keyword Extractor in a Document Management System
DrWolf
Zinn, Claus Ensuring Semantic Interoperability on Lexical Resources
Exploring and Enriching a Language Resource Archive via the Web
Max Planck Institute for Psycholinguistics
Zock, Michael How to Evaluate and Raise the Quality in a Collaborative Lexicographic Approach
Universite de Marseille
Zourari, Maria Building a Greek corpus for Textual Entailment
ILSP - Athens

Powered by ELDA © 2008 ELDA/ELRA