|
AFFILIATIONS: Browse articles of the conference sorted by institution
A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z
A |
Aalto University School of Science, FINLAND |
Comparing computer vision analysis of signed language video with motion capture recordings
|
Adam Mickiewicz University, Faculty of Mathematics and Computer Science, Department of Computer Linguistics and Artificial Intelligence, POLAND |
Wordnet Based Lexicon Grammar for Polish
|
Adam Mickiewicz University, Poznan, Institute of Linguistics, POLAND |
Developing and evaluating an emergency scenario dialogue corpus
|
Advocate, Member of the Johannesburg Bar, SOUTH AFRICA |
Aspects of a Legal Framework for Language Resource Management
|
Agence France-Presse, FRANCE |
Aleda, a free large-scale entity database for French
|
Alexandru Philippide Institute of Philology, Romanian Academy, the Iași, ROMANIA |
Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations
|
Allameh Tabatabaiee University, Linguistic Department, IRAN, ISLAMIC REPUBLIC OF |
Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
|
Alpage, INRIA Paris-Rocquencourt & Université Paris 7, FRANCE |
Applying cross-lingual WSD to wordnet development
Evaluating and improving syntactic lexica by plugging them within a parser
Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations
|
Amirkabir University of Technology, IRAN, ISLAMIC REPUBLIC OF |
A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora
|
Applied Language Solutions Ltd., UNITED KINGDOM |
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
|
Arabic Textware, JORDAN |
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
|
Association Francophone de la Communication Parlée, FRANCE |
The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
|
Athens Technology Center, GREECE |
Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue
|
ATILF Nancy, FRANCE |
Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary
|
AT&T Labs ― Research, UNITED STATES |
Building Text-To-Speech Voices in the Cloud
|
Austrian Academy of Sciences, AUSTRIA |
Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies.
|
Austrian Research Institute for Artificial Intelligence, AUSTRIA |
Applying Random Indexing to Structured Data to Find Contextually Similar Words
|
Autónoma University Madrid, Computational Linguistics Laboratory, SPAIN |
Medical Term Extraction in an Arabic Medical Corpus
|
B |
BAE Systems, UNITED STATES |
Identifying Nuggets of Information in GALE Distillation Evaluation
|
Barcelona Media Research Center, SPAIN |
Holaaa!! writin like u talk is kewl but kinda hard 4 NLP
BUCEADOR, a multi-language search engine for digital libraries
|
Barcelona Media, SPAIN |
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
|
BME MOKK, HUNGARY |
Rapid creation of large-scale corpora and frequency dictionaries
|
Brandeis University, Computer Science Department, UNITED STATES |
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
|
Brandeis University, UNITED STATES |
Word Sense Inventories by Non-Experts.
ATLIS: Identifying Locational Information in Text Automatically
The Role of Model Testing in Standards Development: The Case of ISO-Space
Annotating dropped pronouns in Chinese newswire text
The TARSQI Toolkit
|
Brigham Young University, UNITED STATES |
Item Development and Scoring for Japanese Oral Proficiency Testing
First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis
|
Bulgarian Academy of Sciences, BULGARIA |
Bulgarian X-language Parallel Corpus
Expanding Parallel Resources for Medium-Density Languages for Free
Linguistic Analysis Processing Line for Bulgarian
|
C |
Cairo University, Faculty of Arts, EGYPT |
Medical Term Extraction in an Arabic Medical Corpus
|
Campus de Espinardo, SPAIN |
A New Method for Evaluating Automatically Learned Terminological Taxonomies
|
Carnegie Mellon University, Language Technologies Institute, UNITED STATES |
Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems
Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
|
Carnegie Mellon University, UNITED STATES |
A Universal Part-of-Speech Tagset
Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic
Morphosyntactic Analysis of the CHILDES and TalkBank Corpora
Diversifiable Bootstrapping for Acquiring High-Coverage Paraphrase Resource
|
CASSIDIAN, FRANCE |
New language resources for the Pashto language
|
CEA - LIST, FRANCE |
Evaluation of Unsupervised Information Extraction
Evaluation of a Complex Information Extraction Application in Specific Domain
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation
|
CEHUM/UM, PORTUGAL |
Págico: Evaluating Wikipedia-based information retrieval in Portuguese
|
CELCT, ITALY |
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
CAT: the CELCT Annotation Tool
Evaluating Machine Reading Systems through Comprehension Tests
Chinese Whispers: Cooperative Paraphrase Acquisition
|
CELI, ITALY |
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.
|
Centro de Estudios Lingüísticos y Literarios, El Colegio de México, MEXICO |
Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
|
Centro de Linguística da Universidade de Lisboa, PORTUGAL |
Project FLY: a multidisciplinary project within Linguistics
Modality in Text: a Proposal for Corpus Annotation
|
Centro de Linguística da Universidade Nova de Lisboa, PORTUGAL |
An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Dealing with unknown words in statistical machine translation
|
Centro Ramon Pineiro para a Investigacion en Humanidades. Xunta de Galicia, SPAIN |
A Galician Syntactic Corpus with Application to Intonation Modeling
|
Chalmers University, SWEDEN |
An Open Source Persian Computational Grammar
|
Charles University in Prague, CZECH REPUBLIC |
A database of semantic clusters of verb usages
Alternative Lexicalizations of Discourse Connectives in Czech
HamleDT: To Parse or Not to Parse?
Terra: a Collection of Translation Error-Annotated Corpora
Announcing Prague Czech-English Dependency Treebank 2.0
The Joy of Parallelism with CzEng 1.0
Getting more data -- Schoolkids as annotators
Building a learner corpus
|
Charles University in Prague, Faculty of Arts, CZECH REPUBLIC |
Building a multilingual parallel corpus for human users
|
Charles University in Prague, Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, CZECH REPUBLIC |
Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component
Language Richness of the Web
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Prague Dependency Style Treebank for Tamil
Arabic Word Generation and Modelling for Spell Checking
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
|
Charles University in Prague, Institute of the Czech National Corpus, CZECH REPUBLIC |
Balanced data repository of spontaneous spoken Czech
|
Charles University in Prague, MFF, UFAL, CZECH REPUBLIC |
A High-Quality Web Corpus of Czech
Automatic MT Error Analysis: Hjerson Helping Addicter
IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus
|
Chiba Institute of Technology, JAPAN |
Method for Collection of Acted Speech Using Various Situation Scripts
|
Chiba University, JAPAN |
Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
Annotation of anaphoric relations and topic continuity in Japanese conversation
UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
|
Chinese University of Hong Kong, HONG KONG |
Quantising Opinions for Political Tweets Analysis
|
City University of New York, UNITED STATES |
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
|
CLEO, Centre for Open Electronic Publishing, FRANCE |
Annotated Bibliographical Reference Corpora in Digital Humanities
|
CLiC-Universitat de Barcelona, SPAIN |
Annotating Near-Identity from Coreference Disagreements
|
CLUL, PORTUGAL |
Introducing the Reference Corpus of Contemporary Portuguese Online
|
CNAM-CRTD, FRANCE |
Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
|
CNR, ITALY |
Towards a User-Friendly Platform for Building Language Resources based on Web Services
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
|
CNRS, FRANCE |
DISLOG: A logic-based language for processing discourse structures
ANALEC: a New Tool for the Dynamic Annotation of Textual Data
|
Columbia University, UNITED STATES |
Annotations for Power Relations on Email Threads
The Dependency-Parsed FrameNet Corpus
AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis
Annotating Agreement and Disagreement in Threaded Discussion
Conventional Orthography for Dialectal Arabic
The MASC Word Sense Corpus
Simplified guidelines for the creation of Large Scale Dialectal Arabic Annotations
DramaBank: Annotating Agency in Narrative Discourse
Empirical Comparisons of MASC Word Sense Annotations
|
Computational & Information Sciences Directorate, Army Research Lab (ARL), UNITED STATES |
Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System
|
Computer Research Center of Islamic Sciences, IRAN, ISLAMIC REPUBLIC OF |
A Framework for Spelling Correction in Persian Language Using Noisy Channel Model
|
Copenhagen Business School, DENMARK |
Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies
SMALLWorlds -- Multilingual Content-Controlled Monologues
Translog-II: a Program for Recording User Activity Data for Empirical Reading and Writing Research
|
D |
DANS, Data Archiving & Networked Services, NETHERLANDS |
An Oral History Annotation Tool for INTER-VIEWs
|
De La Salle University, Center for Empathic Human-Computer Interactions, PHILIPPINES |
Building a Multimodal Laughter Database for Emotion Recognition
|
Delft University of Technology, NETHERLANDS |
The BladeMistress Corpus: From Talk to Action in Virtual Worlds
Creating a Data Collection for Evaluating Rich Speech Retrieval
|
Deluxe Digital Studios, UNITED KINGDOM |
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
|
Department of Speech, Music and Hearing, School of Computer Science and Communication, KTH, SWEDEN |
3rd party observer gaze as a continuous measure of dialogue flow
|
Deutsche Telekom Labs, GERMANY |
You Seem Aggressive! Monitoring Anger in a Practical Application
Fast Labeling and Transcription with the Speechalyzer Toolkit
|
DFKI GmbH, GERMANY |
Constructing a Question Corpus for Textual Semantic Relations
Using Verb Subcategorization for Word Sense Disambiguation
Involving Language Professionals in the Evaluation of Machine Translation
Automatic MT Error Analysis: Hjerson Helping Addicter
Joint Grammar and Treebank Development for Mandarin Chinese with HPSG
An Adaptive Framework for Named Entity Combination
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Terra: a Collection of Translation Error-Annotated Corpora
Evaluating the Impact of Phrase Recognition on Concept Tagging
Annotating Opinions in German Political News
MultiUN v2: UN Documents with Multilingual Alignments
Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World
A Graphical Citation Browser for the ACL Anthology
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
CLIMB grammars: three projects using metagrammar engineering
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
The META-SHARE Metadata Schema for the Description of Language Resources
|
Direction Générale de l'Armement, FRANCE |
The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
The REPERE Corpus : a multimodal corpus for person recognition
New language resources for the Pashto language
|
Doshisha University, JAPAN |
Multimodal Corpus of Multi-party Conversations in Second Language
|
Dräger Safety AG & Co. KGaA, GERMANY |
Adapting and evaluating a generic term extraction tool
|
Dublin City University, IRELAND |
Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner
Irish Treebanking and Parsing: A Preliminary Evaluation
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation
Towards a User-Friendly Platform for Building Language Resources based on Web Services
Arabic Word Generation and Modelling for Spell Checking
Automatic Extraction and Evaluation of Arabic LFG Resources
Creating a Data Collection for Evaluating Rich Speech Retrieval
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
|
Duke University, UNITED STATES |
A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
|
Duquesne University, EVL Lab, UNITED STATES |
Distractorless Authorship Verification
|
E |
Eastern Michigan University, UNITED STATES |
Rendering Endangered Lexicons Interoperable through Standards Harmonization: the RELISH project
|
Ecole Polytechnique de Montréal, CANADA |
A disambiguation resource extracted from Wikipedia for semantic annotation
|
ELDA/ELRA, FRANCE |
The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
Towards a User-Friendly Platform for Building Language Resources based on Web Services
ELRA in the heart of a cooperative HLT world
The REPERE Corpus : a multimodal corpus for person recognition
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
VERTa: Linguistic features in MT evaluation
The FLaReNet Strategic Language Resource Agenda
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
New language resources for the Pashto language
Using the International Standard Language Resource Number: Practical and Technical Aspects
An Analytical Model of Language Resource Sustainability
On the Way to a Legal Sharing of Web Applications in NLP
The META-SHARE Metadata Schema for the Description of Language Resources
|
Elhuyar Foundation, SPAIN |
PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web
Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques
|
Elhuyar R&D, SPAIN |
Building a Basque-Chinese Dictionary by Using English as Pivot
|
Eolaistriu Technologies, IRELAND |
Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
|
Erasmus Medical Center / Linguamatics, NETHERLANDS |
Centroids: Gold standards with distributional variation
|
Erasmus University Medical Centre, NETHERLANDS |
CALBC: Releasing the Final Corpora
|
European Bioinformatics Institute, UNITED KINGDOM |
Reclassifying subcategorization frames for experimental analysis and stimulus generation
Annotating progressive aspect constructions in the spoken section of the British National Corpus
Centroids: Gold standards with distributional variation
CALBC: Releasing the Final Corpora
|
European Commission, Directorate General for Translation (DGT), LUXEMBOURG |
MultiUN v2: UN Documents with Multilingual Alignments
DGT-TM: A freely available Translation Memory in 22 languages
|
European Commission, Joint Research Centre (JRC), ITALY |
DGT-TM: A freely available Translation Memory in 22 languages
JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool
Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text
|
Euroscript Deutschland, GERMANY |
Involving Language Professionals in the Evaluation of Machine Translation
|
Euskal Herriko Unibertsitatea, Informatika Fakultatea, SPAIN |
Multilingual Central Repository version 3.0
A proposal for improving WordNet Domains
|
Euskal Herriko Unibertsitatea, SPAIN |
Matching Cultural Heritage items to Wikipedia
Versatile Speech Databases for High Quality Synthesis for Basque
KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
Strategies to Improve a Speaker Diarisation Tool
Using an ASR database to design a pronunciation evaluation system in Basque
|
F |
Fachhochschule Potsdam, University of Applied Science, GERMANY |
Is it Useful to Support Users with Lexical Resources? A User Study.
|
Federal University of Rio Grande do Sul, BRAZIL |
A large scale annotated child language construction database
|
Florida Institute of Human and Machine Cognition, UNITED STATES |
LIE: Leadership, Influence and Expertise
|
Fondazione Bruno Kessler, Trento, ITALY |
Buildind a Resource of Patterns Using Semantic Types
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction
A treebank-based study on the influence of Italian word order on parsing performance
Brand Pitt: A Corpus to Explore the Art of Naming
A Parallel Corpus of Music and Lyrics Annotated with Emotions
NgramQuery - Smart Information Extraction from Google N-gram using External Resources
Chinese Whispers: Cooperative Paraphrase Acquisition
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
The KnowledgeStore: an Entity-Based Storage System
|
Fondazione Ugo Bordoni, ITALY |
Intelligibility assessment in forensic applications
|
Fordham University, Computer and Information Science Department, UNITED STATES |
Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing
|
Fraunhofer-Heinrich Hertz Institute, GERMANY |
AVATecH ― automated annotation through audio and video analysis
|
Fraunhofer IAIS Institute, GERMANY |
Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel
AVATecH ― automated annotation through audio and video analysis
|
Freelance translator, CROATIA |
BLEU Evaluation of Machine-Translated English-Croatian Legislation
|
FreeLing project developer, RUSSIAN FEDERATION |
FreeLing 3.0: Towards Wider Multilinguality
|
Freie Universität Berlin, GERMANY |
Building Large Corpora from the Web Using a New Efficient Tool Chain
From Grammar Rule Extraction to Treebanking: A Bootstrapping Approach
|
Freie Universität Berlin, Web Based Systems Group, GERMANY |
Evaluating the Impact of Phrase Recognition on Concept Tagging
DBpedia: A Multilingual Cross-domain Knowledge Base
|
Friedrich-Schiller Universität, Jena, GERMANY |
Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena
|
FTW Telecommunications Research Center Vienna, AUSTRIA |
Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis
|
Fundació Barcelona Media, SPAIN |
Spell Checking in Spanish: The Case of Diacritic Accents
VERTa: Linguistic features in MT evaluation
|
Fundación CTIC, SPAIN |
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
|
G |
GEIE ERCIM, FRANCE |
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
|
Georgetown University, UNITED STATES |
A New Twitter Verb Lexicon for Natural Language Processing
Using semi-experts to derive judgments on word sense alignment: a pilot study
|
German Research Center for Artificial Intelligence, GERMANY |
Evaluating the Impact of Phrase Recognition on Concept Tagging
|
Goethe Universität Frankfurt am Main, GERMANY |
Customization of the Europarl Corpus for Translation Studies
Rendering Endangered Lexicons Interoperable through Standards Harmonization: the RELISH project
|
Google Inc., UNITED STATES |
A Universal Part-of-Speech Tagset
A light way to collect comparable corpora from the Web
|
Google Research, Google Inc., UNITED STATES |
A Cross-Lingual Dictionary for English Wikipedia Concepts
|
Goss Interactive Limited, UNITED KINGDOM |
Evaluating the Impact of Phrase Recognition on Concept Tagging
|
Gothenburg University, SWEDEN |
An Open Source Persian Computational Grammar
Cloud Logic Programming for Integrating Language Technology Resources
Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish
Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
The Rocky Road towards a Swedish FrameNet - Creating SweFN
Combining Language Resources Into A Grammar-Driven Swedish Parser
Semantic Role Labeling with the Swedish FrameNet
Feedback in Nordic First-Encounters: a Comparative Study
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
|
Greedy Intelligence Ltd., CHINA |
Constructing a Question Corpus for Textual Semantic Relations
|
Griffith University, AUSTRALIA |
The Australian National Corpus: National Infrastructure for Language Resources
|
H |
Heinrich-Heine-Universität, GERMANY |
Arabic Word Generation and Modelling for Spell Checking
|
Hiroshima City University, JAPAN |
Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques
|
HITS gGmbH, GERMANY |
Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles
|
Hochschule Augsburg, GERMANY |
Annotation Facilities for the Reliable Analysis of Human Motion
|
Hogeschool Gent, BELGIUM |
Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
Beyond SoNaR: towards the facilitation of large corpus building efforts
|
Hogeschool Gent, LT3, BELGIUM |
Collection of a corpus of Dutch SMS
|
Hogeschool Gent, LT3 / Universiteit Gent, BELGIUM |
Evaluating automatic cross-domain Dutch semantic role annotation
|
Hokkaido University, JAPAN |
Two Database Resources for Processing Social Media English Text
|
Holzweg e-Commerce Solutions, AUSTRIA |
Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
|
Hong Kong Polytechnic University, Department of Computing, HONG KONG |
A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
|
Hong Kong Polytechnic University, HONG KONG |
Assigning Connotation Values to Events
Annotating a corpus of human interaction with prosodic profiles ― focusing on Mandarin repair/disfluency
A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies
|
Hong Kong University of Science and Technology, HONG KONG |
A Multilingual Natural Stress Emotion Database
|
Hong Kong University of Science and Technology, Human Language Technology Center, Department of Computer Science and Engineering, HONG KONG |
A Mandarin-English Code-Switching Corpus
|
Human Speech and Language Technologies, Vicomtech-IK4, SPAIN |
Using DiAML and ANVIL for multimodal dialogue annotations
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
The coding and annotation of multimodal dialogue acts
Highlighting relevant concepts from Topic Signatures
|
Humboldt-Universität Berlin, GERMANY |
Multi-Layer Discourse Annotation of a Dutch Text Corpus
|
Hungarian Academy of Sciences, HUNGARY |
Light Verb Constructions in the SzegedParalellFX English--Hungarian Parallel Corpus
Rapid creation of large-scale corpora and frequency dictionaries
Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies
|
Hungarian Academy of Sciences, Research Group on Artificial Intelligence, HUNGARY |
HunOr: A Hungarian―Russian Parallel Corpus
Automatically Generated Online Dictionaries
|
I |
Ibaraki University, JAPAN |
Detection of Peculiar Word Sense by Distance Metric Learning with Labeled Examples
|
IBM Research India, INDIA |
A Study of Word-Classing for MT Reordering
|
ICLTT, AUSTRIA |
Federated Search: Towards a Common Search Infrastructure
|
ICSI, UNITED STATES |
The MASC Word Sense Corpus
Empirical Comparisons of MASC Word Sense Annotations
|
ID GmbH & Co. KgaA, GERMANY |
SemScribe: Natural Language Generation for Medical Reports
|
IDIAP Research Institute, SWITZERLAND |
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
|
IICT-BAS, Linguistic Modelling Department, BULGARIA |
A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian
The Political Speech Corpus of Bulgarian
|
IIT Bombay, INDIA |
Experiences in Resource Generation for Machine Translation through Crowdsourcing
Re-ordering Source Sentences for SMT
Cost and Benefit of Using WordNet Senses for Sentiment Analysis
|
IIT - CNR, ITALY |
Visualizing word senses in WordNet Atlas
|
ILC ― CNR, ITALY |
Assigning Connotation Values to Events
Customizable SCF Acquisition in Italian
The Language Library: supporting community effort for collective resource production
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
In the same boat and other idiomatic seafaring expressions
Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access
From medical language processing to BioNLP domain
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser
Creation of a bottom-up corpus-based ontology for Italian Linguistics
The LRE Map. Harmonising Community Descriptions of Resources
The FLaReNet Strategic Language Resource Agenda
Enriching the ISST-TANL Corpus with Semantic Frames
The META-SHARE Metadata Schema for the Description of Language Resources
|
ILSP - Athena RC, GREECE |
The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions
Standardizing a Component Metadata Infrastructure
The FLaReNet Strategic Language Resource Agenda
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools
Collecting and Using Comparable Corpora for Statistical Machine Translation
The META-SHARE Metadata Schema for the Description of Language Resources
|
ILTEC, PORTUGAL |
The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
|
IMC AG, GERMANY |
An Adaptive Framework for Named Entity Combination
|
IMMI - CNRS, FRANCE |
The FLaReNet Strategic Language Resource Agenda
|
Indiana University, UNITED STATES |
A finite-state morphological transducer for Kyrgyz
Using semi-experts to derive judgments on word sense alignment: a pilot study
Annotating Errors in a Hungarian Learner Corpus
|
INESC-ID/L2F, PORTUGAL |
An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Extending a wordnet framework for simplicity and scalability
|
INESC-IST, PORTUGAL |
Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
|
INIST - CNRS, FRANCE |
Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign
Annotating Football Matches: Influence of the Source Medium on Manual Annotation
The Quaero Evaluation Initiative on Term Extraction
|
INRIA, FRANCE |
Aleda, a free large-scale entity database for French
Cleaning noisy wordnets
Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus
Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources
|
INRIA / LORIA / Université Henri Poincaré, FRANCE |
CoALT: A Software for Comparing Automatic Labelling Tools
|
INRIA-Nancy, FRANCE |
Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents
|
Insitute for System Analysis, Siberian State Aerospace University, RUSSIAN FEDERATION |
Speech and Language Resources for LVCSR of Russian
|
Institute for Balkan Studies of Serbian Academy of Sciences and Arts, SERBIA |
Multimedia database of the cultural heritage of the Balkans
|
Institute for Information Industry, TAIWAN, PROVINCE OF CHINA |
NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
|
Institute for Research in Fundamental Sciences, IRAN, ISLAMIC REPUBLIC OF |
Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions
|
Institute for Specialised Communication and Multilingualism, European Academy of Bozen/Bolzano, ITALY |
Unsupervised acquisition of concatenative morphology
|
Institute for System Analysis of RAS, RUSSIAN FEDERATION |
Speech and Language Resources for LVCSR of Russian
|
Institute of Informatics and Telecommunications, NCSR Demokritos, GREECE |
Task-Driven Linguistic Analysis based on an Underspecified Features Representation
|
Institute of Technology of Cambodia, CAMBODIA |
Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu
|
Institut für Deutsche Sprache, GERMANY |
EXMARaLDA and the FOLK tools ― two toolsets for transcribing and annotating spoken language
The New IDS Corpus Analysis Platform: Challenges and Prospects
Evaluating Query Languages for a Corpus Processing System
|
Institut Jožef Stefan, Department of Knowledge Technologies, SLOVENIA |
The goo300k corpus of historical Slovene
Irregularity Detection in Categorized Document Corpora
|
Instituto Internacional da Língua Portuguesa, CAPE VERDE |
The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
|
Intelligent Robotics and Communication Laboratories, Advanced Telecommunications Research Institute International, JAPAN |
Body-conductive acoustic sensors in human-robot communication
|
International Institute of Information Technology, INDIA |
Parsing Any Domain English text to CoNLL dependencies
Hindi Subjective Lexicon: A Lexical Resource for Hindi Adjective Polarity Classification
|
Interverbum Technology, SWEDEN |
On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
|
Iran University of Science and Technology, IRAN, ISLAMIC REPUBLIC OF |
A Framework for Spelling Correction in Persian Language Using Noisy Channel Model
Improving K-Nearest Neighbor Efficacy for Farsi Text Classification
|
IRISA ― CNRS, FRANCE |
Annotating Football Matches: Influence of the Source Medium on Manual Annotation
|
IRISA, FRANCE |
A French Fairy Tale Corpus syntactically and semantically annotated
|
IRIT ― CNRS, FRANCE |
A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures
Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring
|
IRIT, FRANCE |
Clause-based Discourse Segmentation of Arabic Texts
|
ISI / USC, UNITED STATES |
Evaluating Machine Reading Systems through Comprehension Tests
|
Islamic Azad University, IRAN, ISLAMIC REPUBLIC OF |
Improving K-Nearest Neighbor Efficacy for Farsi Text Classification
|
Istanbul Technical University, TURKEY |
The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish
Word Alignment for English-Turkish Language Pair
|
ISTI - CNR, ITALY |
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
From medical language processing to BioNLP domain
|
IXA NLP Group, University of the Basque Country, SPAIN |
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
Highlighting relevant concepts from Topic Signatures
Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques
Mapping WordNet to the Kyoto ontology
|
K |
Katholieke Universiteit Leuven, BELGIUM |
Annotating Story Timelines as Temporal Dependency Structures
Large aligned treebanks for syntax-based machine translation
|
Katholieke Universiteit Leuven, CCL, BELGIUM |
Beyond SoNaR: towards the facilitation of large corpus building efforts
Example-Based Treebank Querying
|
Keio University, JAPAN |
Semantic Annotations in Japanese FrameNet: Comparing Frames in Japanese and English
|
KIIT College of Engineering, INDIA |
Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
|
KIIT University, School of Computer Engineering, INDIA |
A GUI to Detect and Correct Errors in Hindi Dependency Treebank
|
Konstanz Universität, GERMANY |
The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
A Rule-based Morphological Analyzer for Murrinh-Patha
A Reference Dependency Bank for Analyzing Complex Predicates
|
Korea University, KOREA, REPUBLIC OF |
ISO 24617-2: A semantically-based standard for dialogue annotation
|
Kyoto University, JAPAN |
Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese
Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Two Phase Evaluation for Selecting Machine Translation Services
Service Composition Scenarios for Task-Oriented Translation
Annotation of anaphoric relations and topic continuity in Japanese conversation
|
L |
L2F/INESC-ID Lisboa, PORTUGAL |
Building and Exploring Semantic Equivalences Resources
|
Laboratoire National de métrologie et d'Essais (LNE), FRANCE |
Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign
The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign
The REPERE Corpus : a multimodal corpus for person recognition
|
Lancaster University, UNITED KINGDOM |
Document Attrition in Web Corpora: an Exploration
|
Language Computer Corporation, UNITED STATES |
Pursing power in Arabic on-line discussion forums
|
Language Technologies Research Centre, IIIT, Hyderabad, INDIA |
A GUI to Detect and Correct Errors in Hindi Dependency Treebank
Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation
|
LDC, UNITED STATES |
LDC Forced Aligner
Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities
Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects
Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure
Linguistic Resources for Handwriting Recognition and Translation Evaluation
Creating HAVIC: Heterogeneous Audio Visual Internet Collection
LDC Language Resource Database: Building a Bibliographic Database
|
LDC / University of Pennsylvania, UNITED STATES |
Further Developments in Treebank Error Detection Using Derivation Trees
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual
Expanding Arabic Treebank to Speech: Results from Broadcast News
|
Leshan Normal University, CHINA |
CLTC: A Chinese-English Cross-lingual Topic Corpus
|
Lexical Computing Ltd, UNITED KINGDOM |
Word Sketches for Turkish
|
LIF - CNRS, FRANCE |
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
|
LIG, FRANCE |
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
|
LIMSI ― CNRS, FRANCE |
Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results
WebAnnotator, an Annotation Tool for Web Pages
Cross-lingual studies of ASR errors: paradigms for perceptual evaluations
Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
Evolution of Event Designation in Media: Preliminary Study
Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign
A Rough Set Formalization of Quantitative Evaluation with Ambiguity
Kitten: a tool for normalizing HTML and extracting its textual content
Evaluation of Unsupervised Information Extraction
A contrastive review of paraphrase acquisition techniques
DEGELS1: A comparable corpus of French Sign Language and co-speech gestures
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs
The LRE Map. Harmonising Community Descriptions of Resources
Corpus of Children Voices for Mid-level Markers and Affect Bursts Analysis
Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier
Designing French Tale Corpora for Entertaining Text To Speech Synthesis
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation
The META-SHARE Metadata Schema for the Description of Language Resources
|
LIMSI ― CNRS & Université Paris-Sud, FRANCE |
Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French
|
Linguamatics LTD, UNITED KINGDOM |
CALBC: Releasing the Final Corpora
|
Linguateca/FCCN, PORTUGAL |
Págico: Evaluating Wikipedia-based information retrieval in Portuguese
|
Linguatec, GERMANY |
Large Scale Lexical Analysis
|
LinkedIn Corp., UNITED STATES |
Annotations for Power Relations on Email Threads
|
Linköping University, SWEDEN |
Alignment-based reordering for SMT
Eye Tracking as a Tool for Machine Translation Error Analysis
On the practice of error analysis for machine translation evaluation
Error profiling for evaluation of machine-translated text: a Polish-English case study
|
LIPN ― CNRS, FRANCE |
The Quaero Evaluation Initiative on Term Extraction
|
LORIA ― CNRS, FRANCE |
Representation of linguistic and domain knowledge for second language learning in virtual worlds
|
Loyola College in Maryland, UNITED STATES |
Creating and Curating a Cross-Language Person-Entity Linking Collection
|
LPP ― CNRS, FRANCE |
Designing French Tale Corpora for Entertaining Text To Speech Synthesis
|
Ludwig-Maximilians-Universität München, Bavarian Archive for Speech Signals, GERMANY |
Statistical Evaluation of Pronunciation Encoding
|
Lund University, SWEDEN |
SMALLWorlds -- Multilingual Content-Controlled Monologues
Constructing Large Proposition Databases
Visualizing Sentiment Analysis on a User Forum
|
Lymba Corporation, UNITED STATES |
A Tool for Extracting Conversational Implicatures
Polaris: Lymba's Semantic Parser
|
M |
Macquarie University, AUSTRALIA |
Irish Treebanking and Parsing: A Preliminary Evaluation
The Australian National Corpus: National Infrastructure for Language Resources
A Framework for Evaluating Text Correction
|
Masarykova Univerzita, Brno, CZECH REPUBLIC |
Building a 70 billion word corpus of English from ClueWeb
A database of semantic clusters of verb usages
Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification
Legal electronic dictionary for Czech
|
Max Planck Institute for Evolutionary Anthropology, GERMANY |
Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages
The Open Linguistics Working Group
|
Max Planck Institute for Psycholinguistics, NETHERLANDS |
AVATecH ― automated annotation through audio and video analysis
ELAN development, keeping pace with communities' needs
Semantic metadata mapping in practice: the Virtual Language Observatory
Towards Automatic Gesture Stroke Detection
Federated Search: Towards a Common Search Infrastructure
Proper Language Resource Centers
Standardizing a Component Metadata Infrastructure
Citing on-line Language Resources
Rendering Endangered Lexicons Interoperable through Standards Harmonization: the RELISH project
The Language Archive ― a new hub for language resources
RELcat: a Relation Registry for ISOcat data categories
|
Meertens Institute, NETHERLANDS |
Dynamic web service deployment in a cloud environment
|
Meraka Institute, CSIR, SOUTH AFRICA |
Aspects of a Legal Framework for Language Resource Management
|
Mercedes-Benz R&D North America, UNITED STATES |
Textual Characteristics for Language Engineering
A Survey of Text Mining Architectures and the UIMA Standard
|
Mersin University, TURKEY |
Construction of the Turkish National Corpus (TNC)
|
MICA Institute, HUST, VIET NAM |
Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu
|
MICA Institute, VIET NAM |
MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels
|
Microsoft Research Asia, CHINA |
Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries
|
Microsoft Research India, INDIA |
An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
|
Microsoft Research, UNITED STATES |
Statistical Section Segmentation in Free-Text Clinical Records
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms
|
Middle East Technical University, TURKEY |
METU Turkish Discourse Bank Browser
|
MIRACL, TUNISIA |
Clause-based Discourse Segmentation of Arabic Texts
|
MIT Computer Science and Artificial Intelligence Lab, UNITED STATES |
Word Sense Inventories by Non-Experts.
Representing General Relational Knowledge in ConceptNet 5
A large scale annotated child language construction database
|
MITRE Corporation, UNITED STATES |
International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned
|
MODUL University Vienna, Department of New Media Technology, AUSTRIA |
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
|
Møller-Trøndelag Kompetansesenter, NORWAY |
BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users
|
N |
Nagoya University, JAPAN |
Causal analysis of task completion errors in spoken music retrieval interactions
Dictionary Look-up with Katakana Variant Recognition
|
Nara Institute of Science and Technology, JAPAN |
Causal analysis of task completion errors in spoken music retrieval interactions
UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
|
National Institute for Japanese Language, JAPAN |
Prediction of Non-Linguistic Information of Spontaneous Speech from the Prosodic Annotation: Evaluation of the X-JToBI system
Annotation of response tokens and their triggering expressions in Japanese multi-party conversations
|
National Institute of Informatics, JAPAN |
Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques
Annotating Factive Verbs
Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure
|
National Institute of Information and Communications Technology, JAPAN |
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
|
National Institute of Standards and Technology (NIST), UNITED STATES |
Creating HAVIC: Heterogeneous Audio Visual Internet Collection
|
National Taiwan University, TAIWAN, PROVINCE OF CHINA |
Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition
NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks
Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information
|
National Technical University of Athens, Department of Electrical and Computer Engineering, GREECE |
A hierarchical approach with feature selection for emotion recognition from speech
|
National University of Ireland, IRELAND |
Semi-Supervised Technical Term Tagging With Minimal User Feedback
Expertise Mining for Enterprise Content Management
|
National University of Singapore, SINGAPORE |
Affective Common Sense Knowledge Acquisition for Sentiment Analysis
|
Natural Language Processing Research Group, UNITED KINGDOM |
Unsupervised document zone identification using probabilistic graphical models
|
NCSR Demokritos Institute, GREECE |
Evaluation of Online Dialogue Policy Learning Techniques
The SYNC3 Collaborative Annotation Tool
|
Nederlandse Taalunie, NETHERLANDS |
Smooth Sailing for STEVIN
|
Neofonie GmbH, GERMANY |
DBpedia: A Multilingual Cross-domain Knowledge Base
|
NETCONOMY Software & Consulting, AUSTRIA |
EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems
|
New York University, UNITED STATES |
Challenges in the Knowledge Base Population Slot Filling Task
|
NILC, BRAZIL |
The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
|
NI Systems (India) Pvt. Ltd. Bangalore, INDIA |
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics
|
Nokia Research Center, CHINA |
Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment
|
North-West University, SOUTH AFRICA |
Aspects of a Legal Framework for Language Resource Management
|
Norwegian University of Science and Technology, NORWAY |
The Nordic Dialect Corpus
|
NTT Cyber Space Laboratories, NTT Corporation, JAPAN |
Constructing a Class-Based Lexical Dictionary using Interactive Topic Models
|
P |
Paideia Inc, UNITED STATES |
A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
|
Pattern Science AG, GERMANY |
Textual Characteristics for Language Engineering
A Survey of Text Mining Architectures and the UIMA Standard
|
Polish Academy of Sciences, Institute of Computer Science, POLAND |
Creating a Coreference Resolution System for Polish
Towards a comprehensive open repository of Polish language resources
Towards an LFG parser for Polish: An exercise in parasitic grammar development
PoliMorf: a (not so) new open morphological dictionary for Polish
Web Service integration platform for Polish linguistic resources
The Polish Sejm Corpus
|
Politecnico di Milano, DEI, ITALY |
Evaluating Multi-focus Natural Language Queries over Data Services
|
Politecnico di Torino, Dipartimento di Automatica e Informatica, ITALY |
A platform-independent user-friendly dictionary from Italian to LIS
|
Postech, KOREA, REPUBLIC OF |
Grammatical Error Annotation for Korean Learners of Spoken English
|
PRESTO, JST, JAPAN |
Towards automation in using multi-modal language resources: compatibility and interoperability for multi-modal features in Kachako
|
Princeton University, UNITED STATES |
The MASC Word Sense Corpus
Empirical Comparisons of MASC Word Sense Annotations
|
PUC-Rio, BRAZIL |
Págico: Evaluating Wikipedia-based information retrieval in Portuguese
|
PUCRS, BRAZIL |
Corpus+WordNet thesaurus generation for ontology enriching
Combining Formal Concept Analysis and semantic information for building ontological structures from texts : an exploratory study
A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever
|
R |
RACAI, Romanian Academy Bucharest, ROMANIA |
ROMBAC: The Romanian Balanced Annotated Corpus
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora
Collecting and Using Comparable Corpora for Statistical Machine Translation
|
Radboud University Nijmegen, CLST, NETHERLANDS |
An Oral History Annotation Tool for INTER-VIEWs
Collecting and Analysing Chats and Tweets in SoNaR
Collection of a corpus of Dutch SMS
|
Radboud University Nijmegen, NETHERLANDS |
DutchSemCor: Targeting the ideal sense-tagged corpus
Beyond SoNaR: towards the facilitation of large corpus building efforts
The DISCO ASR-based CALL system: practicing L2 oral skills and beyond
|
RATP, FRANCE |
DECODA: a call-centre human-human spoken conversation corpus
|
Research Computing Center Moscow State University Vorolievy Gory, RUSSIAN FEDERATION |
Automatic Term Recognition Needs Multiple Evidence
|
Rijksuniversiteit Groningen, NETHERLANDS |
Developing a large semantically annotated corpus
Multi-Layer Discourse Annotation of a Dutch Text Corpus
Large aligned treebanks for syntax-based machine translation
|
Romanian Academy, Research Institute for Artificial Intelligence, ROMANIA |
Adding Morpho-semantic Relations to the Romanian Wordnet
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information
|
Ruhr-Universität Bochum, GERMANY |
The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
|
Rutgers University, UNITED STATES |
Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
|
RWTH Aachen University, GERMANY |
Arabic-Segmentation Combination Strategies for Statistical Machine Translation
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora
|
S |
Saarland University, Department of Applied Linguistics, GERMANY |
Coreference in Spoken vs. Written Texts: a Corpus-based Analysis
|
Saarland University, Department of Computational Linguistics, GERMANY |
Using Verb Subcategorization for Word Sense Disambiguation
German and English Treebanks and Lexica for Tree-Adjoining Grammars
CLIMB grammars: three projects using metagrammar engineering
|
Saarland University, Spoken Language Systems, GERMANY |
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
A Gold Standard for Relation Extraction in the Food Domain
|
Sabanci University, TURKEY |
SUTAV: A Turkish Audio-Visual Database
|
Santa Anna IT Research Institute AB & Linköping University, SWEDEN |
A good space: Lexical predictors in word space evaluation
This also affects the context - Errors in extraction based summaries
|
SAP AG, GERMANY |
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
|
Sarah M. Taylor Consulting, UNITED STATES |
Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
|
Sara, NETHERLANDS |
Dynamic web service deployment in a cloud environment
|
SDL Language Weaver, UNITED STATES |
An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output
|
Senshu University, JAPAN |
A Search Tool for FrameNet Constructicon
|
Seoul National University, Center for Humanities and Information, KOREA, REPUBLIC OF |
Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
|
Seoul National University, Department of Linguistics, KOREA, REPUBLIC OF |
Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
|
Seoul National University, KOREA, REPUBLIC OF |
Dysarthric Speech Database for Development of QoLT Software Technology
|
Seoul National University of Education, KOREA, REPUBLIC OF |
Grammatical Error Annotation for Korean Learners of Spoken English
|
Shahid Beheshti University, IRAN, ISLAMIC REPUBLIC OF |
Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
|
Shanghai Jiao Tong University, CHINA |
Semi-Supervised Technical Term Tagging With Minimal User Feedback
Spell Checking for Chinese
|
Shiraz University, IRAN, ISLAMIC REPUBLIC OF |
Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions
|
Siena College, Department of Computer Science, UNITED STATES |
Revealing Contentious Concepts Across Social Groups
|
Simon Fraser University, CANADA |
A review corpus annotated for negation, speculation and their scope
|
Social Media Research Group, Institute of Technology Tallaght, IRELAND |
Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System
|
Society for Danish Language and Literature, DENMARK |
Towards a richer wordnet representation of properties
|
Sofia University, BULGARIA |
NgramQuery - Smart Information Extraction from Google N-gram using External Resources
|
Sonear, FRANCE |
DECODA: a call-centre human-human spoken conversation corpus
|
Sorano Ltd., DENMARK |
Creation and use of Language Resources in a Question-Answering eHealth System
|
Stanford University, UNITED STATES |
SUTime: A library for recognizing and normalizing time expressions
Suffix Trees as Language Models
The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
|
State University of Maringa, Department of Informatics, BRAZIL |
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
|
Stockholm University, Department of Linguistics, SWEDEN |
3rd party observer gaze as a continuous measure of dialogue flow
|
Stockholm University, SWEDEN |
Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text
|
Systems Plus, Inc., UNITED STATES |
Creating HAVIC: Heterogeneous Audio Visual Internet Collection
|
SYSTRAN SA, FRANCE |
Automatic Translation of Scientific Documents in the HAL Archive
|
T |
TAUS, NETHERLANDS |
On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
|
Technical University Braunschweig, GERMANY |
Association Norms of German Noun Compounds
|
Technical University of Crete, GREECE |
SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks
Associative and Semantic Features Extracted From Web-Harvested Corpora
|
Technical University of Denmark, DENMARK |
Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies
|
Technische Universität Berlin, DAI-Labor, GERMANY |
GerNED: A German Corpus for Named Entity Disambiguation
|
Technische Universität Darmstadt, GERMANY |
Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution
UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF
|
Technische Universität Darmstadt, Ubiquitous Knowledge Processing Lab, GERMANY |
The Open Linguistics Working Group
|
Technische Universität Wien, AUSTRIA |
Applying Random Indexing to Structured Data to Find Contextually Similar Words
|
Telefónica Investigación y Desarrollo, SPAIN |
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech
|
TextShuttle GmbH, SWITZERLAND |
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
|
The British University in Dubai, UNITED ARAB EMIRATES |
Arabic Word Generation and Modelling for Spell Checking
Automatic Extraction and Evaluation of Arabic LFG Resources
|
The Catholic University of Korea, KOREA, REPUBLIC OF |
Dysarthric Speech Database for Development of QoLT Software Technology
|
The Communication Centre for the Deaf and Hard-of-Hearing, ICELAND |
BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users
|
The Institute of Statistical Mathematics, JAPAN |
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
|
The Netherlands Cancer Institute, NETHERLANDS |
NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
|
Tilde, LATVIA |
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Collecting and Using Comparable Corpora for Statistical Machine Translation
Latvian and Lithuanian Named Entity Recognition with TildeNER
|
Tokyo Institute of Technology, JAPAN |
The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues
Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions
|
Tokyo University of Foreign Studies, JAPAN |
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
|
Tokyo University of Technology, JAPAN |
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
|
Toyohashi University of Technology, JAPAN |
Statistical Machine Translation without Source-side Parallel Corpus Using Word Lattice and Phrase Extension
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
Developing Partially-Transcribed Speech Corpus from Edited Transcriptions
|
Trinity College Dublin, IRELAND |
An audiovisual political speech analysis incorporating eye-tracking and perception data
Irish Treebanking and Parsing: A Preliminary Evaluation
The Herme Database of Spontaneous Multimodal Human-Robot Dialogues
On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
|
Tsinghua University, CHINA |
Affective Common Sense Knowledge Acquisition for Sentiment Analysis
CLTC: A Chinese-English Cross-lingual Topic Corpus
|
Tübitak-Bilgem, TURKEY |
Turkish Paraphrase Corpus
|
U |
UCSC UARC, UNITED STATES |
A Corpus for Research on Deliberation and Debate
|
UJF-Grenoble1, FRANCE |
Dbnary: Wiktionary as a LMF based Multilingual RDF network
Collection of a Large Database of French-English SMT Output Corrections
|
UNDL Foundation, SWITZERLAND |
Le Petit Prince in UNL
|
Union College, UNITED STATES |
Revealing Contentious Concepts Across Social Groups
|
Universidad Autónoma de Madrid, Departamento de Lingüística, SPAIN |
Designing a search interface for a Spanish learner spoken corpus: the end-user's evaluation
|
Universidad Autónoma de Madrid, SPAIN |
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
|
Universidad Complutense de Madrid, SPAIN |
SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis
Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
MaltOptimizer: A System for MaltParser Optimization
|
Universidad de Alicante, Departamento Lenguajes y Sistemas Informáticos, SPAIN |
Massively Increasing TIMEX3 Resources: A Transduction Approach
|
Universidad de Alicante (UA), SPAIN |
TIMEN: An Open Temporal Expression Normalisation Resource
Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text
|
Universidad de Barcelona (UB), SPAIN |
VERTa: Linguistic features in MT evaluation
|
Universidad de Granada, SPAIN |
Linguistic knowledge for specialized text production
Medical Term Extraction in an Arabic Medical Corpus
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
|
Universidad de Huelva, SPAIN |
A review corpus annotated for negation, speculation and their scope
|
Universidad del País Vasco, SPAIN |
BUCEADOR, a multi-language search engine for digital libraries
|
Universidad de Santiago de Compostela, SPAIN |
A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality
|
Universidad de Vigo, SPAIN |
BUCEADOR, a multi-language search engine for digital libraries
|
Universidad de Zaragoza, GTC, I3A, SPAIN |
Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
|
Universidade Aberta, PORTUGAL |
Building and Exploring Semantic Equivalences Resources
|
Universidade de Lisboa, PORTUGAL |
TimeBankPT: A TimeML Annotated Corpus of Portuguese
A PropBank for Portuguese: the CINTIL-PropBank
Rembrandt - a named-entity recognition framework
Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese
|
Universidade de São Paulo, BRAZIL |
Propbank-Br: a Brazilian Treebank annotated with semantic role labels
|
Universidade de São Paulo, Center of Computational Linguistics (NILC), BRAZIL |
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
|
Universidade de São Paulo, Department of Informatics, BRAZIL |
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
|
Universidade de São Paulo, School of Arts, Sciences and Humanities, BRAZIL |
Corpus-based Referring Expressions Generation
Portuguese Text Generation from Large Corpora
|
Universidade de Vigo, SPAIN |
A Galician Syntactic Corpus with Application to Intonation Modeling
|
Universidade do Minho, PORTUGAL |
Structural alignment of plain text books
|
Universidade Federal de Minas Gerais, BRAZIL |
The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese
|
Universidade Técnica de Lisboa, Instituto Superior Técnico, PORTUGAL |
An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT
Dealing with unknown words in statistical machine translation
|
Universidad Nacional Autónoma de México, Instituto de Ingeniería, MEXICO |
Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
|
Universidad Nacional de Córdoba, FaMAF, ARGENTINA |
Evaluating and improving syntactic lexica by plugging them within a parser
|
Universidad Nacional de Córdoba, Medina Allende s/n, Ciudad Universitaria, FaMAF, ARGENTINA |
Extending the adverbial coverage of a French morphological lexicon
|
Universidad Nacional de Educación a Distancia (UNED), Madrid, NLP&IR group, SPAIN |
Evaluating Machine Reading Systems through Comprehension Tests
|
Universidad Politécnica de Cataluña, SPAIN |
Building Synthetic Voices in the META-NET Framework
An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output
The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output
FreeLing 3.0: Towards Wider Multilinguality
BUCEADOR, a multi-language search engine for digital libraries
|
Universidad Politécnica de Madrid, E.T.S.I. de Telecomunicacion, SPAIN |
Investigating Verbal Intelligence Using the TF-IDF Approach
|
Universidad Politécnica de Madrid, Ontology Engineering Group, SPAIN |
Collaborative semantic editing of linked data lexica
|
Universidad Politécnica de Valencia, SPAIN |
Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System
|
Universidad Pompeu Fabra, SPAIN |
Iula2Standoff: a tool for creating standoff documents for the IULACT
The IULA Treebank
Towards a User-Friendly Platform for Building Language Resources based on Web Services
Spell Checking in Spanish: The Case of Diacritic Accents
The FLaReNet Strategic Language Resource Agenda
|
Università Cattolica del Sacro Cuore, Milan, ITALY |
First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin
|
Università degli Studi di Siena, ITALY |
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
|
Università degli Studi di Verona, Department of Foreign Languages and Literatures, ITALY |
A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
|
Università degli Studi Roma Tre, Department of Linguistics, ITALY |
A topologic view of Topic and Focus marking in Italian
|
Università degli Studi Roma Tre, Dipartimento di Scienze dellEducazione, ITALY |
Pedagogical stances and their multimodal signals.
|
Università di Bologna, Department of Computer Science, ITALY |
AnIta: a powerful morphological analyser for Italian
|
Università di Bologna, Department of Linguistics and Oriental Studies, ITALY |
AnIta: a powerful morphological analyser for Italian
A topologic view of Topic and Focus marking in Italian
|
Università di Firenze, Department of Italian Studies, ITALY |
A topologic view of Topic and Focus marking in Italian
|
Università di Firenze, ITALY |
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building
The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
|
Università di Macerata, ITALY |
A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty
|
Università di Padova, Department of Information Engineering, ITALY |
A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties
|
Università di Pavia, ITALY |
Annotating Qualia Relations in Italian and French Complex Nominals
|
Università di Pisa, Dipartimento di Linguistica, ITALY |
LexIt: A Computational Resource on Italian Argument Structure
|
Università di Pisa, ITALY |
Creation of a bottom-up corpus-based ontology for Italian Linguistics
Enriching the ISST-TANL Corpus with Semantic Frames
|
Università di Roma La Sapienza, Dipartimenot di Comunicazione e Ricerca Sociale, ITALY |
Pedagogical stances and their multimodal signals.
|
Università di Roma La Sapienza, ITALY |
A New Method for Evaluating Automatically Learned Terminological Taxonomies
Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
|
Università di Roma Tor Vergata, ART Research Group, ITALY |
PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases
|
Università di Roma Tor Vergata, ITALY |
Intelligibility assessment in forensic applications
Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
|
Università di Torino, Dipartimento di Informatica, ITALY |
The Parallel-TUT: a multilingual and multiformat treebank
A treebank-based study on the influence of Italian word order on parsing performance
|
Università di Torino, ITALY |
NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
|
Università di Trento, Dipartimento Ingegneria e Scienza dell'Informazione, ITALY |
Improving the Recall of a Discourse Parser by Constraint-based Postprocessing
A Parallel Corpus of Music and Lyrics Annotated with Emotions
Domain-specific vs. Uniform Modeling for Coreference Resolution
|
Università di Trento, ITALY |
DeCour: a corpus of DEceptive statements in Italian COURts
|
Università di Verona, ITALY |
Annotating Qualia Relations in Italian and French Complex Nominals
|
Università Federico II di Napoli, Dipartimento di Scienze Fisiche, ITALY |
A treebank-based study on the influence of Italian word order on parsing performance
|
Università Federico II di Napoli, ITALY |
W-PhAMT: A web tool for phonetic multilevel timeline visualization
Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabification
|
Università San Raffaele di Roma, ITALY |
Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
|
Universität Augsburg, Institut für Informatik, GERMANY |
Using DiAML and ANVIL for multimodal dialogue annotations
|
Universitat Autònoma de Barcelona (UAB), SPAIN |
Using Language Resources in Humanities research
|
Universität Bielefeld, Applied Informatics Group, GERMANY |
PAMOCAT: Automatic retrieval of specified postures
|
Universität Bielefeld, Artificial Intelligence Group, GERMANY |
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
|
Universität Bielefeld, CITEC Central Lab, GERMANY |
PAMOCAT: Automatic retrieval of specified postures
Collaborative semantic editing of linked data lexica
|
Universität Bielefeld, GERMANY |
Suffix Trees as Language Models
ULex: new data models and a mobile environment for corpus enrichment.
|
Universität Bremen, GERMANY |
A Speech and Gesture Spatial Corpus in Assisted Living
An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use
|
Universitat dAlacant, DLSI, SPAIN |
Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese
|
Universitat d'Alacant (UA), SPAIN |
A finite-state morphological transducer for Kyrgyz
Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation
|
Universitat de Barcelona (UB), SPAIN |
The IULA Treebank
|
Universität des Saarlandes, GERMANY |
Suffix Trees as Language Models
A methodology for the extraction of information about the usage of formulaic expressions in scientific texts
|
Universitatea Alexandru Ioan Cuza, ROMANIA |
Evaluating Machine Reading Systems through Comprehension Tests
Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations
|
Universitatea din Bucuresti, ROMANIA |
The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System
|
Universität Erlangen-Nuremberg, GERMANY |
Efficient Dependency Graph Matching with the IMS Open Corpus Workbench
|
Universität Freiburg, GERMANY |
Creation and use of Language Resources in a Question-Answering eHealth System
|
Universität Hamburg, GERMANY |
Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation
|
Universität Heidelberg, GERMANY |
Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards
|
Universität Heidelberg, ICL, GERMANY |
French and German Corpora for Audience-based Text Type Classification
|
Universität Hildesheim, GERMANY |
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content
Yes we can!? Annotating English modal verbs
A Tool/Database Interface for Multi-Level Analyses
|
Universität Innsbruck, AUSTRIA |
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus
|
Universitat Jaume I, SPAIN |
Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
|
Universität Jena, Julie Lab, GERMANY |
CALBC: Releasing the Final Corpora
|
Universität Karlsruhe, GERMANY |
The KIT Lecture Corpus for Speech Translation
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation
|
Universität Leipzig, Department of Computer Science, Natural Language Processing Group, GERMANY |
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
The Open Linguistics Working Group
|
Universität Leipzig, GERMANY |
Learning Categories and their Instances by Contextual Features
Textual Characteristics for Language Engineering
Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages
|
Universität München, Research Unit Quantitative Language Comparison, GERMANY |
The Open Linguistics Working Group
|
Universitat Politècnica de Catalunya, TALP center, SPAIN |
Highlighting relevant concepts from Topic Signatures
Summarizing a multimodal set of documents in a Smart Room
|
Universitat Politècnica de Valencia, SPAIN |
The acquisition and dialog act labeling of the EDECAN-SPORTS corpus
|
Universitat Pompeu Fabra, Instituto Universitario de Lingüística Aplicada, SPAIN |
The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language
NeoTag: a POS Tagger for Grammatical Neologism Detection
Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks
|
Universitat Pompeu Fabra, SPAIN |
A Classification of Adjectives for Polarity Lexicons Enhancement
A voting scheme to detect semantic underspecification
A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality
Automatic lexical semantic classification of nouns
The IULA Treebank
The CONCISUS Corpus of Event Summaries
Using Language Resources in Humanities research
Text Simplification Tools for Spanish
|
Universität Potsdam, GERMANY |
Yes we can!? Annotating English modal verbs
Fine-grained German Sentiment Analysis on Social Media
|
Universität Saarland, Computational Linguistics Department, GERMANY |
The Open Linguistics Working Group
|
Universität Saarland, Department of Computational Linguistics and Phonetics, GERMANY |
Task-Driven Linguistic Analysis based on an Underspecified Features Representation
|
Universität Saarland, GERMANY |
Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach
Evaluating Machine Reading Systems through Comprehension Tests
|
Universität Saarland, Spoken Language Systems, GERMANY |
Task-Driven Linguistic Analysis based on an Underspecified Features Representation
|
Universität Stuttgart, GERMANY |
The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles
German """"nach""""-Particle Verbs in Semantic Theory and Corpus Data
Association Norms of German Noun Compounds
Analyzing and Aligning German compound nouns
A Tool/Database Interface for Multi-Level Analyses
|
Universität Stuttgart, IMS, GERMANY |
Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank
Automatic classification of German """"an"""" particle verbs
Making Ellipses Explicit in Dependency Conversion for a German Treebank
Logical metonymies and qualia structures: an annotated database of logical metonymies for German
A Corpus-based Study of the German Recipient Passive
Irish Treebanking and Parsing: A Preliminary Evaluation
French and German Corpora for Audience-based Text Type Classification
Adapting and evaluating a generic term extraction tool
|
Universität Tübingen, GERMANY |
Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC
A Metadata Editor to Support the Description of Linguistic Resources
A Repository for the Sustainable Management of Research Data
Standardizing a Component Metadata Infrastructure
Large aligned treebanks for syntax-based machine translation
|
Universität Tübingen - SFS, GERMANY |
A Comparative Evaluation of Word Sense Disambiguation Algorithms for German
|
Universität Ulm, GERMANY |
A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System
|
Universität Ulm, Institute of Communications Engineering, GERMANY |
Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues
Using multimodal resources for explanation approaches in intelligent systems
Investigating Verbal Intelligence Using the TF-IDF Approach
Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores
Speech and Language Resources for LVCSR of Russian
|
Universität Ulm, Institute of Neural Information Processing, GERMANY |
An audiovisual political speech analysis incorporating eye-tracking and perception data
|
Universität Wien, Zentrum für Translationswissenschaft, AUSTRIA |
Knowledge-Rich Context Extraction and Ranking with KnowPipe
|
Universität Zürich, SWITZERLAND |
German Verb Patterns and Their Implementation in an Electronic Dictionary
Automatic MT Error Analysis: Hjerson Helping Addicter
Terra: a Collection of Translation Error-Annotated Corpora
|
Université Aix-Marseille, FRANCE |
Annotated Bibliographical Reference Corpora in Digital Humanities
|
Université Aix-Marseille, Laboratoire Parole et Langage, FRANCE |
SPPAS: a tool for the phonetic segmentation of speech
Orthographic Transcription: which enrichment is required for phonetization?
DEGELS1: A comparable corpus of French Sign Language and co-speech gestures
An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
|
Université Aix-Marseille, LIF-CNRS, FRANCE |
Syntactic annotation of spontaneous speech: application to call-center conversation data
DECODA: a call-centre human-human spoken conversation corpus
|
Université catholique de Louvain, BELGIUM |
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends
|
Université catholique de Louvain, CENTAL, Institut Langage et Communication, BELGIUM |
Extraction of unmarked quotations in Newspapers
|
Université d'Avignon, CERI/LIA, FRANCE |
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
|
Université d'Avignon, FRANCE |
DECODA: a call-centre human-human spoken conversation corpus
|
Université d'Avignon, LIA, FRANCE |
Annotated Bibliographical Reference Corpora in Digital Humanities
|
Université de Genève, Département de Linguistique, SWITZERLAND |
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns
The Trilingual ALLEGRA Corpus: Presentation and Possible Use for Lexicon Induction
|
Université de Genève, FTI/TIM, SWITZERLAND |
Acquisition of Syntactic Simplification Rules for French
Evaluating Appropriateness Of System Responses In A Spoken CALL Game
|
Université de Genève, SWITZERLAND |
Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification
Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.
|
Université de Genève, TIM/ISSCO, SWITZERLAND |
A Scalable Architecture For Web Deployment of Spoken Dialogue Systems
Annotating Qualia Relations in Italian and French Complex Nominals
A Corpus for a Gesture-Controlled Mobile Spoken Dialogue System
|
Université de Montréal, CANADA |
Capturing syntactico-semantic regularities among terms: An application of the FrameNet methodology to terminology
Texto4Science: a Quebec French Database of Annotated Short Text Messages
|
Université de Montréal, OLST, CANADA |
Logic Based Methods for Terminological Assessment
Identifying equivalents of specialized verbs in a bilingual comparable corpus of judgments: A frame-based methodology
Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation
|
Université de Nancy, LORIA, FRANCE |
Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents
Representation of linguistic and domain knowledge for second language learning in virtual worlds
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
|
Université de Nantes, LINA, FRANCE |
Tackling interoperability issues within UIMA workflows
Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora
|
Université de Nice Sophia-Antipolis, équipe RL, Laboratoire I3S, FRANCE |
Unsupervised acquisition of concatenative morphology
|
Université de Provence, FRANCE |
An ontological approach to model and query multimodal concurrent linguistic annotations
|
Université de Rennes 1, IRISA, FRANCE |
Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora
Towards Fully Automatic Annotation of Audio Books for TTS
|
Université de Savoie, FRANCE |
Ontoterminology: How to unify terminology and ontology into a single paradigm
|
Université de Strasbourg, LiLPA, FRANCE |
French and German Corpora for Audience-based Text Type Classification
|
Université de Toulouse, CLLE, FRANCE |
An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
|
Université de Toulouse, IRIT, FRANCE |
An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
|
Université du Maine, LIUM, FRANCE |
TED-LIUM: an Automatic Speech Recognition dedicated corpus
Automatic Translation of Scientific Documents in the HAL Archive
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora
|
Université du Sud Toulon Var, FRANCE |
An ontological approach to model and query multimodal concurrent linguistic annotations
|
Universiteit Antwerp, BELGIUM |
From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
Irregularity Detection in Categorized Document Corpora
The DISCO ASR-based CALL system: practicing L2 oral skills and beyond
The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language
|
Universiteit Antwerp, CliPS, BELGIUM |
ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories
|
Universiteit Antwerpen, BELGIUM |
Vreselijk mooi! (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives.
|
Universiteit Gent, BELGIUM |
From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
|
Universiteit Twente, NETHERLANDS |
Evaluating automatic cross-domain Dutch semantic role annotation
Creating a Data Collection for Evaluating Rich Speech Retrieval
|
Universiteit Utrecht, NETHERLANDS |
The FLaReNet Strategic Language Resource Agenda
|
Universiteit Utrecht, UiL-OTS, NETHERLANDS |
Recent Developments in CLARIN-NL
Evaluating automatic cross-domain Dutch semantic role annotation
|
Universiteit van Amsterdam, NETHERLANDS |
Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions
|
Universiteit van Tilburg, NETHERLANDS |
Using DiAML and ANVIL for multimodal dialogue annotations
The coding and annotation of multimodal dialogue acts
DutchSemCor: Targeting the ideal sense-tagged corpus
ISO 24617-2: A semantically-based standard for dialogue annotation
|
Universiteit van Tilburg, TiCC, NETHERLANDS |
Beyond SoNaR: towards the facilitation of large corpus building efforts
|
Université Montpellier 2, LIRMM, FRANCE |
A new dynamic approach for lexical networks evaluation
|
Université Paris 13, LDI, FRANCE |
Building a database of French frozen adverbial phrases
|
Université Paris 3 - sorbonne Nouvelle, LaTTiCe, CNRS, FRANCE |
Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger
|
Université Paris 7, FRANCE |
A Phonemic Corpus of Polish Child-Directed Speech
Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus
Semantic annotation of French corpora: animacy and verb semantic classes
|
Université Paris 8 Saint Denis, FRANCE |
Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
|
Université Paris-Est, FRANCE |
Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger
|
Université Paris-Est, LIGM, FRANCE |
A new semantically annotated corpus with syntactic-semantic and cross-lingual senses
Extending the adverbial coverage of a French morphological lexicon
|
Université Paris Ouest Nanterre La Défense, MoDyCo-CNRS, FRANCE |
Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement
|
Université Paul Sabatier, IRIT, FRANCE |
Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber)
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs
|
Universitetet i Bergen, Department of Linguistic, NORWAY |
A Morphological Analyzer For Wolof Using Finite-State Techniques
|
Universitetet i Bergen, NORWAY |
Design and compilation of a specialized Spanish-German parallel corpus
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
|
Universitetet i Oslo, Institutt for lingvistiske og nordiske studier, NORWAY |
The Nordic Dialect Corpus
|
Universitetet i Oslo, NORWAY |
Págico: Evaluating Wikipedia-based information retrieval in Portuguese
The Nordic Dialect Corpus
The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content
|
Université Toulouse II, U.R.I Octogone-Lordat, FRANCE |
MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions.
|
University at Albany, SUNY, UNITED STATES |
Revealing Contentious Concepts Across Social Groups
Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language
|
University College Dublin, IRELAND |
English to Indonesian Transliteration to Support English Pronunciation Practice
Evaluating expressive speech synthesis from audiobook corpora for conversational phrases
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz
|
University College London, UNITED KINGDOM |
A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic.
|
University of Amsterdam, ACLC, NETHERLANDS |
NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy
|
University of Applied Sciences Chur; Faculty of Information Science, SWITZERLAND |
Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources
|
University of Applied Sciences Vern, Zagreb, CROATIA |
Addressing polysemy in bilingual lexicon extraction from comparable corpora
|
University of Aston, UNITED KINGDOM |
Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
|
University of Athens, GREECE |
Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue
|
University of Belgrade, Faculty of Mathematics, SERBIA |
Multimedia database of the cultural heritage of the Balkans
|
University of Belgrade, SERBIA |
A tool for enhanced search of multilingual digital libraries of e-journals
|
University of Birmingham, School of Computer Science, UNITED KINGDOM |
Building Text-to-Speech Systems for Resource Poor Languages
|
University of Boston, UNITED STATES |
Detecting Reduplication in Videos of American Sign Language
Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking
|
University of Brighton, UNITED KINGDOM |
A Repository of Data and Evaluation Resources for Natural Language Generation
LG-Eval: A Toolkit for Creating Online Language Evaluation Experiments
|
University of California, Santa Cruz, UNITED STATES |
An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style
|
University of Cambridge, UNITED KINGDOM |
Reclassifying subcategorization frames for experimental analysis and stimulus generation
Annotating progressive aspect constructions in the spoken section of the British National Corpus
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora
|
University of Colorado at Boulder, UNITED STATES |
Empty Argument Insertion in the Hindi PropBank
|
University of Colorado, UNITED STATES |
Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events
Annotating Story Timelines as Temporal Dependency Structures
|
University of Copenhagen, CST, DENMARK |
Automatic annotation of head velocity and acceleration in Anvil
DSim, a Danish Parallel Corpus for Text Simplification
Creation and use of Language Resources in a Question-Answering eHealth System
|
University of Copenhagen, DENMARK |
A voting scheme to detect semantic underspecification
Towards a richer wordnet representation of properties
A Distributed Resource Repository for Cloud-Based Machine Translation
Creation and use of Language Resources in a Question-Answering eHealth System
Multimodal Behaviour and Feedback in Different Types of Interaction
Feedback in Nordic First-Encounters: a Comparative Study
Polish Multimodal Corpus ― a collection of referential gestures
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
|
University of Düsseldorf, Institut für Sprache und Information, GERMANY |
German and English Treebanks and Lexica for Tree-Adjoining Grammars
|
University of Edinburgh, UNITED KINGDOM |
The BladeMistress Corpus: From Talk to Action in Virtual Worlds
A Database of Attribution Relations
|
University of Essex, UNITED KINGDOM |
DeCour: a corpus of DEceptive statements in Italian COURts
Assessing Crowdsourcing Quality through Objective Tasks
Applying Random Indexing to Structured Data to Find Contextually Similar Words
Domain-specific vs. Uniform Modeling for Coreference Resolution
|
University of Gothenburg, Sprakbanken, SWEDEN |
Korp ― the corpus infrastructure of Spräkbanken
The open lexical infrastructure of Spräkbanken
|
University of Groningen, Institute of Artificial Intelligence, NETHERLANDS |
Pragmatic identification of the witness sets
|
University of Helsinki, FINLAND |
Boosting statistical tagger accuracy with simple rule-based grammars
Constructive Interaction for Talking about Interesting Topics
Representing the Translation Relation in a Bilingual Wordnet
Multimodal Corpus of Multi-party Conversations in Second Language
Rule-Based Detection of Clausal Coordinate Ellipsis
Improving corpus annotation productivity: a method and experiment with interactive tagging
Feedback in Nordic First-Encounters: a Comparative Study
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3
|
University of Iceland, ICELAND |
The Icelandic Parsed Historical Corpus (IcePaHC)
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries
|
University of Illinois at Urbana-Champaign, UNITED STATES |
YADAC: Yet another Dialectal Arabic Corpus
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
|
University of Jyväskylä, FINLAND |
Comparing computer vision analysis of signed language video with motion capture recordings
|
University of Latvia, LATVIA |
An implementation of a Latvian resource grammar in Grammatical Framework
|
University of Leeds, UNITED KINGDOM |
QurAna: Corpus of the Quran annotated with Pronominal Anaphora
QurSim: A corpus for evaluation of relatedness in short texts
Predicting Phrase Breaks in Classical and Modern Standard Arabic Text
Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing
LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis
Development and Application of a Cross-language Document Comparability Metric
Identifying Word Translations from Comparable Documents Without a Seed Lexicon
Collecting and Using Comparable Corpora for Statistical Machine Translation
|
University of Leipzig, Natural Language Processing Group, GERMANY |
The Influence of Corpus Quality on Statistical Measurements on Language Resources
|
University of Limerick, IRELAND |
Evaluating Machine Reading Systems through Comprehension Tests
|
University of Ljubljana, Faculty of Arts, SLOVENIA |
Addressing polysemy in bilingual lexicon extraction from comparable corpora
Irregularity Detection in Categorized Document Corpora
|
University of Ljubljana, SLOVENIA |
Cleaning noisy wordnets
|
University of Łódz, POLAND |
Towards a comprehensive open repository of Polish language resources
|
University of Luxembourg, LUXEMBOURG |
NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge
|
University of Malta, MALTA |
Incorporating an Error Corpus into a Spellchecker for Maltese
Multimodal Behaviour and Feedback in Different Types of Interaction
A Repository of Data and Evaluation Resources for Natural Language Generation
|
University of Manchester, UNITED KINGDOM |
A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic.
Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries
Identification of Manner in Bio-Events
Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench
GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German
|
University of Maribor, SLOVENIA |
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles
|
University of Maryland, UNITED STATES |
Typing Race Games as a Method to Create Spelling Error Corpora
Creating and Curating a Cross-Language Person-Entity Linking Collection
Linguistic Resources for Handwriting Recognition and Translation Evaluation
|
University of Minho, LIAAD-INESC, PORTUGAL |
The Minho Quotation Resource
|
University of Newcastle, AUSTRALIA |
The Icelandic Parsed Historical Corpus (IcePaHC)
|
University of North Texas, UNITED STATES |
Learning Sentiment Lexicons in Spanish
Unsupervised Word Sense Disambiguation with Multilingual Representations
A Parallel Corpus of Music and Lyrics Annotated with Emotions
|
University of Osnabrück, Institute for Cognitive Science, GERMANY |
LexIt: A Computational Resource on Italian Argument Structure
|
University of Pennsylvania, IRCS, UNITED STATES |
Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
|
University of Pennsylvania, UNITED STATES |
Experiences in Resource Generation for Machine Translation through Crowdsourcing
The Icelandic Parsed Historical Corpus (IcePaHC)
Automatic word alignment tools to scale production of manually aligned parallel texts
A corpus of general and specific sentences from news
|
University of Potsdam, Applied Computational Linguistics Lab, POLAND |
SemScribe: Natural Language Generation for Medical Reports
|
University of Rijeka, Department of Informatics, CROATIA |
BLEU Evaluation of Machine-Translated English-Croatian Legislation
|
University of Rochester, UNITED STATES |
An Annotation Scheme for Quantifier Scope Disambiguation
|
University of Sheffield, DCS, UNITED KINGDOM |
Mapping WordNet synsets to Wikipedia articles
|
University of Sheffield, UNITED KINGDOM |
Matching Cultural Heritage items to Wikipedia
Assessing the Comparability of News Texts
Large Scale Semantic Annotation, Indexing and Search at The National Archives
TIMEN: An Open Temporal Expression Normalisation Resource
LIE: Leadership, Influence and Expertise
Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing
Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles
Massively Increasing TIMEX3 Resources: A Transduction Approach
A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English
Assessing Crowdsourcing Quality through Objective Tasks
A light way to collect comparable corpora from the Web
Applying Random Indexing to Structured Data to Find Contextually Similar Words
Collecting and Using Comparable Corpora for Statistical Machine Translation
|
University of Southern California, Information Sciences Institute, UNITED STATES |
Ontologies of Linguistic Annotation: Survey and perspectives
The Open Linguistics Working Group
A generic formalism to represent linguistic corpora in RDF and OWL/DL
|
University of Southern California, Institute for Creative Technologies, UNITED STATES |
An audiovisual political speech analysis incorporating eye-tracking and perception data
The BladeMistress Corpus: From Talk to Action in Virtual Worlds
Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems
The Twins Corpus of Museum Visitor Questions
|
University of Southern California, UNITED STATES |
The Twins Corpus of Museum Visitor Questions
|
University of Southern Denmark, DENMARK |
The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser
|
University of Stirling, UNITED KINGDOM |
Affective Common Sense Knowledge Acquisition for Sentiment Analysis
|
University of Sussex, UNITED KINGDOM |
Corpus Annotation as a Scientific Task
Linguistic Analysis Processing Line for Bulgarian
|
University of Szeged, Department of Hungarian Linguistics, HUNGARY |
HunOr: A Hungarian―Russian Parallel Corpus
|
University of Szeged, Department of Informatics, HUNGARY |
HunOr: A Hungarian―Russian Parallel Corpus
|
University of Tartu, ESTONIA |
Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data
Constructive Interaction for Talking about Interesting Topics
|
University of Tartu, Institute of Computer Science, ESTONIA |
Robust clause boundary identification for corpus annotation
|
University of Texas at Arlington, UNITED STATES |
Evaluation of Online Dialogue Policy Learning Techniques
|
University of Texas at Dallas, Human Language Technology Research Institute, UNITED STATES |
EmpaTweet: Annotating and Detecting Emotions on Twitter
|
University of Texas at Dallas, UNITED STATES |
Tajik-Farsi Persian Transliteration Using Statistical Machine Translation
Annotating Spatial Containment Relations Between Events
|
University of Thessaly, Department of Computer and Communication Engineering, GREECE |
A hierarchical approach with feature selection for emotion recognition from speech
|
University of Toronto, CANADA |
Measuring Interlanguage: Native Language Identification with L1-influence Metrics
Detecting Reduplication in Videos of American Sign Language
|
University of Tsukuba, JAPAN |
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
|
University of Turin, Department of Computer Science, ITALY |
Pragmatic identification of the witness sets
|
University of Washington, Biomedical and Health Informatics, UNITED STATES |
Statistical Section Segmentation in Free-Text Clinical Records
|
University of Washington, Department of Linguistics, UNITED STATES |
Statistical Section Segmentation in Free-Text Clinical Records
|
University of Washington, UNITED STATES |
Effort of Genre Variation and Prediction of System Performance
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms
Using a Goodness Measurement for Domain Adaptation: A Case Study on Chinese Word Segmentation
|
University of Wisconsin-Milwaukee, HIA, UNITED STATES |
Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
|
University of Wolverhampton, Research Institute in Information and Language Processing, UNITED KINGDOM |
CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations
|
University of Wolverhampton, UNITED KINGDOM |
Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach
A review corpus annotated for negation, speculation and their scope
Annotating Near-Identity from Coreference Disagreements
PET: a Tool for Post-editing and Assessing Machine Translation
|
University of Yamanashi, JAPAN |
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation
|
University of Zagreb, CROATIA |
Croatian Dependency Treebank: Recent Development and Initial Experiments
Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian
|
University of Zagreb, Faculty of Humanities and Social Sciences, Department of Information Sciences, CROATIA |
Generation of Verbal Stems in Derivationally Rich Language
BLEU Evaluation of Machine-Translated English-Croatian Legislation
Addressing polysemy in bilingual lexicon extraction from comparable corpora
|
University of Zurich, Institute of Computational Linguistics, SWITZERLAND |
Dependency parsing for interaction detection in pharmacogenomics
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis
A tree is a Baum is an árbol is a sach'a: Creating a trilingual treebank
|
UPMF-Grenoble2, FRANCE |
Collection of a Large Database of French-English SMT Output Corrections
|
Uppsala University, SWEDEN |
A Basic Language Resource Kit for Persian
A Distributed Resource Repository for Cloud-Based Machine Translation
Parallel Data, Tools and Interfaces in OPUS
MaltOptimizer: A System for MaltParser Optimization
Large aligned treebanks for syntax-based machine translation
|
US Department of Defense, UNITED STATES |
Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?
|
UTEM, CHILE |
A proposal for improving WordNet Domains
|
|
|