|
AFFILIATIONS: Browse articles of the conference sorted by institution
A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z -
A |
Aalto University School of Business, FINLAND |
Gold-standard for Topic-specific Sentiment Analysis of Economic Texts
|
Acapela Group S.A., BELGIUM |
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
|
Access for All Foundation, SWITZERLAND |
Applying Accessibility-Oriented Controlled Language (CL) Rules to Improve Appropriateness of Text Alternatives for Images: an Exploratory Study
|
AgroKnow, GREECE |
A Gold Standard for CLIR evaluation in the Organic Agriculture Domain
|
Aholab, University of the Basque Country, SPAIN |
New Bilingual Speech Databases for Audio Diarization
|
Aix Marseille University, FRANCE |
Automatically Enriching Spoken Corpora with Syntactic Information for Linguistic Studies
|
Alexandru Ioan Cuza University of Iaşi, ROMANIA |
Transliteration and Alignment of Parallel Texts from Cyrillic to Latin
How Could Veins Speed Up the Process of Discourse Parsing
|
ALM, UNITED STATES |
Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
|
Alpage (Univ Paris Diderot / INRIA), FRANCE |
Developing a French FrameNet: Methodology and First results
|
Alpheios Project, GREECE |
The Making of Ancient Greek WordNet
|
AMI Software, FRANCE |
Investigating the Image of Entities in Social Media: Dataset Design and First Results
|
ARL, UNITED STATES |
Finding Romanized Arabic Dialect in Code-Mixed Tweets
|
ARTI, INDIA |
Finding Romanized Arabic Dialect in Code-Mixed Tweets
|
EHU, SPAIN |
A Stream Computing Approach Towards Scalable NLP
|
Athena, Research and Innovation Center in Information, Communication and Knowledge Technologies, Institute for Language and Speech Processing, GREECE |
The Strategic Impact of META-NET on the Regional, National and International Level
META-SHARE: One Year After
The DIRHA simulated corpus
Word Semantic Similarity for Morphologically Rich Languages
|
Augsburg University of Applied Sciences, GERMANY |
Single-Person and Multi-Party 3D Visualizations for Nonverbal Communication Analysis
|
Austrian Academy of Sciences, AUSTRIA |
Towards Automatic Quality Assessment of Component Metadata
|
B |
Barcelona Media Innovació, SPAIN |
Adapting Freely Available Resources to Build an Opinion Mining Pipeline in Portuguese
|
Bavarian Archive for Speech Signals, GERMANY |
Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora using WebMAUS
German Alcohol Language Corpus - the Question of Dialect
|
Bavarian Archive for Speech Signals, University of Munich, GERMANY |
German Alcohol Language Corpus - the Question of Dialect
|
Beaute-test.com, FRANCE |
Generating a Resource for Products and Brandnames Recognition. Application to the Cosmetic Domain.
|
Beijing Language and Culture University, CHINA |
Phoneme Set Design Using English Speech Database by Japanese for Dialogue-Based English CALL Systems
|
Bibliotheca Alexandria, EGYPT |
MUHIT: A Multilingual Harmonized Dictionary
|
Boğaziçi Üniversitesi, TURKEY |
Self-training a Constituency Parser using n-gram Trees
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
|
Brigham Young University, UNITED STATES |
Combining Elicited Imitation and Fluency Features for Oral Proficiency Measurement
Momresp: A Bayesian Model for Multi-Annotator Document Labeling
Evaluating Lemmatization Models for Machine-Assisted Corpus-Dictionary Linkage
Using Transfer Learning to Assist Exploratory Corpus Annotation
Student Achievement and French Sentence Repetition Test Scores
LexTerm Manager: Design for an Integrated Lexicography and Terminology System
|
Brno University of Technology, CZECH REPUBLIC |
Semantic Search in Documents Enriched by LOD-based Annotations
|
Budapest University of Technology, HUNGARY |
DCEP -Digital Corpus of the European Parliament
|
Bulgarian Academy of Sciences, BULGARIA |
A System for Experiments with Dependency Parsers
Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora
Constituency Parsing of Bulgarian: Word- vs Class-based Parsing
|
C |
CA, GERMANY |
Conceptual Transfer: Using Local Classifiers for Transfer Selection
|
Capita TI, UNITED KINGDOM |
Machine Translation for Subtitling: A Large-Scale Evaluation
|
Carnegie Mellon University, UNITED STATES |
Augmenting English Adjective Senses with Supersenses
A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness
Dual Subtitles as Parallel Corpora
Two Approaches to Metaphor Detection
Can Crowdsourcing be used for Effective Annotation of Arabic?
Resources for the Detection of Conventionalized Metaphors in Four Languages
Comprehensive Annotation of Multiword Expressions in a Social Web Corpus
A Multidialectal Parallel Corpus of Arabic
YouDACC: the Youtube Dialectal Arabic Comment Corpus
The CMU METAL Farsi NLP Approach
Supervised Within-Document Event Coreference using Information Propagation
Morphological Parsing of Swahili using Crowdsourced Lexical Resources
The Language Application Grid
Large Scale Arabic Error Annotation: Guidelines and Framework
Detecting Subevent Structure for Event Coreference Resolution
|
Catalogue et Index des Sites Médicaux de langue Française, FRANCE |
Language Resources for French in the Biomedical Domain
|
CCLS, Columbia, UNITED STATES |
Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
|
CEA ― LIST, FRANCE |
Adapting VerbNet to French using Existing Resources
The LIMA Multilingual Analyzer Made Free: FLOSS Resources Adaptation and Correction
Developing a French FrameNet: Methodology and First results
Evaluation of Different Strategies for Domain Adaptation in Opinion Mining
Compounds and Distributional Thesauri
|
CEISIC, Royal Institut of Amazigh Culture, MOROCCO |
Amazigh Verb Conjugator
|
CELI, ITALY |
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool
A Gold Standard for CLIR evaluation in the Organic Agriculture Domain
|
Center for Language Engineering, KICS, UET, PAKISTAN |
The CLE Urdu POS Tagset
|
Center of Linguistics of the University of Porto, PORTUGAL |
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
|
Central Institute of Indian Languages, INDIA |
Towards Building a Kashmiri Treebank: Setting up the Annotation Pipeline
|
Centre de Recherche Informatique de Montréal, CANADA |
Linked Open Data and Web Corpus Data for noun compound bracketing
|
Centro de Linguística Universidade de Lisboa, PORTUGAL |
LexTec - a Rich Language Resource for Technical Domains in Portuguese
|
Charles University in Prague, CZECH REPUBLIC |
Votter Corpus: A Corpus of Social Polling Language
Genres in the Prague Discourse Treebank
Mapping Diatopic and Diachronic Variation in Spoken Czech: the Ortofon and Dialekt Corpora
Corpus of 19th-century Czech Texts: Problems and Solutions
Resources in Conflict: A Bilingual Valency Lexicon vs. a Bilingual Treebank vs. a Linguistic Theory
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech
Three Dimensions of the so-called "Interoperability" of Annotation Schemes
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Valency and Word Order in Czech ― A Corpus Probe
Free English and Czech Telephone Speech Corpus Shared Under the CC-BY-SA 3.0 License
Automatic Mapping Lexical Resources: A Lexical Unit as the Keystone
To Pay or to Get Paid: Enriching a Valency Lexicon with Diatheses
Verbs of Saying with a Textual Connecting Function in the Prague Discourse Treebank
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation
HamleDT 2.0: Thirty Dependency Treebanks Stanfordized
Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain
Votter Corpus: A Corpus of Social Polling Language
The MERLIN corpus: Learner Language and the CEFR
|
Charles University in Prague, Faculty of Arts, CZECH REPUBLIC |
Improvements to Dependency Parsing Using Automatic Simplification of Data
|
Charles University in Prague, Faculty of Mathematics and Physics, CZECH REPUBLIC |
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech
Word-Formation Network for Czech
A Tagged Corpus and a Tagger for Urdu
Two-Step Machine Translation with Lattices
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation
HamleDT 2.0: Thirty Dependency Treebanks Stanfordized
Improving Evaluation of English-Czech MT through Paraphrasing
|
Charles University in Prague, Institute of Formal and Applied Linguistics, CZECH REPUBLIC |
A Tagged Corpus and a Tagger for Urdu
Two-Step Machine Translation with Lattices
|
Charles University in Prague, Institute of the Czech National Corpus, CZECH REPUBLIC |
Aligning Parallel Texts with InterText
The SYN-series Corpora of Written Czech
|
Charles University in Prague, Institute of Theoretical and Computational Linguistics, CZECH REPUBLIC |
The SYN-series Corpora of Written Czech
|
City University of Hong Kong, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA |
Narrowing the Gap Between Termbases and Corpora in Commercial Environments
The Halliday Centre Tagger: An Online Platform for Semi-automatic Text Annotation and Analysis
Clustering Tweets using Wikipedia Concepts
|
City University of New York, UNITED STATES |
Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese
|
CLE, KICS, UET, PAKISTAN |
The CLE Urdu POS Tagset
|
CLILLAC-ARP (Univ Paris Diderot), FRANCE |
Developing a French FrameNet: Methodology and First results
|
CLLE-ERSS/CNRS&University of Toulouse 2, FRANCE |
GLÀFF, a Large Versatile French Lexicon
|
CLS/CLST, Radboud University Nijmegen, NETHERLANDS |
The Evolving Infrastructure for Language Resources and the Role for Data Scientists
The Dutch LESLLA Corpus
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
|
CLUL, PORTUGAL |
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
|
CMU-LTI & INESC-ID, UNITED STATES |
Dual Subtitles as Parallel Corpora
|
CNGL at Trinity College Dublin, IRELAND |
Global Intelligent Content: Active Curation of Language Resources using Linked Data
|
CNR/ILC, ITALY |
META-SHARE: One Year After
|
CNRS LTCI, Télécom ParisTech, FRANCE |
Emilya: Emotional Body Expression in Daily Actions Database
|
CNRS/LIMSI, ENSIIE, FRANCE |
A Hierarchical Taxonomy for Classifying Hardness of Inference Tasks
Multiple Choice Question Corpus Analysis for Distractor Characterization
|
CNRS, FRANCE |
Language Resources for French in the Biomedical Domain
|
CNRS/LIMSI, FRANCE |
Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators?
TVD: A Reproducible and Multiply Aligned TV Series Dataset
|
Cognitive Science Department, METU Informatics, TURKEY |
Turkish Resources for Visual Word Recognition
|
Cognizant Setcon, UNITED STATES |
Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian
|
Columbia University, Center for Computational Learning Systems, UNITED STATES |
Large Scale Arabic Error Annotation: Guidelines and Framework
|
Common Language Resources and Technology Infrastructure, EUROPE |
The CLARIN Research Infrastructure: Resources and Tools for eHumanities Scholars
|
Computer Science Department, Technion, ISRAEL |
How to Construct a Multi-Lingual Domain Ontology
|
Cooperative State University Baden-Württemberg, GERMANY |
A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification
|
Copenhagen Business School, DENMARK |
CFT13: a Resource for Research into the Post-editing Process
Evaluating the Effects of Interactivity in a Post-Editing Workbench
|
Czech Technical University in Prague, CZECH REPUBLIC |
The Nijmegen Corpus of Casual Czech
|
D |
DANS, NETHERLANDS |
The CMD Cloud
|
Delux Media, UNITED KINGDOM |
Machine Translation for Subtitling: A Large-Scale Evaluation
|
Department of Computational linguistics, Institute for Bulgarian, BAS, BULGARIA |
The Strategic Impact of META-NET on the Regional, National and International Level
|
Department of Computer Science and Engineering, Indian Institute of Technology, INDIA |
Efficient Reuse of Structured and Unstructured Resources for Ontology Population
Shata-Anuvadak: Tackling Multiway Translation of Indian Languages
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control
|
Department of Computer Science, University of Bath, UNITED KINGDOM |
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
|
Department of Linguistics, Radboud University Nijmegen, NETHERLANDS |
ASR-based CALL Systems and Learner Speech Data: New Resources and Opportunities for Research and Development in Second Language Learning
|
Department of Telecommunications & Media Informatics, Budapest University of Technology & Economics, HUNGARY |
The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
|
DePaul University, UNITED STATES |
Expanding N-gram Analytics in ELAN and a Case Study for Sign Synthesis
|
Deutsche Forschungszentrum für Künstliche Intelligenz - Language Techology Laboratory, GERMANY |
Annotating Relation Mentions in Tabloid Press
Sprinter: Language Technologies for Interactive and Multimedia Language Learning
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction
|
Deutsche Forschungszentrum für Künstliche Intelligenz, GERMANY |
A SKOS-based Schema for TEI encoded Dictionaries at ICLTT
Annotating Relation Mentions in Tabloid Press
The taraXÜ Corpus of Human-Annotated Machine Translations
The Strategic Impact of META-NET on the Regional, National and International Level
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction
Harmonization of German Lexical Resources for Opinion Mining
META-SHARE: One Year After
MAT: a Tool for L2 Pronunciation Errors Annotation
|
Deutsche Forschungszentrum für Künstliche Intelligenz, Saarland University, GERMANY |
Annotating Relation Mentions in Tabloid Press
The taraXÜ Corpus of Human-Annotated Machine Translations
The Strategic Impact of META-NET on the Regional, National and International Level
Sprinter: Language Technologies for Interactive and Multimedia Language Learning
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction
|
DFKI / W3C Fellow, GERMANY |
Global Intelligent Content: Active Curation of Language Resources using Linked Data
|
DFKI LT-lab, GERMANY |
Annotating Relation Mentions in Tabloid Press
|
DHA Suffa University, PAKISTAN |
The CLE Urdu POS Tagset
Automatic Acquisition of Urdu Nouns (along with Gender and Irregular Plurals))
|
Dipartimento Studi Umanistici - Università di Salerno, ITALY |
VOLIP: a Corpus of Spoken Italian and a Virtuous Example of Reuse of Linguistic Resources
|
Documenta, UNITED STATES |
Croatian Memories
|
Doshisha University, JAPAN |
Phoneme Set Design Using English Speech Database by Japanese for Dialogue-Based English CALL Systems
|
Dr. Bhim Rao Ambedkar University, Department of Linguistics, INDIA |
Developing Politeness Annotated Corpus of Hindi Blogs
|
Dublin City University, IRELAND |
TLAXCALA: a Multilingual Corpus of Independent News
The Strategic Impact of META-NET on the Regional, National and International Level
Quality Estimation for Synthetic Parallel Data Generation
caWaC - a Web Corpus of Catalan and its Application to Language Modeling and Machine Translation
|
Dublin City University, Centre for Next Generation Localisation - University of North Texas, IRELAND |
Modeling Language Proficiency Using Implicit Feedback
|
Dublin Institute of Technology, IRELAND |
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
|
Dundee University, UNITED KINGDOM |
A Model for Processing Illocutionary Structures and Argumentation in Debates
|
E |
Eberhard Karls Universität Tübingen, GERMANY |
Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers
|
Ecole des Hautes Etudes en Sciences Sociales, FRANCE |
Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
|
Ecole Nationale d'Ingénieurs de Brest, FRANCE |
A Database of Full Body Virtual Interactions Annotated with Expressivity Scores
|
Ecole Nationale Supérieure de l'Electronique et de ses Applications, CANADA |
An Effortless Way To Create Large-Scale Datasets For Famous Speakers
|
Ecole Polytechnique Fédérale de Lausanne, SWITZERLAND |
Collaboration in the Production of a Massively Multilingual Lexicon
|
Ecole Polytechnique Fédérale de Zurich, SWITZERLAND |
Semantic Approaches to Software Component Retrieval with English Queries
|
Ecole Polytechnique Montreal, CANADA |
Improving Entity Linking using Surface Form Refinement
|
EDF R&D, FRANCE |
Investigating the Image of Entities in Social Media: Dataset Design and First Results
|
ELDA/ELRA, FRANCE |
The ETAPE Speech Processing Evaluation
VERTa: Facing a Multilingual Experience of a Linguistically-based MT Evaluation
META-SHARE: One Year After
ELRA's Consolidated Services for the HLT Community
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis
|
Elhuyar Foundation, IXA NLP Group, University of the Basque Country, SPAIN |
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
|
Eptica-Lingway, FRANCE |
Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter
|
Erasmus University of Rotterdam, NETHERLANDS |
Croatian Memories
|
ERIC lab., Université Lyon 2, FRANCE |
Investigating the Image of Entities in Social Media: Dataset Design and First Results
|
Etudes Interculturelles de Langues Appliquées, Université Paris Diderot, FRANCE |
A Corpus of Machine Translation Errors Extracted from Translation Students Exercises
|
EURECOM, FRANCE |
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web
|
European Academy of Bozen/Bolzano, ITALY |
'interHist' - an Interactive Visual Interface for Corpus Exploration
The MERLIN corpus: Learner Language and the CEFR
KoKo: an L1 Learner Corpus for German
|
European Commission, Joint Research Centre, ITALY |
Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian
Clustering of Multi-Word Named Entity Variants: Multilingual Evaluation
Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts
Named Entity Recognition on Turkish Tweets
|
European Parliament - Intrasoft, LUXEMBOURG |
DCEP -Digital Corpus of the European Parliament
|
Euroscript Deutschland GmbH, GERMANY |
Pre-ordering of Phrase-based Machine Translation Input in Translation Workflow
|
Euroscript Germany, GERMANY |
The taraXÜ Corpus of Human-Annotated Machine Translations
|
Euroscript Luxembourg s.à r.l., LUXEMBOURG |
Pre-ordering of Phrase-based Machine Translation Input in Translation Workflow
|
Euskal Herriko Unibertsitatae, SPAIN |
NewsReader: Recording History from Daily News Streams
|
ExB Group, DENMARK |
PACE Corpus: a Multilingual Corpus of Polarity-Annotated Textual Data from the Domains Automotive and CEllphone
|
Expert System, ITALY |
A Modular System for Rule-based Text Categorisation
|
F |
Faculdade de Letras da Universidade de Lisboa / Centro de Linguistica da Universidade de Lisboa, PORTUGAL |
Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
|
Faculdade de Letras da Universidade de Lisboa / Instituto de Engenharia de Sistemas e Computadores ― Investigação e Desenvolvimento em Lisboa, PORTUGAL |
Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
Teenage and Adult Speech in School Context: Building and Processing a Corpus of European Portuguese
El-WOZ: a Client-Server Wizard-of-Oz Interface
|
FamilySearch, UNITED STATES |
Corpus and Evaluation of Handwriting Recognition of Historical Genealogical Records
|
Federal University of Juiz de Fora, SPAIN |
NIF4OGGD - NLP Interchange Format for Open German Governmental Data
|
Federal University of Rio Grande do Sul - Institute of Informatics, BRAZIL |
Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them
Boosting the Creation of a Treebank
Icelandic Quirks: Testing Linguistic Theories and Language Technology
Comparing Similarity Measures for Distributional Thesauri
|
Federal University of Rio Grande do Sul, BRAZIL |
Icelandic Quirks: Testing Linguistic Theories and Language Technology
|
Federal University of Sâo Carlos, BRAZIL |
Automatic Semantic Relation Extraction from Portuguese Texts
|
FG Language Technology, Technische Universität, Darmstadt, GERMANY |
Distributed Distributional Similarities of Google Books over the Centuries
NoSta-D Named Entity Annotation for German: Guidelines and Dataset
|
Finnish Association of the Deaf, FINLAND |
S-pot - a Benchmark in Spotting Signs Within Continuous Signing
|
Flemish Department of Economy, Science and Innovation, BELGIUM |
A Decade of HLT Agency Activities in the Low Countries: from Resource Maintenance (BLARK) to Service Offerings (BLAISE)
|
Fondazione Bruno Kessler, ITALY |
Crowdsourcing for the Identification of Event Nominals: an Experiment
Mapping WordNet Domains, WordNet Topics and Wikipedia Categories to Generate Multilingual Domain Specific Resources
The Strategic Impact of META-NET on the Regional, National and International Level
T-PAS; A resource of Typed Predicate Argument Structures for linguistic analysis and semantic processing
Creative Language Explorations through a high-Expressivity N-grams Query Language
Enriching the "Senso Comune" Platform with Automatically Acquired Data
Mapping CPA Patterns onto OntoNotes Senses
Euronews: a Multilingual Speech Corpus for ASR
CROMER: a Tool for Cross-Document Event and Entity Coreference
A Gold Standard for CLIR evaluation in the Organic Agriculture Domain
META-SHARE: One Year After
A SICK Cure for the Evaluation of Compositional Distributional Semantic Models
NewsReader: Recording History from Daily News Streams
Automatic Annotation of Machine Translation Datasets with Binary Quality Judgements
The DIRHA simulated corpus
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool
An Efficient and User-friendly Tool for Machine Translation Quality Estimation
Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts
|
Fraunhofer IAIS, GERMANY |
Extracting Information for Context-aware Meeting Preparation
Exploiting the Large-Scale German Broadcast Corpus to Boost the Fraunhofer IAIS Speech Recognition System
|
Freelance Annotator, JAPAN |
Annotation of Computer Science Papers for Semantic Relation Extrac-tion
|
Freie Universitaet Berlin, GERMANY |
Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank
Constituency Parsing of Bulgarian: Word- vs Class-based Parsing
|
Friedrich Alexander Universität, Erlangen-Nürnberg, GERMANY |
New Spanish Speech Corpus Database for the Analysis of People Suffering from Parkinson's Disease
Erlangen-CLP: A Large Annotated Corpus of Speech from Children with Cleft Lip and Palate
|
Friedrich-Schiller-Universität Jena, GERMANY |
Collaboratively Annotating Multilingual Parallel Corpora in the Biomedical Domain―some MANTRAs
Disclose Models, Hide the Data - How to Make Use of Confidential Corpora without Seeing Sensitive Raw Data
|
FTW Telecommunications Research Center Vienna, AUSTRIA |
The MMASCS Multi-Modal Annotated Synchronous Corpus of Audio, Video, Facial Motion and Tongue Motion Data of Normal, Fast and Slow Speech
|
Fujitsu Limited, JAPAN |
A Method for Building Burst-Annotated Co-Occurrence Networks for Analysing Trends in Textual Data
|
Future University of Hakodate, Department of Complex Systems, JAPAN |
Japanese Conversation Corpus for Training and Evaluation of Backchannel Prediction Model.
|
G |
George Washington University , UNITED STATES |
Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic
SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis
|
German Research Center for Artificial Intelligence, GERMANY |
TMO ― The Federated Ontology of the TrendMiner Project
Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies
Annotating Relation Mentions in Tabloid Press
Sprinter: Language Technologies for Interactive and Multimedia Language Learning
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction
Harmonization of German Lexical Resources for Opinion Mining
The taraXÜ Corpus of Human-Annotated Machine Translations
|
Gjøvik University College, NORWAY |
An Open Source Part-of-Speech Tagger for Norwegian: Building on Existing Language Resources
|
Goethe University, Text Technology Laboratory, GERMANY |
ColLex.en: Automatically Generating and Evaluating a Full-form Lexicon for English
|
Google Inc., UNITED STATES |
On the Importance of Text Analysis for Stock Price Prediction
A Gold Standard Dependency Corpus for English
Momresp: A Bayesian Model for Multi-Annotator Document Labeling
Computer-Aided Quality Assurance of an Icelandic Pronunciation Dictionary
A Database for Measuring Linguistic Information Content
|
Graduate Institute of Linguistics, National Taiwan University, TAIWAN |
Why Chinese Web-as-Corpus is Wacky? Or: How Big Data is Killing Chinese Corpus Linguistics
|
Graduate School of Advanced Integration Science, Chiba University, JAPAN |
Japanese Conversation Corpus for Training and Evaluation of Backchannel Prediction Model.
|
Graduate School of Informatics, Kyoto University, JAPAN |
Constructing a Chinese―Japanese Parallel Corpus from Wikipedia
|
Graduate School of Maritime Sciences, Kobe University, JAPAN |
Bilingual Dictionary Induction as an Optimization Problem
|
Graz University of Technology, AUSTRIA |
The DIRHA simulated corpus
|
Graz University of Technology, Signal Processing and Speech Communication Laboratory, AUSTRIA |
GRASS: the Graz corpus of Read And Spontaneous Speech
|
Göteborgs Universitet, SWEDEN |
Sharing Resources Between Free/Open-Source Rule-based Machine Translation Systems: Grammatical Framework and Apertium
|
H |
Hapara, UNITED STATES |
ClearTK 2.0: Design Patterns for Machine Learning in UIMA
|
Heidelberg University, GERMANY |
Computational Narratology: Extracting Tense Clusters from Narrative Texts
Extending HeidelTime for Temporal Expressions Referring to Historic Dates
|
Heidelberg University, Institute of Computer Science, GERMANY |
Computational Narratology: Extracting Tense Clusters from Narrative Texts
Extending HeidelTime for Temporal Expressions Referring to Historic Dates
|
Heriot - Watt University, UNITED KINGDOM |
An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis
|
Historical Archives of the Pontifical Gregorian University, ITALY |
Sharing Cultural Heritage: the Clavius on the Web Project
|
Holmes Semantic Solutions, FRANCE |
The Dangerous Myth of the Star System
Generating a Resource for Products and Brandnames Recognition. Application to the Cosmetic Domain.
|
Hong Kong University of Science & Technology, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA |
Evaluating Improvised Hip Hop Lyrics - Challenges and Observations
On the Reliability and Inter-Annotator Agreement of Human Semantic MT Evaluation via HMEANT
A Hindi-English Code-Switching Corpus
|
Humboldt University Berlin, GERMANY |
Lexical Substitution Dataset for German
Multiword Expressions in Machine Translation
|
Hungarian Academy of Sciences, HUNGARY |
Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian
Pivot-based Multilingual Dictionary Building using Wiktionary
|
I |
IBM Research and EMAp / FGV, UNITED STATES |
A Study on Expert Sourcing Enterprise Question Collection and Classification
Enriching the "Senso Comune" Platform with Automatically Acquired Data
NomLex-PT: A Lexicon of Portuguese Nominalizations
|
IBM Research India, INDIA |
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control
|
IBM Research Lab, UNITED STATES |
Author-Specific Sentiment Aggregation for Polarity Prediction of Reviews
|
IDA Center for Computing Sciences, UNITED STATES |
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
|
IDIAP Research Institute / École Polytechnique Fédérale de Lausanne (EPFL), SWITZERLAND |
Cross-Linguistic Annotation of Narrativity for English/French Verb Tense Disambiguation
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
|
IDIAP Research Institute, SWITZERLAND |
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling
|
IDS Mannheim, GERMANY |
The Database for Spoken German ― DGD2
Exploring and Visualizing Variation in Language Resources
The Liability of Service Providers in e-Research Infrastructures: Killing the Messenger?
The Research and Teaching Corpus of Spoken German ― FOLK
Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers
Access Control by Query Rewriting: the Case of KorAP
Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers
|
IIIT-Hyderabad, INDIA |
Enrichment of Bilingual Dictionary through News Stream Data
Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT
|
IIT Bombay, INDIA |
Efficient Reuse of Structured and Unstructured Resources for Ontology Population
Shata-Anuvadak: Tackling Multiway Translation of Indian Languages
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control
|
IIT Kharagpur, INDIA |
Design and Development of an Online Computational Framework to Facilitate Language Comprehension Research on Indian Languages
|
IITB, Mumbai, INDIA |
Shata-Anuvadak: Tackling Multiway Translation of Indian Languages
|
IITB-Monash Research Academy, INDIA |
Efficient Reuse of Structured and Unstructured Resources for Ontology Population
|
ILC CNR - Pisa, ITALY |
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
Presenting a System of Human-Machine Interaction for Performing Map Tasks.
Generating Polarity Lexicons with WordNet Propagation in 5 Languages
From Synsets to Videos: Enriching ItalWordNet Multimodally
|
ILS Institute, University at Albany, SUNY, UNITED STATES |
Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
|
IMS, Universität Stuttgart, GERMANY |
Turkish Treebank as a Gold Standard for Morphological Disambiguation and Its Influence on Parsing
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
An Out-of-Domain Test Suite for Dependency Parsing of German
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data
Optimizing a Distributional Semantic Model for the Prediction of German Particle Verb Compositionality
|
Independent Consultant, QATAR |
Building a Crisis Management Term Resource for Social Media: The Case of Floods and Protests
|
Indian Institute of Technology of Guwahati, Department of Computer Science and Engineering, INDIA |
MTWatch: A Tool for the Analysis of Noisy Parallel Data
|
INESC-ID, ISCTE-IUL, PORTUGAL |
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
|
INESC-ID/IST, PORTUGAL |
JUST.ASK, a QA System that Learns to Answer New Questions from Previous Interactions
The DIRHA simulated corpus
|
Innoetics ltd, GREECE |
Using Audio Books for Training a Text-to-Speech System
|
INRA-SenS, IFRIS & ISC-PIF, FRANCE |
Reconstructing the Semantic Landscape of Natural Language Processing
|
INRIA - Alpage, FRANCE |
Deep Syntax Annotation of the Sequoia French Treebank
|
Inria Nancy Grand-Est/LORIA, FRANCE |
Deep Syntax Annotation of the Sequoia French Treebank
|
Inria, FRANCE |
DeLex, a Freely-available, Large-scale and Linguistically Grounded Morphological Lexicon for German
A Language-independent Approach to Extracting Derivational Relations from an Inflectional Lexicon
Developing a French FrameNet: Methodology and First results
An Open-Source Heavily Multilingual Translation Graph Extracted from Wiktionaries and Parallel Corpora
A Language-independent and fully Unsupervised Approach to Lexicon Induction and Part-of-Speech Tagging for Closely Related Languages
|
Insight Data Centre for Analytics, Galway, IRELAND |
Extracting Information for Context-aware Meeting Preparation
|
INSIGHT, National University of Ireland, Galway, IRELAND |
Missed Opportunities in Translation Memory Matching
Hot Topics and Schisms in NLP: Community and Trend Analysis with Saffron on ACL and LREC Proceedings
A Method for Building Burst-Annotated Co-Occurrence Networks for Analysing Trends in Textual Data
|
Institut für Deutsche Sprache, GERMANY |
Extending Standoff Annotation
Access Control by Query Rewriting: the Case of KorAP
Recent Developments in DeReKo
|
Institut für Maschinelle Sprachverarbeitung, University of Stuttgart, GERMANY |
A Corpus of Comparisons in Product Reviews
|
Institut für Medizinische Informatik/Charite, GERMANY |
Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies
|
Institut Mines -Télécom, Télécom ParisTech, CNRS LTCI, FRANCE |
Comparative Analysis of Verbal Alignment in Human-Human and Human-Agent Interactions
Mining a Multimodal Corpus for Non-Verbal Behavior Sequences Conveying Attitudes
Emilya: Emotional Body Expression in Daily Actions Database
|
Institut National de l'Audiovisuel, FRANCE |
An Effortless Way To Create Large-Scale Datasets For Famous Speakers
|
Institut National de Recherche en Informatique et en Automatique , FRANCE |
Towards an Environment for the Production and the Validation of Lexical Semantic Resources
Classifying Inconsistencies in DBpedia Language Specific Chapters
|
Institute for Computational Linguistics, National Research Council CNR, ITALY |
Sharing Cultural Heritage: the Clavius on the Web Project
|
Institute for Corpus Linguistics and Text Technology (Austrian Academy of Sciences), AUSTRIA |
A SKOS-based Schema for TEI encoded Dictionaries at ICLTT
|
Institute for Corpus Linguistics and Text Technology, AUSTRIA |
The CMD Cloud
|
Institute for Dutch Lexicology, NETHERLANDS |
Taalportaal: an Online Grammar of Dutch and Frisian
|
Institute for German Language (IDS), GERMANY |
GenitivDB ― a Corpus-Generated Database for German Genitive Classification
|
Institute for Infocomm Research, SINGAPORE |
TaLAPi ― A Thai Linguistically Annotated Corpus for Language Processing
|
Institute for Infocommunication Technologies, Bay Zoltán Nonprofit Ltd. for Applied Research, |
The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
|
Institute for Language and Speech Processing / Innoetics LTD, GREECE |
Using Audio Books for Training a Text-to-Speech System
|
Institute for Language and Speech Processing / RC Athens, GREECE |
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
|
Institute for Language and Speech Processing of Athena, GREECE |
Using Audio Books for Training a Text-to-Speech System
Developing a Framework for Describing Relations among Language Resources
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
|
Institute for Specialised Communication and Multilingualism, European Academy of Bozen / Bolzano, ITALY |
'interHist' - an Interactive Visual Interface for Corpus Exploration
The MERLIN corpus: Learner Language and the CEFR
KoKo: an L1 Learner Corpus for German
|
Institute of Communication Acoustics, Ruhr-Universität Bochum, GERMANY |
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
|
Institute of Computational Linguistics - CNR, FRANCE |
The Making of Ancient Greek WordNet
The IMAGACT Visual Ontology. an Extendable Multilingual Infrastructure for the Representation of Lexical Encoding of Action
The Strategic Impact of META-NET on the Regional, National and International Level
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
Presenting a System of Human-Machine Interaction for Performing Map Tasks.
From Synsets to Videos: Enriching ItalWordNet Multimodally
|
Institute of Computer Science at the Polish Academy of Sciences, POLAND |
Walenty: Towards a Comprehensive Valence Dictionary of Polish
The Strategic Impact of META-NET on the Regional, National and International Level
Projection-based Annotation of a Polish Dependency Treebank
PoliTa: a Multitagger for Polish
Morfeusz Reloaded
|
Institute of Formal and Applied Linguistics, Charles University in Prague, CZECH REPUBLIC |
To Pay or to Get Paid: Enriching a Valency Lexicon with Diatheses
|
Institute of Informatics and Telematics, National Research Council CNR, ITALY |
Sharing Cultural Heritage: the Clavius on the Web Project
Accommodations in Tuscany as Linked Data
|
Institute of Linguistics, Academia Sinica, TAIWAN |
Phone Boundary Annotation in Conversational Speech
|
Institute of Mathematics and Computer Science of Academy of Sciences of Moldova, Chișinău, RUSSIAN FEDERATION |
Transliteration and Alignment of Parallel Texts from Cyrillic to Latin
|
Institute of the Lithuanian Language, LITHUANIA |
The Strategic Impact of META-NET on the Regional, National and International Level
|
Instituto de Engenharia de Sistemas e Computadores ― Investigação e Desenvolvimento em Lisboa, Faculdade de Letras da Universidade de Lisboa, PORTUGAL |
OpenLogos Semantico-Syntactic Knowledge-Rich Bilingual Dictionaries
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
JUST.ASK, a QA System that Learns to Answer New Questions from Previous Interactions
Translation Errors from English to Portuguese: an Annotated Corpus
|
Instituto de Engenharia de Sistemas e Computadores ― Investigação e Desenvolvimento em Lisboa, PORTUGAL |
Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
OpenLogos Semantico-Syntactic Knowledge-Rich Bilingual Dictionaries
JUST.ASK, a QA System that Learns to Answer New Questions from Previous Interactions
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
Translation Errors from English to Portuguese: an Annotated Corpus
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
El-WOZ: a Client-Server Wizard-of-Oz Interface
|
Instituto de Linguística Teórica e Computacional / Universidade de Lisboa, PORTUGAL |
Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
|
Instituto de Telecomunicações, University of Porto, PORTUGAL |
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
|
Instituto de Telecomunicações, University of Porto, Faculty of Engineering, PORTUGAL |
HESITA(te) in Portuguese
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
|
Interinstitutional Center for Research and Development in Computational Linguistics, BRAZIL |
A Large Corpus of Product Reviews in Portuguese: Tackling Out-Of-Vocabulary Words
|
International Institute of Information Technology, Hyderabad, INDIA |
Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT
|
Intersect, UNITED KINGDOM |
The Alveo Virtual Laboratory: A Web based Repository API
|
Invision Ondertiteling, GERMANY |
Machine Translation for Subtitling: A Large-Scale Evaluation
|
IRCAM, UMR STMS UPMC-CNRS, FRANCE |
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
|
IRISA - CNRS, FRANCE |
Generating and using Probabilistic Morphological Resources for the Biomedical Domain
|
IRISA, Université de Rennes 1, FRANCE |
The ETAPE Speech Processing Evaluation
Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
ROOTS: a Toolkit for Easy, Fast and Consistent Processing of Large Sequential Annotated Data Collections
Named Entity Corpus Construction using Wikipedia and DBpedia Ontology
Generating and using Probabilistic Morphological Resources for the Biomedical Domain
|
IRIT ― CNRS, FRANCE |
A Model for Processing Illocutionary Structures and Argumentation in Debates
Developing a French FrameNet: Methodology and First results
Enriching the "Senso Comune" Platform with Automatically Acquired Data
|
ISC-PIF, FRANCE |
Reconstructing the Semantic Landscape of Natural Language Processing
|
ISCTE-IUL - Instituto Universitário de Lisboa, |
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
|
ISCTE/INESC-ID, PORTUGAL |
Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach
Teenage and Adult Speech in School Context: Building and Processing a Corpus of European Portuguese
|
iSOCO, S.A., SPAIN |
Language Processing Infrastructure in the XLike Project
|
ISTC CNR, ITALY |
Discovering the Italian Literature: Interactive Access to Audio-indexed Text Resources
|
Istituto di Linguistica Computazionale "Antonio Zampolli" ― Consiglio Nazionale delle Ricerche, ITALY |
The Making of Ancient Greek WordNet
The IMAGACT Visual Ontology. an Extendable Multilingual Infrastructure for the Representation of Lexical Encoding of Action
The Strategic Impact of META-NET on the Regional, National and International Level
The LRE Map disclosed
From Synsets to Videos: Enriching ItalWordNet Multimodally
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
T2K2: a System for Automatically Extracting and Organizing Knowledge from Texts
Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies
|
Italian Institute of Technology, ITALY |
The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech
|
IXA NLP Group, University of the Basque Country (UPV/EHU), SPAIN |
IXA pipeline: Efficient and Ready to Use Multilingual NLP tools
Generating Polarity Lexicons with WordNet Propagation in 5 Languages
|
J |
Jadavpur University, INDIA |
Word Alignment-Based Reordering of Source Chunks in PB-SMT
|
Japan Science and Technology Agency, JAPAN |
Bilingual Dictionary Construction with Transliteration Filtering
|
Jobandtalent, UNITED KINGDOM |
Collection of a Simultaneous Translation Corpus for Comparative Analysis
Collocation or Free Combination? ― Applying Machine Translation Techniques to identify collocations in Japanese
Language Resource Addition: Dictionary or Corpus?
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System
Ranking Job Offers for Candidates: Learning Hidden Knowledge from Big Data
Parsing Chinese Synthetic Words with a Character-based Dependency Model
|
Johns Hopkins University - Human Language Technology Center of Excellence, UNITED STATES |
Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
|
Johns Hopkins University, UNITED STATES |
A Wikipedia-based Corpus for Contextualized Machine Translation
A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic
The Multilingual Paraphrase Database
The American Local News Corpus
|
Joint Research Centre (JRC), ITALY |
Clustering of Multi-Word Named Entity Variants: Multilingual Evaluation
Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts
|
Jožef Stefan Institute, Department of Knowledge Technologies, SLOVENIA |
sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks
TweetCaT: a Tool for Building Twitter Corpora of Smaller Languages
|
Jožef Stefan Institute, SLOVENIA |
sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks
The Strategic Impact of META-NET on the Regional, National and International Level
Language Processing Infrastructure in the XLike Project
|
JRC European Commission, ITALY |
DCEP -Digital Corpus of the European Parliament
|
JSPS/ National Institute of Informatics, JAPAN |
A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations
|
JULIE Lab, Friedrich Schiller Universität Jena, GERMANY |
Collaboratively Annotating Multilingual Parallel Corpora in the Biomedical Domain―some MANTRAs
Disclose Models, Hide the Data - How to Make Use of Confidential Corpora without Seeing Sensitive Raw Data
|
K |
Karlsruhe Institute of Technology, GERMANY |
GlobalPhone: Pronunciation Dictionaries in 20 Languages
xLiD-Lexica: Cross-lingual Linked Data Lexica
A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification
A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation
Manual Analysis of Structurally Informed Reordering in German-English Machine Translation
RECSA: Resource for Evaluating Cross-lingual Semantic Annotation
|
Kazan Federal University, RUSSIAN FEDERATION |
Finite-State Morphological Transducers for Three Kypchak Languages
|
Keio University, JAPAN |
Relating Frames and Constructions in Japanese FrameNet
|
KIIT College of Engineering, Gurgaon, INDIA |
Statistical Analysis of Multilingual Text Corpus and Development of Language Models
|
Know-Center, AUSTRIA |
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool
|
KnowBel Technologies, BELGIUM |
FLELex: a graded Lexical Resource for French Foreign Learners
|
Knowledge Society Association, POLAND |
The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
|
KOEMEI SA, SWITZERLAND |
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
|
Korea advanced Institute of Science and Technology, KOREA |
Named Entity Corpus Construction using Wikipedia and DBpedia Ontology
|
KTH, School of Computer Science and Communication, Department of Speech Music and Hearing, SWEDEN |
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish
The WaveSurfer Automatic Speech Recognition Plugin
|
KULeuven, BELGIUM |
HiEve: A Corpus for Extracting Event Hierarchies from News Stories
|
Kyoto University, JAPAN |
Bilingual Dictionary Construction with Transliteration Filtering
Crowdsourcing for Evaluating Machine Translation Quality
Integration of Workflow and Pipeline for Language Service Composition
A Large Scale Database of Strongly-related Events in Japanese
Constructing a Corpus of Japanese Predicate Phrases for Synonym/Antonym Relations
Bilingual Dictionary Induction as an Optimization Problem
A Japanese Word Dependency Corpus
Language Resource Addition: Dictionary or Corpus?
Flow Graph Corpus from Recipe Texts
A Framework for Compiling High Quality Knowledge Resources From Raw Corpora
Single Classifier Approach for Verb Sense Disambiguation based on Generalized Features
|
L |
Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences, SLOVAKIA |
The Strategic Impact of META-NET on the Regional, National and International Level
|
L2F@INESC-ID and Department of Computer Science, University of Évora, PORTUGAL |
UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation
|
L3I - Université de La Rochelle, FRANCE |
Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter
|
Laboratoire d'Informatique de Grenoble, FRANCE |
The Sweet-Home Speech and Multimodal Corpus for Home Automation Interaction
|
Laboratoire d'Informatique, de Robotique et de Microélectronique de Montpellier, FRANCE |
Propa-L: a Semantic Filtering Service from a Lexical Network Created using Games With A Purpose
Relation Inference in Lexical Networks ... with Refinements
New Functions for a Multipurpose Multimodal Tool for Phonetic and Linguistic Analysis of Very Large Speech Corpora
|
Laboratoire de Sciences Cognitives et Psycholinguistique - Ecole des Hautes Etudes en Sciences Sociales / Ecole Nationale Supérieure / CNRS, FRANCE |
Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
|
Laboratoire National de métrologie et d'Essais, FRANCE |
The ETAPE Speech Processing Evaluation
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators?
ETER: a New Metric for the Evaluation of Hierarchical Named Entity Recognition
|
Laboratoire Parole et Langage, CNRS, FRANCE |
Representing Multimodal Linguistic Annotated Data
Automatic Detection of Other-Repetition Occurrences: Application to French Conversational Speech
Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
|
Laboratoire Parole et Langage, Université Aix-Marseille, FRANCE |
Automatic Detection of Other-Repetition Occurrences: Application to French Conversational Speech
|
Language Computer Corporation, UNITED STATES |
Semi-Supervised Methods for Expanding Psycholinguistics Norms by Integrating Distributional Similarity with the Structure of WordNet
#mygoal: Finding Motivations on Twitter
|
Language Technologies Institute, Carnegie Mellon University, UNITED STATES |
Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
|
LAS, PORTUGAL |
Construction and Annotation of a French Folkstale Corpus
|
LATTICE - CNRS, FRANCE |
Reconstructing the Semantic Landscape of Natural Language Processing
|
LaTTiCe, FRANCE |
Evaluation of Different Strategies for Domain Adaptation in Opinion Mining
|
Lexical Computing Ltd., UNITED KINGDOM |
Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora
Extrinsic Corpus Evaluation with a Collocation Dictionary Task
|
Lexiques, Dictionnaires, Informatique ― Université Paris 13, FRANCE |
Developing a French FrameNet: Methodology and First results
|
Lexis Nexis BIS, FRANCE |
NewsReader: Recording History from Daily News Streams
|
LIA ― Université d'Avignon, FRANCE |
Characterizing and Predicting Bursty Events: the Buzz Case Study on Twitter
Investigating the Image of Entities in Social Media: Dataset Design and First Results
A LDA-based Topic Classification Approach from highly Imperfect Automatic Transcriptions
|
LIA-CERI (Laboratoire d'Informatique d'Avignon-CERI), FRANCE |
NASTIA: Negotiating Appointment Setting Interface
|
LIF, Aix-Marseille Université, FRANCE |
Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them
Comparing Similarity Measures for Distributional Thesauri
|
LIF-CNRS (Aix Marseille Université), FRANCE |
FLELex: a graded Lexical Resource for French Foreign Learners
|
LiLPa, Université de Strasbourg, FRANCE |
From Non Word to New Word: Automatically Identifying Neologisms in French Newspapers
|
LIMSI - CNRS & ENSIIE, FRANCE |
Construction and Annotation of a French Folkstale Corpus
Annotation of Specialized Corpora using a Comprehensive Entity and Relation Scheme
Multiple Choice Question Corpus Analysis for Distractor Characterization
|
LIMSI - CNRS & IMMI - CNRS, FRANCE |
The ETAPE Speech Processing Evaluation
Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish
The Strategic Impact of META-NET on the Regional, National and International Level
Facing the Identification Problem in Language-Related Scientific Data Analysis.
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis
|
LIMSI - CNRS et Université Paris-Sud, FRANCE |
Construction and Annotation of a French Folkstale Corpus
A Hierarchical Taxonomy for Classifying Hardness of Inference Tasks
Extracting News Web Page Creation Time with DCTFinder
French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime
Multiple Choice Question Corpus Analysis for Distractor Characterization
Evaluating Web-as-corpus Topical Document Retrieval with an Index of the OpenDirectory
Thematic Cohesion: Measuring Terms Discriminatory Power Toward Themes
Bidirectionnal Converter Between Syntactic Annotations: from French Treebank Dependencies to PASSAGE Annotations, and back
Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish
|
LIMSI - CNRS, FRANCE |
Toward a Unifying Model for Opinion, Sentiment and Emotion Information Extraction
A Corpus of Machine Translation Errors Extracted from Translation Students Exercises
Biomedical Entity Extraction using Machine-Learning Based Approaches
Morpho-Syntactic Study of Errors from Speech Recognition System
Use of Unsupervised Word Classes for Entity Recognition: Application to the Detection of Disorders in Clinical Reports
Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter
French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime
Semantic Clustering of Pivot Paraphrases
Annotation of Specialized Corpora using a Comprehensive Entity and Relation Scheme
Language Resources for French in the Biomedical Domain
Multiple Choice Question Corpus Analysis for Distractor Characterization
Rule-based Reordering Space in Statistical Machine Translation
A Database of Full Body Virtual Interactions Annotated with Expressivity Scores
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators?
Facing the Identification Problem in Language-Related Scientific Data Analysis.
Smile and Laughter in Human-Machine Interaction: a Study of Engagement
ETER: a New Metric for the Evaluation of Hierarchical Named Entity Recognition
Bidirectionnal Converter Between Syntactic Annotations: from French Treebank Dependencies to PASSAGE Annotations, and back
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis
|
LINA ― Université de Nantes, FRANCE |
Validation Issues induced by an Automatic Pre-Annotation Mechanism in the Building of Non-projective Dependency Treebanks
|
LINA, FRANCE |
Semi-Compositional Method for Synonym Extraction of Multi-Word Terms
|
LingoChamp, JAPAN |
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
|
LinguaTV GmbH, GERMANY |
Sprinter: Language Technologies for Interactive and Multimedia Language Learning
|
Linguistic Data Consortium, University of Pennsylvania, UNITED STATES |
Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus
The RATS Collection: Supporting HLT Research with Degraded Audio Data
Developing an Egyptian Arabic Treebank: Impact of Dialectal Morphology on Annotation and Tool Development
Incorporating Alternate Translations into English Translation Treebank
New Directions for Language Resource Development and Distribution
The Language Application Grid
Facing the Identification Problem in Language-Related Scientific Data Analysis.
Developing a Framework for Describing Relations among Language Resources
|
Linköping University, SWEDEN |
The Impact of Cohesion Errors in Extraction Based Summaries
|
LIUM, FRANCE |
Enhancing the TED-LIUM Corpus with Selected Data for Language Modeling and More TED Talks
|
LLF ― Université Paris Diderot / CNRS, FRANCE |
Developing a French FrameNet: Methodology and First results
|
LLL, Université François Rabelais & CNRS, Tours-Orléans, FRANCE |
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
|
Logos Institute, FRANCE |
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
|
Lomonosov Moscow State University, RUSSIAN FEDERATION |
Summarizing News Clusters on the Basis of Thematic Chains
|
LORIA / Inria Nancy Grand-Est, FRANCE |
Mapping the Lexique des Verbes du français (Lexicon of French Verbs) to a NLP Lexicon using Examples
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
|
LORIA, FRANCE |
Building and Modelling Multilingual Subjective Corpora
|
LORIA/CNRS, FRANCE |
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
|
LPP (Laboratoire Phonétique & Phonologie) / LIMSI-CNRS, FRANCE |
Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish
"Human Annotation of ASR Error Regions: is ""gravity"" a Sharable Concept for Human Annotators?"
ETER: a New Metric for the Evaluation of Hierarchical Named Entity Recognition
|
LPP (Laboratoire Phonétique & Phonologie) / Université Paris Sorbonne Nouvelle, FRANCE |
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
|
LSCP, EHESS-ENS-CNRS, |
Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
|
LTRC, IIIT Hyderabad, INDIA |
Benchmarking of English-Hindi Parallel Corpora
Towards Building a Kashmiri Treebank: Setting up the Annotation Pipeline
Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT
|
Lund University, SWEDEN |
REFRACTIVE: An Open Source Tool to Extract Knowledge from Syntactic and Semantic Relations
|
LUSI-Lab @DIETI University "Federico II", Naples, ITALY |
VOLIP: a Corpus of Spoken Italian and a Virtuous Example of Reuse of Linguistic Resources
|
M |
Macquarie University, AUSTRALIA |
Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems
|
Macquarie University, Department of Computing, AUSTRALIA |
AusTalk: an Audio-Visual Corpus of Australian English
The Alveo Virtual Laboratory: A Web based Repository API
|
Macquarie University, Department of Linguistics, AUSTRALIA |
AusTalk: an Audio-Visual Corpus of Australian English
|
Manchester University, UNITED KINGDOM |
Detecting Document Structure in a Very Large Corpus of UK Financial Reports
|
Masaryk University, CZECH REPUBLIC |
Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing
|
Masaryk University, Natural Language Processing Centre, CZECH REPUBLIC |
Extrinsic Corpus Evaluation with a Collocation Dictionary Task
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation
|
Massachusetts Institute of Technology, IBM, UNITED STATES |
A Study on Expert Sourcing Enterprise Question Collection and Classification
|
Massachusetts Institute of Technology, Lincoln Laboratory, UNITED STATES |
A Wikipedia-based Corpus for Contextualized Machine Translation
|
Massachusetts Institute of Technology, UNITED STATES |
The N2 Corpus: a Semantically Annotated Collection of Islamist Extremist Stories
|
Max Planck Institut for Psycholinguistics, GERMANY |
Towards Automatic Quality Assessment of Component Metadata
Improving the Exploitation of Linguistic Annotations in ELAN
VarClass: An Open-source Language Identification Tool for Language Varieties
|
Max Planck Institute for Evolutionary Anthropology, GERMANY |
Linguistic Landscaping of South Asia using Digital Language Resources: Genetic vs. Areal Linguistics
|
Max Planck Institute for Psycholinguistics, Nijmegen, GERMANY |
Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
|
Microsoft Language Development Center / IEETA, Universidade de Aveiro, UNITED STATES |
Multimodal Corpora for Silent Speech Interaction
|
Microsoft Language Development Center, UNITED STATES |
Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
Multimodal Corpora for Silent Speech Interaction
The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
|
Microsoft Research India, INDIA |
Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
|
Microsoft Research, UNITED STATES |
Enriching ODIN
Annotating Clinical Events in Text Snippets for Phenotype Detection
|
Microsoft, UNITED STATES |
Modern Chinese Helps Archaic Chinese Processing: Finding and Exploiting the Shared Properties
|
Middle East Technical University, TURKEY |
Turkish Resources for Visual Word Recognition
|
Mipumi Games GmbH, AUSTRIA |
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
|
MLDC/ADETTI-ISCTE, PORTUGAL |
Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese
|
Modyco, Université Paris Ouest Nanterre & CNRS, FRANCE |
Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie
|
Modyco, Université Paris Ouest Nanterre, FRANCE |
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
|
MTA-PPKE Hungarian Language Technology Research Group, Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, HUNGARY |
A New Form of Humor ― Mapping Constraint-Based Computational Morphologies to a Finite-State Representation
|
N |
Nagoya University, JAPAN |
Text Readability and Word Distribution in Japanese
|
Nanyang Technological University, SINGAPORE |
Identifying Idioms in Chinese Translations
Building The Sense-Tagged Multilingual Parallel Corpus
|
Nat'l Institute for Japanese Language and Linguistics, JAPAN |
Design and Development of an RDB Version of the Corpus of Spontaneous Japanese
|
National Institute of Informatics, JAPAN |
Annotation of Computer Science Papers for Semantic Relation Extrac-tion
|
National Institute for Japanese Language and Linguistics, JAPAN |
Building a Database of Japanese Adjective Examples from Special Purpose Web Corpora
Collocation or Free Combination? ― Applying Machine Translation Techniques to identify collocations in Japanese
Towards Automatic Transformation Between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features
|
National Institute of Informatics; The University of Tokyo, |
Corpus for Coreference Resolution on Scientific Papers
|
National Library of Medicine, UNITED STATES |
Annotating Question Decomposition on Complex Medical Questions
|
National Library of Norway, NORWAY |
Towards Automatic Quality Assessment of Component Metadata
|
National Taiwan University, TAIWAN |
Sentence Rephrasing for Parsing Sentences with OOV Words
|
National Technical University of Athens, School of ECE, GREECE |
Word Semantic Similarity for Morphologically Rich Languages
|
National Tsing Hua University, Institute of Information Systems and Applications, TAIWAN |
Phone Boundary Annotation in Conversational Speech
|
National University of Ireland, Altocloud, Galway, IRELAND |
Extracting Information for Context-aware Meeting Preparation
|
National University of Ireland, Galway, IRELAND |
Evaluation of Technology Term Recognition with Random Indexing
|
National University of Ireland, Maynooth, Department of Computer Science, IRELAND |
A Deep Context Grammatical Model For Authorship Attribution
|
National University of Taiwan, Department of Computer Science & Information Engineering, TAIWAN |
Phone Boundary Annotation in Conversational Speech
Sentence Rephrasing for Parsing Sentences with OOV Words
|
NCSR Demokritos, |
NOMAD: Linguistic Resources and Tools Aimed at Policy Formulation and Validation The Ellogon Pattern Engine: Context-free Grammars over Annotations
Annotating Arguments: The NOMAD Collaborative Annotation Tool
|
NEC Corporation, JAPAN |
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System
|
Nederlandse Taalunie / HLT Agency, NETHERLANDS |
A Decade of HLT Agency Activities in the Low Countries: from Resource Maintenance (BLARK) to Service Offerings (BLAISE)
|
New York University, UNITED STATES |
Corpus and Method for Identifying Citations in Non-Academic Text
Annotating Relations in Scientific Articles
|
Newcastle University, UNITED KINGDOM |
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian
|
NLP Technologies, CANADA |
Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government
|
NLP2CT Lab / Department of Computer and Information Science, MOROCCO |
UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation
|
Nomotika SRL, ITALY |
Exploiting networks in Law
|
Norges teknisk-naturvitenskapelige universitet, NORWAY |
MultiVal - Towards a Multilingual Valence Lexicon
|
North-West University, UNITED STATES |
The Development of Dutch and Afrikaans Language Resources for Compound Boundary Analysis.
|
North-West University,Centre for Text Technology, UNITED STATES |
Developing Text Resources for Ten South African Languages
|
Northwestern University, UNITED STATES |
Using Resource-Rich Languages to Improve Morphological Analysis of Under-Resourced Languages
Discosuite - A Parser Test Suite for German Discontinuous Structures
|
Norwegian School of Economics, NORWAY |
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
|
Norwegian University of Science and Technology, NORWAY |
A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence
|
NTT, JAPAN |
Extraction of Daily Changing Words for Question Answering
Constructing a Corpus of Japanese Predicate Phrases for Synonym/Antonym Relations
|
Nuance Communications, BRAZIL |
NomLex-PT: A Lexicon of Portuguese Nominalizations
|
O |
Oberlin College, UNITED STATES |
Morphological Parsing of Swahili using Crowdsourced Lexical Resources
|
Objet Direct, FRANCE |
Generating a Resource for Products and Brandnames Recognition. Application to the Cosmetic Domain.
Towards Electronic SMS Dictionary Construction: An Alignment-based Approach
|
OntoChem GmbH, GERMANY |
Creating a Gold Standard Corpus for the Extraction of Chemistry-Disease Relations from Patent Texts
|
Ontology Engineering Group, Universidad Politécnica de Madrid, SPAIN |
Enabling Language Resources to Expose Translations as Linked Data on the Web
|
Open University of Catalonia, SPAIN |
Machine Translationness: Machine-likeness in Machine Translation Evaluation
|
Orange Labs, FRANCE |
NASTIA: Negotiating Appointment Setting Interface
DINASTI: Dialogues with a Negotiating Appointment Setting Interface
|
Ortolang, FRANCE |
Representing Multimodal Linguistic Annotated Data
|
Otto-von-Guericke Universität, GERMANY |
Modeling and Evaluating Dialog Success in the LAST MINUTE Corpus
|
Owlin, NETHERLANDS |
N-gram Counts and Language Models from the Common Crawl
|
P |
PAVIS, Italian Institute of Technology, ITALY |
The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech
|
Peking University, CHINA |
Construction of Diachronic Ontologies from People's Daily of Fifty Years
|
Peking University, Key Laboratory of Computational Linguistics, Ministry of Education, CHINA |
Construction of Diachronic Ontologies from People's Daily of Fifty Years
|
Philipps-University of Marburg, GERMANY |
Creating a Massively Parallel Bible Corpus
A Benchmark Database of Phonetic Alignments in Historical Linguistics and Dialectology
|
Pixformance GmbH, GERMANY |
The taraXÜ Corpus of Human-Annotated Machine Translations
|
Polish Academy of Science, POLAND |
A Model for Processing Illocutionary Structures and Argumentation in Debates
|
Polytechnique Montréal, CANADA |
A Multimodal Interpreter for 3D Visualization and Animation of Verbal Concepts
Improving Entity Linking using Surface Form Refinement
|
Pontifícia Universidade Católica do Rio Grande do Sul, BRAZIL |
Building Domain Specific Bilingual Dictionaries
Comparative Analysis of Portuguese Named Entities Recognition Tools
VOAR: A Visual and Integrated Ontology Alignment Environment
|
Praxiling UMR 5267 CNRS & Université Paul-Valéry Montpellier 3, FRANCE |
Towards Electronic SMS Dictionary Construction: An Alignment-based Approach
|
Priberam, Instituto de Telecomunicacoes, PORTUGAL |
Priberam Compressive Summarization Corpus: A New Multi-Document Summarization Corpus for European Portuguese
|
Prompsit Language Engineering, SPAIN |
Quality Estimation for Synthetic Parallel Data Generation
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
|
PSD/LDS Church, UNITED STATES |
Corpus and Evaluation of Handwriting Recognition of Historical Genealogical Records
|
PUCRS, BRAZIL |
Boosting Open Information Extraction with Noun-Based Relations
|
Pázmány Péter Catholic University, HUNGARY |
An Efficient Language Independent Toolkit for Complete Morphological Disambiguation
|
R |
RACAI, ROMANIA |
CoRoLa ― The Reference Corpus of Contemporary Romanian Language
|
Radboud University Nijmegen, Center for Language Studies, NETHERLANDS |
The Dutch LESLLA Corpus
The Gulf of Guinea Creole Corpora
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
|
Radboud University Nijmegen, Department of Linguistics, NETHERLANDS |
ASR-based CALL Systems and Learner Speech Data: New Resources and Opportunities for Research and Development in Second Language Learning
|
Radboud University Nijmegen, NETHERLANDS |
Creating and Using Large Monolingual Parallel Corpora for Sentential Paraphrase Generation
The Nijmegen Corpus of Casual Czech
ASR-based CALL Systems and Learner Speech Data: New Resources and Opportunities for Research and Development in Second Language Learning
Semi-Automatic Annotation of the UCU Accents Speech Corpus
Improving the Exploitation of Linguistic Annotations in ELAN
|
Rakuten Institute of Technology, New York, UNITED STATES |
Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese
|
RALI, CANADA |
Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government
|
Rensselaer Polytechnic Institute, UNITED STATES |
Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese
|
Research Institute for Artificial Intelligence, Romanian Academy, ROMANIA |
Large SMT Data-sets Extracted from Wikipedia
CoRoLa ― The Reference Corpus of Contemporary Romanian Language
The Strategic Impact of META-NET on the Regional, National and International Level
RSS-TOBI - a Prosodically Enhanced Romanian Speech Corpus
|
Research institute for Linguistics, Hungarian Academy of Sciences, HUNGARY |
The Strategic Impact of META-NET on the Regional, National and International Level
The Hungarian Gigaword Corpus
Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages
|
Research Scholar, Indian Institute of Technology Kharagpur, INDIA |
Design and Development of an Online Computational Framework to Facilitate Language Comprehension Research on Indian Languages
|
Reykjavik University, ICELAND |
Correcting Errors in a New Gold Standard for Tagging Icelandic Text
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian
|
Rheinisch-Westfaelische Technische Hochschule Aachen , GERMANY |
Extensions of the Sign Language Recognition and Translation Corpus RWTH-PHOENIX-Weather
|
Riken Brain Science Institute, JAPAN |
Design and Development of an RDB Version of the Corpus of Spontaneous Japanese
|
Ritsumeikan University, JAPAN |
A Japanese Word Dependency Corpus
|
Royal Institute of Technology, AUSTRALIA |
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
|
Royal Institute of Technology, School of Computer Science and Communication, Department of Speech, Music and Hearing, AUSTRALIA |
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish
The WaveSurfer Automatic Speech Recognition Plugin
|
Royal Melbourne Institute of Technology, AUSTRALIA |
The Alveo Virtual Laboratory: A Web based Repository API
|
Ruhr - Universität Bochum, GERMANY |
Building a Reference Lexicon for Countability in English
|
Rutgers University, UNITED STATES |
A New Framework for Sign Language Recognition Based on 3D Handshape Identification and Linguistic Modeling
3D Face Tracking and Multi-Scale, Spatio-temporal Analysis of Linguistically Significant Facial Expressions and Head Positions in ASL
|
S |
Saarland University, GERMANY |
A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence
Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction
Interoperability of Dialogue Corpora through ISO 24617-2-based Querying
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization
Multiword Expressions in Machine Translation
Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers
|
Saint John's College, Santa Fe, UNITED STATES |
Identifying Idioms in Chinese Translations
|
Samsung Advanced Institute of Technology, KOREA |
Can the Crowd be Controlled?: A Case Study on Crowd Sourcing and Automatic Validation of Completed Tasks based on User Modeling
|
Samsung Research America, UNITED STATES |
Building a Dataset for Summarization and Keyword Extraction from Emails
|
Sapienza University of Rome, ITALY |
Annotating the MASC Corpus with BabelNet
Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian
Clustering of Multi-Word Named Entity Variants: Multilingual Evaluation
HuRIC: a Human Robot Interaction Corpus
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0
An Analysis of Ambiguity in Word Sense Annotations
|
Sarah M. Taylor Consulting, LLC, UNITED STATES |
Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
A Multi-Cultural Repository of Automatically Discovered Linguistic and Conceptual Metaphors
|
ScraperWiki, UNITED KINGDOM |
NewsReader: Recording History from Daily News Streams
|
SfS Uni Tübingen, GERMANY |
Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D
|
Siberian State Aerospace University named after academician M. F. Reshetnev, RUSSIAN FEDERATION |
Speech-Based Emotion Recognition: Feature Selection by Self-Adaptive Multi-Criteria Genetic Algorithm
|
Siemens AG, GERMANY |
LinkedHealthAnswers: Towards Linked Data-driven Question Answering for the Health Care Domain
|
Sikom Software GmbH, |
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
|
Simon Fraser University, CANADA |
Building a Reference Lexicon for Countability in English
|
Slovak Academy of Sciences, SLOVAKIA |
Alert!... Calm Down, There is Nothing to Worry About. Warning and Soothing Speech Synthesis
|
Sofia University and IICT-BAS, BULGARIA |
A System for Experiments with Dependency Parsers
Constituency Parsing of Bulgarian: Word- vs Class-based Parsing
|
Soochow University, TAIWAN |
Annotating Events in an Emotion Corpus
|
Speech Group - LORIA CNRS, FRANCE |
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
|
Spoken Language Systems Laboratory, INESC ID, PORTUGAL |
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue
|
Spräkbanken, The National Library of Norway, NORWAY |
The Norwegian Dependency Treebank
|
Spräkbanken, University of Gothenburg, SWEDEN |
Linguistic Landscaping of South Asia using Digital Language Resources: Genetic vs. Areal Linguistics
HFST-SweNER ― A New NER Resource for Swedish
Bring vs. MTRoget: Evaluating Automatic Thesaurus Translation
The Strategic Impact of META-NET on the Regional, National and International Level
Computer-aided Morphology Expansion for Old Swedish
A Flexible Language Learning Platform Based on Language Resources and Web Services
|
Stanford Universiy, UNITED STATES |
A Gold Standard Dependency Corpus for English
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data
|
Stofnun Árna Magnússonar í íslenskum fræðum, ICELAND |
Mörkum Njálu. An Annotated Corpus to Analyse and Explain Grammatical Divergences Between 14th-century Manuscripts of Njál's Saga.
|
Syllabs, FRANCE |
Evaluating Web-as-corpus Topical Document Retrieval with an Index of the OpenDirectory
Thematic Cohesion: Measuring Terms Discriminatory Power Toward Themes
|
Synerscope, NETHERLANDS |
NewsReader: Recording History from Daily News Streams
|
Synthema, RUSSIAN FEDERATION |
SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling
|
T |
Tagmatica + IMMI-CNRS, FRANCE |
Facing the Identification Problem in Language-Related Scientific Data Analysis.
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis
|
Taif University, SAUDI ARABIA |
DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment
|
Tallinn University of Technology, ESTONIA |
Dense Components in the Structure of WordNet
|
Technical University Munich, GERMANY |
LinkedHealthAnswers: Towards Linked Data-driven Question Answering for the Health Care Domain
|
Technical University of Cluj-Napoca, Speech Processing Group, ROMANIA |
GRASS: the Graz corpus of Read And Spontaneous Speech
RSS-TOBI - a Prosodically Enhanced Romanian Speech Corpus
|
Technical University of Kosice, SLOVAKIA |
The Slovak Categorized News Corpus
TUKE-BNews-SK: Slovak Broadcast News Corpus Construction and Evaluation
|
Technische Universität Berlin, GERMANY |
The Weltmodell: A Data-Driven Commonsense Knowledge Base
Freepal: A Large Collection of Deep Lexico-Syntactic Patterns for Relation Extraction
|
Technische Universität Darmstadt, GERMANY |
Distributed Distributional Similarities of Google Books over the Centuries
NoSta-D Named Entity Annotation for German: Guidelines and Dataset
Lexical Substitution Dataset for German
|
Technische Universität Dresden, GERMANY |
The MERLIN corpus: Learner Language and the CEFR
|
Technische Universität München / Imperial College of London, GERMANY |
The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production
|
Technische Universität München, GERMANY |
The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production
|
Telecommunication Laboratories, Chunghwa Telecom Co., Ltd., TAIWAN |
Sentence Rephrasing for Parsing Sentences with OOV Words
|
TETIS, Cirad, Irstea, AgroParisTech, FRANCE |
Towards Electronic SMS Dictionary Construction: An Alignment-based Approach
|
Text Shuttle GmbH, FRANCE |
Machine Translation for Subtitling: A Large-Scale Evaluation
|
Text Shuttle GmbH, SWITZERLAND |
Machine Translation for Subtitling: A Large-Scale Evaluation
|
The Chinese University of Hong Kong, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA |
The CUHK Discourse TreeBank for Chinese: Annotating Explicit Discourse Connectives for the Chinese TreeBank
The Halliday Centre Tagger: An Online Platform for Semi-automatic Text Annotation and Analysis
|
The Hong Kong Polytechnic University, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA |
Annotating Events in an Emotion Corpus
|
The Hong Kong University of Science and Technology, HONG KONG SPECIAL ADMINISTRATIVE REGION OF CHINA |
Co-Training for Classification of Live or Studio Music Recordings
|
The Language Archive - DANS, GERMANY |
Experiences with the ISOcat Data Category Registry
RELISH LMF: Unlocking the Full Power of the Lexical Markup Framework
Linguistic Resources and Cats: How to Use ISOcat, RELcat and SCHEMAcat
|
The Language Archive, Max Planck Institute for Psycholinguistics, GERMANY |
The DWAN Framework: Application of a Web Annotation Framework for the General Humanities to the Domain of Language Resources
Experiences with the ISOcat Data Category Registry
RELISH LMF: Unlocking the Full Power of the Lexical Markup Framework
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
|
The Long Now Foundation, UNITED STATES |
PanLex: Building a Resource for Panlingual Lexical Translation
|
The National Institute for Japanese Language and Linguistics, JAPAN |
Design and Development of an RDB Version of the Corpus of Spontaneous Japanese
Towards Automatic Transformation Between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features
|
The Ohio State University, UNITED STATES |
Universal Stanford Dependencies: a Cross-Linguistic Typology
A Gold Standard Dependency Corpus for English
|
The Open University, UNITED KINGDOM |
On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter
|
The Telecommunications Research Center Vienna (FTW), AUSTRIA |
The MMASCS Multi-Modal Annotated Synchronous Corpus of Audio, Video, Facial Motion and Tongue Motion Data of Normal, Fast and Slow Speech
|
The University of Electro-Communications, JAPAN |
A Character-based Approach to Distributional Semantic Models: Exploiting Kanji Characters for Constructing JapaneseWord Vectors
|
The Árni Magnússon Institute for Icelandic Studies, ICELAND |
Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages
Annotation Pro + TGA: Automation of Speech Timing Analysis
ISLEX ― a Multilingual Web Dictionary
Correcting Errors in a New Gold Standard for Tagging Icelandic Text
A 500 Million Word POS-Tagged Icelandic Corpus
Utilizing Constituent Structure for Compound Analysis
|
Tilburg Center for Cognition and Communication, NETHERLANDS |
The Development of Dutch and Afrikaans Language Resources for Compound Boundary Analysis.
|
Tilburg University, NETHERLANDS |
Semantic Approaches to Software Component Retrieval with English Queries
Creating and Using Large Monolingual Parallel Corpora for Sentential Paraphrase Generation
Interoperability of Dialogue Corpora through ISO 24617-2-based Querying
The Development of Dutch and Afrikaans Language Resources for Compound Boundary Analysis.
Synergy of Nederlab and @PhilosTEI: Diachronic and Multilingual Text-induced Corpus Clean-up
|
Tilde, LATVIA |
Terminology Localization Guidelines for the National Scenario
Designing the Latvian Speech Recognition Corpus
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Bilingual dictionaries for all EU languages
Billions of Parallel Words for Free: Building and Using the EU Bookshop Corpus
Terminology Resources and Terminology Work Benefit from Cloud Services
|
Titelbild Subtitling and Translation, GERMANY |
Machine Translation for Subtitling: A Large-Scale Evaluation
|
Tokyo Institute of Technology, JAPAN |
Building a Corpus of Manually Revised Texts from Discourse Perspective
|
Toulouse Université & CNRS, Octogone, FRANCE |
Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
|
Trendiction S.A., GERMANY |
Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction
|
Trento-Rise, ITALY |
Creative Language Explorations through a high-Expressivity N-grams Query Language
Enriching the "Senso Comune" Platform with Automatically Acquired Data
|
Trinity College of Dublin, Centre for Global Intelligent Content, IRELAND |
Global Intelligent Content: Active Curation of Language Resources using Linked Data
|
Trinity College of Dublin, Centre for Next Generation Localisation, IRELAND |
Global Intelligent Content: Active Curation of Language Resources using Linked Data
|
Trojina, Institute for Applied Slovene Studies, SLOVENIA |
The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech
The Slovene BNSI Broadcast News Database and Reference Speech Corpus GOS: Towards the Uniform Guidelines for Future Work
|
Université Technologique de Troyes, FRANCE |
The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech
|
Tsinghua University, CHINA |
NomLex-PT: A Lexicon of Portuguese Nominalizations
Etymological WordNet: Tracing the History of Words
Bring vs. MTRoget: Evaluating Automatic Thesaurus Translation
Language Processing Infrastructure in the XLike Project
Clustering Tweets using Wikipedia Concepts
|
Tsukuba University of Technology, JAPAN |
A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations
|
Télécom ParisTech, FRANCE |
A Model to Generate Adaptive Multimodal Job Interviews with a Virtual Recruiter
|
U |
Ubiquitous Knowledge Processing (UKP) Lab, GERMANY |
WordNet―Wikipedia―Wiktionary: Construction of a Three-way Alignment
Lexical Substitution Dataset for German
|
Ugo Bordoni Foundation, ITALY |
EMOVO Corpus: an Italian Emotional Speech Database
|
UKP Lab, Technische Universität Darmstadt, GERMANY |
WordNet―Wikipedia―Wiktionary: Construction of a Three-way Alignment
|
Ulm University, GERMANY |
First Insight into Quality-Adaptive Dialogue
Comparison of Gender- and Speaker-adaptive Emotion Recognition
Speech-Based Emotion Recognition: Feature Selection by Self-Adaptive Multi-Criteria Genetic Algorithm
|
UMass CS, IBM, UNITED STATES |
A Study on Expert Sourcing Enterprise Question Collection and Classification
|
UMI 2958 CNRS - GeorgiaTech, FRANCE |
NASTIA: Negotiating Appointment Setting Interface
DINASTI: Dialogues with a Negotiating Appointment Setting Interface
|
UMR CLLE-ERSS 5263, Université Toulouse 2/CNRS, |
TALC-Sef a Manually-revised POS-Tagged Literary Corpus in Serbian, English and French
|
UMR STL 8163, Université Lille 3, FRANCE |
TALC-Sef a Manually-revised POS-Tagged Literary Corpus in Serbian, English and French
|
Union College, UNITED KINGDOM |
Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
|
Unister GmbH, Leipzig, Germany, |
NIF4OGGD - NLP Interchange Format for Open German Governmental Data
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
|
Universidad Católica "Nuestra Señora de la Asunción", BRAZIL |
Guampa: a Toolkit for Collaborative Translation
|
Universidad Complutense de Madrid (UCM) / ATLAS (UNED), SPAIN |
Standardisation and Interoperation of Morphosyntactic and Syntactic Annotation Tools for Spanish and their Annotations
|
Universidad Complutense de Madrid (UCM), SPAIN |
Standardisation and Interoperation of Morphosyntactic and Syntactic Annotation Tools for Spanish and their Annotations
NoSta-D Named Entity Annotation for German: Guidelines and Dataset
Can Numerical Expressions Be Simpler? Implementation and Demonstration of a Numerical Simplification System for Spanish
|
Universidad de Antioquia, COLOMBIA |
New Spanish Speech Corpus Database for the Analysis of People Suffering from Parkinson's Disease
|
Universidad de Valladolid, SPAIN |
On the Use of a Fuzzy Classifier to Speed Up the Sp ToBI Labeling of the Glissando Spanish Corpus
Assessment of Non-native Prosody for Spanish as L2 using Quantitative Scores and Perceptual Evaluation
|
Universidad Nacional Autónoma de México, MEXICO |
CIEMPIESS: A New Open-Sourced Mexican Spanish Radio Corpus
|
Universidad Nacional de Educación a Distancia, SPAIN |
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
|
Universidad Nacional de Guinea Ecuatorial, EQUATORIAL GUINEA |
The Gulf of Guinea Creole Corpora
|
Universidad Politécnica de Cataluña, SPAIN |
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
|
Universidad Politécnica de Madrid, SPAIN |
Enabling Language Resources to Expose Translations as Linked Data on the Web
|
Universidade de Lisboa, Centro de Linguística, PORTUGAL |
An Evaluation of the Role of Statistical Measures and Frequency for MWE Identification
LexTec - a Rich Language Resource for Technical Domains in Portuguese
The Gulf of Guinea Creole Corpora
A Cascade Approach for Complex-type Classification
Combining Dependency Information and Generalization in a Pattern-based Approach to the Classification of Lexical-Semantic Relation Instances
Extracting Semantic Relations from Portuguese Corpora using Lexical-Syntactic Patterns
|
Universidade de Lisboa, PORTUGAL |
Teenage and Adult Speech in School Context: Building and Processing a Corpus of European Portuguese
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora
The Strategic Impact of META-NET on the Regional, National and International Level
The Gulf of Guinea Creole Corpora
A Corpus of European Portuguese Child and Child-directed Speech
|
Universidade de São Paulo, BRAZIL |
Generating a Lexicon of Errors in Portuguese to Support an Error Identification System for Spanish Native Learners
|
Universidade de Vigo, SPAIN |
Introducing a Framework for the Evaluation of Music Detection Tools
|
Universidade Federal do Paraná, BRAZIL |
NomLex-PT: A Lexicon of Portuguese Nominalizations
|
Universitaet Bielefeld, GERMANY |
ALICO: a Multimodal Corpus for the Study of Active Listening
|
Universitaet Frankfurt am Main, GERMANY |
Towards Interoperable Discourse Annotation. Discourse Features in the Ontologies of Linguistic Annotation
|
Universitat Autònoma de Barcelona, SPAIN |
On the Use of a Fuzzy Classifier to Speed Up the Sp ToBI Labeling of the Glissando Spanish Corpus
DysList: An Annotated Resource of Dyslexic Errors
Discovering Frames in Specialized Domains
|
Universitat d'Alacant, SPAIN |
On the Annotation of TMX Translation Memories for Advanced Leveraging in Computer-aided Translation
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
Sharing Resources Between Free/Open-Source Rule-based Machine Translation Systems: Grammatical Framework and Apertium
|
Universitat de Barcelona, SPAIN |
VERTa: Facing a Multilingual Experience of a Linguistically-based MT Evaluation
MultiVal - Towards a Multilingual Valence Lexicon
The IULA Spanish LSP Treebank
|
Universitat Oberta de Catalunya, SPAIN |
Automatic Creation of WordNets from Parallel Corpora
The NewSoMe Corpus: a Unifying Opinion Annotation Framework Across Genres and in Multiple Languages
|
Universitat Politecnica de Valencia, SPAIN |
Enrichment of Bilingual Dictionary through News Stream Data
|
Universitat Politècnica de Catalunya, SPAIN |
The Strategic Impact of META-NET on the Regional, National and International Level
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
Language Processing Infrastructure in the XLike Project
|
Universitat Politècnica de València, SPAIN |
Enrichment of Bilingual Dictionary through News Stream Data
Evaluating the Effects of Interactivity in a Post-Editing Workbench
Online Optimisation of Log-linear Weights in Interactive Machine Translation
|
University Pompeu Fabra, SPAIN |
How to Use Less Features and Reach Better Performance in Author Gender Identification
Boosting the Creation of a Treebank
Creating Summarization Systems with SUMMA
Modelling Irony in Twitter: Feature Analysis and Evaluation
Adapting Freely Available Resources to Build an Opinion Mining Pipeline in Portuguese
Can Numerical Expressions Be Simpler? Implementation and Demonstration of a Numerical Simplification System for Spanish
The NewSoMe Corpus: a Unifying Opinion Annotation Framework Across Genres and in Multiple Languages
The IULA Spanish LSP Treebank
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Crowdsourcing as a Preprocessing for Complex Semantic Annotation Tasks
A Cascade Approach for Complex-type Classification
Choosing which to Use? A Study of Distributional Models for Nominal Lexical Semantic Classification
DysList: An Annotated Resource of Dyslexic Errors
Metadata as Linked Open Data: mapping disparate XML metadata registries into one RDF/OWL registry.
Ranking Job Offers for Candidates: Learning Hidden Knowledge from Big Data
Combining Dependency Information and Generalization in a Pattern-based Approach to the Classification of Lexical-Semantic Relation Instances
An Exercise in Reuse of Resources: Adapting General Discourse Coreference Resolution for Detecting Lexical Chains in Patent Documentation
|
University Pompeu Fabra, Department of Information and Communication Technologies, SPAIN |
An Exercise in Reuse of Resources: Adapting General Discourse Coreference Resolution for Detecting Lexical Chains in Patent Documentation
|
University Pompeu Fabra, Department of Translation and Language Sciences, SPAIN |
TexAFon 2.0: a Text Processing Tool for the Generation of Expressive Speech in TTS Applications
|
Universitatea Bucuresti, Facultatea de Matematica si Informatica, ROMANIA |
On the Romance Languages Mutual Intelligibility
Using a Machine Learning Model to Assess the Complexity of Stress Systems
Building a Dataset of Multilingual Cognates for the Romanian Lexicon
|
Universite de Bretagne Sud, IRISA, FRANCE |
Variations on Quantitative Comparability Measures and their Evaluations on Synthetic French-English Comparable Corpora
Co-clustering of Bilingual Datasets as a Mean for Assisting the Construction of Thematic Bilingual Comparable Corpora
|
Universite de Geneve, SWITZERLAND |
C-PhonoGenre: a 7-hour Corpus of 7 Speaking Styles in French: Relations between Situational Features and Prosodic Properties
|
Universiteit Antwerp, BELGIUM |
Legal Aspects of Text Mining
|
Universiteit Antwerp, CliPS, BELGIUM |
CLIPS Stylometry Investigation (CSI) Corpus: a Dutch Corpus for the Detection of Age, Gender, Personality, Sentiment and Deception in Text
The Strategic Impact of META-NET on the Regional, National and International Level
|
Universitetet i Oslo, Department of Informatics, NORWAY |
The Norwegian Dependency Treebank
|
Universitetet i Oslo, Department of Linguistics and Scandinavian Studies, NORWAY |
The Norwegian Dependency Treebank
|
Universitetet i Oslo, NORWAY |
Semantic Technologies for Querying Linguistic Annotations: An Experiment Focusing on Graph-Structured Data
Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar
Off-Road LAF: Encoding and Processing Annotations in NLP Workflows
|
Universitetet i Tromsø, NORWAY |
Finite-State Morphological Transducers for Three Kypchak Languages
|
University at Albany, Institute for Informatics, Logics and Security Studies, UNITED STATES |
Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
A Multi-Cultural Repository of Automatically Discovered Linguistic and Conceptual Metaphors
|
University at Albany, UNITED STATES |
Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
A Multi-Cultural Repository of Automatically Discovered Linguistic and Conceptual Metaphors
|
University College Utrecht, NETHERLANDS |
Semi-Automatic Annotation of the UCU Accents Speech Corpus
|
University Lille 1 - LIFL (CNRS/Lille 1) - SequeL team, FRANCE |
NASTIA: Negotiating Appointment Setting Interface
DINASTI: Dialogues with a Negotiating Appointment Setting Interface
|
University of Aalto, FINLAND |
SLMotion - an Extensible Sign Language Oriented Video Analysis Tool
S-pot - a Benchmark in Spotting Signs Within Continuous Signing
A Toolkit for Efficient Learning of Lexical Units for Speech Recognition
|
University of Aalto, School of Business, FINLAND |
S-pot - a Benchmark in Spotting Signs Within Continuous Signing
|
University of Alabama at Birmingham, UNITED STATES |
ClearTK 2.0: Design Patterns for Machine Learning in UIMA
Sockpuppet Detection in Wikipedia: A Corpus of Real-World Deceptive Writing for Linking Identities
|
University of Alberta, UNITED STATES |
Improving Open Relation Extraction via Sentence Re-Structuring
|
University of Amsterdam, Institute for Logic, Language, and Computation, NETHERLANDS |
All Fragments Count in Parser Evaluation
|
University of Amsterdam, NETHERLANDS |
Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora using WebMAUS
All Fragments Count in Parser Evaluation
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
|
University of Arizona, UNITED STATES |
ACTIV-ES: a Comparable, Cross-Dialect Corpus of "everyday" Spanish from Argentina, Mexico, and Spain
Eliciting and Annotating Uncertainty in Spoken Language
Event Extraction Using Distant Supervision
On the Importance of Text Analysis for Stock Price Prediction
The N2 Corpus: a Semantically Annotated Collection of Islamist Extremist Stories
|
University of Aston, UNITED KINGDOM |
On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter
|
University of Austin, Texas, UNITED STATES |
Exploring Factors that Contribute to Successful Fingerspelling Comprehension
|
University of Aveiro, PORTUGAL |
Multimodal Corpora for Silent Speech Interaction
|
University of Bamberg, GERMANY |
PACE Corpus: a Multilingual Corpus of Polarity-Annotated Textual Data from the Domains Automotive and CEllphone
|
University of Belgrade, SERBIA |
The Strategic Impact of META-NET on the Regional, National and International Level
|
University of Bergen, Department of Linguistic, Literary, and Aesthetic Studies, NORWAY |
Off-Road LAF: Encoding and Processing Annotations in NLP Workflows
|
University of Bergen, NORWAY |
The Interplay Between Lexical and Syntactic Resources in Incremental Parsebanking
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Pruning the Search Space of the Wolof LFG Grammar Using a Probabilistic and a Constraint Grammar Parser
Chasing the Perfect Splitter: A Comparison of Different Compound Splitting Tools
|
University of Birmingham, UNITED KINGDOM |
Mining Online Discussion Forums for Metaphors
DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment
|
University of Bologna, ITALY |
A Modular System for Rule-based Text Categorisation
|
University of Boston, UNITED STATES |
A New Framework for Sign Language Recognition Based on 3D Handshape Identification and Linguistic Modeling
3D Face Tracking and Multi-Scale, Spatio-temporal Analysis of Linguistically Significant Facial Expressions and Head Positions in ASL
|
University of Brandeis, UNITED STATES |
Image Annotation with ISO-Space: Distinguishing Content from Structure
Buy One Get One Free: Distant Annotation of Chinese Tense, Event Type and Modality
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech
Identification of Multiword Expressions in the brWaC
The Language Application Grid
|
University of Brighton, UNITED KINGDOM |
A Comparative Evaluation Methodology for NLG in Interactive Systems
|
University of British Columbia, CANADA |
Morphological Parsing of Swahili using Crowdsourced Lexical Resources
|
University of Bucharest, ROMANIA |
On the Romance Languages Mutual Intelligibility
Aggregation Methods for Efficient Collocation Detection
Using a Machine Learning Model to Assess the Complexity of Stress Systems
Building a Dataset of Multilingual Cognates for the Romanian Lexicon
|
University of California Santa Cruz, UNITED STATES |
Getting Reliable Annotations for Sarcasm in Online Dialogues
|
University of Cambridge (DTAL), UNITED KINGDOM |
Semantic Clustering of Pivot Paraphrases
|
University of Cambridge, UNITED KINGDOM |
Native Language Identification Using Large, Longitudinal Data
Evaluation of Simple Distributional Compositional Operations on Longer Texts
TagNText: a Parallel Corpus for the Induction of Resource-specific non-Taxonomical Relations from Tagged Images
|
University of Chiba, Faculty of Letters, JAPAN |
Design and Development of an RDB Version of the Corpus of Spontaneous Japanese
Towards Automatic Transformation Between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features
Japanese Conversation Corpus for Training and Evaluation of Backchannel Prediction Model.
|
University of Coastal Carolina, UNITED STATES |
The N2 Corpus: a Semantically Annotated Collection of Islamist Extremist Stories
|
University of Coimbra, Centre for Informatics and Systems, PORTUGAL |
Exploiting Portuguese Lexical Knowledge Bases for Answering Open Domain Cloze Questions Automatically
|
University of Coimbra, PORTUGAL |
HESITA(te) in Portuguese
|
University of Cologne, GERMANY |
Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages
Learning from Domain Complexity
VarClass: An Open-source Language Identification Tool for Language Varieties
|
University of Colorado at Boulder, UNITED STATES |
PropBank: Semantics of New Predicate Types
ClearTK 2.0: Design Patterns for Machine Learning in UIMA
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data
Mapping CPA Patterns onto OntoNotes Senses
|
University of Colorado School of Medicine, UNITED STATES |
Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora
|
University of Colorado, UNITED STATES |
PropBank: Semantics of New Predicate Types
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data
Single Classifier Approach for Verb Sense Disambiguation based on Generalized Features
Focusing Annotation for Semantic Role Labeling
|
University of Columbia, Center for Computational Learning Systems, UNITED STATES |
Developing an Egyptian Arabic Treebank: Impact of Dialectal Morphology on Annotation and Tool Development
Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition
MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic
|
University of Columbia, UNITED STATES |
Mapping Between English Strings and Reentrant Semantic Graphs
Towards Automatic Detection of Narrative Structure
Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon
Teenage and Adult Speech in School Context: Building and Processing a Corpus of European Portuguese
A Conventional Orthography for Tunisian Arabic
Annotating the MASC Corpus with BabelNet
A Multidialectal Parallel Corpus of Arabic
MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic
Large Scale Arabic Error Annotation: Guidelines and Framework
|
University of Concordia, UNITED STATES |
Improving Entity Linking using Surface Form Refinement
|
University of Copenhagen, Center for Language Technology, DENMARK |
Augmenting English Adjective Senses with Supersenses
Using TEI, CMDI and ISOcat in CLARIN-DK
Crowdsourcing and Annotating NER for Twitter #drift
When POS Data Sets Don't Add Up: Combatting Sample Bias
|
University of Copenhagen, DENMARK |
SenTube: A Corpus for Sentiment Analysis on YouTube Social Media
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Crowdsourcing and Annotating NER for Twitter #drift
Crowdsourcing as a Preprocessing for Complex Semantic Annotation Tasks
When POS Data Sets Don't Add Up: Combatting Sample Bias
Transfer Learning of Feedback Head Expressions in Danish and Polish Comparable Multimodal Corpora
Encompassing a Spectrum of LT Users in the CLARIN-DK Infrastructure
|
University of Cornell, UNITED STATES |
Annotating Relations in Scientific Articles
|
University of Coventry, UNITED KINGDOM |
The Pragmatic Annotation of a Corpus of Academic Lectures
|
University of Dallas, Texas, UNITED STATES |
SinoCoreferencer: An End-to-End Chinese Event Coreference Resolver
Annotating Inter-Sentence Temporal Relations in Clinical Notes
Clinical Data-Driven Probabilistic Graph Processing
Multilingual eXtended WordNet Knowledge Base: Semantic Parsing and Translation of Glosses
|
University of Deusto, LinguaMedia Group, SPAIN |
IXA pipeline: Efficient and Ready to Use Multilingual NLP tools
|
University of Dublin City, Centre for Global Intelligent Content, IRELAND |
Cross-Language Authorship Attribution
|
University of Dublin City, School of Computing, Centre for Global Intelligent Content, IRELAND |
MTWatch: A Tool for the Analysis of Noisy Parallel Data
|
University of Dundee, UNITED KINGDOM |
A Model for Processing Illocutionary Structures and Argumentation in Debates
|
University of Düsseldorf, GERMANY |
Discosuite - A Parser Test Suite for German Discontinuous Structures
On Complex Word Alignment Configurations
|
University of Eastern Michigan ― LINGUIST List, UNITED STATES |
Visualization of Language Relations and Families: MultiTree
RELISH LMF: Unlocking the Full Power of the Lexical Markup Framework
|
University of Economics and Business, Athens, GREECE/a> |
A Tool Suite for Creating Question Answering Benchmarks
|
University of Economics, Prague, CZECH REPUBLIC |
Towards Linked Hypernyms Dataset 2.0: Complementing DBpedia with Hypernym Discovery
|
University of Edinburgh, Centre for Speech Technology Research, UNITED KINGDOM |
RSS-TOBI - a Prosodically Enhanced Romanian Speech Corpus
|
University of Edinburgh, PPLS and HCRC, UNITED KINGDOM |
Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
|
University of Edinburgh, Schoolf of Informatics, UNITED KINGDOM |
Re-using an Argument Corpus to Aid in the Curation of Social Media Collections
|
University of Edinburgh, UNITED KINGDOM |
N-gram Counts and Language Models from the Common Crawl
ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT
An Analysis of Older Users' Interactions with Spoken Dialogue Systems
Re-using an Argument Corpus to Aid in the Curation of Social Media Collections
|
University of Education Karlsruhe, GERMANY |
A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification
|
University of Essex, UNITED KINGDOM |
AraNLP: a Java-based Library for the Processing of Arabic Text.
|
University of Florence, ITALY |
The IMAGACT Visual Ontology. an Extendable Multilingual Infrastructure for the Representation of Lexical Encoding of Action
|
University of Gallaudet, UNITED STATES |
Expanding N-gram Analytics in ELAN and a Case Study for Sign Synthesis
The Use of a FileMaker Pro Database in Evaluating Sign Language Notation Systems
|
University of Geneva, SWITZERLAND |
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling
A Crowdsourcing Smartphone Application for Swiss German: Putting Language Documentation in the Hands of the Users
Cross-Linguistic Annotation of Narrativity for English/French Verb Tense Disambiguation
Applying Accessibility-Oriented Controlled Language (CL) Rules to Improve Appropriateness of Text Alternatives for Images: an Exploratory Study
A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation
C-PhonoGenre: a 7-hour Corpus of 7 Speaking Styles in French: Relations between Situational Features and Prosodic Properties
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech
|
University of Georgetown, UNITED STATES |
A Framework for Public Health Surveillance
|
University of Ghana, INDIA |
MultiVal - Towards a Multilingual Valence Lexicon
|
University of Ghent, BELGIUM |
Speech Recognition Web Services for Dutch
On the Origin of Errors: a Fine-Grained Analysis of MT and PE Errors and their Relationship
Recognising Suicidal Messages in Dutch Social Media
|
University of Ghent, Language and Translation Technology Team, BELGIUM |
Evaluation of Automatic Hypernym Extraction from Technical Corpora in English and Dutch
Towards Shared Datasets for Normalization Research
|
University of Glasgow, UNITED KINGDOM |
The SSPNet-Mobile Corpus: Social Signal Processing Over Mobile Phones.
Experiences with Parallelisation of an Existing NLP Pipeline: Tagging Hansard
|
University of Gothenburg and Chalmers University of Technology, SWEDEN |
Bootstrapping Open-Source English-Bulgarian Computational Dictionary
|
University of Gothenburg, SWEDEN |
The DWAN Framework: Application of a Web Annotation Framework for the General Humanities to the Domain of Language Resources
Extracting a bilingual semantic grammar from FrameNet-annotated corpora
HFST-SweNER ― A New NER Resource for Swedish
Bring vs. MTRoget: Evaluating Automatic Thesaurus Translation
Computer-aided Morphology Expansion for Old Swedish
A Flexible Language Learning Platform Based on Language Resources and Web Services
Reusing Swedish FrameNet for Training Semantic Roles
|
University of Gothenburg/ JSPS/ National Institute of Informatics, SWEDEN |
A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations
|
University of Granada, SPAIN |
Definition Patterns for Predicative Terms in Specialized Lexical Resources
A Model to Generate Adaptive Multimodal Job Interviews with a Virtual Recruiter
|
University of Groningen, NETHERLANDS |
A Set of Open Source Tools for Turkish Natural Language Processing
Treelet Probabilities for HPSG Parsing and Error Correction
|
University of Hamburg, GERMANY |
Adapting a Part-of-Speech Tagset to Non-Standard Text: the Case of STTS
Because Size Does Matter: The Hamburg Dependency Treebank
|
University of Harvard, UNITED STATES |
Eliciting and Annotating Uncertainty in Spoken Language
|
University of Helsinki, Department of Modern Languages, FINLAND |
HFST-SweNER ― A New NER Resource for Swedish
|
University of Helsinki, FINLAND |
Open-domain Interaction and Online Content in the Sami Language
The Strategic Impact of META-NET on the Regional, National and International Level
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
Computer-aided Morphology Expansion for Old Swedish
ACTIV-ES: a Comparable, Cross-Dialect Corpus of "everyday" Spanish from Argentina, Mexico, and Spain
Heuristic Hyper-minimization of Finite State Lexicons
|
University of Hildesheim, GERMANY |
Adapting a Part-of-Speech Tagset to Non-Standard Text: the Case of STTS
|
University of Iceland, ICELAND |
The Strategic Impact of META-NET on the Regional, National and International Level
Correcting Errors in a New Gold Standard for Tagging Icelandic Text
An Innovative World Language Centre : Challenges for the Use of Language Technology
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian
|
University of Illinois at Urbana-Champaign, UNITED STATES |
ILLINOISCLOUDNLP: Text Analytics Services in the Cloud
|
University of Illinois, UNITED STATES |
Development of a TV Broadcasts Speech Recognition System for Qatari Arabic
Automatic Long Audio Alignment and Confidence Scoring for Conversational Arabic Speech
|
University of Indiana, UNITED STATES |
Finite-State Morphological Transducers for Three Kypchak Languages
Guampa: a Toolkit for Collaborative Translation
Discosuite - A Parser Test Suite for German Discontinuous Structures
SWIFT Aligner, A Multifunctional Tool for Parallel Corpora: Visualization, Word Alignment, and (Morpho)-Syntactic Cross-Language Transfer
SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis
|
University of International Relations, CHINA |
The CUHK Discourse TreeBank for Chinese: Annotating Explicit Discourse Connectives for the Chinese TreeBank
|
University of Jordan, JORDAN |
Tools for Arabic Natural Language Processing: a Case Study in Qalqalah Prosody
|
University of Jyväskylä, FINLAND |
SLMotion - an Extensible Sign Language Oriented Video Analysis Tool
S-pot - a Benchmark in Spotting Signs Within Continuous Signing
|
University of Konstanz, GERMANY |
The CLE Urdu POS Tagset
|
University of Lancaster, UNITED KINGDOM |
Detecting Document Structure in a Very Large Corpus of UK Financial Reports
Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora
Experiences with Parallelisation of an Existing NLP Pipeline: Tagging Hansard
|
University of Latvia, Faculty of Computing, LATVIA |
Coreference Resolution for Latvian
|
University of Latvia, Institute of Mathematics and Computer Science, LATVIA |
Designing the Latvian Speech Recognition Corpus
Dependency Parsing Representation Effects on the Accuracy of Semantic Applications - an Example of an Inflective Language
|
University of Latvia, LATVIA |
Extracting a bilingual semantic grammar from FrameNet-annotated corpora
Terminology Localization Guidelines for the National Scenario
Using C5.0 and Exhaustive Search for Boosting Frame-Semantic Parsing Accuracy
|
University of Leeds, School of Computing, UNITED KINGDOM |
Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing
|
University of Leeds, School of Modern Languages and Cultures, UNITED KINGDOM |
Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing
|
University of Leeds, UNITED KINGDOM |
Tools for Arabic Natural Language Processing: a Case Study in Qalqalah Prosody
Can Crowdsourcing be used for Effective Annotation of Arabic?
Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing
|
University of Ljubljana, SLOVENIA |
sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks
TweetCaT: a Tool for Building Twitter Corpora of Smaller Languages
|
University of Lorraine, FRANCE |
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process
|
University of Macau, MOROCCO |
UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation
|
University of Malta, MALTA |
Automatic Methods for the Extension of a Bilingual Dictionary using Comparable Corpora
The Strategic Impact of META-NET on the Regional, National and International Level
Crowd-sourcing Evaluation of Automatically Acquired, Morphologically Related Word Groupings
|
University of Manchester ― National Centre for Text Mining, UNITED KINGDOM |
Interoperability and Customisation of Annotation Schemata in Argo
The Meta-knowledge of Causality in Biomedical Scientific Discourse
The Strategic Impact of META-NET on the Regional, National and International Level
Locating Requests among Open Source Software Communication Messages
|
University of Manchester, UNITED KINGDOM |
Interoperability and Customisation of Annotation Schemata in Argo
The Strategic Impact of META-NET on the Regional, National and International Level
Out in the Open: Finding and Categorising Errors in the Lexical Simplification Pipeline
Locating Requests among Open Source Software Communication Messages
|
University of Mannheim, GERMANY |
DBpedia Domains: Augmenting DBpedia with Domain Information
|
University of Maribor, SLOVENIA |
Machine Translation for Subtitling: A Large-Scale Evaluation
The Slovene BNSI Broadcast News Database and Reference Speech Corpus GOS: Towards the Uniform Guidelines for Future Work
|
University of Melbourne, AUSTRALIA |
Mapping WordNet Domains, WordNet Topics and Wikipedia Categories to Generate Multilingual Domain Specific Resources
|
University of Memphis, Department of Computer Science, UNITED STATES |
Latent Semantic Analysis Models on Wikipedia and TASA
|
University of Memphis, UNITED STATES |
On Paraphrase Identification Corpora
The DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems
Latent Semantic Analysis Models on Wikipedia and TASA
|
University of Michigan - Flint, UNITED STATES |
A Multimodal Dataset for Deception Detection
|
University of Michigan Ann Arbor, UNITED STATES |
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
|
University of Michigan, UNITED STATES |
Building a Dataset for Summarization and Keyword Extraction from Emails
Modeling Language Proficiency Using Implicit Feedback
A Multimodal Dataset for Deception Detection
|
University of Mons, BELGIUM |
The AV-LASYN Database: a Synchronous Corpus of Audio and 3D Facial Marker Data for Audio-Visual Laughter Synthesis
|
University of North Texas, UNITED STATES |
Building a Dataset for Summarization and Keyword Extraction from Emails
Modeling Language Proficiency Using Implicit Feedback
A Multimodal Dataset for Deception Detection
|
University of Osaka, JAPAN |
Web-imageability of the Behavioral Features of Basic-level Concepts
|
University of Oslo, NORWAY |
Semantic Technologies for Querying Linguistic Annotations: An Experiment Focusing on Graph-Structured Data
Treelet Probabilities for HPSG Parsing and Error Correction
Off-Road LAF: Encoding and Processing Annotations in NLP Workflows
|
University of Padua, ITALY |
A Vector Space Model for Syntactic Distances Between Dialects
|
University of Pennsylvania, Linguistic Data Consortium, UNITED STATES |
Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus
The RATS Collection: Supporting HLT Research with Degraded Audio Data
New Directions for Language Resource Development and Distribution
|
University of Pennsylvania, UNITED STATES |
RESTful Annotation and Efficient Collaboration
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus
New Directions for Language Resource Development and Distribution
A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic
The Multilingual Paraphrase Database
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian
The American Local News Corpus
|
University of Pisa, ITALY |
The Making of Ancient Greek WordNet
Crowdsourcing for the Identification of Event Nominals: an Experiment
Bootstrapping an Italian VerbNet: data-driven analysis of verb alternations
Choosing which to Use? A Study of Distributional Models for Nominal Lexical Semantic Classification
|
University of Pittsburgh, Computer Science Department, UNITED STATES |
A Comparison of MT Errors and ESL Errors
|
University of Pittsburgh, Intelligent Systems Program, UNITED STATES |
A Comparison of MT Errors and ESL Errors
|
University of Pittsburgh, UNITED STATES |
A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness
|
University of Porto - Faculty of Law, School of Criminology, PORTUGAL |
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
|
University of Porto - Faculty of Psychology and Educational Sciences, PORTUGAL |
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments
|
University of Rochester, UNITED KINGDOM |
The CMU METAL Farsi NLP Approach
|
University of Rome Tor Vergata, ITALY |
A Meta-data Driven Platform for Semi-automatic Configuration of Ontology Mediators
HuRIC: a Human Robot Interaction Corpus
EMOVO Corpus: an Italian Emotional Speech Database
|
University of Rome Tor Vergata, Ugo Bordoni Foundation, ITALY |
EMOVO Corpus: an Italian Emotional Speech Database
|
University of Saarbrücken, GERMANY |
Exploring and Visualizing Variation in Language Resources
|
University of Salerno, ITALY |
From Natural Language to Ontology Population in the Cultural Heritage Domain. A Computational Linguistics-based approach.
|
University of Santiago de Compostela, SPAIN |
CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis
Multilingual Corpora with Coreferential Annotation of Person Entities
|
University of Sassari, ITALY |
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
|
University of Sfax, ANLP Research group, MIRACL Lab., TUNISIA |
A Conventional Orthography for Tunisian Arabic
A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition
|
University of Sheffield, UNITED KINGDOM |
Bootstrapping Term Extractors for Multiple Languages
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Building a Crisis Management Term Resource for Social Media: The Case of Floods and Protests
A Quality-based Active Sample Selection Strategy for Statistical Machine Translation
Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis.
Bilingual dictionaries for all EU languages
An Efficient and User-friendly Tool for Machine Translation Quality Estimation
Creating Summarization Systems with SUMMA
Modelling Irony in Twitter: Feature Analysis and Evaluation
|
University of South Africa, SOUTH AFRICA |
Missed Opportunities in Translation Memory Matching
|
University of Southern California, Information Sciences Institute, UNITED STATES |
Mapping Between English Strings and Reentrant Semantic Graphs
|
University of Southern California, Institute for Creative Technologies, UNITED STATES |
The Distress Analysis Interview Corpus of Human and Computer Interviews
A Multimodal Corpus of Rapid Dialogue Games
|
University of Southern California, SAIL, UNITED STATES |
Word Semantic Similarity for Morphologically Rich Languages
|
University of Southern California, UNITED STATES |
A Corpus of Participant Roles in Contentious Discussions
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
The Distress Analysis Interview Corpus of Human and Computer Interviews
Clustering Tweets using Wikipedia Concepts
|
University of Southern Denmark, DENMARK |
ML-Optimization of Ported Constraint Grammars
A 500 Million Word POS-Tagged Icelandic Corpus
|
University of Southern Denmark, Odense, Department of Language and Communication, DENMARK |
High Quality Word Lists as a Resource for Multiple Purposes
|
University of Stanford, UNITED STATES |
Universal Stanford Dependencies: a Cross-Linguistic Typology
On the Importance of Text Analysis for Stock Price Prediction
A Gold Standard Dependency Corpus for English
N-gram Counts and Language Models from the Common Crawl
Event Extraction Using Distant Supervision
Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar
|
University of Suttgart, Institut für Maschinelle Sprachverarbeitung, GERMANY |
A Rank-based Distance Measure to Detect Polysemy and to Determine Salient Vector-Space Features for German Prepositions
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data
|
University of Szeged, HUNGARY |
Szeged Corpus 2.5: Morphological Modifications in a Manually POS-tagged Hungarian Corpus
4FX: Light Verb Constructions in a Multilingual Parallel Corpus
Automatic Error Detection Concerning the Definite and Indefinite Conjugation in the HunLearner Corpus
|
University of São Paulo, BRAZIL |
Generating a Lexicon of Errors in Portuguese to Support an Error Identification System for Spanish Native Learners
A Large Corpus of Product Reviews in Portuguese: Tackling Out-Of-Vocabulary Words
|
University of Tartu, ESTONIA |
The Strategic Impact of META-NET on the Regional, National and International Level
Towards an Integration of Syntactic and Temporal Annotations in Estonian
Dense Components in the Structure of WordNet
|
University of the Basque Country (UPV/EHU), SPAIN |
The Strategic Impact of META-NET on the Regional, National and International Level
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization
Basque Speecon-like and Basque SpeechDAT MDB-600: Speech Databases for the Development of ASR Technology for Basque
First Approach toward Semantic Role Labeling for Basque
KALAKA-3: a Database for the Recognition of Spoken European Languages on YouTube Audios
A Stream Computing Approach Towards Scalable NLP
|
University of Tokyo, JAPAN |
Corpus for Coreference Resolution on Scientific Papers
|
University of Tokyo, National Institute of Informatics, JAPAN |
A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations
Corpus for Coreference Resolution on Scientific Papers
Annotation of Computer Science Papers for Semantic Relation Extrac-tion
Overview of Todai Robot Project and Evaluation Framework of its NLP-based Problem Solving
|
University of Torino, Department of Computer Science, ITALY |
Exploiting Catenae in a Parallel Treebank Alignment
|
University of Torino, Department of Foreign Languages and Literatures and Modern Cultures, ITALY |
Exploiting Catenae in a Parallel Treebank Alignment
|
University of Torino, ITALY |
Exploiting networks in Law
|
University of Trento, ITALY |
Cross-Language Authorship Attribution
SenTube: A Corpus for Sentiment Analysis on YouTube Social Media
A SICK Cure for the Evaluation of Compositional Distributional Semantic Models
The Development of the Multilingual LUNA Corpus for Spoken Language System Porting
|
University of Turku, FINLAND |
Universal Stanford Dependencies: a Cross-Linguistic Typology
|
University of Twente, NETHERLANDS |
Croatian Memories
Twente Debate Corpus ― A Multimodal Corpus for Head Movement Analysis
|
University of Tübingen, GERMANY |
Towards Automatic Quality Assessment of Component Metadata
Automatic Refinement of Syntactic Categories in Chinese Word Structures
How to Tell a Schneemann from a Milchmann: An Annotation Scheme for Compound-Internal Relations
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
The CLARIN Research Infrastructure: Resources and Tools for eHumanities Scholars
The MERLIN corpus: Learner Language and the CEFR
Adapting a Part-of-Speech Tagset to Non-Standard Text: the Case of STTS
|
University of Valladolid, SPAIN |
On the Use of a Fuzzy Classifier to Speed Up the Sp ToBI Labeling of the Glissando Spanish Corpus
Assessment of Non-native Prosody for Spanish as L2 using Quantitative Scores and Perceptual Evaluation
|
University of Vienna, AUSTRIA |
The Strategic Impact of META-NET on the Regional, National and International Level
|
University of Vienna, MODUL, AUSTRIA |
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
|
University of Vigo, SPAIN |
Introducing a Framework for the Evaluation of Music Detection Tools
The Strategic Impact of META-NET on the Regional, National and International Level
CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis
|
University of Virginia, UNITED STATES |
Benchmarking Twitter Sentiment Analysis Tools
|
University of Warsaw, POLAND |
Walenty: Towards a Comprehensive Valence Dictionary of Polish
|
University of Waseda, Faculty of Human Sciences, JAPAN |
Estimation of Speaking Style in Speech Corpora Focusing on Speech Transcriptionss
|
University of Washington, UNITED STATES |
Enriching ODIN
A Finite-State Morphological Analyzer for a Lakota HPSG Grammar
Modern Chinese Helps Archaic Chinese Processing: Finding and Exploiting the Shared Properties
Annotating Clinical Events in Text Snippets for Phenotype Detection
Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar
Language CoLLAGE: Grammatical Description with the LinGO Grammar Matrix
|
University of Western Sydney, The MARCS Institute, AUSTRALIA |
AusTalk: an Audio-Visual Corpus of Australian English
The Alveo Virtual Laboratory: A Web based Repository API
|
University of Wolverhampton, Research Institute in Information and Language Processing, UNITED KINGDOM |
Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing
|
University of Wolverhampton, UNITED KINGDOM |
Using a Machine Learning Model to Assess the Complexity of Stress Systems
Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing
Mapping CPA Patterns onto OntoNotes Senses
|
University of Yamanashi, JAPAN |
Annotating the Focus of Negation in Japanese Text
|
University of York, UNITED KINGDOM |
DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment
|
University of Zagreb, CROATIA |
HiEve: A Corpus for Extracting Event Hierarchies from News Stories
CroDeriV: a New Resource for Processing Croatian Morphology
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites
Language Processing Infrastructure in the XLike Project
The SETimes.HR Linguistically Annotated Corpus of Croatian
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
Quality Estimation for Synthetic Parallel Data Generation
TweetCaT: a Tool for Building Twitter Corpora of Smaller Languages
caWaC - a Web Corpus of Catalan and its Application to Language Modeling and Machine Translation
RECSA: Resource for Evaluating Cross-lingual Semantic Annotation
|
University of Zagreb, Faculty of Electrical Engineering and Computing, CROATIA |
HiEve: A Corpus for Extracting Event Hierarchies from News Stories
DerivBase.hr: A High-Coverage Derivational Morphology Resource for Croatian
BiographyNet: Methodological Issues when NLP Supports Historical Research
Hope and Fear: How Opinions Influence Factuality
|
University of Zagreb, Faculty of Humanities and Social Sciences, CROATIA |
CroDeriV: a New Resource for Processing Croatian Morphology
The Strategic Impact of META-NET on the Regional, National and International Level
Language Processing Infrastructure in the XLike Project
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
RECSA: Resource for Evaluating Cross-lingual Semantic Annotation
|
University of Zurich, SWITZERLAND |
Collaboratively Annotating Multilingual Parallel Corpora in the Biomedical Domain―some MANTRAs
Using Large Biomedical Databases as Gold Annotations for Automatic Relation Extraction
Zmorge: A German Morphological Lexicon Extracted from Wiktionary
A Crowdsourcing Smartphone Application for Swiss German: Putting Language Documentation in the Hands of the Users
Constructing and Exploiting an Automatically Annotated Resource of Legislative Texts
Innovations in Parallel Corpus Search Tools
|
University of Lódz, POLAND |
The Strategic Impact of META-NET on the Regional, National and International Level
|
Università Cattolica del Sacro Cuore, Centro Interdisciplinare di Ricerche per la Computerizzazione dei Segni dellEspressione, ITALY |
Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D
|
Università Cattolica del Sacro Cuore, ITALY |
A Compact Interactive Visualization of Dependency Treebank Query Results
|
Università degli studi di Torino, ITALY |
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web
|
Università di Modena e Reggio Emilia, ITALY |
Presenting a System of Human-Machine Interaction for Performing Map Tasks.
|
Università di Pavia, ITALY |
T-PAS; A resource of Typed Predicate Argument Structures for linguistic analysis and semantic processing
|
Università di Pisa, ITALY |
Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies
|
Università di Salerno, DSPSC, ITALY |
From Natural Language to Ontology Population in the Cultural Heritage Domain. A Computational Linguistics-based approach.
|
Università di Torino, ITALY |
Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies
|
Università di Trento, ITALY |
A SICK Cure for the Evaluation of Compositional Distributional Semantic Models
|
Universität Bielefeld, GERMANY |
ALICO: a Multimodal Corpus for the Study of Active Listening
Annotation Pro + TGA: Automation of Speech Timing Analysis
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0
The USAGE Review Corpus for Fine Grained Multi Lingual Opinion Analysis
|
Universität Des Saarlandes, Saarbrücken, |
Word Alignment-Based Reordering of Source Chunks in PB-SMT
|
Universität des Saarlandes, Institut der Gesellschaft zur Förderung der Angewandten Informationsforschung, GERMANY |
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate
|
Universität Frankfurt, GERMANY |
ColLex.en: Automatically Generating and Evaluating a Full-form Lexicon for English
|
Universität Hamburg, GERMANY |
Because Size Does Matter: The Hamburg Dependency Treebank
|
Universität Hildesheim, GERMANY |
The eIdentity Text Exploration Workbench
|
Universität Leipzig, Agile Knowledge Engineering and Semantic Web research Group, GERMANY |
Global Intelligent Content: Active Curation of Language Resources using Linked Data
NIF4OGGD - NLP Interchange Format for Open German Governmental Data
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
|
Universität Leipzig, Department of English, GERMANY |
High Quality Word Lists as a Resource for Multiple Purposes
|
Universität Leipzig, GERMANY |
The Making of Ancient Greek WordNet
PACE Corpus: a Multilingual Corpus of Polarity-Annotated Textual Data from the Domains Automotive and CEllphone
Vocabulary-Based Language Similarity using Web Corpora
Learning from Domain Complexity
Creating a Gold Standard Corpus for the Extraction of Chemistry-Disease Relations from Patent Texts
High Quality Word Lists as a Resource for Multiple Purposes
NIF4OGGD - NLP Interchange Format for Open German Governmental Data
A 500 Million Word POS-Tagged Icelandic Corpus
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
Open Philology at the University of Leipzig
A Tool Suite for Creating Question Answering Benchmarks
|
Universität Leipzig, Natural Language Processing Group, Computer Science Department, GERMANY |
Vocabulary-Based Language Similarity using Web Corpora
High Quality Word Lists as a Resource for Multiple Purposes
A 500 Million Word POS-Tagged Icelandic Corpus
|
Universität Munchen, Institute of Phonetics and Speech Processing, GERMANY |
The MMASCS Multi-Modal Annotated Synchronous Corpus of Audio, Video, Facial Motion and Tongue Motion Data of Normal, Fast and Slow Speech
Online Experiments with the Percy Software Framework - Experiences and some Early Results
|
Universität München, GERMANY |
Introducing a Web Application for Labeling, Visualizing Speech and Correcting Derived Speech Signals
|
Universität Potsdam, GERMANY |
The KiezDeutsch Korpus (KiDKo) Release 1.0
A German Twitter Snapshot
The eIdentity Text Exploration Workbench
Potsdam Commentary Corpus 2.0: Annotation for Discourse Research
A Model for Processing Illocutionary Structures and Argumentation in Debates
GraPAT: a Tool for Graph Annotations
|
Universität Stuttgart, GERMANY |
Parsing Heterogeneous Corpora with a Rich Dependency Grammar
Automatic Extraction of Synonyms for German Particle Verbs from Parallel Data with Distributional Similarity as a Re-Ranking Feature
A Corpus of Comparisons in Product Reviews
Textual Emigration Analysis (TEA)
An Out-of-Domain Test Suite for Dependency Parsing of German
Textual Emigration Analysis (TEA)
The eIdentity Text Exploration Workbench
Mapping Between English Strings and Reentrant Semantic Graphs
Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database
A Rank-based Distance Measure to Detect Polysemy and to Determine Salient Vector-Space Features for German Prepositions
UnixMan Corpus: A Resource for Language Learning in the Unix Domain
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data
The Extended DIRNDL Corpus as a Resource for Coreference and Bridging Resolution
Optimizing a Distributional Semantic Model for the Prediction of German Particle Verb Compositionality
|
Universität Stuttgart, Institute for Natural Language Processing, GERMANY |
Exploring the Utility of Coreference Chains for Improved Identification of Personal Names
|
Universität Tübingen, GERMANY |
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications
The MERLIN corpus: Learner Language and the CEFR
A Compact Interactive Visualization of Dependency Treebank Query Results
|
Universität Bielefeld, GERMANY |
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0
|
Université Aix-Marseille, CNRS, FRANCE |
Representing Multimodal Linguistic Annotated Data
Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
Segmentation Evaluation Metrics, a Comparison Grounded on Prosodic and Discourse Units
|
Université Aix-Marseille, FRANCE |
A Graph-Based Approach for Computing Free Word Associations
Corpus-Based Computation of Reverse Associations
Using Word Familiarities and Word Associations to Measure Corpus Representativeness
A Collection of Scholarly Book Reviews from the Platforms of Electronic Sources in Humanities and Social Sciences OpenEdition.org
|
Université Aix-Marseille, Laboratoire d'Informatique Fondamentale, FRANCE |
A Collection of Scholarly Book Reviews from the Platforms of Electronic Sources in Humanities and Social Sciences OpenEdition.org
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
|
Université Aix-Marseille, Laboratoire des Sciences de l'Information et des Systèmes, FRANCE |
A Collection of Scholarly Book Reviews from the Platforms of Electronic Sources in Humanities and Social Sciences OpenEdition.org
|
Université Aix-Marseille, Laboratoire Parole et Langage, CNRS, FRANCE |
Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue
|
Université Catholique de Louvain, BELGIUM |
HiEve: A Corpus for Extracting Event Hierarchies from News Stories
FLELex: a graded Lexical Resource for French Foreign Learners
Linking Pictographs to Synsets: Sclera2Cornetto
Speech Recognition Web Services for Dutch
Linguistic Resources and Cats: How to Use ISOcat, RELcat and SCHEMAcat
Multiple Choice Question Corpus Analysis for Distractor Characterization
TermWise: A CAT-tool with Context-Sensitive Terminological Support.
Praaline: Integrating Tools for Speech Corpus Research
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech
|
Université Catholique de Louvain, CCL, BELGIUM |
Polish Coreference Corpus in Numbers
The Polish Summaries Corpus
Digital Library 2.0: Source of Knowledge and Research Collaboration Platform
The Strategic Impact of META-NET on the Regional, National and International Level
Measuring Readability of Polish Texts: Baseline Experiments
The Procedure of Lexico-Semantic Annotation of Składnica Treebank
Projection-based Annotation of a Polish Dependency Treebank
|
Université d'Orléans, Laboratoire Ligérien de Linguistique, FRANCE |
ANCOR_Centre, a Large Free Spoken French Coreference Corpus: Description of the Resource and Reliability Measures
|
Université de Bretagne-Sud, FRANCE |
A Database of Full Body Virtual Interactions Annotated with Expressivity Scores
Co-clustering of Bilingual Datasets as a Mean for Assisting the Construction of Thematic Bilingual Comparable Corpora
|
Université de Genève, Laboratoire d'Analyse et de Traitement du Langage ― Centre Universitaire d'Informatique, SWITZERLAND |
SwissAdmin: a Multilingual Tagged Parallel Corpus of Press Releases
A Language-independent and fully Unsupervised Approach to Lexicon Induction and Part-of-Speech Tagging for Closely Related Languages
|
Université de Genève, SWITZERLAND |
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
Applying Accessibility-Oriented Controlled Language (CL) Rules to Improve Appropriateness of Text Alternatives for Images: an Exploratory Study
A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation
Using a Serious Game to Collect a Child Learner Speech Corpus
|
Université de Grenoble, Groupe d'Étude pour la Traduction/le Traitement Automatique des Langues et de la Parole ― Laboratoire d'Informatique de Grenoble, FRANCE |
MotàMot Project: Conversion of a French-Khmer Published Dictionary for Building a Multilingual Lexical System
|
Université de Lorraine / LORIA, FRANCE |
Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter
Deep Syntax Annotation of the Sequoia French Treebank
Mapping the Lexique des Verbes du français (Lexicon of French Verbs) to a NLP Lexicon using Examples
Propa-L: a Semantic Filtering Service from a Lexical Network Created using Games With A Purpose
|
Université de Montréal, CANADA |
An Iterative Approach for Mining Parallel Sentences in a Comparable Corpus
Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government
|
Université de Montréal, Observatoire de linguistique Sens-Texte, CANADA |
Definition Patterns for Predicative Terms in Specialized Lexical Resources
Discovering Frames in Specialized Domains
|
Université du Maine, LIUM, FRANCE |
A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition
"Human Annotation of ASR Error Regions: is ""gravity"" a Sharable Concept for Human Annotators?"
|
Université Européenne de Bretagne, IRISA, FRANCE |
ANCOR_Centre, a Large Free Spoken French Coreference Corpus: Description of the Resource and Reliability Measures
|
Université François Rabelais de Tours, LI, FRANCE |
ANCOR_Centre, a Large Free Spoken French Coreference Corpus: Description of the Resource and Reliability Measures
Polish Coreference Corpus in Numbers
|
Université Montpellier 1, Centre dEtudes Politiques de lEurope Latine, FRANCE |
Investigating the Image of Entities in Social Media: Dataset Design and First Results
|
Université Montpellier 2, LIRMM, FRANCE |
Towards Electronic SMS Dictionary Construction: An Alignment-based Approach
|
Université Paris 3 - Sorbonne Nouvelle, LaTTiCe, FRANCE |
ACTIV-ES: a Comparable, Cross-Dialect Corpus of "everyday" Spanish from Argentina, Mexico, and Spain
Macrosyntactic Segmenters of a French Spoken Corpus
|
Université Paris Diderot / INRIA ― Alpage, FRANCE |
Deep Syntax Annotation of the Sequoia French Treebank
Developing a French FrameNet: Methodology and First results
|
Université Paris Diderot, Centre de Linguistique Inter-langues, de Lexicologie, de Linguistique Anglaise et de Corpus , FRANCE |
Using a Machine Learning Model to Assess the Complexity of Stress Systems
|
Université Paris Diderot, FRANCE |
Adapting VerbNet to French using Existing Resources
Semantic Clustering of Pivot Paraphrases
Extending the Coverage of a MWE Database for Persian CPs Exploiting Valency Alternations
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech
|
Université Paris Ouest Nanterre, FRANCE |
Macrosyntactic Segmenters of a French Spoken Corpus
|
Université Paris Sorbonne (Paris IV), FRANCE |
Deep Syntax Annotation of the Sequoia French Treebank
|
Université Paris Sud / CNRS ― LIMSI, FRANCE |
A Corpus of Machine Translation Errors Extracted from Translation Students Exercises
TVD: A Reproducible and Multiply Aligned TV Series Dataset
|
Université Paris-Sud / LIMSI-CNRS, FRANCE |
Rule-based Reordering Space in Statistical Machine Translation
|
Université Saint-Louis, BELGIUM |
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
|
Université Sorbonne Nouvelle, FRANCE |
Extending the Coverage of a MWE Database for Persian CPs Exploiting Valency Alternations
Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie
|
Université Toulouse 2 Le Mirail & IRIT, FRANCE |
VOAR: A Visual and Integrated Ontology Alignment Environment
|
Uppsala Universitet, SWEDEN |
Universal Stanford Dependencies: a Cross-Linguistic Typology
Linguistic Landscaping of South Asia using Digital Language Resources: Genetic vs. Areal Linguistics
ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT
Billions of Parallel Words for Free: Building and Using the EU Bookshop Corpus
A Flexible Language Learning Platform Based on Language Resources and Web Services
|
Uppsala University, Department of Linguistics and Philology, SWEDEN |
A Persian Treebank with Stanford Typed Dependencies
|
UPS, FRANCE |
A Database of Full Body Virtual Interactions Annotated with Expressivity Scores
|
UPV/EHU, SPAIN |
Predicate Matrix: extending SemLink through WordNet mappings
IXA pipeline: Efficient and Ready to Use Multilingual NLP tools
|
USC ICT, UNITED STATES |
The Distress Analysis Interview Corpus of Human and Computer Interviews
|
Utrecht University, NETHERLANDS |
The Strategic Impact of META-NET on the Regional, National and International Level
CLARIN-NL: Major Results
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive
|
Utrecht University, Université Catholique de Louvain, NETHERLANDS |
Experiences with the ISOcat Data Category Registry
|
V |
Vassar College, UNITED STATES |
The Language Application Grid
|
Verbatim Analysis - Vera & Inria, FRANCE |
An Open-Source Heavily Multilingual Translation Graph Extracted from Wiktionaries and Parallel Corpora
|
Viavoo & Université Paris-Diderot, FRANCE |
A Language-independent Approach to Extracting Derivational Relations from an Inflectional Lexicon
|
Vicomtech-IK4, SPAIN |
SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling
Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling
Machine Translation for Subtitling: A Large-Scale Evaluation
Generating Polarity Lexicons with WordNet Propagation in 5 Languages
|
Voice & Script International, UNITED KINGDOM |
Machine Translation for Subtitling: A Large-Scale Evaluation
|
VoiceInteraction, PORTUGAL |
SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling
|
Vrije Universiteit Amsterdam, NETHERLANDS |
BiographyNet: Methodological Issues when NLP Supports Historical Research
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web
Hope and Fear: How Opinions Influence Factuality
NewsReader: Recording History from Daily News Streams
Discovering and Visualising Stories in News
Using a Sledgehammer to Crack a Nut? Lexical Diversity and Event Coreference Resolution
Generating Polarity Lexicons with WordNet Propagation in 5 Languages
|
VTEX, BRAZIL |
Language Editing Dataset of Academic Texts
|
|
|