|
PAPERS: Browse articles of the conference sorted by title
1 - 4 - A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - Y
A |
|
A Bilingual Discourse Corpus and Its Applications |
Yang Liu, Jiajun Zhang, Chengqing Zong, Yating Yang and Xi Zhou |
Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus Commonization |
Hiroki Mori, Atsushi Nagaoka and Yoshiko Arimoto |
Accessing and Elaborating Walenty - a Valence Dictionary of Polish - via Internet Browser |
Bartłomiej Nitoń, Tomasz Bartosiak Elżbieta Hajnicz |
Accurate Deep Syntactic Parsing of Graphs: The Case of French |
Corentin Ribeyre, Eric Villemonte de la Clergerie and Djamé Seddah |
A Classification-based Approach to Economic Event Detection in Dutch News Text |
Els Lefever and Véronique Hoste |
A Comparative Analysis of Crowdsourced Natural Language Corpora for Spoken Dialog Systems |
Patricia Braunger, Hansjörg Hofmann, Steffen Werner and Maria Schmidt |
A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances |
Roman Sergienko, Muhammad Shan and Wolfgang Minker |
A Comparison of Domain-based Word Polarity Estimation using different Word Embeddings |
Aitor García Pablos, Montse Cuadros and German Rigau |
A comparison of Named-Entity Disambiguation and Word Sense Disambiguation |
Angel Chang, Valentin I. Spitkovsky, Christopher D. Manning and Eneko Agirre |
A Computational Perspective on the Romanian Dialects |
Alina Maria Ciobanu and Liviu P. Dinu |
A Corpus of Argument Networks: Using Graph Properties to Analyse Divisive Issues |
Barbara Konat, John Lawrence, Joonsuk Park, Katarzyna Budzynska and Chris Reed |
A Corpus of Clinical Practice Guidelines Annotated with the Importance of Recommendations |
Jonathon Read, Erik Velldal, Marc Cavazza and Gersende Georg |
A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives |
Zhichao Hu, Michelle Dick, Chung-Ning Chang, Kevin Bowden, Michael Neff, Jean Fox Tree and Marilyn Walker |
A Corpus of Images and Text in Online News |
Laura Hollink, Adriatik Bedjeti, Martin van Harmelen and Desmond Elliott |
A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds |
Andrea Horbach, Andrea Hensler, Sabine Krome, Jakob Prange, Werner Scholze-Stubenrecht, Diana Steffen, Stefan Thater, Christian Wellner and Manfred Pinkal |
A Corpus of Native, Non-native and Translated Texts |
Sergiu Nisioi, Ella Rabinovich, Liviu P. Dinu and Shuly Wintner |
A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation |
Robert Herms, Laura Seelig, Stefanie Münch and Maximilian Eibl |
A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults |
Victoria Yaneva, Irina Temnikova and Ruslan Mitkov |
A Corpus of Wikipedia Discussions: Over the Years, with Topic, Power and Gender Labels |
Vinodkumar Prabhakaran and Owen Rambow |
A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System |
Ajda Gokcen, Evan Jaffe, Johnsey Erdmann, Michael White and Douglas Danforth |
Acquiring Opposition Relations among Italian Verb Senses using Crowdsourcing |
Anna Feltracco, Simone Magnolini, Elisabetta Jezek and Bernardo Magnini |
A Crowdsourced Database of Event Sequence Descriptions for the Acquisition of High-quality Script Knowledge |
Lilian D. A. Wanzare, Alessandra Zarcone, Stefan Thater and Manfred Pinkal |
A CUP of CoFee: A large Collection of feedback Utterances Provided with communicative function annotations |
Laurent Prévot, Jan Gorisch and Roxane Bertrand |
Adapting an Entity Centric Model for Portuguese Coreference Resolution |
Evandro Fonseca, Renata Vieira and Aline Vanin |
Adapting the TANL tool suite to Universal Dependencies |
Maria Simi and Giuseppe Attardi |
A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices |
Philipp Aichinger, Immer Roesner, Matthias Leonhard, Doris-Maria Denk-Linnert, Wolfgang Bigenzahn and Berit Schneider-Stickler |
A Dataset for Detecting Stance in Tweets |
Saif Mohammad, Svetlana Kiritchenko, Parinaz Sobhani, Xiaodan Zhu and Colin Cherry |
A Dataset for Open Event Extraction in English |
Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret and Romaric Besançon |
Adding Semantic Relations to a Large-Coverage Connective Lexicon of German |
Tatjana Scheffler and Manfred Stede |
Addressing the MFS Bias in WSD systems |
Marten Postma, Ruben Izquierdo, Eneko Agirre, German Rigau and Piek Vossen |
A Dependency Treebank of the Chinese Buddhist Canon |
Tak-sum Wong and John Lee |
A Document Repository for Social Media and Speech Conversations |
Adam Funk, Robert Gaizauskas and Benoit Favre |
A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research |
Emre Yilmaz, Mario Ganzeboom, Lilian Beijer, Catia Cucchiarini and Helmer Strik |
Affective Lexicon Creation for the Greek Language |
Elisavet Palogiannidi, Polychronis Koutsakis, Elias Iosif and Alexandros Potamianos |
A Finite-State Morphological Analyser for Sindhi |
Raveesh Motlani, Francis Tyers and Dipti Sharma |
A Finite-state Morphological Analyser for Tuvan |
Francis Tyers, Aziyana Bayyr-ool, Aelita Salchak and Jonathan Washington |
A Framework for Automatic Acquisition of Croatian and Serbian Verb Aspect from Corpora |
Tanja Samardzic and Maja Miličević |
A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus |
Mauro Nicolao, Heidi Christensen, Stuart Cunningham, Phil Green and Thomas Hain |
A Framework for Cross-lingual/Node-wise Alignment of Lexical-Semantic Resources |
Yoshihiko Hayashi |
AfriBooms: An Online Treebank for Afrikaans |
Liesbeth Augustinus, Peter Dirix, Daniel Van Niekerk, Ineke Schuurman, Vincent Vandeghinste, Frank Van Eynde and Gerhard Van Huyssteen |
Age and Gender Prediction on Health Forum Data |
Prasha Shrestha, Nicolas Rey-Villamizar, Farig Sadeque, Ted Pedersen, Steven Bethard and Thamar Solorio |
A General Framework for the Annotation of Causality Based on FrameNet |
Laure Vieu, Philippe Muller, Marie Candito and Marianne Djemaa |
A Gold Standard for Scalar Adjectives |
Bryan Wilkinson and Oates Tim |
A Hungarian Sentiment Corpus Manually Annotated at Aspect Level |
Martina Katalin Szabó, Veronika Vincze, Katalin Ilona Simkó, Viktor Varga and Viktor Hangya |
AIMU: Actionable Items for Meeting Understanding |
Yun-Nung Chen and Dilek Hakkani-Tur |
A Japanese Chess Commentary Corpus |
Shinsuke Mori, John Richardson, Atsushi Ushiku, Tetsuro Sasada, Hirotaka Kameko and Yoshimasa Tsuruoka |
A Language Independent Method for Generating Large Scale Polarity Lexicons |
Giuseppe Castellucci, Danilo Croce and Roberto Basili |
A Language Resource of German Errors Written by Children with Dyslexia |
Maria Rauschenberger, Luz Rello, Silke Füchsel and Jörg Thomaschewski |
A Large DataBase of Hypernymy Relations Extracted from the Web. |
Julian Seitner, Christian Bizer, Kai Eckert, Stefano Faralli, Robert Meusel, Heiko Paulheim and Simone Paolo Ponzetto |
A Large Rated Lexicon with French Medical Words |
Natalia Grabar and Thierry Hamon |
A Large Scale Corpus of Gulf Arabic |
Salam Khalifa, Nizar Habash, Dana Abdulrahim and Sara Hassan |
A Large-Scale Multilingual Disambiguation of Glosses |
José Camacho-Collados, Claudio Delli Bovi, Alessandro Raganato and Roberto Navigli |
A Large-scale Recipe and Meal Data Collection as Infrastructure for Food Research |
Jun Harashima, Michiaki Ariga, Kenta Murata and Masayuki Ioki |
A Lexical Resource for the Identification of Weak Words in German Specification Documents |
Jennifer Krisch, Melanie Dick, Ronny Jauch and Ulrich Heid |
A Lexical Resource of Hebrew Verb-Noun Multi-Word Expressions |
Chaya Liebeskind and Yaakov HaCohen-Kerner |
A lexicon of perception for the identification of synaesthetic metaphors in corpora |
Francesca Strik Lievers and Chu-Ren Huang |
A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research |
Emre Yilmaz, Maaike Andringa, Sigrid Kingma, Jelske Dijkstra, Frits Van der Kuip, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel and David van Leeuwen |
Al Qamus al Muhit, a Medieval Arabic Lexicon in LMF |
Ouafae Nahli, Francesca Frontini, Monica Monachini, Fahad Khan, Arsalan Zarghili and Mustapha Khalfi |
ALT Explored: Integrating an Online Dialectometric Tool and an Online Dialect Atlas |
Martijn Wieling, Eva Sassolini, Sebastiana Cucurullo and Simonetta Montemagni |
A Machine Learning based Music Retrieval and Recommendation System |
Naziba Mostafa, Yan Wan, Unnayan Amitabh and Pascale Fung |
Ambiguity Diagnosis for Terms in Digital Humanities |
Béatrice Daille, Evelyne Jacquey, Gaël Lejeune, Luis Felipe Melo and Yannick Toussaint |
AMISCO: The Austrian German Multi-Sensor Corpus |
Hannes Pessentheiner, Thomas Pichler and Martin Hagmüller |
A Morphological Lexicon of Esperanto with Morpheme Frequencies |
Eckhard Bick |
A Multi-domain Corpus of Swedish Word Sense Annotation |
Richard Johansson, Yvonne Adesam, Gerlof Bouma and Karin Hedberg |
A Multi-Layered Annotated Corpus of Scientific Papers |
Beatriz Fisas, Francesco Ronzano and Horacio Saggion |
A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection |
Jérémy Ferrero, Frédéric Agnès, Laurent Besacier and Didier Schwab |
A Multilingual Predicate Matrix |
Maddalen Lopez de Lacalle, Egoitz Laparra, Itziar Aldabe and German Rigau |
A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety |
Mathieu Chollet, Torsten Wörtwein, Louis-Philippe Morency and Stefan Scherer |
A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs |
Jackson Tolins, Kris Liu, Yingying Wang, Jean Fox Tree, Marilyn Walker and Michael Neff |
A Multi-party Multi-modal Dataset for Focus of Visual Attention in Human-human and Human-robot Interaction |
Kalin Stefanov and Jonas Beskow |
Analysing Constraint Grammars with a SAT-solver |
Inari Listenmaa and Koen Claessen |
Analysis of English Spelling Errors in a Word-Typing Game |
Ryuichi Tachibana and Mamoru Komachi |
Analyzing Pre-processing Settings for Urdu Single-document Extractive Summarization |
Muhammad Humayoun and Hwanjo Yu |
Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests |
Takakazu Imada, Yusuke Inoue, Lei Chen, Syunya Doi, Tian Nie, Chen Zhao, Takehito Utsuro and Yasuhide Kawada |
An Annotated Corpus and Method for Analysis of Ad-Hoc Structures Embedded in Text |
Eric Yeh, John Niekrasz, Dayne Freitag and Richard Rohwer |
An Annotated Corpus of Direct Speech |
John Lee and Chak Yan Yeung |
An Arabic-Moroccan Darija Code-Switched Corpus |
Younes Samih and Wolfgang Maier |
An Empirical Exploration of Moral Foundations Theory in Partisan News Sources |
Dean Fulgoni, Jordan Carpenter, Lyle Ungar and Daniel Preoţiuc-Pietro |
An Empirical Study of Arabic Formulaic Sequence Extraction Methods |
Ayman Alghamdi, Eric Atwell and Claire Brierley |
A Neural Lemmatizer for Bengali |
Abhisek Chakrabarty, Akshay Chaturvedi and Utpal Garain |
ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages |
Samira Shaikh, Kit Cho, Tomek Strzalkowski, Laurie Feldman, John Lien, Ting Liu and George Aaron Broadwell |
A New Integrated Open-source Morphological Analyzer for Hungarian |
Attila Novák, Borbála Siklósi and Csaba Oravecz |
An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation |
Peter Viszlay, Ján Staš, Tomáš Koctúr, Martin Lojka and Jozef Juhár |
An Interaction-Centric Dataset for Learning Automation Rules in Smart Homes |
Kai Frederic Engelmann, Patrick Holthaus, Britta Wrede and Sebastian Wrede |
Annotating and Detecting Medical Events in Clinical Notes |
Prescott Klassen, Fei Xia and Meliha Yetisgen |
Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel |
Hardik Vala, Stefan Dimitrov, David Jurgens, Andrew Piper and Derek Ruths |
Annotating Discourse Relations in Spoken Language: A Comparison of the PDTB and CCR Frameworks |
Ines Rehbein, Merel Scholman and Vera Demberg |
Annotating Logical Forms for EHR Questions |
Kirk Roberts and Dina Demner-Fushman |
Annotating Named Entities in Consumer Health Questions |
Halil Kilicoglu, Asma Ben Abacha, Yassine Mrabet, Kirk Roberts, Laritza Rodriguez, Sonya Shooshan and Dina Demner-Fushman |
Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola |
Marco Stranisci, Cristina Bosco, Delia Irazú Hernández Farías and Viviana Patti |
Annotating Temporally-Anchored Spatial Knowledge on Top of OntoNotes Semantic Roles |
Alakananda Vempala and Eduardo Blanco |
Annotating Topic Development in Information Seeking Queries |
Marta Andersson, Adnan Ozturel and Silvia Pareti |
An Open Corpus for Named Entity Recognition in Historic Newspapers |
Clemens Neudecker |
A Novel Evaluation Method for Morphological Segmentation |
Javad Nouri and Roman Yangarber |
ANTUSD: A Large Chinese Sentiment Dictionary |
Shih-Ming Wang and Lun-Wei Ku |
AppDialogue: Multi-App Dialogues for Intelligent Assistants |
Ming Sun, Yun-Nung Chen, Zhenhao Hua, Yulian Tamres-Rudnicky, Arnab Dash and Alexander Rudnicky |
Applying Core Scientific Concepts to Context-Based Citation Recommendation |
Daniel Duma, Maria Liakata, Amanda Clare, James Ravenscroft and Ewan Klein |
Applying the Cognitive Machine Translation Evaluation Approach to Arabic |
Irina Temnikova, Wajdi Zaghouani, Stephan Vogel and Nizar Habash |
A Proposal for a Part-of-Speech Tagset for the Albanian Language |
Besim Kabashi and Thomas Proisl |
A Proposition Bank of Urdu |
Maaz Anwar, Riyaz Ahmad Bhat, Dipti Sharma, Ashwini Vaidya, Martha Palmer and Tafseer Ahmed Khan |
A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization |
Fajri Koto |
Arabic Corpora for Credibility Analysis |
Ayman Al Zaatari, Rim El Ballouli, Shady ELbassouni, Wassim El-Hajj, Hazem Hajj, Khaled Shaban, Nizar Habash and Emad Yahya |
Arabic to English Person Name Transliteration using Twitter |
Hamdy Mubarak and Ahmed Abdelali |
ArchiMob - A Corpus of Spoken Swiss German |
Tanja Samardzic, Yves Scherrer and Elvira Glaser |
A Reading Comprehension Corpus for Machine Translation Evaluation |
Carolina Scarton and Lucia Specia |
A Regional News Corpora for Contextualized Entity Discovery and Linking |
Adrian Brasoveanu, Lyndon J.B. Nixon, Albert Weichselbraun and Arno Scharl |
Argument Mining: the Bottleneck of Knowledge and Language Resources |
Patrick Saint-Dizier |
ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions |
Olga Uryupina, Ron Artstein, Antonella Bristot, Federica Cavicchio, Kepa Rodriguez and Massimo Poesio |
A Rule-based Shallow-transfer Machine Translation System for Scots and English |
Gavin Abercrombie |
A Semantically Compositional Annotation Scheme for Time Normalization |
Steven Bethard and Jonathan Parker |
A Semi-Supervised Approach for Gender Identification |
Juan Soler and Leo Wanner |
A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon |
Tibor Kiss, Francis Jeffry Pelletier, Halima Husic, Roman Nino Simunic and Johanna Marie Poppek |
A Sequence Model Approach to Relation Extraction in Portuguese |
Sandra Collovini, Gabriel Machado and Renata Vieira |
A Shared Task for Spoken CALL? |
Claudia Baur, Johanna Gerlach, Manny Rayner, Martin Russell and Helmer Strik |
A Singing Voice Database in Basque for Statistical Singing Synthesis of Bertsolaritza |
Xabier Sarasola, Eva Navas, David Tavarez, Daniel Erro, Ibon Saratxaga and Inma Hernaez |
ASPEC: Asian Scientific Paper Excerpt Corpus |
Toshiaki Nakazawa, Manabu Yaguchi, Kiyotaka Uchimoto, Masao Utiyama, Eiichiro Sumita, Sadao Kurohashi and Hitoshi Isahara |
Aspect based Sentiment Analysis in Hindi: Resource Creation and Evaluation |
Md Shad Akhtar, Asif Ekbal and Pushpak Bhattacharyya |
Aspectual Flexibility Increases with Agentivity and Concreteness\\ A Computational Classification Experiment on Polysemous Verbs |
Ingrid Falk and Fabienne Martin |
Assessing the Potential of Metaphoricity of verbs using corpus data |
Marco Del Tredici and Nuria Bel |
Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets |
Eduardo Coutinho, Florian Hönig, Yue Zhang, Simone Hantke, Anton Batliner, Elmar Nöth and Björn Schuller |
A Study of Reuse and Plagiarism in LREC papers |
Gil Francopoulo, Joseph Mariani and Patrick Paroubek |
A Tagged Corpus for Automatic Labeling of Disabilities in Medical Scientific Papers |
Carlos Valmaseda, Juan Martinez-Romo and Lourdes Araujo |
A Tangled Web: The Faint Signals of Deception in Text - Boulder Lies and Truth Corpus (BLT-C) |
Franco Salvetti, John B. Lowe and James H. Martin |
A Taxonomy of Spanish Nouns, a Statistical Algorithm to Generate it and its Implementation in Open Source Code |
Rogelio Nazar and Irene Renau |
A Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source Performance |
Felix Burkhardt and Uwe D. Reichel |
A Turkish Database for Psycholinguistic Studies Based on Frequency, Age of Acquisition, and Imageability |
Elif Ahsen Acar, Deniz Zeyrek, Murathan Kurfalı and Cem Bozşahin |
A Turkish-German Code-Switching Corpus |
Özlem Çetinoğlu |
Automatically Generated Affective Norms of Abstractness, Arousal, Imageability and Valence for 350 000 German Lemmas |
Maximilian Köper and Sabine Schulte im Walde |
Automatic Anomaly Detection for Dysarthria across Two Speech Styles: Read vs Spontaneous Speech |
Imed Laaridh, Corinne Fredouille and Christine Meunier |
Automatic Biomedical Term Polysemy Detection |
Juan Antonio Lossio-Ventura, Clement Jonquet, Mathieu Roche and Maguelonne Teisseire |
Automatic Classification of Tweets for Analyzing Communication Behavior of Museums |
Nicolas Foucault and Antoine Courtin |
Automatic Construction of Discourse Corpora for Dialogue Translation |
Longyue Wang, Xiaojun Zhang, Zhaopeng Tu, Andy Way and Qun Liu |
Automatic Corpus Extension for Data-driven Natural Language Generation |
Elena Manishina, Bassam Jabaian, Stéphane Huet and Fabrice Lefevre |
Automatic Enrichment of WordNet with Common-Sense Knowledge |
Luigi Di Caro and Guido Boella |
Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions |
Daniela Beltrami, Laura Calzà, Gloria Gagliardi, Enrico Ghidoni, Norina Marcello, Rema Rossini Favretti and Fabio Tamburini |
Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs |
Daniel Couto-Vale, Stella Neumann and Paula Niemietz |
AVAB-DBS: an Audio-Visual Affect Bursts Database for Synthesis |
Kevin El Haddad, Huseyin Cakmak, Stéphane Dupont and Thierry Dutoit |
A Verbal and Gestural Corpus of Story Retellings to an Expressive Embodied Virtual Character |
Jackson Tolins, Kris Liu, Michael Neff, Marilyn Walker and Jean Fox Tree |
A Web Tool for Building Parallel Corpora of Spoken and Sign Languages |
Alex Becker, Fabio Kepler and Sara Candeias |
Axolotl: a Web Accessible Parallel Corpus for Spanish-Nahuatl |
Ximena Gutierrez-Vasques, Gerardo Sierra and Isaac Hernandez Pompa |
B |
|
B2SG: a TOEFL-like Task for Portuguese |
Rodrigo Wilkens, Leonardo Zilio, Eduardo Ferreira and Aline Villavicencio |
BAS Speech Science Web Services - an Update of Current Developments |
Thomas Kisler, Uwe Reichel, Florian Schiel, Christoph Draxler, Bernhard Jackl and Nina Pörner |
Benchmarking Lexical Simplification Systems |
Gustavo Paetzold and Lucia Specia |
Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015 |
Johann Poignant, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau and Thomas Tamisier |
Best of Both Worlds: Making Word Sense Embeddings Interpretable |
Alexander Panchenko |
Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers |
Amal Htait, Sebastien Fournier and Patrice Bellot |
Bilingual Lexicon Extraction at the Morpheme Level Using Distributional Analysis |
Amir Hazem and Béatrice Daille |
Bootstrapping a Hybrid MT System to a New Language Pair |
João António Rodrigues, Nuno Rendeiro, Andreia Querido, Sanja Štajner and António Branco |
BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains |
Necati Cihan Camgöz, Ahmet Alp Kındıroğlu, Serpil Karabüklü, Meltem Kelepir, Ayşe Sumru Özsoy and Lale Akarun |
Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik |
Patrick Littell, David R. Mortensen, Kartik Goyal, Chris Dyer and Lori Levin |
Building A Case-based Semantic English-Chinese Parallel Treebank |
Huaxing Shi, Tiejun Zhao and Keh-Yih Su |
Building a Corpus of Errors and Quality in Machine Translation: Experiments on Error Impact |
Angela Costa, Rui Correia and Luisa Coheur |
Building a Dataset for Possessions Identification in Text |
Carmen Banea, Xi Chen and Rada Mihalcea |
Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation |
Wajdi Zaghouani, Nizar Habash, Ossama Obeid, Behrang Mohit, Houda Bouamor and Kemal Oflazer |
Building Concept Graphs from Monolingual Dictionary Entries |
Gábor Recski |
Building Evaluation Datasets for Consumer-Oriented Information Retrieval |
Lorraine Goeuriot, Liadh Kelly, Guido Zuccon and Joao Palotti |
Building Language Resources for Exploring Autism Spectrum Disorders |
Julia Parish-Morris, Christopher Cieri, Mark Liberman, Leila Bateman, Emily Ferguson and Robert T. Schultz |
Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi |
Dipawesh Pawar, Mohammed Hasanuzzaman and Asif Ekbal |
Building the Macedonian-Croatian Parallel Corpus |
Ines Cebović and Marko Tadić |
BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology |
Neli Hateva, Petar Mitankin and Stoyan Mihov |
C |
|
C4Corpus: Multilingual Web-size Corpus with Free License |
Ivan Habernal, Omnia Zayed and Iryna Gurevych |
Can Topic Modelling benefit from Word Sense Information? |
Adriana Ferrugento, Hugo Gonçalo Oliveira, Ana Alves and Filipe Rodrigues |
Can Tweets Predict TV Ratings? |
Bridget Sommerdijk, Eric Sanders and Antal van den Bosch |
Capturing Chat: Annotation and Tools for Multiparty Casual Conversation. |
Emer Gilmartin and Nick Campbell |
CASSAurus: A Resource of Simpler Spanish Synonyms |
Ricardo Baeza-Yates, Luz Rello and Julia Dembowski |
CATaLog Online: Porting a Post-editing Tool to the Web |
Santanu Pal, Marcos Zampieri, Sudip Kumar Naskar, Tapas Nayak, Mihaela Vela and Josef van Genabith |
CEPLEXicon ― A Lexicon of Child European Portuguese |
Ana Lúcia Santos, Maria João Freitas and Aida Cardoso |
Challenges and Solutions for Consistent Annotation of Vietnamese Treebank |
Quy Nguyen, Yusuke Miyao, Ha Le and Ngan Nguyen |
Challenges of Adjective Mapping between plWordNet and Princeton WordNet |
Ewa Rudnicka, Wojciech Witkowski and Katarzyna Podlaska |
Challenges of Evaluating Sentiment Analysis Tools on Social Media |
Diana Maynard and Kalina Bontcheva |
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project |
Guntis Barzdins, Steve Renals and Didzis Gosko |
Chatbot Technology with Synthetic Voices in the Acquisition of an Endangered Language: Motivation, Development and Evaluation of a Platform for Irish |
Neasa Ní Chiaráin and Ailbhe Ní Chasaide |
CHATR the Corpus; a 20-year-old archive of Concatenative Speech Synthesis |
Nick Campbell |
CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese |
Rita de Carvalho, Andreia Querido, Marisa Campos, Rita Valadas Pereira, João Silva and António Branco |
CirdoX: an on/off-line multisource speech and sound analysis software |
Frédéric Aman, Michel Vacher, François Portet, William Duclot and Benjamin Lecouteux |
CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence |
Alessia Barbagli, Pietro Lucisano, Felice Dell'Orletta, Simonetta Montemagni and Giulia Venturi |
CLARIAH in the Netherlands |
Jan Odijk |
CLARIN-EL Web-based Annotation Tool |
Ioannis Manousos Katakis, Georgios Petasis and Vangelis Karkaletsis |
Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus |
SoHyun Park, Afsaneh Fazly, Annie Lee, Brandon Seibel, Wenjie Zi and Paul Cook |
CodE Alltag: A German-Language E-Mail Corpus |
Ulrike Krieg-Holz, Christian Schuschnig, Franz Matthies, Benjamin Redling and Udo Hahn |
Cognitively Motivated Distributional Representations of Meaning |
Elias Iosif, Spiros Georgiladakis and Alexandros Potamianos |
Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish |
Andre Quispersaravia and Walter Perez and Marco Sobrevilla and Fernando Alva-Manchengo |
Cohere: A Toolkit for Local Coherence |
Karin Sim Smith, Wilker Aziz and Lucia Specia |
Collecting Language Resources for the Latvian e-Government Machine Translation Platform |
Roberts Rozis, Andrejs Vasiļjevs and Raivis Skadiņš |
Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof |
Elodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese and Uriel Pascal Elingui |
Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis |
Sandrine Brognaux, Thomas Francois and Marco Saerens |
Combining Ontologies and Neural Networks for Analyzing Historical Language Varieties. A Case Study in Middle Low German |
Maria Sukhareva and Christian Chiarcos |
Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data |
Éva Mújdricza-Maydt, Silvana Hartmann, Iryna Gurevych and Anette Frank |
CommonCOW: Massively Huge Web Corpora from CommonCrawl Data and a Method to Distribute them Freely under Restrictive EU Copyright Laws |
Roland Schäfer |
Comparing Speech and Text Classification on ICNALE |
Sergiu Nisioi |
Comparing the Level of Code-Switching in Corpora |
Björn Gambäck and Amitava Das |
Comparison of Emotional Understanding in Modality-Controlled Environments using Multimodal Online Emotional Communication Corpus |
Yoshiko Arimoto and Kazuo Okanoya |
Compasses, Magnets, Water Microscopes: Annotation of Terminology in a Diachronic Corpus of Scientific Texts |
Anne-Kathrin Schumann and Stefan Fischer |
Compilation of an Arabic Childrens Corpus |
Latifa Al-Sulaiti, Noorhan Abbas, Claire Brierley, Eric Atwell and Ayman Alghamdi |
Complementarity, F-score, and NLP Evaluation |
Leon Derczynski |
Comprehensive and Consistent PropBank Light Verb Annotation |
Claire Bonial and Martha Palmer |
Concepticon: A Resource for the Linking of Concept Lists |
Johann-Mattis List, Michael Cysouw and Robert Forkel |
Constraint-Based Bilingual Lexicon Induction for Closely Related Languages |
Arbi Haza Nasution, Yohei Murakami and Toru Ishida |
Constructing a Norwegian Academic Wordlist |
Janne M Johannessen, Arash Saidi and Kristin Hagen |
Construction and Analysis of a Large Vietnamese Text Corpus |
Dieu-Thu Le and Uwe Quasthoff |
Construction of an English Dependency Corpus incorporating Compound Function Words |
Akihiko Kato, Hiroyuki Shindo and Yuji Matsumoto |
Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition |
Nurul Lubis, Randy Gomez, Sakriani Sakti, Keisuke Nakamura, Koichiro Yoshino, Satoshi Nakamura and Kazuhiro Nakadai |
Context-enhanced Adaptive Entity Linking |
Filip Ilievski, Giuseppe Rizzo, Marieke van Erp, Julien Plu and Raphael Troncy |
Controlled Propagation of Concept Annotations in Textual Corpora |
Cyril Grouin |
Coordinating Communication in the Wild: The Artwalk Dialogue Corpus of Pedestrian Navigation and Mobile Referential Communication |
Kris Liu, Jean Fox Tree and Marilyn Walker |
Coreference Annotation Scheme and Relation Types for Hindi |
Vandan Mujadia, Palash Gupta and Dipti Misra Sharma |
Coreference in Prague Czech-English Dependency Treebank |
Anna Nedoluzhko, Michal Novák, Silvie Cinkova, Marie Mikulová and Jiří Mírovský |
CORILSE: a Spanish Sign Language Repository for Linguistic Analysis |
María del Carmen Cabeza-Pereiro, José Mª García-Miguel, Carmen García Mateo and José Luis Alba Castro |
Corpora for Learning the Mutual Relationship between Semantic Relatedness and Textual Entailment |
Ngoc Phuoc An Vo and Octavian Popescu |
Corpus Analysis based on Structural Phenomena in Texts: Exploiting TEI Encoding for Linguistic Research |
Susanne Haaf |
Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology |
Marianne Djemaa, Marie Candito, Philippe Muller and Laure Vieu |
Corpus-Based Diacritic Restoration for South Slavic Languages |
Nikola Ljubešić, Tomaž Erjavec and Darja Fišer |
Corpus for Childrens Writing with Enhanced Output for Specific Spelling Patterns (2nd and 3rd Grade) |
Kay Berkling |
Corpus for Customer Purchase Behavior Prediction in Social Media |
Shigeyuki Sakaki, Francine Chen, Mandy Korpusik and Yan-Ying Chen |
Corpus Query Lingua Franca (CQLF) |
Piotr Banski, Elena Frick and Andreas Witt |
Corpus Resources for Dispute Mediation Discourse |
Mathilde Janier and Chris Reed |
corpus-tools.org: An Interoperable Generic Software Tool Set for Multi-layer Linguistic Corpora |
Stephan Druskat, Volker Gast, Thomas Krause and Florian Zipser |
Corpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of Slovene |
Nikola Ljubešić and Tomaž Erjavec |
Correcting Errors in a Treebank Based on Tree Mining |
Kanta Suzuki, Yoshihide Kato and Shigeki Matsubara |
CoRuSS - a New Prosodically Annotated Corpus of Russian Spontaneous Speech |
Tatiana Kachkovskaia, Daniil Kocharov, Pavel Skrelin and Nina Volskaya |
Could Speaker, Gender or Age Awareness be beneficial in Speech-based Emotion Recognition? |
Maxim Sidorov, Alexander Schmitt, Eugene Semenkin and Wolfgang Minker |
Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility |
Anaïs Lefeuvre-Halftermeyer, Jean-Yves Antoine, Alain Couillault, Emmanuel Schang, Lotfi Abouda, Agata Savary, Denis Maurel, Iris Eshkol and Delphine Battistelli |
Creating a General Russian Sentiment Lexicon |
Natalia Loukachevitch and Anatolii Levchik |
Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data |
Mona Diab, Mahmoud Ghoneim, Abdelati Hawwari, Fahad AlGhamdi, Nada AlMarwani and Mohamed Al-Badrashiny |
Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing |
Manuel Burghardt, Daniel Granvogl and Christian Wolff |
Creating Annotated Dialogue Resources: Cross-domain Dialogue Act Classification |
Dilafruz Amanova, Volha Petukhova and Dietrich Klakow |
Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme Inventory |
Bettina Klimek, Natanael Arndt, Sebastian Krause and Timotheus Arndt |
Creation of comparable corpora for English-{Urdu, Arabic, Persian} |
Murad Abouammoh, Kashif Shah and Ahmet Aker |
Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation |
Domagoj Alagić and Jan Šnajder |
Croatian Error-Annotated Corpus of Non-Professional Written Language |
Vanja Štefanec, Nikola Ljubešić and Jelena Kuvač Kraljević |
Cross-lingual and Supervised Models for Morphosyntactic Annotation: a Comparison on Romanian |
Lauriane Aufrant, Guillaume Wisniewski and François Yvon |
Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms |
Guillaume Jacquet, Maud Ehrmann, Ralf Steinberger and Jaakko Väyrynen |
Cross-lingual RDF Thesauri Interlinking |
Tatiana Lesnikova, Jérôme David and Jérôme Euzenat |
Crossmodal Network-Based Distributional Semantic Models |
Elias Iosif and Alexandros Potamianos |
Cross-validating Image Description Datasets and Evaluation Metrics |
Josiah Wang and Robert Gaizauskas |
Crosswalking from CMDI to Dublin Core and MARC 21 |
Claus Zinn, Thorsten Trippel, Steve Kaminski and Emanuel Dima |
Crowdsourced Corpus with Entity Salience Annotations |
Milan Dojchinovski, Dinesh Reddy, Tomáš Kliegr, Tomas Vitvar and Harald Sack |
Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations |
Maria Sukhareva, Judith Eckle-Kohler, Ivan Habernal and Iryna Gurevych |
Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora |
Andrew Caines, Christian Bentz, Calbert Graham, Tim Polzehl and Paula Buttery |
Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus |
Simon Clematide, Lenz Furrer and Martin Volk |
Crowdsourcing Ontology Lexicons |
Bettina Lanser, Christina Unger and Philipp Cimiano |
Crowdsourcing Salient Information from News and Tweets |
Oana Inel, Tommaso Caselli and Lora Aroyo |
Curation of Dutch Regional Dictionaries |
Henk van den Heuvel, Eric Sanders and Nicoline van der Sijs |
C-WEP―Rich Annotated Collection of Writing Errors by Professionals |
Cerstin Mahlow |
Cysill Ar-lein: A Corpus of Written Contemporary Welsh Compiled from an On-line Spelling and Grammar Checker |
Delyth Prys, Gruffudd Prys and Dewi Bryn Jones |
Czech Legal Text Treebank 1.0 |
Vincent Kríž, Barbora Hladka and Zdenka Uresova |
D |
|
DALILA: The Dialectal Arabic Linguistic Learning Assistant |
Salam Khalifa, Houda Bouamor and Nizar Habash |
DART: a Dataset of Arguments and their Relations on Twitter |
Tom Bosc, Elena Cabrio and Serena Villata |
Database of Mandarin Neighborhood Statistics |
Karl Neergaard, Hongzhi Xu and Chu-Ren Huang |
Data Formats and Management Strategies from the Perspective of Language Resource Producers ― Personal Diachronic and Social Synchronic Data Sharing ― |
Kazushi Ohya |
Data Management Plans and Data Centers |
Denise DiPersio, Christopher Cieri and Daniel Jaquette |
Datasets for Aspect-Based Sentiment Analysis in French |
Marianna Apidianaki, Xavier Tannier and Cécile Richart |
DBpedia Abstracts: A Large-Scale, Open, Multilingual NLP Training Corpus |
Martin Brümmer, Milan Dojchinovski and Sebastian Hellmann |
Deep Learning of Audio and Language Features for Humor Prediction |
Dario Bertero and Pascale Fung |
Defining and Counting Phonological Classes in Cross-linguistic Segment Databases |
Dan Dediu and Scott Moisik |
DeQue: A Lexicon of Complex Prepositions and Conjunctions in French |
Carlos Ramisch, Alexis Nasr, André Valli and José Deulofeu |
Deriving Morphological Analyzers from Example Inflections |
Markus Forsberg and Mans Hulden |
Design and Development of the MERLIN Learner Corpus Platform |
Verena Lyding and Karin Schöne |
Designing A Long Lasting Linguistic Project: The Case Study of ASIt |
Maristella Agosti, Emanuele Di Buccio, Giorgio Maria Di Nunzio, Cecilia Poletto and Esther Rinke |
Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian |
Mārcis Pinnis, Askars Salimbajevs and Ilze Auzina |
Detecting Annotation Scheme Variation in Out-of-Domain Treebanks |
Yannick Versley and Julius Steen |
Detecting Expressions of Blame or Praise in Text |
Udochukwu Orizu and Yulan He |
Detecting Implicit Expressions of Affect from Text using Semantic Knowledge on Common Concept Properties |
Alexandra Balahur and Hristo Tanev |
Detecting Optional Arguments of Verbs |
Andras Kornai, Dávid Márk Nemeskey and Gábor Recski |
Detecting Word Usage Errors in Chinese Sentences for Learning Chinese as a Foreign Language |
Yow-Ting Shiue and Hsin-Hsi Chen |
Detection of Major ASL Sign Types in Continuous Signing For ASL Recognition |
Polina Yanovich, Carol Neidle and Dimitris Metaxas |
Detection of Reformulations in Spoken French |
Natalia Grabar and Iris Eshkol-Taravela |
Developing a Dataset for Evaluating Approaches for Document Expansion with Images |
Debasis Ganguly, Iacer Calixto and Gareth Jones |
D(H)ante: A New Set of Tools for XIII Century Italian |
Angelo Basile and Federico Sangati |
Dialogue System Characterisation by Back-channelling Patterns Extracted from Dialogue Corpus |
Masashi Inoue and Hiroshi Ueno |
Differentia compositionem facit. A Slower-Paced and Reliable Parser for Latin |
Edoardo Maria Ponti and Marco Passarotti |
Discontinuous Verb Phrases in Parsing and Machine Translation of English and German |
Sharid Loáiciga and Kristina Gulordava |
Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus |
Nicholas Asher, Julie Hunter, Mathieu Morey, Benamara Farah and Stergos Afantenos |
Discovering Fuzzy Synsets from the Redundancy in Different Lexical-Semantic Resources |
Hugo Gonçalo Oliveira and Fábio Santos |
Discriminating Similar Languages: Evaluations and Explorations |
Cyril Goutte, Serge Léger, Shervin Malmasi and Marcos Zampieri |
Discriminative Analysis of Linguistic Features for Typological Study |
Hiroya Takamura, Ryo Nagata and Yoshifumi Kawasaki |
Distributional Thesauri for Information Retrieval and vice versa |
Vincent Claveau and Ewa Kijak |
Distribution of Valency Complements in Czech Complex Predicates: Between Verb and Noun |
Václava Kettnerová and Eduard Bejček |
Domain Adaptation for Named Entity Recognition Using CRFs |
Tian Tian, Marco Dinarelli, Isabelle Tellier and Pedro Dias Cardoso |
Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation |
Gorka Labaka, Iñaki Alegria and Kepa Sarasola |
Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpedia |
Liumingjing Xiao, Chong Ruan, An Yang, Junhao Zhang and Junfeng Hu |
Domain-Specific Corpus Expansion with Focused Webcrawling |
Steffen Remus and Chris Biemann |
DRANZIERA: An Evaluation Protocol For Multi-Domain Opinion Mining |
Mauro Dragoni, Andrea Tettamanzi and Célia da Costa Pereira |
DT-Neg: Tutorial Dialogues Annotated for Negation Scope and Focus in Context |
Rajendra Banjade and Vasile Rus |
DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter |
Julian Hough, Ye Tian, Laura de Ruiter, Simon Betz, Spyros Kousidis, David Schlangen and Jonathan Ginzburg |
E |
|
EasyTree: A Graphical Tool for Dependency Tree Annotation |
Alexa Little and Stephen Tratz |
Ecological Gestures for HRI: the GEE Corpus |
Maxence Girard-Rivier, Romain Magnani, Veronique Auberge, Yuko Sasa, Liliya Tsvetanova, Frederic Aman and Clarisse Bayol |
EDISON: Feature Extraction for NLP, Simplified |
Mark Sammons, Christos Christodoulopoulos, Parisa Kordjamshidi, Daniel Khashabi, Vivek Srikumar and Dan Roth |
Edit Categories and Editor Role Identification in Wikipedia |
Diyi Yang, Aaron Halfaker, Robert Kraut and Eduard Hovy |
Effect Functors for Opinion Inference |
Josef Ruppenhofer and Jasper Brandes |
Effects of Sampling on Twitter Trend Detection |
Andrew Yates, Alek Kolcz, Nazli Goharian and Ophir Frieder |
ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain |
Sergio Oramas, Luis Espinosa Anke, Mohamed Sordo, Horacio Saggion and Xavier Serra |
ELRA Activities and Services |
Khalid Choukri, Valérie Mapelli, Hélène Mazo and Vladimir Popescu |
Embedding Open-domain Common-sense Knowledge from Text |
Travis Goodwin and Sanda Harabagiu |
Emotion Analysis on Twitter: The Hidden Challenge |
Luca Dini and André Bittar |
Emotion Corpus Construction Based on Selection from Hashtags |
Minglei Li, Yunfei Long, Lu Qin and Wenjie Li |
EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment Analysis |
Jasy Suet Yan Liew, Howard R. Turtle and Elizabeth D. Liddy |
Encoding Adjective Scales for Fine-grained Resources |
Cédric Lopez, Frederique Segond and Christiane Fellbaum |
Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR |
Malgorzata Cavar, Damir Cavar and Hilaria Cruz |
EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis |
David Vilares, Miguel A. Alonso and Carlos Gómez-Rodríguez |
English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual Setting |
Michael Carl, Akiko Aizawa and Masaru Yamada |
Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech |
Roberto Seara, Marta Martinez, Rocio Varela, Carmen García Mateo, Elisa Fernandez Rei and Xose Luis Regueira |
Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks |
Sebastian Schuster and Christopher D. Manning |
Enhancing Access to Online Education: Quality Machine Translation of MOOC Content |
Valia Kordoni, Antal van den Bosch, Katia Lida Kermanidis, Vilelmini Sosoni, Kostadin Cholakov, Iris Hendrickx, Matthias Huck and Andy Way |
Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities |
Meritxell Fernández Barrera, Vladimir Popescu, Antonio Toral, Federico Gaspari and Khalid Choukri |
Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP Tasks |
Carole Lailler, Anaïs Landeau, Frédéric Béchet, Yannick Estève and Paul Deléglise |
Enriching a Portuguese WordNet using Synonyms from a Monolingual Dictionary |
Alberto Simões, Xavier Gómez Guinovart and José João Almeida |
Enriching TimeBank: Towards a more precise annotation of temporal relations in a text |
Volker Gast, Lennart Bierkandt, Stephan Druskat and Christoph Rzymski |
Ensemble Classification of Grants using LDA-based Features |
Yannis Korkontzelos, Beverley Thomas, Makoto Miwa and Sophia Ananiadou |
Entity Linking with a Paraphrase Flavor |
Maria Pershina, Yifan He and Ralph Grishman |
Error Typology and Remediation Strategies for Requirements Written in English by Non-Native Speakers |
Marie Garnier and Patrick Saint-Dizier |
EstNLTK - NLP Toolkit for Estonian |
Siim Orasmaa, Timo Petmanson, Alexander Tkachenko, Sven Laur and Heiki-Jaan Kaalep |
Estonian Dependency Treebank: from Constraint Grammar tagset to Universal Dependencies |
Kadri Muischnek, Kaili Müürisep and Tiina Puolakainen |
E-TIPSY: Search Query Corpus Annotated with Entities, Term Importance, POS Tags, and Syntactic Parses |
Yuval Marton and Kristina Toutanova |
European Union Language Resources in Sketch Engine |
Vít Baisa, Jan Michelfeit, Marek Medveď and Milos Jakubicek |
Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing |
Hao Zhou, Yue Zhang, Shujian Huang, Xin-Yu Dai and Jiajun Chen |
Evaluating a Topic Modelling Approach to Measuring Corpus Similarity |
Richard Fothergill, Paul Cook and Timothy Baldwin |
Evaluating Context Selection Strategies to Build Emotive Vector Space Models |
Lucia C. Passaro and Alessandro Lenci |
Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job |
Marieke van Erp, Pablo Mendes, Heiko Paulheim, Filip Ilievski, Julien Plu, Giuseppe Rizzo and Joerg Waitelonis |
Evaluating Interactive System Adaptation |
Edouard Geoffrois |
Evaluating Lexical Similarity to build Sentiment Similarity |
Grégoire Jadi, Vincent Claveau, Béatrice Daille and Laura Monceaux |
Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource |
Anaïs Tack, Thomas Francois, Anne-Laure Ligozat and Cédrick Fairon |
Evaluating Machine Translation in a Usage Scenario |
Rosa Gaudio, Aljoscha Burchardt and António Branco |
Evaluating the Impact of Light Post-Editing on Usability |
Sheila Castilho and Sharon O'Brien |
Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene |
Izaskun Etxeberria, Iñaki Alegria, Larraitz Uria and Mans Hulden |
Evaluating the Readability of Text Simplification Output for Readers with Cognitive Disabilities |
Victoria Yaneva, Irina Temnikova and Ruslan Mitkov |
Evaluating Translation Quality and CLIR Performance of Query Sessions |
Xabier Saralegi, Eneko Agirre and Iñaki Alegria |
Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource |
Stephan Tulkens, Chris Emmery and Walter Daelemans |
Evaluation of the KIT Lecture Translation System |
Markus Müller, Sarah Fünfer, Sebastian Stüker and Alex Waibel |
Evaluation Set for Slovak News Information Retrieval |
Daniel Hládek, Ján Staš and Jozef Juhár |
EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMs |
Liu Hongchao, Karl Neergaard, Enrico Santus and Chu-Ren Huang |
Event Coreference Resolution with Multi-Pass Sieves |
Jing Lu and Vincent Ng |
Example-based Acquisition of Fine-grained Collocation Resources |
Sara Rodríguez-Fernández, Roberto Carlini, Luis Espinosa Anke and Leo Wanner |
Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic |
Abdelati Hawwari, Mohammed Attia, Mahmoud Ghoneim and Mona Diab |
Exploitation of Co-reference in Distributional Semantics |
Dominik Schlechtweg |
Exploiting a Large Strongly Comparable Corpus |
Thierry Etchegoyhen, Andoni Azpeitia and Naiara Pérez |
Exploiting Arabic Diacritization for High Quality Automatic Annotation |
Nizar Habash, Anas Shahrour and Muhamed Al-Khalil |
Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics |
Dirk Hovy and Anders Johannsen |
Exploring the Realization of Irony in Twitter Data |
Cynthia Van Hee, Els Lefever and Veronique Hoste |
Extending Monolingual Semantic Textual Similarity Task to Multiple Cross-lingual Settings |
Yoshihiko Hayashi and Wentao Luo |
Extracting Interlinear Glossed Text from LaTeX Documents |
Mathias Schenner and Sebastian Nordhoff |
Extracting Structured Scholarly Information from the Machine Translation Literature |
Eunsol Choi, Matic Horvat, Jonathan May, Kevin Knight and Daniel Marcu |
Extracting Weighted Language Lexicons from Wikipedia |
Gregory Grefenstette |
Extractive Summarization under Strict Length Constraints |
Yashar Mehdad, Amanda Stent, Kapil Thadani, Dragomir Radev, Youssef Billawala and Karolina Buchner |
F |
|
FABIOLE, a Speech Database for Forensic Speaker Comparison |
Moez Ajili, Jean-françois Bonastre, Juliette Kahn, Solange Rossato and Guillaume Bernard |
Facilitating Metadata Interoperability in CLARIN-DK |
Lene Offersgaard and Dorte Haltrup Hansen |
Factuality Annotation and Learning in Spanish Texts |
Dina Wonsever, Aiala Rosá and Marisa Malcuori |
Falling silent, lost for words ... Tracing personal involvement in interviews with Dutch war veterans |
Henk van den Heuvel and Nelleke Oostdijk |
Farasa: A New Fast and Accurate Arabic Word Segmenter |
Kareem Darwish and Hamdy Mubarak |
Fast and Robust POS tagger for Arabic Tweets Using Agreement-based Bootstrapping |
Fahad Albogamy and Allan Ramsay |
Features for Generic Corpus Querying |
Thomas Eckart, Christoph Kuras and Uwe Quasthoff |
Filtering Wiktionary Triangles by Linear Mbetween Distributed Word Models |
Márton Makrai |
Finding Alternative Translations in a Large Corpus of Movie Subtitle |
Jörg Tiedemann |
Finding Definitions in Large Corpora with Sketch Engine |
Vojtěch Kovář, Monika Močiariková and Pavel Rychlý |
Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus |
Andy Luecking, Alexander Mehler, Désirée Walther, Marcel Mauri and Dennis Kurfürst |
Fine-Grained Chinese Discourse Relation Labelling |
Huan-Yuan Chen, Wan-Shan Liao, Hen-Hsen Huang and Hsin-Hsi Chen |
First Steps Towards Coverage-Based Sentence Alignment |
Luís Gomes and Gabriel Pereira Lopes |
FLAT: Constructing a CLARIN Compatible Home for Language Resources |
Menzo Windhouwer, Marc Kemps-Snijders, Paul Trilsbeek, André Moreira, Bas Van der Veen, Guilherme Silva and Daniel Von Reihn |
FlexTag: A Highly Flexible PoS Tagging Framework |
Torsten Zesch and Tobias Horsmann |
Focus Annotation of Task-based Data: A Comparison of Expert and Crowd-Sourced Annotation in a Reading Comprehension Corpus |
Kordula De Kuthy, Ramon Ziai and Detmar Meurers |
FOLK-Gold ― A Gold Standard for Part-of-Speech-Tagging of Spoken German |
Swantje Westpfahl and Thomas Schmidt |
Forecasting Emerging Trends from Scientific Literature |
Kartik Asooja, Georgeta Bordea, Gabriela Vulcu and Paul Buitelaar |
Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project |
Claudia Soria, Irene Russo, Valeria Quochi, Davyth Hicks, Antton Gurrutxaga, Anneli Sarhimaa and Matti Tuomisto |
Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities |
Georg Rehm, Jan Hajic, Josef van Genabith and Andrejs Vasiļjevs |
FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies |
Milan Dojchinovski, Felix Sasaki, Tatjana Gornostaja, Sebastian Hellmann, Erik Mannens, Frank Salliau, Michele Osella, Phil Ritchie, Giannis Stoitsis, Kevin Koidl, Markus Ackermann and Nilesh Chakraborty |
French Learners Audio Corpus of German Speech (FLACGS) |
Jane Wottawa and Martine Adda-Decker |
From Interoperable Annotations towards Interoperable Resources: A Multilingual Approach to the Analysis of Discourse |
Ekaterina Lapshinova-Koltunski, Kerstin Anna Kunz and Anna Nedoluzhko |
Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments |
Rafiya Begum, Kalika Bali, Monojit Choudhury, Koustav Rudra and Niloy Ganguly |
G |
|
GATE-Time: Extraction of Temporal Expressions and Events |
Leon Derczynski, Jannik Strötgen, Diana Maynard, Mark A. Greenwood and Manuel Jung |
Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text |
Ravindra Harige and Paul Buitelaar |
Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project |
Malgorzata Cavar, Damir Cavar, Dov-Ber Kerler and Anya Quilitzsch |
Generating Task-Pertinent sorted Error Lists for Speech Recognition |
Olivier Galibert, Mohamed Ameur Ben Jannet, Juliette Kahn and Sophie Rosset |
GhoSt-NN: A Representative Gold Standard of German Noun-Noun Compounds |
Sabine Schulte im Walde, Anna Hätty, Stefan Bott and Nana Khvtisavrishvili |
Giving Lexical Resources a Second Life: Démonette, a Multi-sourced Morpho-semantic Network for French |
Nabil Hathout and Fiammetta Namer |
Global Open Resources and Information for Language and Linguistic Analysis (GORILLA) |
Damir Cavar, Malgorzata Cavar and Lwin Moe |
Government Domain Named Entity Recognition for South African Languages |
Roald Eiselen |
Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study |
Silvie Cinkova, Ema Krejčová, Anna Vernerová and Vít Baisa |
Graph-Based Induction of Word Senses in Croatian |
Marko Bekavac and Jan Šnajder |
Graphical Annotation for Syntax-Semantics Mapping |
Koiti Hasida |
GRaSP: A Multilayered Annotation Scheme for Perspectives |
Chantal van Son, Tommaso Caselli, Antske Fokkens, Isa Maks, Roser Morante, Lora Aroyo and Piek Vossen |
Gulf Arabic Linguistic Resource Building for Sentiment Analysis |
Wafia Adouane and Richard Johansson |
H |
|
Happy Accident: A Sentiment Composition Lexicon for Opposing Polarity Phrases |
Svetlana Kiritchenko and Saif Mohammad |
Hard Time Parsing Questions: Building a QuestionBank for French |
Djamé Seddah and Marie Candito |
He Said She Said ― a Male/Female Corpus of Polish |
Filip Graliński, Łukasz Borchmann and Piotr Wierzchoń |
Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives |
Jens Edlund and Joakim Gustafson |
Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations |
Beatrice Alex, Clare Llewellyn, Claire Grover, Jon Oberlander and Richard Tobin |
How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News |
Imran Sheikh, Irina Illina and Dominique Fohr |
How does Dictionary Size Influence Performance of Vietnamese Word Segmentation? |
Wuying Liu and Lin Wang |
How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment |
Patrick Holthaus, Christian Leichsenring, Jasmin Bernotat, Viktor Richter, Marian Pohling, Birte Carlmeyer, Norman Köster, Sebastian Meyer zu Borgsen, René Zorn, Birte Schiffhauer, Kai Frederic Engelmann, Florian Lier, Simon Schulz, Philipp Cimiano, Friederike Eyssel, Thomas Hermann, Franz Kummert, David Schlangen, Sven Wachsmuth, Petra Wagner, Britta Wrede and Sebastian Wrede |
Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest |
Dragomir Radev, Amanda Stent, Joel Tetreault, Aasish Pappu, Aikaterini Iliakopoulou, Agustin Chanfreau, Paloma de Juan, Jordi Vallmitjana, Alejandro Jaimes, Rahul Jha and Robert Mankoff |
Hypergraph Modelization of a Syntactically Annotated English Wikipedia Dump |
Edmundo Pavel Soriano Morales, Julien Ah-Pine and Sabine Loudcher |
I |
|
Identification of Drug-Related Medical Conditions in Social Media |
François Morlane-Hondère, Cyril Grouin and Pierre Zweigenbaum |
Identifying Content Types of Messages Related to Open Source Software Projects |
Yannis Korkontzelos, Paul Thompson and Sophia Ananiadou |
If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers |
Zhiwei Yu, David Mareček, Zdeněk Žabokrtský and Daniel Zeman |
Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles |
Aitor Alvarez, Marina Balenciaga, Arantza del Pozo, Haritz Arzelus, Anna Matamala and Carlos-D. Martínez-Hinarejos |
Improving Bilingual Terminology Extraction from Comparable Corpora via Multiple Word-Space Models |
Amir Hazem and Emmanuel Morin |
Improving corpus search via parsing |
Natalia Klyueva and Pavel Straňák |
Improving Information Extraction from Wikipedia Texts using Basic English |
Teresa Rodriguez-Ferreira, Adrian Rabadan, Raquel Hervas and Alberto Diaz |
Improving POS Tagging of German Learner Language in a Reading Comprehension Scenario |
Lena Keiper, Andrea Horbach and Stefan Thater |
Improving the Annotation of Sentence Specificity |
Junyi Jessy Li, Bridget O'Daniel, Yi Wu, Wenli Zhao and Ani Nenkova |
IMS HotCoref DE: A Data-driven Co-reference Resolver for German |
Ina Roesiger and Jonas Kuhn |
Inconsistency Detection in Semantic Annotation |
Nora Hollenstein, Nathan Schneider and Bonnie Webber |
Incorporating Lexico-semantic Heuristics into Coreference Resolution Sieves for Named Entity Recognition at Document-level |
Marcos Garcia |
Information structure in the Potsdam Commentary Corpus: Topics |
Manfred Stede and Sara Mamprin |
InScript: Narrative texts annotated with script information |
Ashutosh Modi, Tatjana Anikina, Simon Ostermann and Manfred Pinkal |
Integration of Lexical and Semantic Knowledge for Sentiment Analysis in SMS |
Wejdene Khiari, Mathieu Roche and Asma Bouhafs Hafsia |
Internet Argument Corpus 2.0: An SQL schema for Dialogic Social Media and the Corpora to go with it |
Rob Abbott, Brian Ecker, Pranav Anand and Marilyn Walker |
Interoperability of Annotation Schemes: Using the Pepper Framework to Display AWA Documents in the ANNIS Interface |
Talvany Carlotto, Zuhaitz Beloki, Xabier Artola and Aitor Soroa |
Introducing the Asian Language Treebank (ALT) |
Ye Kyaw Thu, Win Pa Pa, Masao Utiyama, Andrew Finch and Eiichiro Sumita |
Introducing the LCC Metaphor Datasets |
Michael Mohler, Mary Brunson, Bryan Rink and Marc Tomlinson |
Introducing the SEA_AP: an Enhanced Tool for Automatic Prosodic Analysis |
Marta Martinez, Rocio Varela, Carmen García Mateo, Elisa Fernandez Rei and Adela Martinez Calvo |
Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification |
Simone Hantke, Erik Marchi and Björn Schuller |
IRIS: English-Irish Machine Translation System |
Mihael Arcan, Caoilfhionn Lane, Eoin Ó Droighneáin and Paul Buitelaar |
Issues and Challenges in Annotating Urdu Action Verbs on the IMAGACT4ALL Platform |
Sharmin Muzaffar, Pitambar Behera and Girish Jha |
Italian VerbNet: A Construction-based Approach to Italian Verb Classification |
Lucia Busso and Alessandro Lenci |
L |
|
LanguageCrawl: A Generic Tool for Building Language Models Upon Common-Crawl |
Szymon Roziewski and Wojciech Stokowiec |
Language Resource Addition Strategies for Raw Text Parsing |
Atsushi Ushiku, Tetsuro Sasada and Shinsuke Mori |
Language Resource Citation: the ISLRN Dissemination and Further Developments |
Valérie Mapelli, Vladimir Popescu, Lin Liu and Khalid Choukri |
Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus |
Xuansong Li, Martha Palmer, Nianwen Xue, Lance Ramshaw, Mohamed Maamouri, Ann Bies, Kathryn Conger, Stephen Grimes and Stephanie Strassel |
Guidelines and Framework for a Large Scale Arabic Diacritized Corpus |
Wajdi Zaghouani, Houda Bouamor, Abdelati Hawwari, Mona Diab, Ossama Obeid, Mahmoud Ghoneim, Sawsan Alqahtani and Kemal Oflazer |
Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin |
Marco Passarotti, Berta González Saavedra and Christophe Onambele |
Laughter in French Spontaneous Conversational Dialogs |
Brigitte Bigi and Roxane Bertrand |
Learning from Within? Comparing PoS Tagging Approaches for Historical Text |
Sarah Schulz and Jonas Kuhn |
Learning Thesaurus Relations from Distributional Features |
Rosa Tsegaye Aga, Christian Wartena, Lucas Drumond and Lars Schmidt-Thieme |
Learning Tone and Attribution for Financial Text Mining |
Mahmoud El-Haj, Paul Rayson, Steve Young, Andrew Moore, Martin Walker, Thomas Schleicher and Vasiliki Athanasakou |
Legacy language atlas data mining: mapping Kru languages |
Dafydd Gibbon |
Legal Text Interpretation: Identifying Hohfeldian Relations from Text |
Wim Peters and Adam Wyner |
LELIO: An Auto-Adaptative System to Acquire Domain Lexical Knowledge in Technical Texts |
Patrick Saint-Dizier |
Lemmatization and Morphological Tagging in German and Latin: A Comparison and a Survey of the State-of-the-art |
Steffen Eger, Rüdiger Gleim and Alexander Mehler |
Leveraging Native Data to Correct Preposition Errors in Learners' Dutch |
Lennart Kloppenburg and Malvina Nissim |
Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries |
Marta Villegas, Maite Melero, Núria Bel and Jorge Gracia |
LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon |
Giulia Rambelli, Gianluca Lebani, Laurent Prévot and Alessandro Lenci |
Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages |
Scott Piao, Paul Rayson, Dawn Archer, Francesca Bianchi, Carmen Dayrell, Mahmoud El-Haj, Ricardo-María Jiménez, Dawn Knight, Michal Křen, Laura Löfberg, Rao Muhammad Adeel Nawab, Jawad Shafi, Phoey Lee Teh and Olga Mudraya |
Lexical Resources to Enrich English Malayalam Machine Translation |
Sreelekha S and Pushpak Bhattacharyya |
LibN3L:A Lightweight Package for Neural NLP |
Meishan Zhang, Jie Yang, Zhiyang Teng and Yue Zhang |
Linguistically Inspired Language Model Augmentation for MT |
George Tambouratzis and Vasiliki Pouli |
Lin|gu|is|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open Data |
Christian Chiarcos, Christian Fäth, Heike Renner-Westermann, Frank Abromeit and Vanya Dimitrova |
LORELEI Language Packs: Data, Tools, and Resources for Technology Development in Low Resource Languages |
Stephanie Strassel and Jennifer Tracey |
LREC as a Graph: People and Resources in a Network |
Riccardo Del Gratta, Francesca Frontini, Monica Monachini, Gabriella Pardelli, Irene Russo, Roberto Bartolini, Fahad Khan, Claudia Soria and Nicoletta Calzolari |
LVF-lemon ― Towards a Linked Data Representation of Les Verbes français |
Ingrid Falk and Achim Stein |
M |
|
MADAD: A Readability Annotation Tool for Arabic Text |
Nora Al-Twairesh, Abeer Al-Dayel, Hend Al-Khalifa, Maha Al-Yahya, Sinaa Alageel, Nora Abanmy and Nouf Al-Shenaifi |
Managing Linguistic and Terminological Variation in a Medical Dialogue System |
Leonardo Campillos Llanos, Dhouha Bouamor, Pierre Zweigenbaum and Sophie Rosset |
Manual and Automatic Paraphrases for MT Evaluation |
Aleš Tamchyna and Petra Barancikova |
Mapping Ontologies Using Ontologies: Cross-lingual Semantic Role Information Transfer |
Balázs Indig, Márton Miháltz and András Simonyi |
Markov Logic Networks for Text Mining: A Qualitative and Empirical Comparison with Integer Linear Programming |
Luis Gerardo Mojica de la Vega and Vincent Ng |
MARMOT: A Toolkit for Translation Quality Estimation at the Word Level |
Varvara Logacheva, Chris Hokamp and Lucia Specia |
MarsaGram: an excursion in the forests of parsing trees |
Philippe Blache, Stéphane Rauzy and Grégoire Montcheuil |
MEANTIME, the NewsReader Multilingual Event and Time Corpus |
Anne-Lyse Minard, Manuela Speranza, Ruben Urizar, Begoña Altuna, Marieke van Erp, Anneleen Schoen and Chantal van Son |
Measuring Lexical Quality of a Historical Finnish Newspaper Collection ― Analysis of Garbled OCR Data with Basic Language Technology Tools and Means |
Kimmo Kettunen and Tuula Pääkkönen |
Medical Concept Embeddings via Labeled Background Corpora |
Eneldo Loza Mencía, Gerard de Melo and Jinseok Nam |
Merging Data Resources for Inflectional and Derivational Morphology in Czech |
Zdeněk Žabokrtský, Magda Sevcikova, Milan Straka, Jonáš Vidra and Adéla Limburská |
metaTED: a Corpus of Metadiscourse for Spoken Language |
Rui Correia, Nuno Mamede, Jorge Baptista and Maxine Eskenazi |
Metonymy Analysis Using Associative Relations between Words |
Takehiro Teraoka |
Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation |
Borja Navarro, María Ribes-Lafoz and Noelia Sánchez |
Mining the Spoken Wikipedia for Speech Data and Beyond |
Arne Köhn, Florian Stegen and Timo Baumann |
Mirroring Facial Expressions and Emotions in Dyadic Conversations |
Costanza Navarretta |
MoBiL: A Hybrid Feature Set for Automatic Human Translation Quality Assessment |
Yu Yuan, Serge Sharoff and Bogdan Babych |
Modeling Language Change in Historical Corpora: The Case of Portuguese |
Marcos Zampieri, Shervin Malmasi and Mark Dras |
Modelling a Parallel Corpus of French and French Belgian Sign Language |
Laurence Meurant, Maxime Gobert and Anthony Cleve |
Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus |
Volha Petukhova, Christopher Stevens, Harmen de Weerd, Niels Taatgen, Fokie Cnossen and Andrei Malchanau |
Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge |
Elena Arsevska, Mathieu Roche, Sylvain Falala, Renaud Lancelot, David Chavernac, Pascal Hendrikx and Barbara Dufour |
Monolingual Social Media Datasets for Detecting Contradiction and Entailment |
Piroska Lendvai, Isabelle Augenstein, Kalina Bontcheva and Thierry Declerck |
More than Word Cooccurrence: Exploring Support and Opposition in International Climate Negotiations with Semantic Parsing |
Pablo Ruiz, Clément Plancq and Thierry Poibeau |
Morphological Analysis of Sahidic Coptic for Automatic Glossing |
Daniel Smith and Mans Hulden |
Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni Arabic |
Faisal Al-Shargi, Aidan Kaplan, Ramy Eskander, Nizar Habash and Owen Rambow |
Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus |
James Ravenscroft, Anika Oellrich, Shyamasree Saha and Maria Liakata |
Multi-language Speech Collection for NIST LRE |
Karen Jones, Stephanie Strassel, Kevin Walker, David Graff and Jonathan Wright |
Multilevel Annotation of Agreement and Disagreement in Italian News Blogs |
Fabio Celli, Giuseppe Riccardi and Firoj Alam |
Multimodal Resources for Human-Robot Communication Modelling |
Stavroula―Evita Fotinea, Eleni Efthimiou, Maria Koutsombogera, Athanasia-Lida Dimou, Theodore Goulas and Kyriaki Vasilaki |
Multi-prototype Chinese Character Embedding |
Yanan Lu, Yue Zhang and Donghong Ji |
MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP |
Alexandre Berard, Christophe Servan, Olivier Pietquin and Laurent Besacier |
Multiword Expressions Dataset for Indian Languages |
Dhirendra Singh, Sudha Bhingardive and Pushpak Bhattacharya |
Multiword Expressions in Child Language |
Rodrigo Wilkens, Marco Idiart and Aline Villavicencio |
MWEs in Treebanks: From Survey to Guidelines |
Victoria Rosén, Koenraad De Smedt, Gyri Smørdal Losnegaard, Eduard Bejček, Agata Savary and Petya Osenova |
mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE Processing |
Silvio Cordeiro, Carlos Ramisch and Aline Villavicencio |
N |
|
Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word Embeddings |
Eda Okur, Hakan Demir and Arzucan Özgür |
Named Entity Resources - Overview and Outlook |
Maud Ehrmann, Damien Nouvel and Sophie Rosset |
Name Translation based on Fine-grained Named Entity Recognition in a Single Language |
Kugatsu Sadamitsu, Itsumi Saito, Taichi Katayama, Hisako Asano and Yoshihiro Matsuo |
Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora |
Hennie Brugman, Martin Reynaert, Nicoline van der Sijs, René van Stipriaan, Erik Tjong Kim Sang and Antal van den Bosch |
Neural Embedding Language Models in Semantic Clustering of Web Search Results |
Andrey Kutuzov and Elizaveta Kuzmenko |
Neural Scoring Function for MST Parser |
Jindřich Libovický |
New Developments in the LRE Map |
Vladimir Popescu, Lin Liu, Riccardo Del Gratta, Khalid Choukri and Nicoletta Calzolari |
New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian |
Nikola Ljubešić, Filip Klubička, Željko Agić and Ivo-Pavao Jazbec |
New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification |
Eleanor Chodroff, Matthew Maciejewski, Jan Trmal, Sanjeev Khudanpur and John Godfrey |
NileULex: A Phrase and Word Level Sentiment Lexicon for Egyptian and Modern Standard Arabic |
Samhaa R. El-Beltagy |
Nine Features in a Random Forest to Learn Taxonomical Semantic Relations |
Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu and Chu-Ren Huang |
NLP and Public Engagement: The Case of the Italian School Reform |
Tommaso Caselli, Giovanni Moretti, Rachele Sprugnoli, Sara Tonelli, Damien Lanfrey and Donatella Solda Kutzmann |
NLP Infrastructure for the Lithuanian Language |
Daiva Vitkutė-Adžgauskienė, Andrius Utka, Darius Amilevičius and Tomas Krilavičius |
NNBlocks: A Deep Learning Framework for Computational Linguistics Neural Network Models |
Frederico Tommasi Caroli, André Freitas, João Carlos Pereira da Silva and Siegfried Handschuh |
NorGramBank: A Deep Treebank for Norwegian |
Helge Dyvik, Paul Meurer, Victoria Rosén, Koenraad De Smedt, Petter Haugereid, Gyri Smørdal Losnegaard, Gunn Inger Lyse and Martha Thunes |
Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts |
Yong Xu and François Yvon |
O |
|
OCR Post-Correction Evaluation of Early Dutch Books Online - Revisited |
Martin Reynaert |
Odin's Runes: A Rule Language for Information Extraction |
Marco A. Valenzuela-Escárcega, Gus Hahn-Powell and Mihai Surdeanu |
Old French Dependency Parsing: Results of Two Parsers Analysed from a Linguistic Point of View |
Achim Stein |
On Developing Resources for Patient-level Information Retrieval |
Stephen Wu, Tamara Timmons, Amy Yates, Meikun Wang, Steven Bedrick, William Hersh and Hongfang Liu |
On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual Disabilities |
Mario Corrales-Astorgano, David Escudero-Mancebo, Yurena Gutiérrez-González, Valle Flores-Lucas, César González-Ferreras and Valentín Cardeñoso-Payo |
Open Data Vocabularies for Assigning Usage Rights to Data Resources from Translation Projects |
David Lewis, Kaniz Fatema, Alfredo Maldonado, Brian Walshe and Arturo Calvo |
OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles |
Pierre Lison and Jörg Tiedemann |
Operational Assessment of Keyword Search on Oral History |
Elizabeth Salesky, Jessica Ray and Wade Shen |
OPFI: A Tool for Opinion Finding in Polish |
Aleksander Wawer |
Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces |
Matthias Sperber, Graham Neubig, Satoshi Nakamura and Alex Waibel |
Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility |
Andrea Fischer, Klara Jagrova, Irina Stenger, Tania Avgustinova, Dietrich Klakow and Roland Marti |
OSMAN ― A Novel Arabic Readability Metric |
Mahmoud El-Haj and Paul Rayson |
P |
|
Palabras: Crowdsourcing Transcriptions of L2 Speech |
Eric Sanders, Pepi Burgos, Catia Cucchiarini and Roeland van Hout |
Parallel Chinese-English Entities, Relations and Events Corpora |
Justin Mott, Ann Bies, Zhiyi Song and Stephanie Strassel |
Parallel Discourse Annotations on a Corpus of Short Texts |
Manfred Stede, Stergos Afantenos, Andreas Peldszus, Nicholas Asher and Jérémy Perret |
Parallel Global Voices: a Collection of Multilingual Corpora with Citizen Media Stories |
Prokopis Prokopidis, Vassilis Papavassiliou and Stelios Piperidis |
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features |
Chenhui Chu, Raj Dabre and Sadao Kurohashi |
Parallel Speech Corpora of Japanese Dialects |
Koichiro Yoshino, Naoki Hirayama, Shinsuke Mori, Fumihiko Takahashi, Katsutoshi Itoyama and Hiroshi G. Okuno |
Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation |
Chenhui Chu and Sadao Kurohashi |
PARC 3.0: A Corpus of Attribution Relations |
Silvia Pareti |
PARSEME Survey on MWE Resources |
Gyri Smørdal Losnegaard, Federico Sangati, Carla Parra Escartín, Agata Savary, Sascha Bargmann and Johanna Monti |
Passing a USA National Bar Exam: a First Corpus for Experimentation |
Biralatei Fawei, Adam Wyner and Jeff Pan |
PE2rr Corpus: Manual Error Annotation of Automatically Pre-annotated MT Post-edits |
Maja Popović and Mihael Arcan |
PentoRef: A Corpus of Spoken References in Task-oriented Dialogues |
Sina Zarrieß, Julian Hough, Casey Kennington, Ramesh Manuvinakurike, David DeVault, Raquel Fernandez and David Schlangen |
Persian Proposition Bank |
Azadeh Mirzaei and Amirsaeid Moloodi |
PersonaBank: A Corpus of Personal Narratives and Their Story Intention Graphs |
Stephanie Lukin, Kevin Bowden, Casey Barackman and Marilyn Walker |
Phoneme Alignment Using the Information on Phonological Processes in Continuous Speech |
Daniil Kocharov |
Phonetic Inventory for an Arabic Speech Corpus |
Nawar Halabi and Mike Wald |
Phrase Detectives Corpus 1.0 Crowdsourced Anaphoric Coreference. |
Jon Chamberlain, Massimo Poesio and Udo Kruschwitz |
Phrase Level Segmentation and Labelling of Machine Translation Errors |
Frédéric Blain, Varvara Logacheva and Lucia Specia |
Polarity Lexicon Building: to what Extent Is the Manual Effort Worth? |
Iñaki San Vicente and Xabier Saralegi |
Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis |
Agnieszka Wagner, Katarzyna Klessa and Jolanta Bachan |
Poly-GrETEL: Cross-Lingual Example-based Querying of Syntactic Constructions |
Liesbeth Augustinus, Vincent Vandeghinste and Tom Vanallemeersch |
Port4NooJ v3.0: Integrated Linguistic Resources for Portuguese NLP |
Cristina Mota, Paula Carvalho and Anabela Barreiro |
POS-tagging of Historical Dutch |
Dieuwke Hupkes and Rens Bod |
PotTS: The Potsdam Twitter Sentiment Corpus |
Uladzimir Sidarenka |
Predicting Author Age from Weibo Microblog Posts |
Wanru Zhang, Andrew Caines, Dimitrios Alikaniotis and Paula Buttery |
Predictive Modeling: Guessing the NLP Terms of Tomorrow |
Gil Francopoulo, Joseph Mariani and Patrick Paroubek |
PreMOn: a Lemon Extension for Exposing Predicate Models as Linked Data |
Francesco Corcoglioniti, Marco Rospocher, Alessio Palmero Aprosio and Sara Tonelli |
Privacy Issues in Online Machine Translation Services - European Perspective |
Pawel Kamocki and Jim O'Regan |
Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair |
Nikola Ljubešić, Miquel Esplà-Gomis, Antonio Toral, Sergio Ortiz Rojas and Filip Klubička |
PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors |
Gözde Özbal, Carlo Strapparava and Serra Sinem Tekiroglu |
ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool |
Xiaofeng Wu, Jinhua Du, Qun Liu and Andy Way |
PROTEST: A Test Suite for Evaluating Pronouns in Machine Translation |
Liane Guillou and Christian Hardmeier |
Providing a Catalogue of Language Resources for Commercial Users |
Bente Maegaard, Lina Henriksen, Andrew Joscelyne, Vesna Lusicky, Margaretha Mazura, Sussi Olsen, Claus Povlsen and Philippe Wacker |
Publishing the Trove Newspaper Corpus |
Steve Cassidy |
Punctuation Prediction for Unsegmented Transcript Based on Word Vector |
Xiaoyin Che, Cheng Wang, Haojin Yang and Christoph Meinel |
Purely Corpus-based Automatic Conversation Authoring |
Guillaume Dubuisson Duplessis, Vincent Letard, Anne-Laure Ligozat and Sophie Rosset |
Q |
|
QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages |
Arantxa Otegi, Nora Aranberri, António Branco, Jan Hajic, Martin Popel, Kiril Simov, Eneko Agirre, Petya Osenova, Rita Pereira, João Silva and Steven Neale |
Quality Assessment of the Reuters Vol. 2 Multilingual Corpus |
Robin Eriksson |
Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 Conversations |
Ichiro Umata, Koki Ijuin, Mitsuru Ishida, Moe Takeuchi and Seiichi Yamamoto |
QUEMDISSE? Reported speech in Portuguese |
Cláudia Freitas, Bianca Freitas and Diana Santos |
Question-Answering with Logic Specific to Video Games |
Corentin Dumont, Ran Tian and Kentaro Inui |
R |
|
RankDCG: Rank-Ordering Evaluation Measure |
Denys Katerenchuk and Andrew Rosenberg |
Rapid Development of Morphological Analyzers for Typologically Diverse Languages |
Seth Kulick and Ann Bies |
Recent Advances in Development of a Lexicon-Grammar of Polish: PolNet 3.0 |
Zygmunt Vetulani, Grażyna Vetulani and Bartłomiej Kochanowski |
Refurbishing a Morphological Database for German |
Petra Steiner |
Relation- and Phrase-level Linking of FrameNet with Sar-graphs |
Aleksandra Gabryszak, Sebastian Krause, Leonhard Hennig, Feiyu Xu and Hans Uszkoreit |
Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset |
Vuk Batanović, Boško Nikolić and Milan Milosavljević |
Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages |
John Sylak-Glassman, Christo Kirov and David Yarowsky |
Resources for building applications with Dependency Minimal Recursion Semantics |
Ann Copestake, Guy Emerson, Michael Wayne Goodman, Matic Horvat, Alexander Kuhnle and Ewa Muszyńska |
Review on the Existing Language Resources for Languages of France |
Thibault Grouas, Valérie Mapelli and Quentin Samier |
Revisiting Summarization Evaluation for Scientific Articles |
Arman Cohan and Nazli Goharian |
Riddle Generation using Word Associations |
Paloma Galvan, Virginia Francisco, Raquel Hervas and Gonzalo Mendez |
Rude waiter but mouthwatering pastries! An exploratory study into Dutch Aspect-Based Sentiment Analysis |
Orphee De Clercq and Veronique Hoste |
Rule-based Automatic Multi-word Term Extraction and Lemmatization |
Ranka Stankovic, Cvetana Krstev, Ivan Obradovic, Biljana Lazic and Aleksandra Trtovac |
S |
|
SatiricLR: a Language Resource of Satirical News Articles |
Alice Frain and Sander Wubben |
SCALE: A Scalable Language Engineering Toolkit |
Joris Pelemans, Lyan Verwimp, Kris Demuynck, Hugo Van hamme and Patrick Wambacq |
SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German |
Mario Sänger, Ulf Leser, Steffen Kemmerer, Peter Adolphs and Roman Klinger |
SciCorp: A Corpus of English Scientific Articles Annotated for Information Status Analysis |
Ina Roesiger |
Searching in the Penn Discourse Treebank Using the PML-Tree Query |
Jiří Mírovský, Lucie Poláková and Jan Štěpánek |
Segmenting Hashtags using Automatically Created Training Data |
Arda Celebi and Arzucan Özgür |
Selection Criteria for Low Resource Language Programs |
Christopher Cieri, Mike Maxwell, Stephanie Strassel and Jennifer Tracey |
SemAligner: A Method and Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores |
Nabin Maharjan, Rajendra Banjade, Nobal Bikram Niraula and Vasile Rus |
Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature |
Kata Gábor, Haifa Zargayouna, Davide Buscaldi, Isabelle Tellier and Thierry Charnois |
Semantic Layer of the Valence Dictionary of Polish Walenty |
Elżbieta Hajnicz, Anna Andrzejczuk and Tomasz Bartosiak |
Semantic Links for Portuguese |
Fabricio Chalub, Livy Real, Alexandre Rademaker and Valeria de Paiva |
Semantic Relation Extraction with Semantic Patterns Experiment on Radiology Reports |
Mathieu Lafourcade and Lionel Ramadier |
Semi-automatically Alignment of Predicates between Speech and OntoNotes data |
Niraj Shrestha and Marie-Francine Moens |
Semi-automatic Parsing for Web Knowledge Extraction through Semantic Annotation |
Maria Pia di Buono |
SemLinker, a Modular and Open Source Framework for Named Entity Discovery and Linking |
Marie-Jean Meurs, Hayda Almeida, Ludovic Jean-Louis and Eric Charton |
SemRelData ― Multilingual Contextual Annotation of Semantic Relations between Nominals: Dataset and Guidelines |
Darina Benikova and Chris Biemann |
Sense-annotating a Lexical Substitution Data Set with Ubyline |
Tristan Miller, Mohamed Khemakhem, Richard Eckart de Castilho and Iryna Gurevych |
Sentence Similarity based on Dependency Tree Kernels for Multi-document Summarization |
Şaziye Betül Özateş, Arzucan Özgür and Dragomir Radev |
Sentiframes: A Resource for Verb-centered German Sentiment Inference |
Manfred Klenner and Michael Amsler |
Sentiment Analysis in Social Networks through Topic modeling |
Debashis Naskar, Sidahmed Mokaddem, Miguel Rebollo and Eva Onaindia |
Sentiment Lexicons for Arabic Social Media |
Saif Mohammad, Mohammad Salameh and Svetlana Kiritchenko |
Sieve-based Coreference Resolution in the Biomedical Domain |
Dane Bell, Gus Hahn-Powell, Marco A. Valenzuela-Escárcega and Mihai Surdeanu |
Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons |
Antoine Bourlon, Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi |
SlangNet: A WordNet like resource for English Slang |
Shehzaad Dhuliawala, Diptesh Kanojia and Pushpak Bhattacharyya |
Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus Construction |
Dain Kaplan, Neil Rubens, Simone Teufel and Takenobu Tokunaga |
South African Language Resources: Phrase Chunking |
Roald Eiselen |
South African National Centre for Digital Language Resources |
Justus Roux |
SpaceRef: A corpus of street-level geographic descriptions |
Jana Götze and Johan Boye |
Spanish Word Vectors from Wikipedia |
Mathias Etcheverry and Dina Wonsever |
SPA: Web-based Platform for easy Access to Speech Processing Modules |
Fernando Batista, Pedro Curto, Isabel Trancoso, Alberto Abad, Jaime Ferreira, Eugénio Ribeiro, Helena Moniz, David Martins de Matos and Ricardo Ribeiro |
Specialising Paragraph Vectors for Text Polarity Detection |
Fabio Tamburini |
Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese |
Yurie Iribe, Norihide Kitaoka and Shuhei Segawa |
Speech Synthesis of Code-Mixed Text |
Sunayana Sitaram and Alan W Black |
Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context |
Félicien Vallet, Jim Uro, Jérémy Andriamakaoly, Hakim Nabi, Mathieu Derval and Jean Carrive |
SPLIT: Smart Preprocessing (Quasi) Language Independent Tool |
Mohamed Al-Badrashiny, Arfath Pasha, Mona Diab, Nizar Habash, Owen Rambow, Wael Salloum and Ramy Eskander |
Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events |
Stephen Wu, Chung-Il Wi, Sunghwan Sohn, Hongfang Liu and Young Juhn |
Standard Test Collection for English-Persian Cross-Lingual Word Sense Disambiguation |
Navid Rekabsaz, Serwah Sabetghadam, Mihai Lupu, Linda Andersson and Allan Hanbury |
Studying the Temporal Dynamics of Word Co-occurrences: An Application to Event Detection |
Daniel Preoţiuc-Pietro, P. K. Srijith, Mark Hepple and Trevor Cohn |
SubCo: A Learner Translation Corpus of Human and Machine Subtitles |
José Manuel Martínez Martínez and Mihaela Vela |
Subtask Mining from Search Query Logs for How-Knowledge Acceleration |
Chung-Lun Kuo and Hsin-Hsi Chen |
Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations |
Morena Danieli, Balamurali A R, Evgeny Stepanov, Benoit Favre, Frederic Bechet and Giuseppe Riccardi |
Summ-it++: an Enriched Version of the Summ-it Corpus |
Evandro Fonseca, André Antonitsch, Sandra Collovini, Daniela Amaral, Renata Vieira and Anny Figueira |
SuperCAT: The (New and Improved) Corpus Analysis Toolkit |
K. Bretonnel Cohen, William A. Baumgartner Jr. and Irina Temnikova |
Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation |
Hanae Koiso, Tomoyuki Tsuchiya, Ryoko Watanabe, Daisuke Yokomori, Masao Aizawa and Yasuharu Den |
SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language Learners |
Thomas Francois, Elena Volodina, Ildikó Pilán and Anaïs Tack |
SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies |
Elena Volodina, Ildikó Pilán, Ingegerd Enström, Lorena Llozhi, Peter Lundkvist, Gunlög Sundberg and Monica Sandell |
Syllable based DNN-HMM Cantonese Speech to Text System |
Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin Lu, Minglei Li, Dan Xiong, Roy Shing Yu and Vincent T.Y. Ng |
SYN2015: Representative Corpus of Contemporary Written Czech |
Michal Křen, Václav Cvrček, Tomáš Čapka, Anna Čermáková, Milena Hnátková, Lucie Chlumská, Tomáš Jelínek, Dominika Kováříková, Vladimír Petkevič, Pavel Procházka, Hana Skoumalová, Michal Škrabal, Petr Truneček, Pavel Vondřička and Adrian Jan Zasina |
Synset Ranking of Hindi WordNet |
Sudha Bhingardive, Rajita Shukla, Jaya Saraswati, Laxmi Kashyap, Dhirendra Singh and Pushpak Bhattacharya |
Syntactic Analysis of Phrasal Compounds in Corpora: a Challenge for NLP Tools |
Carola Trips |
Syntax-based Multi-system Machine Translation |
Matīss Rikters and Inguna Skadina |
T |
|
TEG-REP: A corpus of Textual Entailment Graphs based on Relation Extraction Patterns |
Kathrin Eichler, Feiyu Xu, Hans Uszkoreit, Leonhard Hennig and Sebastian Krause |
TEITOK: Text-Faithful Annotated Corpora |
Maarten Janssen |
Temporal Information Annotation: Crowd vs. Experts |
Tommaso Caselli, Rachele Sprugnoli and Oana Inel |
TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation |
Adrien Bougouin, Sabine Barreaux, Laurent Romary, Florian Boudin and Beatrice Daille |
TermoPL - a Flexible Tool for Terminology Extraction |
Malgorzata Marciniak, Agnieszka Mykowiecka and Piotr Rychlik |
Text Segmentation of Digitized Clinical Texts |
Cyril Grouin |
Tēzaurs.lv: the Largest Open Lexical Database for Latvian |
Andrejs Spektors, Ilze Auziņa, Roberts Darģis, Normunds Grūzītis, Pēteris Paikens, Lauma Pretkalniņa, Laura Rituma and Baiba Saulīte |
TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics |
Andy Luecking, Armin Hoenen and Alexander Mehler |
That'll Do Fine!: A Coarse Lexical Resource for English-Hindi MT, Using Polylingual Topic Models |
Diptesh Kanojia, Aditya Joshi, Pushpak Bhattacharyya and Mark James Carman |
The ACL RD-TEC 2.0: A Language Resource for Evaluating Term Extraction and Entity Recognition Methods |
Behrang QasemiZadeh and Anne-Kathrin Schumann |
The ACQDIV Database: Min(d)ing the Ambient Language |
Steven Moran |
The Alaskan Athabascan Grammar Database |
Sebastian Nordhoff, Siri Tuttle and Olga Lovick |
The BAS Speech Data Repository |
Uwe Reichel, Florian Schiel, Thomas Kisler, Christoph Draxler and Nina Pörner |
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents |
Johann Poignant, Mateusz Budnik, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau, Gilles Adda, Laurent Besacier, Hazim Ekenel, Gil Francopoulo, Javier Hernando, Joseph Mariani, Ramon Morros, Georges Quénot, Sophie Rosset and Thomas Tamisier |
The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People |
Michel Vacher, Saïda Bouakaz, Marc-Eric Bobillier Chaumon, Frédéric Aman, R. A. Khan, Slima Bekkadja, François Portet, Erwan Guillou, Solange Rossato and Benjamin Lecouteux |
The COPLE2 corpus: a learner corpus for Portuguese |
Amália Mendes, Sandra Antunes, Maarten Janssen and Anabela Gonçalves |
The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions |
Joachim Daiber and Rob van der Goot |
The DialogBank |
Harry Bunt, Volha Petukhova, Andrei Malchanau, Kars Wijnhoven and Alex Fang |
The dialogue breakdown detection challenge: Task description, datasets, and evaluation metrics |
Ryuichiro Higashinaka, Kotaro Funakoshi, Yuka Kobayashi and Michimasa Inaba |
The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data. |
Miguel Matos, Alberto Abad and António Serralheiro |
The ELRA License Wizard |
Valérie Mapelli, Vladimir Popescu, Lin Liu, Meritxell Fernández Barrera and Khalid Choukri |
The Event and Implied Situation Ontology (ESO): Application and Evaluation |
Roxane Segers, Marco Rospocher, Piek Vossen, Egoitz Laparra, German Rigau and Anne-Lyse Minard |
The Gavagai Living Lexicon |
Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Jussi Karlgren, Fredrik Olsson, Per Persson, Akshay Viswanathan and Anders Holst |
The Hebrew FrameNet Project |
Avi Hayoun and Michael Elhadad |
The hunvec framework for NN-CRF-based sequential tagging |
Katalin Pajkossy and Attila Zséder |
The IFCASL Corpus of French and German Non-native and Native Read Speech |
Juergen Trouvain, Anne Bonneau, Vincent Colotte, Camille Fauth, Dominique Fohr, Denis Jouvet, Jeanin Jügler, Yves Laprie, Odile Mella, Bernd Möbius and Frank Zimmerer |
The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus |
Akira Hayakawa, Saturnino Luz, Loredana Cerrato and Nick Campbell |
The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language |
Dan Tufiș, Verginica Barbu Mititelu, Elena Irimia, Ștefan Daniel Dumitrescu and Tiberiu Boroș |
The Language Application Grid and Galaxy |
Nancy Ide, Keith Suderman, James Pustejovsky, Marc Verhagen and Christopher Cieri |
The Language Resource Life Cycle: Towards a Generic Model for Creating, Maintaining, Using and Distributing Language Resources |
Georg Rehm |
The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation |
Jorge Proença, Dirce Celorico, Sara Candeias, Carla Lopes and Fernando Perdigão |
The Methodius Corpus of Rhetorical Discourse Structures and Generated Texts |
Amy Isard |
The Negochat Corpus of Human-agent Negotiation Dialogues |
Vasily Konovalov, Ron Artstein, Oren Melamud and Ido Dagan |
The OFAI Multi-Modal Task Description Corpus |
Stephanie Schreitter and Brigitte Krenn |
The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015 |
Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Jorge Valderrama and Hugo Zaragoza |
The on-line version of Grammatical Dictionary of Polish |
Marcin Woliński and Witold Kieraś |
The OpenCourseWare Metadiscourse (OCWMD) Corpus |
Ghada Alharbi and Thomas Hain |
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud |
John Philip McCrae, Christian Chiarcos, Francis Bond, Philipp Cimiano, Thierry Declerck, Gerard de Melo, Jorge Gracia, Sebastian Hellmann, Bettina Klimek, Steven Moran, Petya Osenova, Antonio Pareja-Lora and Jonathan Pool |
The PsyMine Corpus - A Corpus annotated with Psychiatric Disorders and their Etiological Factors |
Tilia Ellendorff, Simon Foster and Fabio Rinaldi |
The Public License Selector:
Making Open Licensing Easier |
Pawel Kamocki, Pavel Straňák and Michal Sedlák |
The Query of Everything: Developing Open-Domain, Natural-Language Queries for BOLT Information Retrieval |
Kira Griffitt and Stephanie Strassel |
The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban Scenes |
Phil Bartie, William Mackaness, Dimitra Gkatzia and Verena Rieser |
The Royal Society Corpus: From Uncharted Data to Corpus |
Hannah Kermes, Stefania Degaetano-Ortlieb, Ashraf Khamis, Jörg Knappen and Elke Teich |
The Scielo Corpus: a Parallel Corpus of Scientific Publications for Biomedicine |
Mariana Neves, Antonio Jimeno Yepes and Aurélie Névéol |
The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories |
Bolette Pedersen, Anna Braasch, Anders Johannsen, Héctor Martínez Alonso, Sanni Nimb, Sussi Olsen, Anders Søgaard and Nicolai Hartvig Sørensen |
The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource |
Andrej Zgank, Mirjam Sepesy Maucec and Darinka Verdonik |
The SpeDial datasets: datasets for Spoken Dialogue Systems analytics |
José Lopes, Arodami Chorianopoulou, Elisavet Palogiannidi, Helena Moniz, Alberto Abad, Katerina Louka, Elias Iosif and Alexandros Potamianos |
The Trials and Tribulations of Predicting Post-Editing Productivity |
Lena Marg |
The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles |
Christine Meunier, Cecile Fougeron, Corinne Fredouille, Brigitte Bigi, Lise Crevier-Buchman, Elisabeth Delais-Roussarie, Laurianne Georgeton, Alain Ghio, Imed Laaridh, Thierry Legou, Claire Pillot-Loiseau and Gilles Pouchoulin |
The United Nations Parallel Corpus v1.0 |
Michał Ziemski, Marcin Junczys-Dowmunt and Bruno Pouliquen |
The Universal Dependencies Treebank of Spoken Slovenian |
Kaja Dobrovoljc and Joakim Nivre |
The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis |
Beata Megyesi, Jesper Näsman and Anne Palmér |
The Validation of MRCPD Cross-language Expansions on Imageability Ratings |
Ting Liu, Kit Cho, Tomek Strzalkowski, Samira Shaikh and Mehrdad Mirzaei |
The VU Sound Corpus: Adding More Fine-grained Annotations to the Freesound Database |
Emiel van Miltenburg, Benjamin Timmermans and Lora Aroyo |
TLT-CRF: A Lexicon-supported Morphological Tagger for Latin Based on Conditional Random Fields |
Tim vor der Brück and Alexander Mehler |
Tools and Guidelines for Principled Machine Translation Development |
Nora Aranberri, Eleftherios Avramidis, Aljoscha Burchardt, Ondrej Klejch, Martin Popel and Maja Popović |
Towards a Corpus of Violence Acts in Arabic Social Media |
Ayman Alhelbawy, Poesio Massimo and Udo Kruschwitz |
Towards a Language Service Infrastructure for Mobile Environments |
Ngoc Nguyen, Donghui Lin, Takao Nakaguchi and Toru Ishida |
Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge Reuse |
Artemis Parvizi, Matt Kohl, Meritxell Gonzàlez and Roser Saurí |
Towards a Multi-dimensional Taxonomy of Stories in Dialogue |
Kathryn J. Collins and David Traum |
Towards Automatic Identification of Effective Clues for Team Word-Guessing Games |
Eli Pincus and David Traum |
Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging |
Jochen Weiner, Claudia Frankenberg, Dominic Telaar, Britta Wendelstein, Johannes Schröder and Tanja Schultz |
Towards Building Semantic Role Labeler for Indian Languages |
Maaz Anwar and Dipti Sharma |
Towards Comparability of Linguistic Graph Banks for Semantic Parsing |
Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinkova, Dan Flickinger, Jan Hajic, Angelina Ivanova and Zdenka Uresova |
Towards Lexical Encoding of Multi-Word Expressions in Spanish Dialects |
Diana Bogantes, Eric Rodríguez, Alejandro Arauco, Alejandro Rodríguez and Agata Savary |
Towards Multiple Antecedent Coreference Resolution in Specialized Discourse |
Alicia Burga, Sergio Cajal, Joan Codina-Filba and Leo Wanner |
Towards producing bilingual lexica from monolingual corpora |
Jingyi Han and Núria Bel |
Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness |
Dane Bell, Daniel Fried, Luwen Huangfu, Mihai Surdeanu and Stephen Kobourov |
Training & Quality Assessment of an Optical Character Recognition Model for Northern Haida |
Isabell Hubert, Antti Arppe, Jordan Lachler and Eddie Antonio Santos |
Transfer-Based Learning-to-Rank Assessment of Medical Term Technicality |
Dhouha Bouamor, Leonardo Campillos Llanos, Anne-Laure Ligozat, Sophie Rosset and Pierre Zweigenbaum |
Transfer of Corpus-Specific Dialogue Act Annotation to ISO Standard: Is it worth it? |
Shammur Absar Chowdhury, Evgeny Stepanov and Giuseppe Riccardi |
Translation Errors and Incomprehensibility: a Case Study using Machine-Translated Second Language Proficiency Tests |
Takuya Matsuzaki, Akira Fujita, Naoya Todo and Noriko H. Arai |
Trends in HLT Research: A Survey of LDC's Data Scholarship Program |
Denise DiPersio and Christopher Cieri |
TTS for Low Resource Languages: A Bangla Synthesizer |
Alexander Gutkin, Linne Ha, Martin Jansche, Knot Pipatsrisawat and Richard Sproat |
Tweeting and Being Ironic in the Debate about a Political Reform: the French Annotated Corpus TWitter-MariagePourTous |
Cristina Bosco, Mirko Lai, Viviana Patti and Daniela Virone |
TweetMT: A Parallel Microblog Corpus |
Iñaki San Vicente, Iñaki Alegria, Cristina España-Bonet, Pablo Gamallo, Hugo Gonçalo Oliveira, Eva Martinez Garcia, Antonio Toral, Arkaitz Zubiaga and Nora Aranberri |
TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling |
Ben Verhoeven, Walter Daelemans and Barbara Plank |
Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages |
Muhammad Imran, Prasenjit Mitra and Carlos Castillo |
Two Architectures for Parallel Processing of Huge Amounts of Text |
Mathijs Kattenberg, Zuhaitz Beloki, Aitor Soroa, Xabier Artola, Antske Fokkens, Paul Huygen and Kees Verstoep |
Two Decades of Terminology: European Framework Programmes Titles |
Gabriella Pardelli, Sara Goggi, Silvia Giannini and Stefania Biagioni |
Two Years of Aranea: Increasing Counts and Tuning the Pipeline |
Vladimír Benko |
Typed Entity and Relation Annotation on Computer Science Papers |
Yuka Tateisi, Tomoko Ohta, Sampo Pyysalo, Yusuke Miyao and Akiko Aizawa |
Typology of Adjectives Benchmark for Compositional Distributional Models |
Daria Ryzhova, Maria Kyuseva and Denis Paperno |
U |
|
Ubuntu-fr: A Large and Open Corpus for Multi-modal Analysis of Online Written Conversations |
Nicolas Hernandez, Soufian Salim and Elizaveta Loginova Clouet |
UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing |
Milan Straka, Jan Hajic and Jana Straková |
UIMA-Based JCoRe 2.0 Goes GitHub and Maven Central ― State-of-the-Art Software Resource Engineering and Distribution of NLP Pipelines |
Udo Hahn, Franz Matthies, Erik Faessler and Johannes Hellrich |
Universal Dependencies for Japanese |
Takaaki Tanaka, Yusuke Miyao, Masayuki Asahara, Sumire Uematsu, Hiroshi Kanayama, Shinsuke Mori and Yuji Matsumoto |
Universal Dependencies for Norwegian |
Lilja Øvrelid and Petter Hohle |
Universal Dependencies for Persian |
Mojgan Seraji, Filip Ginter and Joakim Nivre |
Universal Dependencies v1: A Multilingual Treebank Collection |
Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajic, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty and Daniel Zeman |
Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages |
Stefan Ecker, Andrea Horbach and Stefan Thater |
UPPC - Urdu Paraphrase Plagiarism Corpus |
Muhammad Sharjeel, Paul Rayson and Rao Muhammad Adeel Nawab |
Urdu Summary Corpus |
Muhammad Humayoun, Rao Muhammad Adeel Nawab, Muhammad Uzair, Saba Aslam and Omer Farzand |
Use of Domain-Specific Language Resources in Machine Translation |
Sanja Štajner, Andreia Querido, Nuno Rendeiro, João António Rodrigues and António Branco |
User, who art thou? User Profiling for Oral Corpus Platforms |
Christian Fandrych, Elena Frick, Hanna Hedeland, Anna Iliash, Daniel Jettka, Cordula Meißner, Thomas Schmidt, Franziska Wallner, Kathrin Weigert and Swantje Westpfahl |
Using a Cross-Language Information Retrieval System based on OHSUMED to Evaluate the Moses and KantanMT Statistical Machine Translation Systems |
Nikolaos Katris, Richard Sutcliffe and Theodore Kalamboukis |
Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data |
Julian Bleicken, Thomas Hanke, Uta Salden and Sven Wagner |
Using a Small Lexicon with CRFs Confidence Measure to Improve POS Tagging Accuracy |
Mohamed Outahajala and Paolo Rosso |
Using BabelNet to Improve OOV Coverage in SMT |
Jinhua Du, Andy Way and Andrzej Zydron |
Using Contextual Information for Machine Translation Evaluation |
Marina Fomicheva and Núria Bel |
Using Data Mining Techniques for Sentiment Shifter Identification |
Samira Noferesti and Mehrnoush Shamsfard |
Using lexical and Dependency Features to Disambiguate Discourse Connectives in Hindi |
Rohit Jain, Himanshu Sharma and Dipti Sharma |
Using SMT for OCR Error Correction of Historical Texts |
Haithem Afli, Zhengwei Qiu, Andy Way and Páraic Sheridan |
Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation |
Jeevanthi Liyanapathirana and Andrei Popescu-Belis |
Using Word Embeddings to Translate Named Entities |
Octavia-Maria Şulea, Sergiu Nisioi and Liviu P. Dinu |
Uzbek-English and Turkish-English Morpheme Alignment Corpora |
Xuansong Li, Jennifer Tracey, Stephen Grimes and Stephanie Strassel |
W |
|
WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words |
Luisa Bentivogli, Mauro Cettolo, M. Amin Farajian and Marcello Federico |
Web Chat Conversations from Contact Centers: a Descriptive Study |
Geraldine Damnati, Aleksandra Guerraz and Delphine Charlet |
What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets |
Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu and Chu-Ren Huang |
What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis |
Francesco Barbieri, Francesco Ronzano and Horacio Saggion |
Whats the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems |
Emma Barker, Monica Paramita, Adam Funk, Emina Kurtic, Ahmet Aker, Jonathan Foster, Mark Hepple and Robert Gaizauskas |
Who was Pietro Badoglio? Towards a QA system for Italian History |
Stefano Menini, Rachele Sprugnoli and Antonio Uva |
|
|