|
PAPERS: Browse articles of the conference sorted by title
A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z
A |
|
A 500 Million Word POS-Tagged Icelandic Corpus |
Thomas Eckart, Erla Hallsteinsdóttir, Sigrún Helgadóttir, Uwe Quasthoff and Dirk Goldhahn |
A Benchmark Database of Phonetic Alignments in Historical Linguistics and Dialectology |
Johann-Mattis List and Jelena Prokić |
A Cascade Approach for Complex-type Classification |
Lauren Romeo, Sara Mendes and Núria Bel |
A Character-based Approach to Distributional Semantic Models: Exploiting Kanji Characters for Constructing JapaneseWord Vectors |
Akira Utsumi |
A Collection of Scholarly Book Reviews from the Platforms of Electronic Sources in Humanities and Social Sciences OpenEdition.org |
Chahinez Benkoussas, Hussam Hamdan, Patrice Bellot, Frédéric Béchet and Elodie Faath |
A Colloquial Corpus of Japanese Sign Language: Linguistic Resources for Observing Sign Language Conversations |
Mayumi Bono, Kouhei Kikuchi, Paul Cibulka and Yutaka Osugi |
A Compact Interactive Visualization of Dependency Treebank Query Results |
Chris Culy, Marco Passarotti and Ulla König-Cardanobile |
A Comparative Evaluation Methodology for NLG in Interactive Systems |
Helen Hastie and Anja Belz |
A Comparison of MT Errors and ESL Errors |
Homa B. Hashemi and Rebecca Hwa |
A Conventional Orthography for Tunisian Arabic |
Inès Zribi, Rahma Boujelbane, Abir Masmoudi, Mariem Ellouze, Lamia Belguith and Nizar Habash |
A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition |
Abir Masmoudi, Mariem Ellouze Khmekhem, Yannick Esteve, Lamia Hadrich Belguith and Nizar Habash |
A Corpus of Comparisons in Product Reviews |
Wiltrud Kessler and Jonas Kuhn |
A Corpus of European Portuguese Child and Child-directed Speech |
Ana Lúcia Santos, Michel Généreux, Aida Cardoso, Celina Agostinho and Silvana Abalada |
A Corpus of Machine Translation Errors Extracted from Translation Students Exercises |
Guillaume Wisniewski, Natalie Kübler and François Yvon |
A Corpus of Participant Roles in Contentious Discussions |
Siddharth Jain, Archna Bhatia, Angelique Rein and Eduard Hovy |
A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation |
Eunah Cho, Sarah Fünfer, Sebastian Stüker and Alex Waibel |
A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence |
Bistra Andreeva, William Barry and Jacques Koreman |
A Crowdsourcing Smartphone Application for Swiss German: Putting Language Documentation in the Hands of the Users |
Jean-Philippe Goldman, Adrian Leeman, Marie-José Kolly, Ingrid Hove, Ibrahim Almajai, Volker Dellwo and Steven Moran |
A Database for Measuring Linguistic Information Content |
Richard Sproat, Bruno Cartoni, HyunJeong Choe, David Huynh, Linne Ha, Ravindran Rajakumar and Evelyn Wenzel-Grondie |
A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification |
Kay Berkling, Johanna Fay, Masood Ghayoomi, Katrin Hein, Rémi Lavalley, Ludwig Linhuber and Sebastian Stüker |
A Database of Full Body Virtual Interactions Annotated with Expressivity Scores |
Demulier Virginie, Elisabetta Bevacqua, Florian Focone, Tom Giraud, Pamela Carreno, Brice Isableu, Sylvie Gibet, Pierre De Loor and Jean-Claude Martin |
A Decade of HLT Agency Activities in the Low Countries: from Resource Maintenance (BLARK) to Service Offerings (BLAISE) |
Peter Spyns and Remco van Veenendaal |
A Deep Context Grammatical Model For Authorship Attribution |
Simon Fuller, Phil Maguire and Philippe Moser |
A Finite-State Morphological Analyzer for a Lakota HPSG Grammar |
Christian Curtis |
A Flexible Language Learning Platform Based on Language Resources and Web Services |
Elena Volodina, Ildikó Pilán, Lars Borin and Therese Lindström Tiedemann |
A Framework for Compiling High Quality Knowledge Resources From Raw Corpora |
Gongye Jin, Daisuke Kawahara and Sadao Kurohashi |
A Framework for Public Health Surveillance |
Andrew Yates, Jon Parker, Nazli Goharian and Ophir Frieder |
A German Twitter Snapshot |
Tatjana Scheffler |
A Gold Standard Dependency Corpus for English |
Natalia Silveira, Timothy Dozat, Marie-Catherine de Marneffe, Samuel Bowman, Miriam Connor, John Bauer and Chris Manning |
A Gold Standard for CLIR evaluation in the Organic Agriculture Domain |
Alessio Bosca, Matteo Casu, Matteo Dragoni and Nikolaos Marianos |
A Graph-Based Approach for Computing Free Word Associations |
Gemma Bel Enguix, Reinhard Rapp and Michael Zock |
A Hierarchical Taxonomy for Classifying Hardness of Inference Tasks |
Martin Gleize and Brigitte Grau |
A Hindi-English Code-Switching Corpus |
Anik Dey and Pascale Fung |
A Japanese Word Dependency Corpus |
Shinsuke Mori, Hideki Ogura and Tetsuro Sasada |
A Language-independent and fully Unsupervised Approach to Lexicon Induction and Part-of-Speech Tagging for Closely Related Languages |
Yves Scherrer and Benoît Sagot |
A Language-independent Approach to Extracting Derivational Relations from an Inflectional Lexicon |
Marion Baranes and Benoît Sagot |
A Large Corpus of Product Reviews in Portuguese: Tackling Out-Of-Vocabulary Words |
Nathan Hartmann, Lucas Avanço, Pedro Balage, Magali Duran, Maria das Graças Volpe Nunes, Thiago Pardo and Sandra Aluísio |
A Large Scale Database of Strongly-related Events in Japanese |
Tomohide Shibata, Shotaro Kohama and Sadao Kurohashi |
A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation |
Violeta Seretan, Pierrette Bouillon and Johanna Gerlach |
A LDA-based Topic Classification Approach from highly Imperfect Automatic Transcriptions |
Mohamed Morchid, Richard Dufour and Georges Linares |
A Meta-data Driven Platform for Semi-automatic Configuration of Ontology Mediators |
Manuel Fiorelli, Maria Teresa Pazienza and Armando Stellato |
A Method for Building Burst-Annotated Co-Occurrence Networks for Analysing Trends in Textual Data |
Yutaka Mitsuishi, Vit Novacek and Pierre-Yves Vandenbussche |
A Model for Processing Illocutionary Structures and Argumentation in Debates |
Kasia Budzynska, Mathilde Janier, Chris Reed, Patrick Saint-Dizier, Manfred Stede and Olena yakorska |
A Model to Generate Adaptive Multimodal Job Interviews with a Virtual Recruiter |
Zoraida Callejas, Brian Ravenet, Magalie Ochs and Catherine Pelachaud |
A Modular System for Rule-based Text Categorisation |
Marco Del Tredici and Malvina Nissim |
A Multi-Cultural Repository of Automatically Discovered Linguistic and Conceptual Metaphors |
Samira Shaikh, Tomek Strzalkowski, Ting Liu, George Aaron Broadwell, Boris Yamrom, Sarah Taylor, Laurie Feldman, Kit Cho, Umit Boz, Ignacio Cases, Yuliya Peshkova and Ching-Sheng Lin |
A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic |
Ryan Cotterell and Chris Callison-Burch |
A Multidialectal Parallel Corpus of Arabic |
Houda Bouamor, Nizar Habash and Kemal Oflazer |
A Multimodal Corpus of Rapid Dialogue Games |
Maike Paetzel, David Nicolas Racca and David DeVault |
A Multimodal Dataset for Deception Detection |
Veronica Perez-Rosas, Rada Mihalcea, Alexis Narvaez and Mihai Burzo |
A Multimodal Interpreter for 3D Visualization and Animation of Verbal Concepts |
Coline Claude-Lachenaud, Eric Charton, Benoit Ozell and Michel Gagnon |
A New Form of Humor ― Mapping Constraint-Based Computational Morphologies to a Finite-State Representation |
Attila Novák |
A New Framework for Sign Language Recognition Based on 3D Handshape Identification and Linguistic Modeling |
Mark Dilsizian, Polina Yanovich, Shu Wang, Carol Neidle and Dimitris Metaxas |
A Persian Treebank with Stanford Typed Dependencies |
Mojgan Seraji, Carina Jahani, Beáta Megyesi and Joakim Nivre |
A Quality-based Active Sample Selection Strategy for Statistical Machine Translation |
Varvara Logacheva and Lucia Specia |
A Rank-based Distance Measure to Detect Polysemy and to Determine Salient Vector-Space Features for German Prepositions |
Maximilian Köper and Sabine Schulte im Walde |
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization |
Kai Hong, John Conroy, Benoit Favre, Alex Kulesza, Hui Lin and Ani Nenkova |
A Set of Open Source Tools for Turkish Natural Language Processing |
Çağrı Çöltekin |
A SICK Cure for the Evaluation of Compositional Distributional Semantic Models |
Marco Marelli, Stefano Menini, Marco Baroni, Luisa Bentivogli, Raffaella bernardi and Roberto Zamparelli |
A SKOS-based Schema for TEI encoded Dictionaries at ICLTT |
Thierry Declerck, Karlheinz Mörth and Eveline Wandl-Vogt |
A Stream Computing Approach Towards Scalable NLP |
Xabier Artola, Zuhaitz Beloki and Aitor Soroa |
A Study on Expert Sourcing Enterprise Question Collection and Classification |
Yuan Luo, Thomas Boucher, Tolga Oral, David Osofsky and Sara Weber |
A System for Experiments with Dependency Parsers |
Kiril Simov, Iliana Simova, Ginka Ivanova, Maria Mateva and Petya Osenova |
A Tagged Corpus and a Tagger for Urdu |
Bushra Jawaid, Amir Kamran and Ondrej Bojar |
A Tool Suite for Creating Question Answering Benchmarks |
Axel-Cyrille Ngonga Ngomo, Norman Heino, René Speck and Prodromos Malakasiotis |
A Toolkit for Efficient Learning of Lexical Units for Speech Recognition |
Matti Varjokallio and mikko kurimo |
A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness |
Archna Bhatia, Mandy Simons, Lori Levin, Yulia Tsvetkov, Chris Dyer and Jordan Bender |
A Vector Space Model for Syntactic Distances Between Dialects |
Emanuele Di Buccio, Giorgio Maria Di Nunzio and Gianmaria Silvello |
A Wikipedia-based Corpus for Contextualized Machine Translation |
Jennifer Drexler, Pushpendre Rastogi, Jacqueline Aguilar, Benjamin Van Durme and Matt Post |
Access Control by Query Rewriting: the Case of KorAP |
Piotr Banski, Nils Diewald, Michael Hanl, Marc Kupietz and Andreas Witt |
Accommodations in Tuscany as Linked Data |
Clara Bacciu, Angelica Lo Duca, Andrea Marchetti and Maurizio Tesconi |
ACTIV-ES: a Comparable, Cross-Dialect Corpus of "everyday" Spanish from Argentina, Mexico, and Spain |
Jerid Francom, Mans Hulden and Adam Ussishkin |
Adapting a Part-of-Speech Tagset to Non-Standard Text: the Case of STTS |
Heike Zinsmeister, Ulrich Heid and Kathrin Beck |
Adapting Freely Available Resources to Build an Opinion Mining Pipeline in Portuguese |
Patrik Lambert and Carlos Rodriguez-Penagos |
Adapting VerbNet to French using Existing Resources |
Quentin Pradet, Laurence Danlos and Gaël de Chalendar |
Aggregation Methods for Efficient Collocation Detection |
Anca Dinu, Liviu Dinu and Ionut Sorodoc |
Aix Map Task Corpus: the French Multimodal Corpus of Task-oriented Dialogue |
Jan Gorisch, Corine Astésano, Ellen Gurman Bard, Brigitte Bigi and Laurent Prévot |
Alert!... Calm Down, There is Nothing to Worry About. Warning and Soothing Speech Synthesis |
Milan Rusko, Sakhia Darjaa, Marian Trnka, Marian Ritomsky and Robert Sabo |
ALICO: a Multimodal Corpus for the Study of Active Listening |
Hendrik Buschmeier, Zofia Malisz, Joanna Skubisz, Marcin Wlodarczak, Ipke Wachsmuth, Stefan Kopp and Petra Wagner |
Aligning Parallel Texts with InterText |
Pavel Vondřička |
Aligning Predicate-Argument Structures for Paraphrase Fragment Extraction |
Michaela Regneri, Rui Wang and Manfred Pinkal |
All Fragments Count in Parser Evaluation |
Joost Bastings and Khalil Sima'an |
Amazigh Verb Conjugator |
Fadoua Ataa Allah and Siham Boulaknadel |
An Analysis of Ambiguity in Word Sense Annotations |
David Jurgens |
An Analysis of Older Users' Interactions with Spoken Dialogue Systems |
Jamie Bost and Johanna Moore |
An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis |
Eshrag Refaee and Verena Rieser |
An Efficient and User-friendly Tool for Machine Translation Quality Estimation |
Kashif Shah, Marco Turchi and Lucia Specia |
An Efficient Language Independent Toolkit for Complete Morphological Disambiguation |
László Laki and György Orosz |
An Effortless Way To Create Large-Scale Datasets For Famous Speakers |
François Salmon and Félicien Vallet |
An Evaluation of the Role of Statistical Measures and Frequency for MWE Identification |
Sandra Antunes and Amália Mendes |
An Exercise in Reuse of Resources: Adapting General Discourse Coreference Resolution for Detecting Lexical Chains in Patent Documentation |
Nadjet Bouayad-Agha, Alicia Burga, Gerard Casamayor, Joan Codina, Rogelio Nazar and Leo Wanner |
An Innovative World Language Centre : Challenges for the Use of Language Technology |
Auður Hauksdóttir |
An Iterative Approach for Mining Parallel Sentences in a Comparable Corpus |
Lise Rebout and Phillippe Langlais |
An Open Source Part-of-Speech Tagger for Norwegian: Building on Existing Language Resources |
Cristina Sánchez Marco |
An Open-Source Heavily Multilingual Translation Graph Extracted from Wiktionaries and Parallel Corpora |
Valérie Hanoka and Benoît Sagot |
An Out-of-Domain Test Suite for Dependency Parsing of German |
Wolfgang Seeker and Jonas Kuhn |
ANCOR_Centre, a Large Free Spoken French Coreference Corpus: Description of the Resource and Reliability Measures |
Judith Muzerelle, Anaïs Lefeuvre, Emmanuel Schang, Jean-Yves Antoine, Aurore Pelletier, Denis Maurel, Iris Eshkol and Jeanne Villaneau |
Annotating Arguments: The NOMAD Collaborative Annotation Tool |
Georgios Petasis |
Annotating Clinical Events in Text Snippets for Phenotype Detection |
Prescott Klassen, Fei Xia, Lucy Vanderwende and Meliha Yetisgen |
Annotating Events in an Emotion Corpus |
Sophia Lee, Shoushan Li and Chu-Ren Huang |
Annotating Inter-Sentence Temporal Relations in Clinical Notes |
Jennifer D'Souza and Vincent Ng |
Annotating Question Decomposition on Complex Medical Questions |
Kirk Roberts, Kate Masterton, Marcelo Fiszman, Halil Kilicoglu and Dina Demner-Fushman |
Annotating Relation Mentions in Tabloid Press |
Hong Li, Sebastian Krause, Feiyu Xu, Hans Uszkoreit, Robert Hummel and Veselina Mironova |
Annotating Relations in Scientific Articles |
Adam Meyers, Giancarlo Lee, Angus Grieve-Smith, Yifan He and Harriet Taber |
Annotating the Focus of Negation in Japanese Text |
Suguru Matsuyoshi, Ryo Otsuki and Fumiyo Fukumoto |
Annotating the MASC Corpus with BabelNet |
Andrea Moro, Roberto Navigli, Francesco Maria Tucci and Rebecca J. Passonneau |
Annotation of Computer Science Papers for Semantic Relation Extrac-tion |
Yuka Tateisi, Yo Shidahara, Yusuke Miyao and Akiko Aizawa |
Annotation of Specialized Corpora using a Comprehensive Entity and Relation Scheme |
Louise Deleger, Anne-Laure Ligozat, Cyril Grouin, Pierre Zweigenbaum and Aurelie Neveol |
Annotation Pro + TGA: Automation of Speech Timing Analysis |
Katarzyna Klessa and Dafydd Gibbon |
Applying Accessibility-Oriented Controlled Language (CL) Rules to Improve Appropriateness of Text Alternatives for Images: an Exploratory Study |
Silvia Rodríguez Vázquez, Pierrette Bouillon and Anton Bolfing |
AraNLP: a Java-based Library for the Processing of Arabic Text. |
Maha Althobaiti, Udo Kruschwitz and Massimo Poesio |
ASR-based CALL Systems and Learner Speech Data: New Resources and Opportunities for Research and Development in Second Language Learning |
Catia Cucchiarini, Steve Bodnar, Bart Penning de Vries, Roeland Van Hout and Helmer Strik |
Assessment of Non-native Prosody for Spanish as L2 using Quantitative Scores and Perceptual Evaluation |
Valentín Cardeñoso-Payo, César González-Ferreras and David Escudero |
Augmenting English Adjective Senses with Supersenses |
Yulia Tsvetkov, Nathan Schneider, Dirk Hovy, Archna Bhatia, Manaal Faruqui and Chris Dyer |
AusTalk: an Audio-Visual Corpus of Australian English |
Dominique Estival, Steve Cassidy, Felicity Cox and Denis Burnham |
Author-Specific Sentiment Aggregation for Polarity Prediction of Reviews |
Subhabrata Mukherjee and Sachindra Joshi |
Automatic Acquisition of Urdu Nouns (along with Gender and Irregular Plurals)) |
Tafseer Ahmed Khan |
Automatic Annotation of Machine Translation Datasets with Binary Quality Judgements |
Marco Turchi and Matteo Negri |
Automatic Creation of WordNets from Parallel Corpora |
Antoni Oliver and Salvador Climent |
Automatic Detection of Other-Repetition Occurrences: Application to French Conversational Speech |
Brigitte Bigi, Roxane Bertrand and Mathilde Guardiola |
Automatic Error Detection Concerning the Definite and Indefinite Conjugation in the HunLearner Corpus |
Veronika Vincze, János Zsibrita, Péter Durst and Martina Katalin Szabó |
Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings |
Ting Liu, Kit Cho, G. Aaron Broadwell, Samira Shaikh, Tomek Strzalkowski, John Lien, Sarah Taylor, Laurie Feldman, Boris Yamrom, Nick Webb, Umit Boz, Ignacio Cases and Ching-Sheng Lin |
Automatic Extraction of Synonyms for German Particle Verbs from Parallel Data with Distributional Similarity as a Re-Ranking Feature |
Moritz Wittmann, Marion Weller and Sabine Schulte im Walde |
Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish |
Thomas Lavergne, Gilles Adda, Martine Adda-Decker and Lori Lamel |
Automatic Long Audio Alignment and Confidence Scoring for Conversational Arabic Speech |
Mohamed Elmahdy, Mark Hasegawa-Johnson and Eiman Mustafawi |
Automatic Mapping Lexical Resources: A Lexical Unit as the Keystone |
Eduard Bejček, Kettnerová Václava and Marketa Lopatkova |
Automatic Methods for the Extension of a Bilingual Dictionary using Comparable Corpora |
Michael Rosner and Kurt Sultana |
Automatic Refinement of Syntactic Categories in Chinese Word Structures |
Jianqiang Ma |
Automatic Semantic Relation Extraction from Portuguese Texts |
Leonardo Sameshima Taba and Helena Caseli |
Automatically Enriching Spoken Corpora with Syntactic Information for Linguistic Studies |
Alexis Nasr, Frederic Bechet, Benoit Favre, Thierry Bazillon, Jose Deulofeu and Andre Valli |
B |
|
Basque Speecon-like and Basque SpeechDAT MDB-600: Speech Databases for the Development of ASR Technology for Basque |
Igor Odriozola, Inma Hernaez, María Inés Torres, Luis Javier Rodriguez-Fuentes, Mikel Penagarikano and Eva Navas |
Because Size Does Matter: The Hamburg Dependency Treebank |
Kilian A. Foth, Arne Köhn, Niels Beuck and Wolfgang Menzel |
Benchmarking of English-Hindi Parallel Corpora |
Jayendra Rakesh Yeka, Prasanth Kolachina and Dipti Misra Sharma |
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web |
Giuseppe Rizzo, Marieke van Erp and Raphaël Troncy |
Benchmarking Twitter Sentiment Analysis Tools |
Ahmed Abbasi, Ammar Hassan and Milan Dhar |
Bidirectionnal Converter Between Syntactic Annotations: from French Treebank Dependencies to PASSAGE Annotations, and back |
Munshi Asadullah, Patrick Paroubek and Anne Vilnat |
Bilingual dictionaries for all EU languages |
Ahmet Aker, Monica Paramita, Marcis Pinnis and Robert Gaizauskas |
Bilingual Dictionary Construction with Transliteration Filtering |
John Richardson, Toshiaki Nakazawa and Sadao Kurohashi |
Bilingual Dictionary Induction as an Optimization Problem |
Wushouer Mairidan, Toru Ishida, Donghui Lin and Katsutoshi Hirayama |
Billions of Parallel Words for Free: Building and Using the EU Bookshop Corpus |
Raivis Skadiņš, Jörg Tiedemann, Roberts Rozis and Daiga Deksne |
BiographyNet: Methodological Issues when NLP Supports Historical Research |
Antske Fokkens, Serge Ter Braake, Niels Ockeloen, Piek Vossen, Susan Legêne and Guus Schreiber |
Biomedical Entity Extraction using Machine-Learning Based Approaches |
Cyril Grouin |
Boosting Open Information Extraction with Noun-Based Relations |
Clarissa Xavier and Vera Lima |
Boosting the Creation of a Treebank |
Blanca Arias, Nuria Bel, Mercè Lorente, Montserrat Marimón, Alba Milà, Jorge Vivaldi, Muntsa Padró, Marina Fomicheva and Imanol Larrea |
Bootstrapping an Italian VerbNet: data-driven analysis of verb alternations |
Gianluca Lebani, Veronica Viola and Alessandro Lenci |
Bootstrapping Open-Source English-Bulgarian Computational Dictionary |
Krasimir Angelov |
Bootstrapping Term Extractors for Multiple Languages |
Ahmet Aker, Monica Paramita, Emma Barker and Robert Gaizauskas |
Bridging the Gap between Speech Technology and Natural Language Processing: An Evaluation Toolbox for Term Discovery Systems |
Bogdan Ludusan, Maarten Versteegh, Aren Jansen, Guillaume Gravier, Xuan-Nga Cao, Mark Johnson and Emmanuel Dupoux |
Bring vs. MTRoget: Evaluating Automatic Thesaurus Translation |
Lars Borin, Jens Allwood and Gerard de Melo |
Building a Corpus of Manually Revised Texts from Discourse Perspective |
Ryu Iida and Takenobu Tokunaga |
Building a Crisis Management Term Resource for Social Media: The Case of Floods and Protests |
Irina Temnikova, Andrea Varga and Dogan Biyikli |
Building a Database of Japanese Adjective Examples from Special Purpose Web Corpora |
Masaya Yamaguchi |
Building a Dataset for Summarization and Keyword Extraction from Emails |
Vanessa Loza, Shibamouli Lahiri, Rada Mihalcea and Po-Hsiang Lai |
Building a Dataset of Multilingual Cognates for the Romanian Lexicon |
Liviu Dinu and Alina Maria Ciobanu |
Building a Reference Lexicon for Countability in English |
Tibor Kiss, Francis Jeffry Pelletier and Tobias Stadtfeld |
Building and Modelling Multilingual Subjective Corpora |
Motaz Saad, David Langlois and Kamel Smaili |
Building Domain Specific Bilingual Dictionaries |
Lucas Hilgert, Lucelene Lopes, Artur Freitas, Renata Vieira, Denise Hogetop and Aline Vanin |
Building The Sense-Tagged Multilingual Parallel Corpus |
Shan Wang and Francis Bond |
Buy One Get One Free: Distant Annotation of Chinese Tense, Event Type and Modality |
Nianwen Xue and Yuchen Zhang |
C |
|
C-PhonoGenre: a 7-hour Corpus of 7 Speaking Styles in French: Relations between Situational Features and Prosodic Properties |
Jean-Philippe Goldman, Tea Prsir and Antoine Auchlin |
Can Crowdsourcing be used for Effective Annotation of Arabic? |
Wajdi Zaghouani and Kais Dukes |
Can Numerical Expressions Be Simpler? Implementation and Demonstration of a Numerical Simplification System for Spanish |
Susana Bautista and Horacio Saggion |
Can the Crowd be Controlled?: A Case Study on Crowd Sourcing and Automatic Validation of Completed Tasks based on User Modeling |
A.R Balamurali |
Casa De La Lhéngua: a Set of Language Resources and Natural Language Processing Tools for Mirandese |
José Pedro Ferreira, Cristiano Chesi, Daan Baldewijns, Fernando Miguel Pinto, Margarita Correia, Daniela Braga, Hyongsil Cho, Amadeu Ferreira and Miguel Dias |
caWaC - a Web Corpus of Catalan and its Application to Language Modeling and Machine Translation |
Nikola Ljubešić and Antonio Toral |
CFT13: a Resource for Research into the Post-editing Process |
Michael Carl, Mercedes Martínez García and Bartolomé Mesa-Lao |
Characterizing and Predicting Bursty Events: the Buzz Case Study on Twitter |
Mohamed Morchid, Georges Linares and Richard Dufour |
Chasing the Perfect Splitter: A Comparison of Different Compound Splitting Tools |
Carla Parra Escartín |
Choosing which to Use? A Study of Distributional Models for Nominal Lexical Semantic Classification |
Lauren Romeo, Gianluca Lebani, Núria Bel and Alessandro Lenci |
CIEMPIESS: A New Open-Sourced Mexican Spanish Radio Corpus |
Carlos Daniel Hernandez Mena and Abel Herrera Camacho |
CLARA: A New Generation of Researchers in Common Language Resources and Their Applications |
Koenraad De Smedt, Erhard Hinrichs, Detmar Meurers, Inguna Skadina, Bolette Pedersen, Costanza Navarretta, Núria Bel, Krister Linden, Marketa Lopatkova, Jan Hajic, Gisle Andersen and Przemyslaw Lenkiewicz |
CLARIN-NL: Major Results |
Jan Odijk |
Classifying Inconsistencies in DBpedia Language Specific Chapters |
Elena Cabrio, Serena Villata and Fabien Gandon |
ClearTK 2.0: Design Patterns for Machine Learning in UIMA |
Steven Bethard, Philip Ogren and Lee Becker |
Clinical Data-Driven Probabilistic Graph Processing |
Travis Goodwin and Sanda Harabagiu |
CLIPS Stylometry Investigation (CSI) Corpus: a Dutch Corpus for the Detection of Age, Gender, Personality, Sentiment and Deception in Text |
Ben Verhoeven and Walter Daelemans |
Clustering of Multi-Word Named Entity Variants: Multilingual Evaluation |
Guillaume Jacquet, Maud Ehrmann and Ralf Steinberger |
Clustering Tweets using Wikipedia Concepts |
Guoyu Tang, Yunqing Xia, Weizhi Wang, Raymond Lau and Fang Zheng |
Co-clustering of Bilingual Datasets as a Mean for Assisting the Construction of Thematic Bilingual Comparable Corpora |
Guiyao Ke and Pierre-Francois Marteau |
Co-Training for Classification of Live or Studio Music Recordings |
Nicolas Auguin and Pascale Fung |
Collaboration in the Production of a Massively Multilingual Lexicon |
Martin Benjamin |
Collaboratively Annotating Multilingual Parallel Corpora in the Biomedical Domain―some MANTRAs |
Johannes Hellrich, Simon Clematide, Udo Hahn and Dietrich Rebholz-Schuhmann |
Collecting Natural SMS and Chat Conversations in Multiple Languages: The BOLT Phase 2 Corpus |
Zhiyi Song, Stephanie Strassel, Haejoong Lee, Kevin Walker, Jonathan Wright, Jennifer Garland, Dana Fore, Brian Gainor, Preston Cabe, Thomas Thomas, Brendan Callahan and Ann Sawyer |
Collection of a Simultaneous Translation Corpus for Comparative Analysis |
Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda and Satoshi Nakamura |
ColLex.en: Automatically Generating and Evaluating a Full-form Lexicon for English |
Tim vor der Brück, Alexander Mehler and Zahurul Islam |
Collocation or Free Combination? ― Applying Machine Translation Techniques to identify collocations in Japanese |
Lis Pereira, Elga Strafella and Yuji Matsumoto |
Combining Dependency Information and Generalization in a Pattern-based Approach to the Classification of Lexical-Semantic Relation Instances |
Silvia Necsulescu, Sara Mendes and Núria Bel |
Combining Elicited Imitation and Fluency Features for Oral Proficiency Measurement |
Deryle Lonsdale and Carl Christensen |
Comparative Analysis of Portuguese Named Entities Recognition Tools |
Daniela Amaral, Evandro Fonseca, Lucelene Lopes and Renata Vieira |
Comparative Analysis of Verbal Alignment in Human-Human and Human-Agent Interactions |
Sabrina Campano, Jessica Durand and Chloé Clavel |
Comparing Similarity Measures for Distributional Thesauri |
Muntsa Padró, Marco Idiart, Aline Villavicencio and Carlos Ramisch |
Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them |
Bruno Laranjeira, Viviane Moreira, Aline Villavicencio, Carlos Ramisch and Maria José Finatto |
Comparing Two Acquisition Systems for Automatically Building an English-Croatian Parallel Corpus from Multilingual Websites |
Miquel Esplà-Gomis, Filip Klubička, Nikola Ljubešić, Sergio Ortiz-Rojas, Vassilis Papavassiliou and Prokopis Prokopidis |
Comparison of Gender- and Speaker-adaptive Emotion Recognition |
Maxim Sidorov, Stefan Ultes and Alexander Schmitt |
Comparison of the Impact of Word Segmentation on Name Tagging for Chinese and Japanese |
Haibo Li, Masato Hagiwara, Qi Li and Heng Ji |
Compounds and Distributional Thesauri |
Olivier Ferret |
Comprehensive Annotation of Multiword Expressions in a Social Web Corpus |
Nathan Schneider, Spencer Onuffer, Nora Kazour, Emily Danchik, Michael T. Mordowanec, Henrietta Conrad and Noah A. Smith |
Computational Narratology: Extracting Tense Clusters from Narrative Texts |
Thomas Bögel, Jannik Strötgen and Michael Gertz |
Computer-aided Morphology Expansion for Old Swedish |
Yvonne Adesam, Malin Ahlberg, Peter Andersson, Gerlof Bouma, Markus Forsberg and Mans Hulden |
Computer-Aided Quality Assurance of an Icelandic Pronunciation Dictionary |
Martin Jansche |
Conceptual Transfer: Using Local Classifiers for Transfer Selection |
Gregor Thurmair |
Constituency Parsing of Bulgarian: Word- vs Class-based Parsing |
Masood Ghayoomi, Kiril Simov and Petya Osenova |
Constructing a Chinese―Japanese Parallel Corpus from Wikipedia |
Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi |
Constructing a Corpus of Japanese Predicate Phrases for Synonym/Antonym Relations |
Tomoko Izumi, Tomohide Shibata, Hisako Asano, Yoshihiro Matsuo and Sadao Kurohashi |
Constructing and Exploiting an Automatically Annotated Resource of Legislative Texts |
Stefan Höfler and Kyoko Sugisaki |
Construction and Annotation of a French Folkstale Corpus |
Anne Garcia-Fernandez, Anne-Laure Ligozat and Anne Vilnat |
Construction of Diachronic Ontologies from People's Daily of Fifty Years |
Shaoda He, Xiaojun Zou, Liumingjing Xiao and Junfeng Hu |
Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank |
Masood Ghayoomi and Jonas Kuhn |
Coreference Resolution for Latvian |
Arturs Znotins and Peteris Paikens |
CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis |
Carmen Garcia-Mateo, Antonio Cardenal, Xose Luis Regueira, Elisa Fernández Rei, Marta Martinez, Roberto Seara, Rocío Varela and Noemí Basanta |
CoRoLa ― The Reference Corpus of Contemporary Romanian Language |
Verginica Barbu Mititelu, Elena Irimia and Dan Tufiș |
Corpus and Evaluation of Handwriting Recognition of Historical Genealogical Records |
Patrick Schone, Heath Nielson and Mark Ward |
Corpus and Method for Identifying Citations in Non-Academic Text |
Yifan He and Adam Meyers |
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines |
Marta Sabou, Kalina Bontcheva, Leon Derczynski and Arno Scharl |
Corpus for Coreference Resolution on Scientific Papers |
Panot Chaimongkol, Akiko Aizawa and Yuka Tateisi |
Corpus of 19th-century Czech Texts: Problems and Solutions |
Karel Kučera and Martin Stluka |
Corpus-Based Computation of Reverse Associations |
Reinhard Rapp |
Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie |
Rachel Bawden, Marie-Amélie Botalla, kim gerdes and Sylvain Kahane |
Correcting Errors in a New Gold Standard for Tagging Icelandic Text |
Sigrún Helgadóttir, Hrafn Loftsson and Eiríkur Rögnvaldsson |
Creating a Gold Standard Corpus for the Extraction of Chemistry-Disease Relations from Patent Texts |
Antje Schlaf, Claudia Bobach and Matthias Irmer |
Creating a Massively Parallel Bible Corpus |
Thomas Mayer and Michael Cysouw |
Creating and Using Large Monolingual Parallel Corpora for Sentential Paraphrase Generation |
Sander Wubben, Antal van den Bosch and Emiel Krahmer |
Creating Summarization Systems with SUMMA |
Horacio Saggion |
Creative Language Explorations through a high-Expressivity N-grams Query Language |
Carlo Strapparava, Lorenzo Gatti, Marco Guerini and Oliviero Stock |
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data |
Jena D. Hwang, Annie Zaenen and Martha Palmer |
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing |
Željko Agić, Daša Berović, Danijela Merkler and Marko Tadić |
Croatian Memories |
Arjan van Hessen, Franciska de Jong, Stef Scagliola and Tanja Petrovic |
CroDeriV: a New Resource for Processing Croatian Morphology |
Krešimir Šojat, Matea Srebačić, Marko Tadić and Tin Pavelić |
CROMER: a Tool for Cross-Document Event and Entity Coreference |
Christian Girardi, Manuela Speranza, Rachele Sprugnoli and Sara Tonelli |
Cross-Language Authorship Attribution |
Dasha Bogdanova and Angeliki Lazaridou |
Cross-Linguistic Annotation of Narrativity for English/French Verb Tense Disambiguation |
Cristina Grisot and Thomas Meyer |
Crowd-sourcing Evaluation of Automatically Acquired, Morphologically Related Word Groupings |
Claudia Borg and Albert Gatt |
Crowdsourcing and Annotating NER for Twitter #drift |
Hege Fromreide, Dirk Hovy and Anders Søgaard |
Crowdsourcing as a Preprocessing for Complex Semantic Annotation Tasks |
Héctor Martínez Alonso and Lauren Romeo |
Crowdsourcing for Evaluating Machine Translation Quality |
Shinsuke Goto, Donghui Lin and Toru Ishida |
Crowdsourcing for the Identification of Event Nominals: an Experiment |
Rachele Sprugnoli and Alessandro Lenci |
D |
|
Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers |
Stefania Degaetano-Ortlieb, Peter Fankhauser, Hannah Kermes, Ekaterina Lapshinova-Koltunski, Noam Ordan and Elke Teich |
DBpedia Domains: Augmenting DBpedia with Domain Information |
Gregor Titze, Volha Bryl, Cäcilia Zirn and Simone Paolo Ponzetto |
DCEP -Digital Corpus of the European Parliament |
Najeh Hajlaoui, David Kolovratnik, Jaakko Väyrynen, Ralf Steinberger and Daniel Varga |
Deep Syntax Annotation of the Sequoia French Treebank |
Marie Candito, Guy Perrier, Bruno Guillaume, Corentin Ribeyre, Karën Fort, Djamé Seddah and Eric de la Clergerie |
Definition Patterns for Predicative Terms in Specialized Lexical Resources |
Antonio San Martín and Marie-Claude L'Homme |
DeLex, a Freely-available, Large-scale and Linguistically Grounded Morphological Lexicon for German |
Benoît Sagot |
Dense Components in the Structure of WordNet |
Ahti Lohk, Kaarel Allik, Heili Orav and Leo Võhandu |
Dependency Parsing Representation Effects on the Accuracy of Semantic Applications - an Example of an Inflective Language |
Lauma Pretkalniņa, Artūrs Znotiņš, Laura Rituma and Didzis Goško |
DerivBase.hr: A High-Coverage Derivational Morphology Resource for Croatian |
Jan Šnajder |
Design and Development of an Online Computational Framework to Facilitate Language Comprehension Research on Indian Languages |
manjira sinha, Tirthankar Dasgupta and Anupam Basu |
Design and Development of an RDB Version of the Corpus of Spontaneous Japanese |
Hanae Koiso, Yasuharu Den, Ken'ya Nishikawa and Kikuo Maekawa |
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process |
Camille Fauth, Anne Bonneau, Frank Zimmerer, Juergen Trouvain, Bistra Andreeva, Vincent Colotte, Dominique Fohr, Denis Jouvet, Jeanin Jügler, Yves Laprie, Odile Mella and Bernd Möbius |
Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing |
Noushin Rezapour Asheghi, Serge Sharoff and Katja Markert |
Designing the Latvian Speech Recognition Corpus |
Mārcis Pinnis, Ilze Auziņa and Kārlis Goba |
Detecting Document Structure in a Very Large Corpus of UK Financial Reports |
Mahmoud El-Haj, Paul Rayson, Steve Young and Martin Walker |
Detecting Subevent Structure for Event Coreference Resolution |
Jun Araki, Zhengzhong Liu, Eduard Hovy and Teruko Mitamura |
Developing a Framework for Describing Relations among Language Resources |
Penny Labropoulou, Christopher Cieri and Maria Gavrilidou |
Developing a French FrameNet: Methodology and First results |
Marie Candito, Pascal Amsili, Lucie Barque, Farah Benamara, Gaël de Chalendar, Marianne Djemaa, Pauline Haas, Richard Huyghe, Yvette Yannick Mathieu, Philippe Muller, Benoît Sagot and Laure Vieu |
Developing an Egyptian Arabic Treebank: Impact of Dialectal Morphology on Annotation and Tool Development |
Mohamed Maamouri, Ann Bies, Seth Kulick, Michael Ciul, Nizar Habash and Ramy Eskander |
Developing Politeness Annotated Corpus of Hindi Blogs |
Ritesh Kumar |
Developing Text Resources for Ten South African Languages |
Roald Eiselen and Martin Puttkammer |
Development of a TV Broadcasts Speech Recognition System for Qatari Arabic |
Mohamed Elmahdy, Mark Hasegawa-Johnson and Eiman Mustafawi |
Digital Library 2.0: Source of Knowledge and Research Collaboration Platform |
Włodzimierz Gruszczyński and Maciej Ogrodniczuk |
DINASTI: Dialogues with a Negotiating Appointment Setting Interface |
Layla El Asri, Romain Laroche and Olivier Pietquin |
Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing |
Ismail El Maarouf, Jane Bradbury, Vít Baisa and Patrick Hanks |
Disclose Models, Hide the Data - How to Make Use of Confidential Corpora without Seeing Sensitive Raw Data |
Erik Faessler, Johannes Hellrich and Udo Hahn |
Discosuite - A Parser Test Suite for German Discontinuous Structures |
Wolfgang Maier, Miriam Kaeshammer, Peter Baumann and Sandra Kübler |
Discovering and Visualising Stories in News |
Marieke van Erp, Gleb Satyukov, Piek Vossen and Marit Nijsen |
Discovering Frames in Specialized Domains |
Marie-Claude L'Homme, Benoît Robichaud and Carlos Subirats Rüggeberg |
Discovering the Italian Literature: Interactive Access to Audio-indexed Text Resources |
Vincenzo Galatà, Alberto Benin, Piero Cosi, Giuseppe Riccardo Leone, Giulio Paci, Giacomo Sommavilla and Fabio Tesser |
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech |
George Christodoulides, Mathieu Avanzi and Jean-Philippe Goldman |
Distributed Distributional Similarities of Google Books over the Centuries |
Martin Riedl, Richard Steuer and Chris Biemann |
DiVE-Arabic: Gulf Arabic Dialogue in a Virtual Environment |
Andrew Gargett, Sam Hellmuth and Ghazi AlGethami |
Dual Subtitles as Parallel Corpora |
Shikun Zhang, Wang Ling and Chris Dyer |
DysList: An Annotated Resource of Dyslexic Errors |
Luz Rello, Ricardo Baeza-Yates and Joaquim Llisterri |
E |
|
Efficient Reuse of Structured and Unstructured Resources for Ontology Population |
Chetana Gavankar, Ashish Kulkarni and Ganesh Ramakrishnan |
El-WOZ: a Client-Server Wizard-of-Oz Interface |
Thomas Pellegrini, Vahid Hedayati and Angela Costa |
Eliciting and Annotating Uncertainty in Spoken Language |
Heather Pon-Barry, Stuart Shieber and Nicholas Longenbaugh |
ELRA's Consolidated Services for the HLT Community |
Victoria Arranz, Khalid Choukri, Valérie Mapelli and Hélène Mazo |
Emilya: Emotional Body Expression in Daily Actions Database |
Nesrine Fourati and Catherine Pelachaud |
EMOVO Corpus: an Italian Emotional Speech Database |
Giovanni Costantini, Iacopo Iaderola, Andrea Paoloni and Massimiliano Todisco |
Enabling Language Resources to Expose Translations as Linked Data on the Web |
Jorge Gracia, Elena Montiel-Ponsoda, Daniel Vila-Suero and Guadalupe Aguado-de-Cea |
Encompassing a Spectrum of LT Users in the CLARIN-DK Infrastructure |
Lina Henriksen, Dorte Haltrup Hansen, Bente Maegaard, Bolette Sandford Pedersen and Claus Povlsen |
English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling |
Sharid Loaiciga, Thomas Meyer and Andrei Popescu-Belis |
Enhancing the TED-LIUM Corpus with Selected Data for Language Modeling and More TED Talks |
Anthony Rousseau, Paul Deléglise and Yannick Estève |
Enriching ODIN |
Fei Xia, William Lewis, Michael Wayne Goodman, Joshua Crowgey and Emily M. Bender |
Enriching the "Senso Comune" Platform with Automatically Acquired Data |
Tommaso Caselli, Laure Vieu, Carlo Strapparava and Guido Vetere |
Enrichment of Bilingual Dictionary through News Stream Data |
Ajay Dubey, Parth Gupta, Vasudeva Varma and Paolo Rosso |
Erlangen-CLP: A Large Annotated Corpus of Speech from Children with Cleft Lip and Palate |
Tobias Bocklet, Andreas Maier, Korbinian Riedhammer, Ulrich Eysholdt and Elmar Nöth |
Estimation of Speaking Style in Speech Corpora Focusing on Speech Transcriptionss |
Raymond Shen and Hideaki Kikuchi |
ETER: a New Metric for the Evaluation of Hierarchical Named Entity Recognition |
Mohamed Ben Jannet, Martine Adda-Decker, Olivier Galibert, Juliette Kahn and Sophie Rosset |
Etymological WordNet: Tracing the History of Words |
Gerard de Melo |
Euronews: a Multilingual Speech Corpus for ASR |
Roberto Gretter |
Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter |
Alain Couillault, Karën Fort, Gilles Adda and Hugues Mazancourt (de) |
Evaluating Improvised Hip Hop Lyrics - Challenges and Observations |
Karteek Addanki and Dekai Wu |
Evaluating Lemmatization Models for Machine-Assisted Corpus-Dictionary Linkage |
Kevin Black, Eric Ringger, Paul Felt, Kevin Seppi, Kristian Heal and Deryle Lonsdale |
Evaluating the Effects of Interactivity in a Post-Editing Workbench |
Nancy Underwood, Bartolomé Mesa-Lao, Mercedes García Martínez, Michael Carl, Vicent Alabau, Jesús González-Rubio, Luis A. Leiva, Germán Sanchis-Trilles, Daniel Ortíz-Martínez and Francisco Casacuberta |
Evaluating Web-as-corpus Topical Document Retrieval with an Index of the OpenDirectory |
Clément de Groc and Xavier Tannier |
Evaluation of Automatic Hypernym Extraction from Technical Corpora in English and Dutch |
Els Lefever, Marjan Van de Kauter and Veronique Hoste |
Evaluation of Different Strategies for Domain Adaptation in Opinion Mining |
Anne Garcia-Fernandez, Olivier Ferret and Marco Dinarelli |
Evaluation of Simple Distributional Compositional Operations on Longer Texts |
Tamara Polajnar, Laura Rimell and Stephen Clark |
Evaluation of Technology Term Recognition with Random Indexing |
Behrang Zadeh and Siegfried Handschuh |
Event Extraction Using Distant Supervision |
Kevin Reschke, Martin Jankowiak, Mihai Surdeanu, Christopher Manning and Daniel Jurafsky |
Expanding N-gram Analytics in ELAN and a Case Study for Sign Synthesis |
Rosalee Wolfe, John McDonald, Larwan Berke and Marie Stumbo |
Experiences with Parallelisation of an Existing NLP Pipeline: Tagging Hansard |
Stephen Wattam, Paul Rayson, Marc Alexander and Jean Anderson |
Experiences with the ISOcat Data Category Registry |
Daan Broeder, Ineke Schuurman and Menzo Windhouwer |
Exploiting Catenae in a Parallel Treebank Alignment |
Manuela Sanguinetti, Cristina Bosco and Loredana Cupi |
Exploiting networks in Law |
Livio Robaldo, Guido Boella, Luigi Di Caro and Andrea Violato |
Exploiting Portuguese Lexical Knowledge Bases for Answering Open Domain Cloze Questions Automatically |
Hugo Gonçalo Oliveira, Inês Coelho and Paulo Gomes |
Exploiting the Large-Scale German Broadcast Corpus to Boost the Fraunhofer IAIS Speech Recognition System |
Michael Stadtschnitzer, Jochen Schwenninger, Daniel Stein and Joachim Koehler |
Exploring and Visualizing Variation in Language Resources |
Peter Fankhauser, Jörg Knappen and Elke Teich |
Exploring Factors that Contribute to Successful Fingerspelling Comprehension |
Leah Geer and Jonathan Keane |
Exploring the Utility of Coreference Chains for Improved Identification of Personal Names |
Andrea Glaser and Jonas Kuhn |
Extending HeidelTime for Temporal Expressions Referring to Historic Dates |
Jannik Strötgen, Thomas Bögel, Julian Zell, Ayser Armiti, Tran Van Canh and Michael Gertz |
Extending Standoff Annotation |
Maik Stührenberg |
Extending the Coverage of a MWE Database for Persian CPs Exploiting Valency Alternations |
Pollet Samvelian, Pegah Faghiri and Sarra El Ayari |
Extensions of the Sign Language Recognition and Translation Corpus RWTH-PHOENIX-Weather |
Jens Forster, Christoph Schmidt, Oscar Koller, Martin Bellgardt and Hermann Ney |
Extracting a bilingual semantic grammar from FrameNet-annotated corpora |
Dana Dannells and Normunds Gruzitis |
Extracting Information for Context-aware Meeting Preparation |
Simon Scerri, Behrang Q. Zadeh, Maciej Dabrowski and Ismael Rivera |
Extracting News Web Page Creation Time with DCTFinder |
Xavier Tannier |
Extracting Semantic Relations from Portuguese Corpora using Lexical-Syntactic Patterns |
Raquel Amaro |
Extraction of Daily Changing Words for Question Answering |
Kugatsu Sadamitsu, Ryuichiro Higashinaka and Yoshihiro Matsuo |
Extrinsic Corpus Evaluation with a Collocation Dictionary Task |
Adam Kilgarriff, Pavel Rychlý, Milos Jakubicek, Vojtěch Kovář, Vit Baisa and Lucia Kocincová |
F |
|
4FX: Light Verb Constructions in a Multilingual Parallel Corpus |
Anita Rácz, István Nagy T. and Veronika Vincze |
Facing the Identification Problem in Language-Related Scientific Data Analysis. |
Joseph Mariani, Christopher Cieri, Gil Francopoulo, Patrick Paroubek and Marine Delaborde |
Finding a Tradeoff between Accuracy and Rater's Workload in Grading Clustered Short Answers |
Andrea Horbach, Alexis Palmer and Magdalena Wolska |
Finding Romanized Arabic Dialect in Code-Mixed Tweets |
Clare Voss, Stephen Tratz, Jamal Laoudi and Douglas Briesch |
Finite-State Morphological Transducers for Three Kypchak Languages |
Jonathan Washington, Ilnar Salimzyanov and Francis Tyers |
First Approach toward Semantic Role Labeling for Basque |
Haritz Salaberri, Olatz Arregi and Beñat Zapirain |
First Insight into Quality-Adaptive Dialogue |
Stefan Ultes, Hüseyin Dikme and Wolfgang Minker |
FLELex: a graded Lexical Resource for French Foreign Learners |
Thomas Francois, Nùria Gala, Patrick Watrin and Cédrick Fairon |
Flow Graph Corpus from Recipe Texts |
Shinsuke Mori, Hirokuni Maeta, Yoko Yamakata and Tetsuro Sasada |
Focusing Annotation for Semantic Role Labeling |
Daniel Peterson, Martha Palmer and Shumin Wu |
Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish |
Niklas Vanhainen and Giampiero Salvi |
Free English and Czech Telephone Speech Corpus Shared Under the CC-BY-SA 3.0 License |
Matěj Korvas, Ondřej Plátek, Ondřej Dušek, Lukáš Žilka and Filip Jurčíček |
Freepal: A Large Collection of Deep Lexico-Syntactic Patterns for Relation Extraction |
Johannes Kirschnick, Alan Akbik and Holmer Hemsen |
French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime |
Véronique Moriceau and Xavier Tannier |
From Natural Language to Ontology Population in the Cultural Heritage Domain. A Computational Linguistics-based approach. |
Maria Pia di Buono and Mario Monteleone |
From Non Word to New Word: Automatically Identifying Neologisms in French Newspapers |
Ingrid Falk, Delphine Bernhard and Christophe Gérard |
From Synsets to Videos: Enriching ItalWordNet Multimodally |
Roberto Bartolini, Valeria Quochi, Irene De Felice, Irene Russo and Monica Monachini |
Fuzzy V-Measure - An Evaluation Method for Cluster Analyses of Ambiguous Data |
Jason Utt, Sylvia Springorum, Maximilian Köper and Sabine Schulte im Walde |
G |
|
Generating a Lexicon of Errors in Portuguese to Support an Error Identification System for Spanish Native Learners |
Lianet Sepúlveda Torres, Magali Sanches Duran and Sandra Aluísio |
Generating a Resource for Products and Brandnames Recognition. Application to the Cosmetic Domain. |
Cédric Lopez, Frédérique Segond, Olivier Hondermarck, Paolo Curtoni and Luca Dini |
Generating and using Probabilistic Morphological Resources for the Biomedical Domain |
Vincent Claveau and Ewa Kijak |
Generating Polarity Lexicons with WordNet Propagation in 5 Languages |
Isa Maks, Ruben Izquierdo, Francesca Frontini, Rodrigo Agerri, Piek Vossen and Andoni Azpeitia |
GenitivDB ― a Corpus-Generated Database for German Genitive Classification |
Roman Schneider |
Genres in the Prague Discourse Treebank |
Lucie Poláková, Pavlína Jínová and Jiří Mírovský |
German Alcohol Language Corpus - the Question of Dialect |
Florian Schiel and Thomas Kisler |
Getting Reliable Annotations for Sarcasm in Online Dialogues |
Reid Swanson, Stephanie Lukin, Luke Eisenberg, Thomas Corcoran and Marilyn Walker |
Global Intelligent Content: Active Curation of Language Resources using Linked Data |
David Lewis, Rob Brennan, Leroy Finn, Dominic Jones, Alan Meehan, Declan O'sullivan, Sebastian Hellmann and Felix Sasaki |
GlobalPhone: Pronunciation Dictionaries in 20 Languages |
Tanja Schultz and Tim Schlippe |
GLÀFF, a Large Versatile French Lexicon |
Nabil Hathout, Franck Sajous and Basilio Calderone |
Gold-standard for Topic-specific Sentiment Analysis of Economic Texts |
Pyry Takala, Pekka Malo, Ankur Sinha and Oskar Ahlgren |
GraPAT: a Tool for Graph Annotations |
Jonathan Sonntag and Manfred Stede |
GRASS: the Graz corpus of Read And Spontaneous Speech |
Barbara Schuppler, Martin Hagmueller, Juan A. Morales-Cordovilla and Hannes Pessentheiner |
Guampa: a Toolkit for Collaborative Translation |
Alex Rudnick, Taylor Skidmore, Alberto Samaniego and Michael Gasser |
H |
|
HamleDT 2.0: Thirty Dependency Treebanks Stanfordized |
Rudolf Rosa, Jan Mašek, David Mareček, Martin Popel, Daniel Zeman and Zdeněk Žabokrtský |
Harmonization of German Lexical Resources for Opinion Mining |
Thierry Declerck and Hans-Ulrich Krieger |
Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government |
Fabrizio Gotti, Phillippe Langlais and Atefeh Farzindar |
HESITA(te) in Portuguese |
Sara Candeias, Dirce Celorico, Jorge Proença, Arlindo Veiga, Carla Lopes and Fernando Perdigão |
Heuristic Hyper-minimization of Finite State Lexicons |
Senka Drobac, Krister Lindén, Tommi Pirinen and Miikka Silfverberg |
HFST-SweNER ― A New NER Resource for Swedish |
Dimitrios Kokkinakis, Jyrki Niemi, Sam Hardwick, Krister Lindén and Lars Borin |
HiEve: A Corpus for Extracting Event Hierarchies from News Stories |
Goran Glavaš, Jan Šnajder, Marie-Francine Moens and Parisa Kordjamshidi |
High Quality Word Lists as a Resource for Multiple Purposes |
Uwe Quasthoff, Dirk Goldhahn, Thomas Eckart, Erla Hallsteinsdóttir and Sabine Fiedler |
HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation |
Ondrej Bojar, Vojtěch Diatka, Pavel Rychlý, Pavel Stranak, Vit Suchomel, Aleš Tamchyna and Daniel Zeman |
Hindi to English Machine Translation: Using Effective Selection in Multi-Model SMT |
Kunal Sachdeva, Rishabh Srivastava, Sambhav Jain and Dipti Sharma |
Hope and Fear: How Opinions Influence Factuality |
Chantal van Son, Marieke van Erp, Antske Fokkens and Piek Vossen |
Hot Topics and Schisms in NLP: Community and Trend Analysis with Saffron on ACL and LREC Proceedings |
Paul Buitelaar, Georgeta Bordea and Barry Coughlan |
How Could Veins Speed Up the Process of Discourse Parsing |
Elena Mitocariu, Daniel Anechitei and Dan Cristea |
How to Construct a Multi-Lingual Domain Ontology |
Nitsan Chrizman and Alon Itai |
How to Tell a Schneemann from a Milchmann: An Annotation Scheme for Compound-Internal Relations |
Corina Dima, Verena Henrich, Erhard Hinrichs and Christina Hoppermann |
How to Use Less Features and Reach Better Performance in Author Gender Identification |
Juan Soler and Leo Wanner |
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators? |
Daniel Luzzati, Cyril Grouin, Ioana Vasilescu, Martine Adda-Decker, Eric Bilinski, Nathalie Camelin, Juliette Kahn, Carole Lailler, Lori Lamel and Sophie Rosset |
HuRIC: a Human Robot Interaction Corpus |
Emanuele Bastianelli, Giuseppe Castellucci, Danilo Croce, Luca Iocchi, Roberto Basili and Daniele Nardi |
I |
|
Icelandic Quirks: Testing Linguistic Theories and Language Technology |
Rodrigo Boos, Kassius Prestes and Aline Villavicencio |
Identification of Multiword Expressions in the brWaC |
Peter Anick, Marc Verhagen and James Pustejovsky |
Identifying Idioms in Chinese Translations |
Wan Yu Ho, Christine Kng, Shan Wang and Francis Bond |
ILLINOISCLOUDNLP: Text Analytics Services in the Cloud |
Hao Wu, Zhiye Fei, Aaron Dai, Mark Sammons, Dan Roth and Stephen Mayhew |
Image Annotation with ISO-Space: Distinguishing Content from Structure |
James Pustejovsky and Zachary Yocum |
Improvements to Dependency Parsing Using Automatic Simplification of Data |
Tomáš Jelínek |
Improving Entity Linking using Surface Form Refinement |
Eric Charton, Marie-Jean Meurs, Ludovic Jean-Louis and Michel Gagnon |
Improving Evaluation of English-Czech MT through Paraphrasing |
Petra Barancikova, Rudolf Rosa and Ales Tamchyna |
Improving Open Relation Extraction via Sentence Re-Structuring |
Jordan Schmidek and Denilson Barbosa |
Improving the Exploitation of Linguistic Annotations in ELAN |
Onno Crasborn and Han Sloetjes |
Incorporating Alternate Translations into English Translation Treebank |
Ann Bies, Justin Mott, Seth Kulick, Jennifer Garland and Colin Warner |
Information Extraction from German Patient Records via Hybrid Parsing and Relation Extraction Strategies |
Hans-Ulrich Krieger, Christian Spurk, Hans Uszkoreit, Feiyu Xu, Yi Zhang, Frank Müller and Thomas Tolxdorff |
Innovations in Parallel Corpus Search Tools |
Martin Volk, Johannes Graën and Elena Callegaro |
Integration of Workflow and Pipeline for Language Service Composition |
Trang Mai Xuan, Yohei Murakami, Donghui Lin and Toru Ishida |
Interoperability and Customisation of Annotation Schemata in Argo |
Rafal Rak, Jacob Carter, Andrew Rowley, Riza Theresa Batista-Navarro and Sophia Ananiadou |
Interoperability of Dialogue Corpora through ISO 24617-2-based Querying |
Volha Petukhova, Andrei Malchanau and Harry Bunt |
'interHist' - an Interactive Visual Interface for Corpus Exploration |
Verena Lyding, Lionel Nicolas and Egon Stemle |
Introducing a Framework for the Evaluation of Music Detection Tools |
Paula Lopez-Otero, Laura Docio-Fernandez and Carmen Garcia-Mateo |
Introducing a Web Application for Labeling, Visualizing Speech and Correcting Derived Speech Signals |
Raphael Winkelmann and Georg Raess |
Investigating the Image of Entities in Social Media: Dataset Design and First Results |
Julien Velcin, Young-Min Kim, Caroline Brun, Jean-Yves Dormagen, Eric SanJuan, Leila Khouas, Anne Peradotto, Stéphane Bonnevay, Claude Roux, Julien Boyadjian, Alejandro Molina and Marie Neihouser |
ISLEX ― a Multilingual Web Dictionary |
Þórdís Úlfarsdóttir |
IXA pipeline: Efficient and Ready to Use Multilingual NLP tools |
Rodrigo Agerri, Josu Bermudez and German Rigau |
L |
|
Language CoLLAGE: Grammatical Description with the LinGO Grammar Matrix |
Emily M. Bender |
Language Editing Dataset of Academic Texts |
Vidas Daudaravicius |
Language Processing Infrastructure in the XLike Project |
Lluís Padró, Zeljko Agic, Xavier Carreras, Blaz Fortuna, Esteban García-Cuesta, Zhixing Li, Tadej Stajner and Marko Tadić |
Language Resource Addition: Dictionary or Corpus? |
Shinsuke Mori and Graham Neubig |
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction |
Sebastian Krause, Hong Li, Feiyu Xu, Hans Uszkoreit, Robert Hummel and Luise Spielhagen |
Language Resources for French in the Biomedical Domain |
Aurelie Neveol, Julien Grosjean, Stéfan Darmoni and Pierre Zweigenbaum |
Languagesindanger.eu - Including Multimedia Language Resources to disseminate Knowledge and Create Educational Material on less-Resourced Languages |
Dagmar Jung, Katarzyna Klessa, Zsuzsa Duray, Beatrix Oszkó, Mária Sipos, Sándor Szeverényi, Zsuzsa Várnai, Trilsbeek Paul and Tamás Váradi |
Large Scale Arabic Error Annotation: Guidelines and Framework |
Wajdi Zaghouani, Behrang Mohit, Nizar Habash, Ossama Obeid, Nadi Tomeh, Alla Rozovskaya, Noura Farra, Sarah Alkuhlani and Kemal Oflazer |
Large SMT Data-sets Extracted from Wikipedia |
Dan Tufiș, Stefan Daniel Dumitrescu and Dan Stefanescu |
Latent Semantic Analysis Models on Wikipedia and TASA |
Dan Stefanescu, Rajendra Banjade and Vasile Rus |
Learning from Domain Complexity |
Robert Remus and Dominique Ziegelmayer |
Legal Aspects of Text Mining |
Maarten Truyens and Patrick Van Eecke |
Less is More? Towards a Reduced Inventory of Categories for Training a Parser for the Italian Stanford Dependencies |
Maria Simi, Cristina Bosco and Simonetta Montemagni |
Lexical Substitution Dataset for German |
Kostadin Cholakov, Chris Biemann, Judith Eckle-Kohler and Iryna Gurevych |
LexTec - a Rich Language Resource for Technical Domains in Portuguese |
Palmira Marrafa, Raquel Amaro and Sara Mendes |
LexTerm Manager: Design for an Integrated Lexicography and Terminology System |
Joshua Elliot, Logan Kearsley, Jason Housley and Alan Melby |
Linguistic Evaluation of Support Verb Constructions by OpenLogos and Google Translate |
Anabela Barreiro, Johanna Monti, Brigitte Orliac, Susanne Preuß, Kutz Arrieta, Wang Ling, Fernando Batista and Isabel Trancoso |
Linguistic Landscaping of South Asia using Digital Language Resources: Genetic vs. Areal Linguistics |
Lars Borin, Anju Saxena, Taraka Rama and Bernard Comrie |
Linguistic Resources and Cats: How to Use ISOcat, RELcat and SCHEMAcat |
Menzo Windhouwer and Ineke Schuurman |
Linked Open Data and Web Corpus Data for noun compound bracketing |
Pierre André Ménard and Caroline Barriere |
LinkedHealthAnswers: Towards Linked Data-driven Question Answering for the Health Care Domain |
Artem Ostankov, Florian Röhrbein and Ulli Waltinger |
Linking Pictographs to Synsets: Sclera2Cornetto |
Vincent Vandeghinste and Ineke Schuurman |
Locating Requests among Open Source Software Communication Messages |
Ioannis Korkontzelos and Sophia Ananiadou |
LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization |
Annemarie Friedrich, Marina Valeeva and Alexis Palmer |
M |
|
#mygoal: Finding Motivations on Twitter |
Marc Tomlinson, David Bracewell, Wayne Krug and David Hinote |
Machine Translation for Subtitling: A Large-Scale Evaluation |
Thierry Etchegoyhen, Lindsay Bywood, Mark Fishel, Panayota Georgakopoulou, Jie Jiang, Gerard van Loenhout, Arantza del Pozo, Mirjam Sepesy Maucec, Anja Turner and Martin Volk |
Machine Translationness: Machine-likeness in Machine Translation Evaluation |
Joaquim Moré and Salvador Climent |
Macrosyntactic Segmenters of a French Spoken Corpus |
Ilaine Wang, Sylvain Kahane and Isabelle Tellier |
MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic |
Arfath Pasha, Mohamed Al-Badrashiny, Mona Diab, Ahmed El Kholy, Ramy Eskander, Nizar Habash, Manoj Pooleery, Owen Rambow and Ryan Roth |
Manual Analysis of Structurally Informed Reordering in German-English Machine Translation |
Teresa Herrmann, Jan Niehues and Alex Waibel |
Mapping Between English Strings and Reentrant Semantic Graphs |
Fabienne Braune, Daniel Bauer and Kevin Knight |
Mapping CPA Patterns onto OntoNotes Senses |
Octavian Popescu, Martha Palmer and Patrick Hanks |
Mapping Diatopic and Diachronic Variation in Spoken Czech: the Ortofon and Dialekt Corpora |
Marie Kopřivová, Hana Goláňová, Petra Klimešová and David Lukeš |
Mapping the Lexique des Verbes du français (Lexicon of French Verbs) to a NLP Lexicon using Examples |
Bruno Guillaume, Karën Fort, Guy Perrier and Paul Bédaride |
Mapping WordNet Domains, WordNet Topics and Wikipedia Categories to Generate Multilingual Domain Specific Resources |
Spandana Gella, Carlo Strapparava and Vivi Nastase |
MAT: a Tool for L2 Pronunciation Errors Annotation |
Renlong Ai and Marcela Charfuelan |
Measuring Readability of Polish Texts: Baseline Experiments |
Bartosz Broda, Bartłomiej Nitoń, Włodzimierz Gruszczyński and Maciej Ogrodniczuk |
Measuring the Impact of Spelling Errors on the Quality of Machine Translation |
Irina Galinskaya, Valentin Gusev, Elena Mescheryakova and Mariya Shmatova |
Media Monitoring and Information Extraction for the Highly Inflected Agglutinative Language Hungarian |
Júlia Pajzs, Ralf Steinberger, Maud Ehrmann, Mohamed Ebrahim, Leonida Della Rocca, Stefano Bucci, Eszter Simon and Tamás Váradi |
Meta-Classifiers Easily Improve Commercial Sentiment Detection Tools |
Mark Cieliebak, Oliver Dürr and Fatih Uzdilli |
META-SHARE: One Year After |
Stelios Piperidis, Harris Papageorgiou, Christian Spurk, Georg Rehm, Khalid Choukri, Olivier Hamon, Nicoletta Calzolari, Riccardo Del Gratta, Bernardo Magnini and Christian Girardi |
Metadata as Linked Open Data: mapping disparate XML metadata registries into one RDF/OWL registry. |
Marta Villegas, Maite Melero and Núria Bel |
Mining a Multimodal Corpus for Non-Verbal Behavior Sequences Conveying Attitudes |
Mathieu Chollet, Magalie Ochs and Catherine Pelachaud |
Mining Online Discussion Forums for Metaphors |
Andrew Gargett and John Barnden |
Missed Opportunities in Translation Memory Matching |
Friedel Wolff, Laurette Pretorius and Paul Buitelaar |
ML-Optimization of Ported Constraint Grammars |
Eckhard Bick |
Modeling and Evaluating Dialog Success in the LAST MINUTE Corpus |
Dietmar Rösner, Rafael Friesen, Stephan Günther and Rico Andrich |
Modeling Language Proficiency Using Implicit Feedback |
Chris Hokamp, Rada Mihalcea and Peter Schuelke |
Modeling, Managing, Exposing, and Linking Ontologies with a Wiki-based Tool |
Mauro Dragoni, Alessio Bosca, Matteo Casu and Andi Rexha |
Modelling Irony in Twitter: Feature Analysis and Evaluation |
Francesco Barbieri and Horacio Saggion |
Modern Chinese Helps Archaic Chinese Processing: Finding and Exploiting the Shared Properties |
Yan Song and Fei Xia |
Momresp: A Bayesian Model for Multi-Annotator Document Labeling |
Paul Felt, Robbie Haertel, Eric Ringger and Kevin Seppi |
Morfeusz Reloaded |
Marcin Woliński |
Morpho-Syntactic Study of Errors from Speech Recognition System |
Maria Goryainova, Cyril Grouin, Sophie Rosset and Ioana Vasilescu |
Morphological Parsing of Swahili using Crowdsourced Lexical Resources |
Patrick Littell, Kaitlyn Price and Lori Levin |
MotàMot Project: Conversion of a French-Khmer Published Dictionary for Building a Multilingual Lexical System |
Mathieu Mangeot |
MTWatch: A Tool for the Analysis of Noisy Parallel Data |
Sandipan Dandapat and Declan Groves |
MUHIT: A Multilingual Harmonized Dictionary |
Sameh Alansary |
Multilingual Corpora with Coreferential Annotation of Person Entities |
Marcos Garcia and Pablo Gamallo |
Multilingual eXtended WordNet Knowledge Base: Semantic Parsing and Translation of Glosses |
Tatiana Erekhinskaya, Meghana Satpute and Dan Moldovan |
Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain |
Zdenka Uresova, Jan Hajic, Pavel Pecina and Ondrej Dusek |
Multimodal Corpora for Silent Speech Interaction |
João Freitas, António Teixeira and Miguel Dias |
Multimodal Dialogue Segmentation with Gesture Post-Processing |
Kodai Takahashi and Masashi Inoue |
Multiple Choice Question Corpus Analysis for Distractor Characterization |
Van-Minh Pho, Thibault André, Anne-Laure Ligozat, Brigitte Grau, Gabriel Illouz and Thomas Francois |
MultiVal - Towards a Multilingual Valence Lexicon |
Lars Hellan, Dorothee Beermann, Tore Bruland, Mary Esther Kropp Dakubu and Montserrat Marimon |
Multiword Expressions in Machine Translation |
Valia Kordoni and Iliana Simova |
Mörkum Njálu. An Annotated Corpus to Analyse and Explain Grammatical Divergences Between 14th-century Manuscripts of Njál's Saga. |
Ludger Zeevaert |
N |
|
N-gram Counts and Language Models from the Common Crawl |
Christian Buck, Kenneth Heafield and Bas van Ooyen |
Named Entity Corpus Construction using Wikipedia and DBpedia Ontology |
Younggyun Hahm, Jungyeul Park, Kyungtae Lim, Youngsik Kim, Dosam Hwang and Key-Sun Choi |
Named Entity Recognition on Turkish Tweets |
Dilek Kucuk, Guillaume Jacquet and Ralf Steinberger |
Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers |
Joachim Bingel and Thomas Haider |
Narrowing the Gap Between Termbases and Corpora in Commercial Environments |
Kara Warburton |
NASTIA: Negotiating Appointment Setting Interface |
Layla El Asri, Rémi Lemonnier, Romain Laroche, Olivier Pietquin and Hatim Khouzaimi |
Native Language Identification Using Large, Longitudinal Data |
Xiao Jiang, Yufan Guo, Jeroen Geertzen, Dora Alexopoulou, Lin Sun and Anna Korhonen |
New Bilingual Speech Databases for Audio Diarization |
David Tavarez, Eva Navas, Daniel Erro, Ibon Saratxaga and Inma Hernaez |
New Directions for Language Resource Development and Distribution |
Christopher Cieri, Denise DiPersio, Mark Liberman, Andrea Mazzucchi, Stephanie Strassel and Jonathan Wright |
New Functions for a Multipurpose Multimodal Tool for Phonetic and Linguistic Analysis of Very Large Speech Corpora |
Philippe Martin |
New Spanish Speech Corpus Database for the Analysis of People Suffering from Parkinson's Disease |
Juan Rafael Orozco-Arroyave, Julián David Arias-Londoño, Jesús Francisco Vargas-Bonilla, María Claudia Gonzalez-Rátiva and Elmar Nöth |
NewsReader: Recording History from Daily News Streams |
Piek Vossen, German Rigau, Luciano Serafini, Pim Stouten, Francis Irving and Willem Van Hage |
NIF4OGGD - NLP Interchange Format for Open German Governmental Data |
Mohamed Sherif, Sandro Coelho, Ricardo Usbeck, Sebastian Hellmann, Jens Lehmann, Martin Brümmer and Andreas Both |
NOMAD: Linguistic Resources and Tools Aimed at Policy Formulation and Validation |
George Kiomourtzis, George Giannakopoulos, Georgios Petasis, Pythagoras Karampiperis and Vangelis Karkaletsis |
NomLex-PT: A Lexicon of Portuguese Nominalizations |
Valeria de Paiva, Livy Real, Alexandre Rademaker and Gerard de Melo |
NoSta-D Named Entity Annotation for German: Guidelines and Dataset |
Darina Benikova, Chris Biemann and Marc Reznicek |
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech |
Nianwen Xue, Ondrej Bojar, Jan Hajic, Martha Palmer, Zdenka Uresova and Xiuhong Zhang |
N³ - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format |
Michael Röder, Ricardo Usbeck, Sebastian Hellmann, Daniel Gerber and Andreas Both |
O |
|
Off-Road LAF: Encoding and Processing Annotations in NLP Workflows |
Emanuele Lapponi, Erik Velldal, Stephan Oepen and Rune Lain Knudsen |
On Complex Word Alignment Configurations |
Miriam Kaeshammer and Anika Westburg |
On Paraphrase Identification Corpora |
Vasile Rus, Rajendra Banjade and Mihai Lintean |
On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter |
Hassan Saif, Miriam Fernandez, Yulan He and Harith Alani |
On the Annotation of TMX Translation Memories for Advanced Leveraging in Computer-aided Translation |
Mikel Forcada |
On the Importance of Text Analysis for Stock Price Prediction |
Heeyoung Lee, Mihai Surdeanu, Bill MacCartney and Dan Jurafsky |
On the Origin of Errors: a Fine-Grained Analysis of MT and PE Errors and their Relationship |
Joke Daems, Lieve Macken and Sonia Vandepitte |
On the Reliability and Inter-Annotator Agreement of Human Semantic MT Evaluation via HMEANT |
Chi-kiu Lo and Dekai Wu |
On the Romance Languages Mutual Intelligibility |
Liviu Dinu and Alina Maria Ciobanu |
On the Use of a Fuzzy Classifier to Speed Up the Sp ToBI Labeling of the Glissando Spanish Corpus |
David Escudero, Aguilar-Cuevas Lourdes, González-Ferreras César, Gutiérrez-González Yurena and Valentín Cardeñoso-Payo |
Online Experiments with the Percy Software Framework - Experiences and some Early Results |
Christoph Draxler |
Online Optimisation of Log-linear Weights in Interactive Machine Translation |
Mara Chinea-Rios, Germán Sanchis Trilles, Daniel Ortiz-Martínez and Francisco Casacuberta |
Open Philology at the University of Leipzig |
Frederik Baumgardt, Giuseppe Celano, Gregory R. Crane, Stella Dee, Maryam Foradi, Emily Franzini, Greta Franzini, Monica Lent, Maria Moritz and Simona Stoyanova |
Open-domain Interaction and Online Content in the Sami Language |
Kristiina Jokinen |
OpenLogos Semantico-Syntactic Knowledge-Rich Bilingual Dictionaries |
Anabela Barreiro, Fernando Batista, Ricardo Ribeiro, Helena Moniz and Isabel Trancoso |
Optimizing a Distributional Semantic Model for the Prediction of German Particle Verb Compositionality |
Stefan Bott and Sabine Schulte im Walde |
Out in the Open: Finding and Categorising Errors in the Lexical Simplification Pipeline |
Matthew Shardlow |
Overview of Todai Robot Project and Evaluation Framework of its NLP-based Problem Solving |
Akira Fujita, Akihiro Kameda, Ai Kawazoe and Yusuke Miyao |
P |
|
PACE Corpus: a Multilingual Corpus of Polarity-Annotated Textual Data from the Domains Automotive and CEllphone |
Christian Haenig, Andreas Niekler and Carsten Wuensch |
PanLex: Building a Resource for Panlingual Lexical Translation |
David Kamholz, Jonathan Pool and Susan Colowick |
ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT |
Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann and Bonnie Webber |
Parsing Chinese Synthetic Words with a Character-based Dependency Model |
Fei Cheng, Kevin Duh and Yuji Matsumoto |
Parsing Heterogeneous Corpora with a Rich Dependency Grammar |
Achim Stein |
Phone Boundary Annotation in Conversational Speech |
Yi-Fen Liu, Shu-Chuan Tseng and J.-S Roger Jang |
Phoneme Set Design Using English Speech Database by Japanese for Dialogue-Based English CALL Systems |
Xiaoyun Wang, Jinsong Zhang, Masafumi Nishida and Seiichi Yamamoto |
Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling |
Pablo Ruiz, Aitor Álvarez and Haritz Arzelus |
Pivot-based Multilingual Dictionary Building using Wiktionary |
Judit Ács |
Polish Coreference Corpus in Numbers |
Maciej Ogrodniczuk, Mateusz Kopeć and Agata Savary |
PoliTa: a Multitagger for Polish |
Łukasz Kobyliński |
Polysemy Index for Nouns: an Experiment on Italian using the PAROLE SIMPLE CLIPS Lexical Database |
Francesca Frontini, Valeria Quochi, Sebastian Padó, Monica Monachini and Jason Utt |
Potsdam Commentary Corpus 2.0: Annotation for Discourse Research |
Manfred Stede and Arne Neumann |
Praaline: Integrating Tools for Speech Corpus Research |
George Christodoulides |
Pre-ordering of Phrase-based Machine Translation Input in Translation Workflow |
Alexandru Ceausu and Sabine Hunsicker |
Predicate Matrix: extending SemLink through WordNet mappings |
Maddalen Lopez de Lacalle, Egoitz Laparra and German Rigau |
Presenting a System of Human-Machine Interaction for Performing Map Tasks. |
Gabriele Pallotti, Francesca Frontini, Fabio Affè, Monica Monachini and Stefania Ferrari |
Priberam Compressive Summarization Corpus: A New Multi-Document Summarization Corpus for European Portuguese |
Miguel B. Almeida, Mariana S. C. Almeida, André F. T. Martins, Helena Figueira, Pedro Mendes and Cláudia Pinto |
Production of Phrase Tables in 11 European Languages using an Improved Sub-sentential Aligner |
Juan Luo and Yves Lepage |
Projection-based Annotation of a Polish Dependency Treebank |
Alina Wróblewska and Adam Przepiórkowski |
Propa-L: a Semantic Filtering Service from a Lexical Network Created using Games With A Purpose |
Mathieu Lafourcade and Karën Fort |
PropBank: Semantics of New Predicate Types |
Claire Bonial, Julia Bonn, Kathryn Conger, Jena D. Hwang and Martha Palmer |
Prosodic, Syntactic, Semantic Guidelines for Topic Structures Across Domains and Corpora |
Ana Isabel Mata, Helena Moniz, Telmo Móia, Anabela Gonçalves, Fátima Silva, Fernando Batista, Inês Duarte, Fátima Oliveira and Isabel Falé |
Pruning the Search Space of the Wolof LFG Grammar Using a Probabilistic and a Constraint Grammar Parser |
Cheikh M. Bamba Dione |
R |
|
Ranking Job Offers for Candidates: Learning Hidden Knowledge from Big Data |
Marc Poch, Núria Bel, Sergio Espeja and Felipe Navio |
Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian |
Anton Karl Ingason, Hrafn Loftsson, Eiríkur Rögnvaldsson, Einar Freyr Sigurðsson and Joel C. Wallenberg |
Re-using an Argument Corpus to Aid in the Curation of Social Media Collections |
Clare Llewellyn, Claire Grover, Jon Oberlander and Ewan Klein |
Recent Developments in DeReKo |
Marc Kupietz and Harald Lüngen |
Recognising Suicidal Messages in Dutch Social Media |
Bart Desmet and Véronique Hoste |
Reconstructing the Semantic Landscape of Natural Language Processing |
Elisa Omodei, Jean-Philippe Cointet and Thierry Poibeau |
RECSA: Resource for Evaluating Cross-lingual Semantic Annotation |
Achim Rettinger, Lei Zhang, Daša Berović, Danijela Merkler, Matea Srebačić and Marko Tadić |
Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis |
Joseph Mariani, Gil Francopoulo, Patrick Paroubek and Olivier Hamon |
REFRACTIVE: An Open Source Tool to Extract Knowledge from Syntactic and Semantic Relations |
Peter Exner and Pierre Nugues |
Relating Frames and Constructions in Japanese FrameNet |
Kyoko Ohara |
Relation Inference in Lexical Networks ... with Refinements |
Manel Zarrouk and Mathieu Lafourcade |
RELISH LMF: Unlocking the Full Power of the Lexical Markup Framework |
Menzo Windhouwer, Justin Petro and Shakila Shayan |
Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0 |
Maud Ehrmann, Francesco Cecconi, Daniele Vannella, John Philip McCrae, Philipp Cimiano and Roberto Navigli |
Representing Multimodal Linguistic Annotated Data |
Brigitte Bigi, Tatsuya Watanabe and Laurent Prévot |
Resource Creation and Evaluation for Multilingual Sentiment Analysis in Social Media Texts |
Alexandra Balahur, Marco Turchi, Ralf Steinberger, Jose Manuel Perea-Ortega, Guillaume Jacquet, Dilek Kucuk, Vanni Zavarella and Adil El Ghali |
Resources for the Detection of Conventionalized Metaphors in Four Languages |
Lori Levin, Teruko Mitamura, Brian MacWhinney, Davida Fromm, Jaime Carbonell, Weston Feely, Robert Frederking, Anatole Gershman and Carlos Ramirez |
Resources in Conflict: A Bilingual Valency Lexicon vs. a Bilingual Treebank vs. a Linguistic Theory |
Jana Sindlerova, Zdenka Uresova and Eva Fucikova |
RESTful Annotation and Efficient Collaboration |
Jonathan Wright |
Reusing Swedish FrameNet for Training Semantic Roles |
Ildikó Pilán and Elena Volodina |
Revising the Annotation of a Broadcast News Corpus: a Linguistic Approach |
Vera Cabarrão, Helena Moniz, Fernando Batista, Ricardo Ribeiro, Nuno Mamede, Hugo Meinedo, Isabel Trancoso, Ana Isabel Mata and David Martins de Matos |
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French |
Anne Lacheret, Sylvain Kahane, Julie Beliao, Anne Dister, Kim Gerdes, Jean-Philippe Goldman, Nicolas Obin, Paola Pietrandrea and Atanas Tchobanov |
ROOTS: a Toolkit for Easy, Fast and Consistent Processing of Large Sequential Annotated Data Collections |
Jonathan Chevelu, Gwénolé Lecorvé and Damien Lolive |
RSS-TOBI - a Prosodically Enhanced Romanian Speech Corpus |
Tiberiu Boroș, Adriana Stan, Oliver Watts and Stefan Daniel Dumitrescu |
Rule-based Reordering Space in Statistical Machine Translation |
Nicolas Pécheux, Alexander Allauzen and François Yvon |
Ruled-based, Interlingual Motivated Mapping of plWordNet onto SUMO Ontology |
Paweł Kędzia and Maciej Piasecki |
S |
|
S-pot - a Benchmark in Spotting Signs Within Continuous Signing |
Ville Viitaniemi, Tommi Jantunen, Leena Savolainen, Matti Karppa and Jorma Laaksonen |
SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis |
Muhammad Abdul-Mageed and Mona Diab |
SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling |
Arantza del Pozo, Carlo Aliprandi, Aitor Álvarez, Carlos Mendes, Joao P. Neto, Sérgio Paulo, Nicola Piccinini and Matteo Raffaelli |
Segmentation Evaluation Metrics, a Comparison Grounded on Prosodic and Discourse Units |
Klim Peshkov and Laurent Prévot |
Self-training a Constituency Parser using n-gram Trees |
Arda Celebi and Arzucan Özgür |
Semantic Approaches to Software Component Retrieval with English Queries |
Huijing Deng and Grzegorz Chrupała |
Semantic Clustering of Pivot Paraphrases |
Marianna Apidianaki, Emilia Verzeni and Diana McCarthy |
Semantic Search in Documents Enriched by LOD-based Annotations |
Pavel Smrz and Jan Kouril |
Semantic Technologies for Querying Linguistic Annotations: An Experiment Focusing on Graph-Structured Data |
Milen Kouylekov and Stephan Oepen |
Semi-Automatic Annotation of the UCU Accents Speech Corpus |
Rosemary Orr, Marijn Huijbregts, Roeland van Beek, Lisa Teunissen, Kate Backhouse and David van Leeuwen |
Semi-Compositional Method for Synonym Extraction of Multi-Word Terms |
Béatrice Daille and Amir Hazem |
Semi-Supervised Methods for Expanding Psycholinguistics Norms by Integrating Distributional Similarity with the Structure of WordNet |
Michael Mohler, Marc Tomlinson, David Bracewell and Bryan Rink |
Sentence Rephrasing for Parsing Sentences with OOV Words |
Hen-Hsen Huang, Huan-Yuan Chen, Chang-Sheng Yu, Hsin-Hsi Chen, Po-Ching Lee and Chun-Hsun Chen |
SenTube: A Corpus for Sentiment Analysis on YouTube Social Media |
Olga Uryupina, Barbara Plank, Aliaksei Severyn, Agata Rotondi and Alessandro Moschitti |
Sharing Cultural Heritage: the Clavius on the Web Project |
Matteo Abrate, Angelo Mario Del Grosso, Emiliano Giovannetti, Angelica Lo Duca, Damiana Luzzi, Lorenzo Mancini, Andrea Marchetti, Irene Pedretti and Silvia Piccini |
Sharing Resources Between Free/Open-Source Rule-based Machine Translation Systems: Grammatical Framework and Apertium |
Grégoire Détrez, Víctor M. Sánchez-Cartagena and Aarne Ranta |
Shata-Anuvadak: Tackling Multiway Translation of Indian Languages |
Anoop Kunchukuttan, Abhijit Mishra, Rajen Chatterjee, Ritesh Shah and Pushpak Bhattacharyya |
Simple Effective Microblog Named Entity Recognition: Arabic as an Example |
Kareem Darwish and Wei Gao |
Single Classifier Approach for Verb Sense Disambiguation based on Generalized Features |
Daisuke Kawahara and Martha Palmer |
Single-Person and Multi-Party 3D Visualizations for Nonverbal Communication Analysis |
Michael kipp, Levin Freiherr von Hollen, Michael Christopher Hrstka and Franziska Zamponi |
SinoCoreferencer: An End-to-End Chinese Event Coreference Resolver |
Chen Chen and Vincent Ng |
SLMotion - an Extensible Sign Language Oriented Video Analysis Tool |
Matti Karppa, Ville Viitaniemi, Marcos Luzardo, Jorma Laaksonen and Tommi Jantunen |
sloWCrowd: a Crowdsourcing Tool for Lexicographic Tasks |
Darja Fišer, Aleš Tavčar and Tomaž Erjavec |
Smile and Laughter in Human-Machine Interaction: a Study of Engagement |
Mariette Soury and Laurence Devillers |
Sockpuppet Detection in Wikipedia: A Corpus of Real-World Deceptive Writing for Linking Identities |
Thamar Solorio, Ragib Hasan and Mainul Mizan |
Speech Recognition Web Services for Dutch |
Joris Pelemans, Kris Demuynck, Hugo Van hamme and Patrick Wambacq |
Speech-Based Emotion Recognition: Feature Selection by Self-Adaptive Multi-Criteria Genetic Algorithm |
Maxim Sidorov, Christina Brester, Wolfgang Minker and Eugene Semenkin |
Sprinter: Language Technologies for Interactive and Multimedia Language Learning |
Renlong Ai, Marcela Charfuelan, Walter Kasper, Tina Klüwer, Hans Uszkoreit, Feiyu Xu, Sandra Gasber and Philip Gienandt |
Standardisation and Interoperation of Morphosyntactic and Syntactic Annotation Tools for Spanish and their Annotations |
Antonio Pareja-Lora, Guillermo Cárcamo-Escorza and Alicia Ballesteros-Calvo |
Statistical Analysis of Multilingual Text Corpus and Development of Language Models |
Shyam Sundar Agrawal,Mandal Abhimanue, Shweta Bansal and Minakshi Mahajan |
Student Achievement and French Sentence Repetition Test Scores |
Deryle Lonsdale and Benjamin Millard |
Sublanguage Corpus Analysis Toolkit: a Tool for Assessing the Representativeness and Sublanguage Characteristics of Corpora |
Irina Temnikova, William A. Baumgartner Jr., Negacy D. Hailu, Ivelina Nikolova, Tony McEnery, Adam Kilgarriff, Galia Angelova and K. Bretonnel Cohen |
Summarizing News Clusters on the Basis of Thematic Chains |
Natalia Loukachevitch and Aleksey Alekseev |
Supervised Within-Document Event Coreference using Information Propagation |
Zhengzhong Liu, Jun Araki, Eduard Hovy and Teruko Mitamura |
SWIFT Aligner, A Multifunctional Tool for Parallel Corpora: Visualization, Word Alignment, and (Morpho)-Syntactic Cross-Language Transfer |
Timur Gilmanov, Olga Scrivner and Sandra Kübler |
SwissAdmin: a Multilingual Tagged Parallel Corpus of Press Releases |
Yves Scherrer, Luka Nerima, Lorenza Russo, Maria Ivanova and Eric Wehrli |
Synergy of Nederlab and @PhilosTEI: Diachronic and Multilingual Text-induced Corpus Clean-up |
Martin Reynaert |
Szeged Corpus 2.5: Morphological Modifications in a Manually POS-tagged Hungarian Corpus |
Veronika Vincze, Viktor Varga, Katalin Ilona Simkó, János Zsibrita, Ágoston Nagy, Richárd Farkas and János Csirik |
T |
|
3D Face Tracking and Multi-Scale, Spatio-temporal Analysis of Linguistically Significant Facial Expressions and Head Positions in ASL |
Bo Liu, Jingjing Liu, Xiang Yu, Dimitris Metaxas and Carol Neidle |
T-PAS; A resource of Typed Predicate Argument Structures for linguistic analysis and semantic processing |
Elisabetta Jezek, Bernardo Magnini, Anna Feltracco, Alessia Bianchini and Octavian Popescu |
T2K^2: a System for Automatically Extracting and Organizing Knowledge from Texts |
Felice Dell'Orletta, Giulia Venturi, Andrea Cimino and Simonetta Montemagni |
Taalportaal: an Online Grammar of Dutch and Frisian |
Frank Landsbergen, Carole Tiberius and Roderik Dernison |
TagNText: a Parallel Corpus for the Induction of Resource-specific non-Taxonomical Relations from Tagged Images |
Theodosia Togia and Ann Copestake |
TaLAPi ― A Thai Linguistically Annotated Corpus for Language Processing |
AiTi Aw, Sharifah Mahani Aljunied, Nattadaporn Lertcheva and Sasiwimon Kalunsima |
TALC-Sef a Manually-revised POS-Tagged Literary Corpus in Serbian, English and French |
Antonio Balvet, Dejan Stosic and Aleksandra Miletic |
Teenage and Adult Speech in School Context: Building and Processing a Corpus of European Portuguese |
Ana Isabel Mata, Helena Moniz, Fernando Batista and Julia Hirschberg |
Terminology Localization Guidelines for the National Scenario |
Juris Borzovs, Ilze Ilziņa, Iveta Keiša, Mārcis Pinnis and Andrejs Vasiļjevs |
Terminology Resources and Terminology Work Benefit from Cloud Services |
Tatiana Gornostay and Andrejs Vasiļjevs |
TermWise: A CAT-tool with Context-Sensitive Terminological Support. |
Kris Heylen, Stephen Bond, Dirk De Hertog De Hertog, Ivan Vulić and Hendrik Kockaert |
TexAFon 2.0: a Text Processing Tool for the Generation of Expressive Speech in TTS Applications |
Juan-María Garrido, Yesika Laplaza, Benjamin Kolz and Miquel Cornudella |
Text Readability and Word Distribution in Japanese |
Satoshi Sato |
Textual Emigration Analysis (TEA) |
Andre Blessing and Jonas Kuhn |
Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon |
Mona Diab, Mohamed AlBadrashiny, Maryam Aminian, Mohammed Attia, Heba Elfardy, Nizar Habash, Abdelati Hawwari, Wael Salloum, Pradeep Dasigi and Ramy Eskander |
The Alveo Virtual Laboratory: A Web based Repository API |
Steve Cassidy, Dominique Estival, Timothy Jones, Denis Burnham and Jared Burghold |
The AMARA Corpus: Building Parallel Language Resources for the Educational Domain |
Ahmed Abdelali, Francisco Guzman, Hassan Sajjad and Stephan Vogel |
The American Local News Corpus |
Ann Irvine, Joshua Langfus and Chris Callison-Burch |
The AV-LASYN Database: a Synchronous Corpus of Audio and 3D Facial Marker Data for Audio-Visual Laughter Synthesis |
Huseyin Cakmak, Jerome Urbain, Thierry Dutoit and Joelle Tilmanne |
The CLARIN Research Infrastructure: Resources and Tools for eHumanities Scholars |
Erhard Hinrichs and Steven Krauwer |
The CLE Urdu POS Tagset |
Saba Urooj, Sarmad Hussain, Asad Mustafa, Rahila Parveen, Farah Adeeba, Tafseer Ahmed Khan, Miriam Butt and Annette Hautli |
The CMD Cloud |
Matej Durco and Menzo Windhouwer |
The CMU METAL Farsi NLP Approach |
Weston Feely, Mehdi Manshadi, Robert Frederking and Lori Levin |
The CUHK Discourse TreeBank for Chinese: Annotating Explicit Discourse Connectives for the Chinese TreeBank |
Lanjun Zhou, Binyang Li, Zhongyu Wei and Kam-Fai Wong |
The D-ANS corpus: the Dublin-Autonomous Nervous System corpus of biosignal and multimodal recordings of conversational speech |
Shannon Hennig, Ryad Chellali and Nick Campbell |
The Dangerous Myth of the Star System |
André Bittar, dini luca, Sigrid Maurel and Mathieu Ruhlmann |
The DARE Corpus: A Resource for Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems |
Nobal Niraula, Vasile Rus, Rajendra Banjade, Dan Stefanescu, William Baggett and Brent Morgan |
The Database for Spoken German ― DGD2 |
Thomas Schmidt |
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues |
Volha Petukhova, Martin Gropp, Dietrich Klakow, Gregor Eigner, Mario Topf, Stefan Srb, Petr Motlicek, Blaise Potard, John Dines, Olivier Deroo, Ronny Egeler, Uwe Meinz, Steffen Liersch and Anna Schmidt |
The Development of Dutch and Afrikaans Language Resources for Compound Boundary Analysis. |
Menno van Zaanen, Gerhard Van Huyssteen, Suzanne Aussems, Chris Emmery and Roald Eiselen |
The Development of the Multilingual LUNA Corpus for Spoken Language System Porting |
Evgeny Stepanov, Giuseppe Riccardi and Ali Orkan Bayer |
The DIRHA simulated corpus |
Luca Cristoforetti, Mirco Ravanelli, Maurizio Omologo, Alessandro Sosi, Alberto Abad, Martin Hagmueller and Petros Maragos |
The Distress Analysis Interview Corpus of Human and Computer Interviews |
Jonathan Gratch, Ron Artstein, Gale Lucas, Giota stratou, Stefan Scherer, Angela Nazarian, Rachel Wood, Jill Boberg, David DeVault, Stacy Marsella, David Traum, Albert "Skip" Rizzo and Louis-Philippe Morency |
The Dutch LESLLA Corpus |
Eric Sanders, Ineke van de Craats and Vanja de Lint |
The DWAN Framework: Application of a Web Annotation Framework for the General Humanities to the Domain of Language Resources |
Przemyslaw Lenkiewicz, Olha Shkaravska, Twan Goosen, Daan Broeder, Menzo Windhouwer, Stephanie Roth and Olof Olsson |
The EASR Corpora of European Portuguese, French, Hungarian and Polish Elderly Speech |
Annika Hämäläinen, Jairo Avelar, Silvia Rodrigues, Miguel Sales Dias, Artur Kolesiński, Tibor Fegyó, Géza Németh, Petra Csobánka, Karine Lan and David Hewson |
The eIdentity Text Exploration Workbench |
Fritz Kliche, Andre Blessing, Dr. Ulrich Heid and Jonathan Sonntag |
The Ellogon Pattern Engine: Context-free Grammars over Annotations |
Georgios Petasis |
The ETAPE Speech Processing Evaluation |
Olivier Galibert, Jeremy Leixa, Gilles Adda, Khalid Choukri and Guillaume Gravier |
The Evolving Infrastructure for Language Resources and the Role for Data Scientists |
Nelleke Oostdijk and Henk van den Heuvel |
The Extended DIRNDL Corpus as a Resource for Coreference and Bridging Resolution |
Anders Björkelund, Kerstin Eckart, Arndt Riester, Nadja Schauffler and Katrin Schweitzer |
The Gulf of Guinea Creole Corpora |
Tjerk Hagemeijer, Michel Généreux, Iris Hendrickx, Amália Mendes, Abigail Tiny and Armando Zamora |
The Halliday Centre Tagger: An Online Platform for Semi-automatic Text Annotation and Analysis |
Billy T.M. Wong, Ian C. Chow, Jonathan J. Webster and Hengbin Yan |
The Hungarian Gigaword Corpus |
Csaba Oravecz, Tamás Váradi and Bálint Sass |
The IMAGACT Visual Ontology. an Extendable Multilingual Infrastructure for the Representation of Lexical Encoding of Action |
Massimo Moneglia, Susan Brown, Francesca Frontini, Gloria Gagliardi, Fahad Khan, Monica Monachini and Alessandro Panunzi |
The Impact of Cohesion Errors in Extraction Based Summaries |
Evelina Rennes and Arne Jonsson |
The Interplay Between Lexical and Syntactic Resources in Incremental Parsebanking |
Victoria Rosén, Petter Haugereid, Martha Thunes, Gyri S. Losnegaard and Helge Dyvik |
The IULA Spanish LSP Treebank |
Montserrat Marimon, Núria Bel, Beatriz Fisas, Blanca Arias, Silvia Vázquez, Jorge Vivaldi, Carlos Morell and Mercè Lorente |
The KiezDeutsch Korpus (KiDKo) Release 1.0 |
Ines Rehbein, Sören Schalowski and Heike Wiese |
The Language Application Grid |
Nancy Ide, James Pustejovsky, Christopher Cieri, Eric Nyberg, Di Wang, Keith Suderman, Marc Verhagen and Jonathan Wright |
The Liability of Service Providers in e-Research Infrastructures: Killing the Messenger? |
Pawel Kamocki |
The LIMA Multilingual Analyzer Made Free: FLOSS Resources Adaptation and Correction |
Gaël de Chalendar |
The LRE Map disclosed |
Riccardo Del Gratta, Gabriella Pardelli and Sara Goggi |
The Making of Ancient Greek WordNet |
Yuri Bizzoni, Federico Boschetti, Harry Diakoff, Riccardo Del Gratta, Monica Monachini and Gregory Crane |
The MERLIN corpus: Learner Language and the CEFR |
Adriane Boyd, Jirka Hana, Lionel Nicolas, Detmar Meurers, Katrin Wisniewski, Andrea Abel, Karin Schöne, Barbora Štindlová and Chiara Vettori |
The Meta-knowledge of Causality in Biomedical Scientific Discourse |
Claudiu Mihăilă and Sophia Ananiadou |
The MMASCS Multi-Modal Annotated Synchronous Corpus of Audio, Video, Facial Motion and Tongue Motion Data of Normal, Fast and Slow Speech |
Dietmar Schabus, Michael Pucher and Phil Hoole |
The Multilingual Paraphrase Database |
Juri Ganitkevitch and Chris Callison-Burch |
The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production |
Björn Schuller, Felix Friedmann and Florian Eyben |
The N2 Corpus: a Semantically Annotated Collection of Islamist Extremist Stories |
Mark Finlayson, Jeffry Halverson and Steven Corman |
The NewSoMe Corpus: a Unifying Opinion Annotation Framework Across Genres and in Multiple Languages |
Roser Saurí, Judith Domingo and Toni Badia |
The Nijmegen Corpus of Casual Czech |
Mirjam Ernestus, Lucie Kočková-Amortová and Petr Pollak |
The Norwegian Dependency Treebank |
Per Erik Solberg, Arne Skjærholt, Lilja Øvrelid, Kristin Hagen and Janne Bondi Johannessen |
The Polish Summaries Corpus |
Maciej Ogrodniczuk and Mateusz Kopeć |
The Pragmatic Annotation of a Corpus of Academic Lectures |
Sian Alsop and Hilary Nesi |
The Procedure of Lexico-Semantic Annotation of Składnica Treebank |
Elżbieta Hajnicz |
The RATS Collection: Supporting HLT Research with Degraded Audio Data |
David Graff, Kevin Walker, Stephanie Strassel, Xiaoyi Ma, Karen Jones and Ann Sawyer |
The Research and Teaching Corpus of Spoken German ― FOLK |
Thomas Schmidt |
The SETimes.HR Linguistically Annotated Corpus of Croatian |
Željko Agić and Nikola Ljubešić |
The Slovak Categorized News Corpus |
Daniel Hladek, Jan Stas and Jozef Juhar |
The Slovene BNSI Broadcast News Database and Reference Speech Corpus GOS: Towards the Uniform Guidelines for Future Work |
Andrej Zgank, Ana Zwitter Vitez and Darinka Verdonik |
The SSPNet-Mobile Corpus: Social Signal Processing Over Mobile Phones. |
Anna Polychroniou, Hugues Salamin and Alessandro Vinciarelli |
The Strategic Impact of META-NET on the Regional, National and International Level |
Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, Núria Bel, Audroné Bielevičiené, Lars Borin, António Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garabík, Marko Grobelnik, Carmen Garcia-Mateo, Josef van Genabith, Jan Hajic, Inma Hernaez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Linden, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asuncion Moreno, Jan Odijk, Maciej Ogrodniczuk, Piotr Pezik, Stelios Piperidis, Adam Przepiórkowski, Eiríkur Rögnvaldsson, Michael Rosner, Bolette Pedersen, Inguna Skadina, Koenraad De Smedt, Marko Tadić, Paul Thompson, Dan Tufiș, Tamás Váradi, Andrejs Vasiļjevs, Kadri Vider and Jolanta Zabarskaite |
The Sweet-Home Speech and Multimodal Corpus for Home Automation Interaction |
Michel Vacher, Benjamin Lecouteux, Pedro Chahuara, François Portet, Brigitte Meillon and Nicolas Bonnefond |
The SYN-series Corpora of Written Czech |
Milena Hnátková, Michal Křen, Pavel Procházka and Hana Skoumalová |
The taraXÜ Corpus of Human-Annotated Machine Translations |
Eleftherios Avramidis, Aljoscha Burchardt, Sabine Hunsicker, Maja Popović, Cindy Tscherwinka, David Vilar and Hans Uszkoreit |
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue |
Maria Koutsombogera, Samer Al Moubayed, Bajibabu Bollepalli, Ahmed Hussen Abdelaziz, Martin Johansson, José David Aguas Lopes, Jekaterina Novikova, Catharine Oertel, Kalin Stefanov and Gül Varol |
The USAGE Review Corpus for Fine Grained Multi Lingual Opinion Analysis |
Roman Klinger and Philipp Cimiano |
The Use of a FileMaker Pro Database in Evaluating Sign Language Notation Systems |
Julie Hochgesang |
The WaveSurfer Automatic Speech Recognition Plugin |
Giampiero Salvi and Niklas Vanhainen |
The Weltmodell: A Data-Driven Commonsense Knowledge Base |
Alan Akbik and Thilo Michael |
Thematic Cohesion: Measuring Terms Discriminatory Power Toward Themes |
Clément de Groc, Xavier Tannier and Claude de Loupy |
Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D |
Scott Martens and Marco Passarotti |
Three Dimensions of the so-called "Interoperability" of Annotation Schemes |
Eva Hajičová |
TLAXCALA: a Multilingual Corpus of Independent News |
Antonio Toral |
TMO ― The Federated Ontology of the TrendMiner Project |
Hans-Ulrich Krieger and Thierry Declerck |
To Pay or to Get Paid: Enriching a Valency Lexicon with Diatheses |
Anna Vernerová, Václava Kettnerová and Marketa Lopatkova |
Tools for Arabic Natural Language Processing: a Case Study in Qalqalah Prosody |
Claire Brierley, Majdi Sawalha and Eric Atwell |
Toward a Unifying Model for Opinion, Sentiment and Emotion Information Extraction |
Amel Fraisse and Patrick Paroubek |
Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar |
Dan Flickinger, Emily M. Bender and Stephan Oepen |
Towards an Environment for the Production and the Validation of Lexical Semantic Resources |
Mikaël Morardo and Eric De La Clergerie |
Towards an Integration of Syntactic and Temporal Annotations in Estonian |
Siim Orasmaa |
Towards Automatic Detection of Narrative Structure |
Jessica Ouyang and Kathy McKeown |
Towards Automatic Quality Assessment of Component Metadata |
Thorsten Trippel, Daan Broeder, Matej Durco and Oddrun Ohren |
Towards Automatic Transformation Between Different Transcription Conventions: Prediction of Intonation Markers from Linguistic and Acoustic Features |
Yuichi Ishimoto, Tomoyuki Tsuchiya, Hanae Koiso and Yasuharu Den |
Towards Building a Kashmiri Treebank: Setting up the Annotation Pipeline |
Riyaz Ahmad Bhat, Shahid Musjtaq Bhat and Dipti Misra Sharma |
Towards Electronic SMS Dictionary Construction: An Alignment-based Approach |
Cédric Lopez, Reda Bestandji, Mathieu Roche and Rachel Panckhurst |
Towards Interoperable Discourse Annotation. Discourse Features in the Ontologies of Linguistic Annotation |
Christian Chiarcos |
Towards Linked Hypernyms Dataset 2.0: Complementing DBpedia with Hypernym Discovery |
Tomáš Kliegr and Ondřej Zamazal |
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System |
Sakriani Sakti, Keigo Kubo, Sho Matsumiya, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Fumihiro Adachi and Ryosuke Isotani |
Towards Shared Datasets for Normalization Research |
Orphee De Clercq, Sarah Schulz, Bart Desmet and Veronique Hoste |
Transfer Learning of Feedback Head Expressions in Danish and Polish Comparable Multimodal Corpora |
Costanza Navarretta and Magdalena Lis |
Translation Errors from English to Portuguese: an Annotated Corpus |
Angela Costa, Tiago Luís and Luísa Coheur |
Transliteration and Alignment of Parallel Texts from Cyrillic to Latin |
Petic Mircea and Daniela Gîfu |
Treelet Probabilities for HPSG Parsing and Error Correction |
Angelina Ivanova and Gertjan van Noord |
TUKE-BNews-SK: Slovak Broadcast News Corpus Construction and Evaluation |
Matus Pleva and Jozef Juhar |
Turkish Resources for Visual Word Recognition |
Begum Erten, Cem Bozsahin and Deniz Zeyrek |
Turkish Treebank as a Gold Standard for Morphological Disambiguation and Its Influence on Parsing |
Ozlem Cetinoglu |
TVD: A Reproducible and Multiply Aligned TV Series Dataset |
Anindya Roy, Camille Guinaudeau, Herve Bredin and Claude Barras |
TweetCaT: a Tool for Building Twitter Corpora of Smaller Languages |
Nikola Ljubešić, Darja Fišer and Tomaž Erjavec |
TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization |
Iñaki Alegria, Nora Aranberri, Pere Comas, Victor Fresno, Pablo Gamallo, Lluís Padró, Iñaki San Vicente, Jordi Turmo and Arkaitz Zubiaga |
Twente Debate Corpus ― A Multimodal Corpus for Head Movement Analysis |
Bayu Rahayudi, Ronald Poppe and Dirk Heylen |
Two Approaches to Metaphor Detection |
Brian MacWhinney and Davida Fromm |
Two-Step Machine Translation with Lattices |
Bushra Jawaid and Ondrej Bojar |
U |
|
UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation |
Liang Tian, Derek F. Wong, Lidia S. Chao, Paulo Quaresma, Francisco Oliveira and Lu Yi |
Universal Stanford Dependencies: a Cross-Linguistic Typology |
Marie-Catherine de Marneffe, Timothy Dozat, Natalia Silveira, Katri Haverinen, Filip Ginter, Joakim Nivre and Christopher D. Manning |
UnixMan Corpus: A Resource for Language Learning in the Unix Domain |
Kyle Richardson and Jonas Kuhn |
Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora using WebMAUS |
Jan Strunk, Florian Schiel and Frank Seifart |
Use of Unsupervised Word Classes for Entity Recognition: Application to the Detection of Disorders in Clinical Reports |
Maria Evangelia Chatzimina, Cyril Grouin and Pierre Zweigenbaum |
Using a Machine Learning Model to Assess the Complexity of Stress Systems |
Liviu Dinu, Alina Maria Ciobanu, Ioana Chitoran and Vlad Niculae |
Using a Serious Game to Collect a Child Learner Speech Corpus |
Claudia Baur, Manny Rayner and Nikos Tsourakis |
Using a Sledgehammer to Crack a Nut? Lexical Diversity and Event Coreference Resolution |
Agata Cybulska and Piek Vossen |
Using Audio Books for Training a Text-to-Speech System |
Aimilios Chalamandaris, Pirros Tsiakoulis, Sotiris Karabetsos and Spyros Raptis |
Using C5.0 and Exhaustive Search for Boosting Frame-Semantic Parsing Accuracy |
Guntis Barzdins, Didzis Gosko, Laura Rituma and Peteris Paikens |
Using Large Biomedical Databases as Gold Annotations for Automatic Relation Extraction |
Tilia Ellendorff, Fabio Rinaldi and Simon Clematide |
Using Resource-Rich Languages to Improve Morphological Analysis of Under-Resourced Languages |
Peter Baumann and Janet Pierrehumbert |
Using Stem-Templates to Improve Arabic POS and Gender/Number Tagging |
Kareem Darwish, Ahmed Abdelali and Hamdy Mubarak |
Using TEI, CMDI and ISOcat in CLARIN-DK |
Dorte Haltrup Hansen, Lene Offersgaard and Sussi Olsen |
Using Transfer Learning to Assist Exploratory Corpus Annotation |
Paul Felt, Eric Ringger, Kevin Seppi and Kristian Heal |
Using Word Familiarities and Word Associations to Measure Corpus Representativeness |
Reinhard Rapp |
Utilizing Constituent Structure for Compound Analysis |
Kristín Bjarnadóttir and Jón Daðason |
V |
|
Valency and Word Order in Czech ― A Corpus Probe |
Katerina Rysova and Jiří Mírovský |
Validation Issues induced by an Automatic Pre-Annotation Mechanism in the Building of Non-projective Dependency Treebanks |
Ophélie Lacroix and Denis Béchet |
VarClass: An Open-source Language Identification Tool for Language Varieties |
Marcos Zampieri and Binyam Gebre |
Variations on Quantitative Comparability Measures and their Evaluations on Synthetic French-English Comparable Corpora |
Guiyao Ke, Pierre-Francois Marteau and Gildas Menier |
Verbs of Saying with a Textual Connecting Function in the Prague Discourse Treebank |
Magdalena Rysova |
VERTa: Facing a Multilingual Experience of a Linguistically-based MT Evaluation |
Elisabet Comelles, Jordi Atserias, Victoria Arranz, Irene Castellon and Jordi Sesé |
Visualization of Language Relations and Families: MultiTree |
Damir Cavar and Malgorzata Cavar |
VOAR: A Visual and Integrated Ontology Alignment Environment |
Bernardo Severo, Cassia Trojahn and Renata Vieira |
Vocabulary-Based Language Similarity using Web Corpora |
Dirk Goldhahn and Uwe Quasthoff |
VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments |
Ana Aguiar, Mariana Kaiseler, Hugo Meinedo, Pedro Almeida, Mariana Cunha and Jorge Silva |
VOLIP: a Corpus of Spoken Italian and a Virtuous Example of Reuse of Linguistic Resources |
Iolanda Alfano, Francesco Cutugno, Aurelio De Rosa, Claudio Iacobini, Renata Savy and Miriam Voghera |
Votter Corpus: A Corpus of Social Polling Language |
Nathan Green and Septina Dian Larasati |
Vulnerability in Acquisition, Language Impairments in Dutch: Creating a VALID Data Archive |
Jetske Klatter, Roeland Van Hout, Henk van den Heuvel, Paula Fikkert, Anne Baker, Jan De Jong, Frank Wijnen, Eric Sanders and Paul Trilsbeek |
W |
|
Walenty: Towards a Comprehensive Valence Dictionary of Polish |
Adam Przepiórkowski, Elżbieta Hajnicz, Agnieszka Patejuk, Marcin Woliński, Filip Skwarski and Marek Świdziński |
Web-imageability of the Behavioral Features of Basic-level Concepts |
Yoshihiko Hayashi |
When POS Data Sets Don't Add Up: Combatting Sample Bias |
Dirk Hovy, Barbara Plank and Anders Søgaard |
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control |
Mitesh M. Khapra, Ananthakrishnan Ramanathan, Anoop Kunchukuttan, Karthik Visweswariah and Pushpak Bhattacharyya |
Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis. |
Diana Maynard and Mark Greenwood |
Why Chinese Web-as-Corpus is Wacky? Or: How Big Data is Killing Chinese Corpus Linguistics |
Shu-Kai Hsieh |
Word Alignment-Based Reordering of Source Chunks in PB-SMT |
Santanu Pal, Sudip Kumar Naskar and Sivaji Bandyopadhyay |
Word Semantic Similarity for Morphologically Rich Languages |
Kalliopi Zervanou, Elias Iosif and Alexandros Potamianos |
Word-Formation Network for Czech |
Magda Sevcikova and Zdenek Zabokrtsky |
WordNet―Wikipedia―Wiktionary: Construction of a Three-way Alignment |
Tristan Miller and Iryna Gurevych |
|
|