PAPERS: Browse articles of the conference sorted by title

3 - A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - R - S - T - U - V - W - Y

3  
3rd party observer gaze as a continuous measure of dialogue flow Jens Edlund, Simon Alexandersson, Jonas Beskow, Lisa Gustavsson, Mattias Heldner, Anna Hjalmarsson, Petter Kallionen and Ellen Marklund

 

A  
A Basic Language Resource Kit for Persian Mojgan Seraji, Beáta Megyesi and Joakim Nivre
Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies Thierry Declerck, Karlheinz Mörth and Piroska Lendvai
A Classification of Adjectives for Polarity Lexicons Enhancement Silvia Vázquez and Núria Bel
A Comparative Evaluation of Word Sense Disambiguation Algorithms for German Verena Henrich and Erhard Hinrichs
A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation Anil Kumar Singh
A contrastive review of paraphrase acquisition techniques Houda Bouamor, Aurélien Max, Gabriel Illouz and Anne Vilnat
A Corpus-based Study of the German Recipient Passive Patrick Ziering, Sina Zarrieß and Jonas Kuhn
A Corpus for a Gesture-Controlled Mobile Spoken Dialogue System Nikos Tsourakis and Manny Rayner
A Corpus for Research on Deliberation and Debate Marilyn Walker, Jean Fox Tree, Pranav Anand, Rob Abbott and Joseph King
A corpus of general and specific sentences from news Annie Louis and Ani Nenkova
A Corpus of Scientific Biomedical Texts Spanning over 168 Years Annotated for Uncertainty Ramona Bongelli, Carla Canestrari, Ilaria Riccioni, Andrzej Zuczkowski, Cinzia Buldorini, Ricardo Pietrobon, Alberto Lavelli and Bernardo Magnini
A Corpus of Spontaneous Multi-party Conversation in Bosnian Serbo-Croatian and British English Emina Kurtic, Bill Wells, Guy J. Brown, Timothy Kempton and Ahmet Aker
Acquisition of Syntactic Simplification Rules for French Violeta Seretan
A Cross-Lingual Dictionary for English Wikipedia Concepts Valentin I. Spitkovsky and Angel X. Chang
A Curated Database for Linguistic Research: The Test Case of Cimbrian Varieties Maristella Agosti, Birgit Alber, Giorgio Maria Di Nunzio, Marco Dussin, Stefan Rabanus and Alessandra Tomaselli
Adapting and evaluating a generic term extraction tool Anita Gojun, Ulrich Heid, Bernd Weißbach, Carola Loth and Insa Mingers
Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora Amir Hazem and Emmanuel Morin
Adaptive Speech Understanding for Intuitive Model-based Spoken Dialogues Tobias Heinroth, Maximilian Grotz, Florian Nothdurft and Wolfgang Minker
A data and analysis resource for an experiment in text mining a collection of micro-blogs on a political topic. William Black, Rob Procter, Steven Gray and Sophia Ananiadou
A Database of Attribution Relations Silvia Pareti
A database of semantic clusters of verb usages Silvie Cinková, Martin Holub, Adam Rambousek and Lenka Smejkalová
Adding Morpho-semantic Relations to the Romanian Wordnet Verginica Barbu Mititelu
Addressing polysemy in bilingual lexicon extraction from comparable corpora Darja Fišer, Nikola Ljubešić and Ozren Kubelka
A disambiguation resource extracted from Wikipedia for semantic annotation Eric Charton and Michel Gagnon
A Distributed Resource Repository for Cloud-Based Machine Translation Jörg Tiedemann, Dorte Haltrup Hansen, Lene Offersgaard, Sussi Olsen and Matthias Zumpe
A Fast, Memory Efficient, Scalable and Multilingual Dictionary Retriever Paulo Fernandes, Lucelene Lopes, Carlos A. Prolo, Afonso Sales and Renata Vieira
Affective Common Sense Knowledge Acquisition for Sentiment Analysis Erik Cambria, Yunqing Xia and Amir Hussain
A finite-state morphological transducer for Kyrgyz Jonathan Washington, Mirlan Ipasov and Francis Tyers
A Framework for Evaluating Text Correction Robert Dale and George Narroway
A Framework for Spelling Correction in Persian Language Using Noisy Channel Model Mohammad Hoseyn Sheykholeslam, Behrouz Minaei-Bidgoli and Hossein Juzi
A French Fairy Tale Corpus syntactically and semantically annotated Ismaïl El Maarouf and Jeanne Villaneau
A Galician Syntactic Corpus with Application to Intonation Modeling Montserrat Arza, José M. García-Miguel, Francisco Campillo and Miguel Cuevas - Alonso
A generic formalism to represent linguistic corpora in RDF and OWL/DL Christian Chiarcos
A Gold Standard for Relation Extraction in the Food Domain Michael Wiegand, Benjamin Roth, Eva Lasarcyk, Stephanie Köser and Dietrich Klakow
A good space: Lexical predictors in word space evaluation Christian Smith, Henrik Danielsson and Arne Jönsson
A Grammar-informed Corpus-based Sentence Database for Linguistic and Computational Studies Hongzhi Xu, Helen Kaiyun Chen, Chu-Ren Huang, Qin Lu, Dingxu Shi and Tin-Shing Chiu
A Graphical Citation Browser for the ACL Anthology Benjamin Weitz and Ulrich Schäfer
A GUI to Detect and Correct Errors in Hindi Dependency Treebank Rahul Agarwal, Bharat Ram Ambati and Anil Kumar Singh
A hierarchical approach with feature selection for emotion recognition from speech Panagiotis Giannoulis and Gerasimos Potamianos
A High-Quality Web Corpus of Czech Johanka Spoustová and Miroslav Spousta
A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora Mahdi Khademian, Kaveh Taghipour, Saab Mansour and Shahram Khadivi
A large scale annotated child language construction database Aline Villavicencio, Beracah Yankama, Marco Idiart and Robert Berwick
Aleda, a free large-scale entity database for French Benoît Sagot and Rosa Stern
A light way to collect comparable corpora from the Web Ahmet Aker, Evangelos Kanoulas and Robert Gaizauskas
Alignment-based reordering for SMT Maria Holmqvist, Sara Stymne, Lars Ahrenberg and Magnus Merkel
Alternative Lexicalizations of Discourse Connectives in Czech Magdalena Rysova
A Mandarin-English Code-Switching Corpus Ying Li, Yue Yu and Pascale Fung
A Metadata Editor to Support the Description of Linguistic Resources Emanuel Dima, Christina Hoppermann, Erhard Hinrichs, Thorsten Trippel and Claus Zinn
A methodology for the extraction of information about the usage of formulaic expressions in scientific texts Hannah Kermes
A Morphological Analyzer For Wolof Using Finite-State Techniques Cheikh M. Bamba Dione
A Multilingual Natural Stress Emotion Database Xin Zuo, Tian Li and Pascale Fung
An Adaptive Framework for Named Entity Combination Bogdan Sacaleanu and Günter Neumann
ANALEC: a New Tool for the Dynamic Annotation of Textual Data Frederic Landragin, Thierry Poibeau and Bernard Victorri
Analyzing and Aligning German compound nouns Marion Weller and Ulrich Heid
Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign Karën Fort, Claire François, Olivier Galibert and Maha Ghribi
An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output Daniele Pighin, Lluís Màrquez and Jonathan May
An Analytical Model of Language Resource Sustainability Khalid Choukri and Victoria Arranz
An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style Marilyn Walker, Grace Lin and Jennifer Sawyer
An Annotation Scheme for Quantifier Scope Disambiguation Mehdi Manshadi, James Allen and Mary Swift
An audiovisual political speech analysis incorporating eye-tracking and perception data Stefan Scherer, Georg Layher, John Kane, Heiko Neumann and Nick Campbell
An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus Stergos Afantenos, Nicholas Asher, Farah Benamara, Myriam Bras, Cecile Fabre, Mai Ho-Dac, Anne Le Draoulec, Philippe Muller, Marie-Paul Pery-Woodley, Laurent Prevot, Josette Rebeyrolles, Ludovic Tanguy, Marianne Vergez-Couret and Laure Vieu
An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora K Saravanan, Monojit Choudhury, Raghavendra Udupa and A Kumaran
An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT Ângela Costa, Tiago Luís, Joana Ribeiro, Ana Cristina Mendes and Luísa Coheur
An Evaluation of the Effect of Automatic Preprocessing on Syntactic Parsing for Biomedical Relation Extraction Md. Faisal Mahbub Chowdhury and Alberto Lavelli
A new dynamic approach for lexical networks evaluation Alain Joubert and Mathieu Lafourcade
A New Method for Evaluating Automatically Learned Terminological Taxonomies Paola Velardi, Roberto Navigli, Stefano Faralli and Juana Maria Ruiz-Martinez
A new semantically annotated corpus with syntactic-semantic and cross-lingual senses Myriam Rakho, Éric Laporte and Matthieu Constant
A New Twitter Verb Lexicon for Natural Language Processing Jennifer Williams and Graham Katz
An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use Mohammad Fazleh Elahi and Paola Monachesi
An implementation of a Latvian resource grammar in Grammatical Framework Peteris Paikens and Normunds Gruzitis
AnIta: a powerful morphological analyser for Italian Fabio Tamburini and Matias Melandri
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) James Clarke, Vivek Srikumar, Mark Sammons and Dan Roth
Annotated Bibliographical Reference Corpora in Digital Humanities Young-Min Kim, Patrice Bellot, Elodie Faath and Marin Dacos
Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner Tsuyoshi Okita
Annotating a corpus of human interaction with prosodic profiles ― focusing on Mandarin repair/disfluency Helen Kaiyun Chen
Annotating Agreement and Disagreement in Threaded Discussion Jacob Andreas, Sara Rosenthal and Kathleen McKeown
Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic Emad Mohamed, Behrang Mohit and Kemal Oflazer
Annotating dropped pronouns in Chinese newswire text Elizabeth Baran, Yaqin Yang and Nianwen Xue
Annotating Errors in a Hungarian Learner Corpus Markus Dickinson and Scott Ledbetter
Annotating Factive Verbs Alvin Grissom II and Yusuke Miyao
Annotating Football Matches: Influence of the Source Medium on Manual Annotation Karën Fort and Vincent Claveau
Annotating Near-Identity from Coreference Disagreements Marta Recasens, M. Antònia Martí and Constantin Orasan
Annotating Opinions in German Political News Hong Li, Xiwen Cheng, Kristina Adson, Tal Kirshboim and Feiyu Xu
Annotating progressive aspect constructions in the spoken section of the British National Corpus Andrew Caines and Paula Buttery
Annotating Qualia Relations in Italian and French Complex Nominals Pierrette Bouillon, Elisabetta Jezek, Chiara Melloni and Aurélie Picton
Annotating Spatial Containment Relations Between Events Kirk Roberts, Travis Goodwin and Sanda M. Harabagiu
Annotating Story Timelines as Temporal Dependency Structures Steven Bethard, Oleksandr Kolomiyets and Marie-Francine Moens
Annotation Facilities for the Reliable Analysis of Human Motion Michael Kipp
Annotation of anaphoric relations and topic continuity in Japanese conversation Natsuko Nakagawa and Yasuharu Den
Annotation of response tokens and their triggering expressions in Japanese multi-party conversations Yasuharu Den, Hanae Koiso, Katsuya Takanashi and Nao Yoshida
Annotations for Power Relations on Email Threads Vinodkumar Prabhakaran, Huzaifa Neralwala, Owen Rambow and Mona Diab
Annotation Trees: LDC's customizable, extensible, scalable, annotation infrastructure Jonathan Wright, Kira Griffitt, Joe Ellis, Stephanie Strassel and Brendan Callahan
Announcing Prague Czech-English Dependency Treebank 2.0 Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Ondřej Bojar, Silvie Cinková, Eva Fučíková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiří Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová and Zdeněk Žabokrtský
An ontological approach to model and query multimodal concurrent linguistic annotations Julien Seinturier, Elisabeth Murisasco, Emmanuel Bruno and Philippe Blache
An Open Source Persian Computational Grammar Shafqat Mumtaz Virk and Elnaz Abolahrar
An Oral History Annotation Tool for INTER-VIEWs Henk van den Heuvel, Eric Sanders, Robin Rutten, Stef Scagliola and Paula Witkamp
A Parallel Corpus of Music and Lyrics Annotated with Emotions Carlo Strapparava, Rada Mihalcea and Alberto Battocchi
A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System Alexander Schmitt, Stefan Ultes and Wolfgang Minker
A Phonemic Corpus of Polish Child-Directed Speech Luc Boruta and Justyna Jastrzebska
A platform-independent user-friendly dictionary from Italian to LIS Umar Shoaib, Nadeem Ahmad, Paolo Prinetto and Gabriele Tiotto
A Portuguese-Spanish Corpus Annotated for Subject Realization and Referentiality Luz Rello and Iria Gayo
Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation Maria Teresa Pazienza, Noemi Scarpato and Armando Stellato
Applying cross-lingual WSD to wordnet development Marianna Apidianaki and Benoît Sagot
Applying Random Indexing to Structured Data to Find Contextually Similar Words Danica Damljanovic, Udo Kruschwitz, M-Dyaa Albakour, Johann Petrak and Mihai Lupu
A PropBank for Portuguese: the CINTIL-PropBank António Branco, Catarina Carvalheiro, Sílvia Pereira, Sara Silveira, João Silva, Sérgio Castro and João Graça
A proposal for improving WordNet Domains Aitor Gonzalez-Agirre, Mauro Castillo and German Rigau
Arabic-Segmentation Combination Strategies for Statistical Machine Translation Saab Mansour and Hermann Ney
Arabic Word Generation and Modelling for Spell Checking Khaled Shaalan, Mohammed Attia, Pavel Pecina, Younes Samih and Josef van Genabith
A Reference Dependency Bank for Analyzing Complex Predicates Tafseer Ahmed, Miriam Butt, Annette Hautli and Sebastian Sulger
A Repository for the Sustainable Management of Research Data Emanuel Dima, Verena Henrich, Erhard Hinrichs, Marie Hinrichs, Christina Hoppermann, Thorsten Trippel, Thomas Zastrow and Claus Zinn
A Repository of Data and Evaluation Resources for Natural Language Generation Anja Belz and Albert Gatt
A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures Sarah Bourse and Patrick Saint-Dizier
A Resource-light Approach to Phrase Extraction for English and German Documents from the Patent Domain and User Generated Content Julia Maria Schulz, Daniela Becks, Christa Womser-Hacker and Thomas Mandl
A review corpus annotated for negation, speculation and their scope Natalia Konstantinova, Sheila C.M. de Sousa, Noa P. Cruz, Manuel J. Maña, Maite Taboada and Ruslan Mitkov
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation Eleftherios Avramidis, Marta R. Costa-Jussà, Christian Federmann, Josef van Genabith, Maite Melero and Pavel Pecina
A Rough Set Formalization of Quantitative Evaluation with Ambiguity Patrick Paroubek and Xavier Tannier
A Rule-based Morphological Analyzer for Murrinh-Patha Melanie Seiss
A Scalable Architecture For Web Deployment of Spoken Dialogue Systems Matthew Fuchs, Nikos Tsourakis and Manny Rayner
A Search Tool for FrameNet Constructicon Hiroaki Sato
Aspects of a Legal Framework for Language Resource Management Aditi Sharma Grover, Annamart Nieman, Gerhard Van Huyssteen and Justus Roux
A Speech and Gesture Spatial Corpus in Assisted Living Dimitra Anastasiou
Assessing Crowdsourcing Quality through Objective Tasks Ahmet Aker, Mahmoud El-Haj, M-Dyaa Albakour and Udo Kruschwitz
Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System Claire Jaja, Douglas Briesch, Jamal Laoudi and Clare Voss
Assessing the Comparability of News Texts Emma Barker and Robert Gaizauskas
Assigning Connotation Values to Events Tommaso Caselli, Irene Russo and Francesco Rubino
Association Norms of German Noun Compounds Sabine Schulte im Walde, Susanne Borgwaldt and Ronny Jauch
Associative and Semantic Features Extracted From Web-Harvested Corpora Elias Iosif, Maria Giannoudaki, Eric Fosler-Lussier and Alexandros Potamianos
A Study of Word-Classing for MT Reordering Ananthakrishnan Ramanathan and Karthik Visweswariah
A Survey of Text Mining Architectures and the UIMA Standard Mathias Bank and Martin Schierle
ATLIS: Identifying Locational Information in Text Automatically John Vogel, Marc Verhagen and James Pustejovsky
A Tool/Database Interface for Multi-Level Analyses Kurt Eberle, Kerstin Eckart, Ulrich Heid and Boris Haselbach
A tool for enhanced search of multilingual digital libraries of e-journals Ranka Stanković, Cvetana Krstev, Ivan Obradović, Aleksandra Trtovac and Miloš Utvić
A Tool for Extracting Conversational Implicatures Marta Tatu and Dan Moldovan
A topologic view of Topic and Focus marking in Italian Gloria Gagliardi, Edoardo Lombardi Vallauri and Fabio Tamburini
A treebank-based study on the influence of Italian word order on parsing performance Anita Alicante, Cristina Bosco, Anna Corazza and Alberto Lavelli
A Treebank-driven Creation of an OntoValence Verb lexicon for Bulgarian Petya Osenova, Kiril Simov, Laska Laskova and Stanislava Kancheva
A tree is a Baum is an árbol is a sach'a: Creating a trilingual treebank Annette Rios and Anne Göhring
A Universal Part-of-Speech Tagset Slav Petrov, Dipanjan Das and Ryan McDonald
Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing Ziqi Zhang, Philip Webster, Victoria Uren, Andrea Varga and Fabio Ciravegna
Automatically Generated Online Dictionaries Enikő Héja and Dávid Takács
Automatic Annotation and Manual Evaluation of the Diachronic German Corpus TüBa-D/DC Erhard Hinrichs and Thomas Zastrow
Automatic annotation of head velocity and acceleration in Anvil Bart Jongejan
Automatic classification of German """"an"""" particle verbs Sylvia Springorum, Sabine Schulte im Walde and Antje Roßdeutscher
Automatic Extraction and Evaluation of Arabic LFG Resources Mohammed Attia, Khaled Shaalan, Lamia Tounsi and Josef van Genabith
Automatic lexical semantic classification of nouns Núria Bel, Lauren Romeo and Muntsa Padró
Automatic MT Error Analysis: Hjerson Helping Addicter Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović and Daniel Zeman
Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel Daniel Stein and Bela Usabaev
Automatic Term Recognition Needs Multiple Evidence Natalia Loukachevitch
Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques Hidetsugu Nanba, Toshiyuki Takezawa, Kiyoko Uchiyama and Akiko Aizawa
Automatic Translation of Scientific Documents in the HAL Archive Lambert Patrik, Holger Schwenk and Frédéric Blain
Automatic word alignment tools to scale production of manually aligned parallel texts Stephen Grimes, Katherine Peterson and Xuansong Li
AVATecH ― automated annotation through audio and video analysis Przemyslaw Lenkiewicz, Binyam Gebrekidan Gebre, Oliver Schreer, Stefano Masneri, Daniel Schneider and Sebastian Tschöpel
A voting scheme to detect semantic underspecification Héctor Martínez Alonso, Núria Bel and Bolette Sandford Pedersen
AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis Muhammad Abdul-Mageed and Mona Diab

 

B  
Balanced data repository of spontaneous spoken Czech Lucie Válková, Martina Waclawičová and Michal Křen
Beyond SoNaR: towards the facilitation of large corpus building efforts Martin Reynaert, Ineke Schuurman, Veronique Hoste, Nelleke Oostdijk and Maarten van Gompel
BiBiKit - A Bilingual Bimodal Reading and Writing Tool for Sign Language Users Nedelina Ivanova and Olle Eriksen
Biomedical Chinese-English CLIR Using an Extended CMeSH Resource to Expand Queries Xinkai Wang, Paul Thompson, Jun'ichi Tsujii and Sophia Ananiadou
BLEU Evaluation of Machine-Translated English-Croatian Legislation Sanja Seljan, Marija Brkić and Tomislav Vičić
Body-conductive acoustic sensors in human-robot communication Panikos Heracleous, Carlos Ishi, Takahiro Miyashita and Norihiro Hagita
Boosting statistical tagger accuracy with simple rule-based grammars Mans Hulden and Jerid Francom
Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations Kata Gábor, Marianna Apidianaki, Benoît Sagot and Éric Villemonte de la Clergerie
Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank Christian Scheible and Hinrich Schütze
Brand Pitt: A Corpus to Explore the Art of Naming Gozde Ozbal, Carlo Strapparava and Marco Guerini
BUCEADOR, a multi-language search engine for digital libraries Jordi Adell, Antonio Bonafonte, Antonio Cardenal, Marta R. Costa-Jussà, José A. R. Fonollosa, Asunción Moreno, Eva Navas and Eduardo R. Banga
Buildind a Resource of Patterns Using Semantic Types Octavian Popescu
Building a 70 billion word corpus of English from ClueWeb Jan Pomikálek, Miloš Jakubíček and Pavel Rychlý
Building a Basque-Chinese Dictionary by Using English as Pivot Xabier Saralegi, Iker Manterola and Iñaki San Vicente
Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions Maria Aloni, Andreas van Cranenburgh, Raquel Fernandez and Marta Sznajder
Building a database of French frozen adverbial phrases Aude Grezka and Céline Poudat
Building a fine-grained subjectivity lexicon from a web corpus Isa Maks and Piek Vossen
Building a learner corpus Jirka Hana, Alexandr Rosen, Barbora Štindlová and Petr Jäger
Building a multilingual parallel corpus for human users Alexandr Rosen and Martin Vavřín
Building a Multimodal Laughter Database for Emotion Recognition Merlin Teodosia Suarez, Jocelynn Cu and Madelene Sta. Maria
Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents Lina M. Rojas-Barahona, Alejandra Lorenzo and Claire Gardent
Building and Exploring Semantic Equivalences Resources Gracinda Carvalho, David Martins de Matos and Vitor Rocio
Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis Dietmar Schabus, Michael Pucher and Gregor Hofer
Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure Yuichiroh Matsubayashi, Yusuke Miyao and Akiko Aizawa
Building Large Corpora from the Web Using a New Efficient Tool Chain Roland Schäfer and Felix Bildhauer
Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages Dirk Goldhahn, Thomas Eckart and Uwe Quasthoff
Building Synthetic Voices in the META-NET Framework Emília Garcia Casademont, Antonio Bonafonte and Asunción Moreno
Building Text-to-Speech Systems for Resource Poor Languages Nur-Hana Samsudin and Mark Lee
Building Text-To-Speech Voices in the Cloud Alistair Conkie, Thomas Okken, Yeon-Jun Kim and Giuseppe Di Fabbrizio
Bulgarian X-language Parallel Corpus Svetla Koeva, Ivelina Stoyanova, Rositsa Dekova, Borislav Rizov and Angel Genov

 

C  
CALBC: Releasing the Final Corpora Şenay Kafkas, Ian Lewin, David Milward, Erik van Mulligen, Jan Kors, Udo Hahn and Dietrich Rebholz-Schuhmann
Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine? Marianna J. Martindale
Capturing syntactico-semantic regularities among terms: An application of the FrameNet methodology to terminology Marie-Claude L'Homme and Janine Pimentel
CAT: the CELCT Annotation Tool Valentina Bartalesi Lenzi, Giovanni Moretti and Rachele Sprugnoli
Causal analysis of task completion errors in spoken music retrieval interactions Sunao Hara, Norihide Kitaoka and Kazuya Takeda
Centroids: Gold standards with distributional variation Ian Lewin, Şenay Kafkas and Dietrich Rebholz-Schuhmann
Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi Ritesh Kumar
Challenges in the Knowledge Base Population Slot Filling Task Bonan Min and Ralph Grishman
Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi
Chinese Whispers: Cooperative Paraphrase Acquisition Matteo Negri, Yashar Mehdad, Alessandro Marchetti, Danilo Giampiccolo and Luisa Bentivogli
Citing on-line Language Resources Daan Broeder, Dieter van Uytvanck and Gunter Senft
Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types Yoshihiko Hayashi and Chiharu Narawa
Clause-based Discourse Segmentation of Arabic Texts Iskandar Keskes, Farah Benamara and Lamia Hadrich Belguith
CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations Irina Temnikova, Constantin Orasan and Ruslan Mitkov
Cleaning noisy wordnets Benoît Sagot and Darja Fišer
CLIMB grammars: three projects using metagrammar engineering Antske Fokkens, Tania Avgustinova and Yi Zhang
Cloud Logic Programming for Integrating Language Technology Resources Markus Forsberg and Torbjörn Lager
CLTC: A Chinese-English Cross-lingual Topic Corpus Yunqing Xia, Guoyu Tang, Peng Jin and Xia Yang
CoALT: A Software for Comparing Automatic Labelling Tools Dominique Fohr and Odile Mella
Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench Rafal Rak, Andrew Rowley and Sophia Ananiadou
Collaborative semantic editing of linked data lexica John McCrae, Elena Montiel-Ponsoda and Philipp Cimiano
Collecting and Analysing Chats and Tweets in SoNaR Eric Sanders
Collecting and Using Comparable Corpora for Statistical Machine Translation Inguna Skadiņa, Ahmet Aker, Nikos Mastropavlos, Fangzhong Su, Dan Tufiș, Mateja Verlic, Andrejs Vasiļjevs, Bogdan Babych, Paul Clough, Robert Gaizauskas, Nikos Glaros, Monica Lestari Paramita and Mārcis Pinnis
Collecting humorous expressions from a community-based question-answering-service corpus Masashi Inoue and Toshiki Akagi
Collection of a corpus of Dutch SMS Maaske Treurniet, Orphée De Clercq, Henk van den Heuvel and Nelleke Oostdijk
Collection of a Large Database of French-English SMT Output Corrections Marion Potet, Emmanuelle Esperança-Rodier, Laurent Besacier and Hervé Blanchon
Combining Formal Concept Analysis and semantic information for building ontological structures from texts : an exploratory study Silvia Moraes and Vera Lima
Combining Language Resources Into A Grammar-Driven Swedish Parser Malin Ahlberg and Ramona Enache
Comparing computer vision analysis of signed language video with motion capture recordings Matti Karppa, Tommi Jantunen, Ville Viitaniemi, Jorma Laaksonen, Birgitta Burger and Danny De Weerdt
Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora Nelly Barbot, Olivier Boeffard and Arnaud Delhay
Comparison between two models of language for the automatic phonetic labeling of an undocumented language of the South-Asia: the case of Mo Piu Geneviève Caelen-Haumont and Sethserey Sam
ConanDoyle-neg: Annotation of negation cues and their scope in Conan Doyle stories Roser Morante and Walter Daelemans
Concept-based Selectional Preferences and Distributional Representations from Wikipedia Articles Alex Judea, Vivi Nastase and Michael Strube
Constraint Based Description of Polish Multiword Expressions Roman Kurc, Maciej Piasecki and Bartosz Broda
Constructing a Class-Based Lexical Dictionary using Interactive Topic Models Kugatsu Sadamitsu, Kuniko Saito, Kenji Imamura and Yoshihiro Matsuo
Constructing a Question Corpus for Textual Semantic Relations Rui Wang and Shuguang Li
Constructing Large Proposition Databases Peter Exner and Pierre Nugues
Construction of the Turkish National Corpus (TNC) Yeşim Aksan, Mustafa Aksan, Ahmet Koltuksuz, Taner Sezer, Ümit Mersinli, Umut Ufuk Demirhan, Hakan Yılmazer, Gülsüm Atasoy, Seda Öz, İpek Yıldız and Özlem Kurtoğlu
Constructive Interaction for Talking about Interesting Topics Kristiina Jokinen and Graham Wilcock
Conventional Orthography for Dialectal Arabic Nizar Habash, Mona Diab and Owen Rambow
Coreference in Spoken vs. Written Texts: a Corpus-based Analysis Marilisa Amoia, Kerstin Kunz and Ekaterina Lapshinova-Koltunski
Corpus Annotation as a Scientific Task Donia Scott, Rossano Barone and Rob Koeling
Corpus-based Referring Expressions Generation Hilder Pereira, Eder Novais, Andre Mariotti and Ivandre Paraboni
Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations Somayeh Bagherbeygi and Mehrnoush Shamsfard
Corpus of Children Voices for Mid-level Markers and Affect Bursts Analysis Marie Tahon, Agnes Delaborde and Laurence Devillers
Corpus+WordNet thesaurus generation for ontology enriching Fernando Castilho, Roger Granada, Breno Meneghetti, Leonardo Carvalho and Renata Vieira
Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles Monica Lestari Paramita, Paul Clough, Ahmet Aker and Robert Gaizauskas
Cost and Benefit of Using WordNet Senses for Sentiment Analysis Balamuraliar, Aditya Joshi and Pushpak Bhattacharyya
Creating a Coreference Resolution System for Polish Mateusz Kopeć and Maciej Ogrodniczuk
Creating a Data Collection for Evaluating Rich Speech Retrieval Maria Eskevich, Gareth J.F. Jones, Martha Larson and Roeland Ordelman
Creating and Curating a Cross-Language Person-Entity Linking Collection Dawn Lawrie, James Mayfield, Paul McNamee and Douglas Oard
Creating HAVIC: Heterogeneous Audio Visual Internet Collection Stephanie Strassel, Amanda Morris, Jonathan Fiscus, Christopher Caruso, Haejoong Lee, Paul Over, James Fiumara, Barbara Shaw, Brian Antonishek and Martial Michel
Creation and use of Language Resources in a Question-Answering eHealth System Ulrich Andersen, Anna Braasch, Lina Henriksen, Csaba Huszka, Anders Johannsen, Lars Kayser, Bente Maegaard, Ole Norgaard, Stefan Schulz and Jürgen Wedekind
Creation of a bottom-up corpus-based ontology for Italian Linguistics Elisa Bianchi, Mirko Tavosanis and Emiliano Giovannetti
Creation of an Open Shared Language Resource Repository in the Nordic and Baltic Countries Andrejs Vasiljevs, Markus Forsberg, Tatiana Gornostay, Dorte Haltrup Hansen, Kristín Jóhannsdóttir, Gunn Lyse, Krister Lindén, Lene Offersgaard, Sussi Olsen, Bolette Pedersen, Eiríkur Rögnvaldsson, Inguna Skadiņa, Koenraad De Smedt, Ville Oksanen and Roberts Rozis
Croatian Dependency Treebank: Recent Development and Initial Experiments Dasa Berovic, Zeljko Agic and Marko Tadić
Cross-lingual studies of ASR errors: paradigms for perceptual evaluations Ioana Vasilescu, Martine Adda-Decker and Lori Lamel
Customizable SCF Acquisition in Italian Tommaso Caselli, Francesco Rubino, Francesca Frontini, Irene Russo and Valeria Quochi
Customization of the Europarl Corpus for Translation Studies Zahurul Islam and Alexander Mehler

 

D  
Dbnary: Wiktionary as a LMF based Multilingual RDF network Gilles Sérasset
DBpedia: A Multilingual Cross-domain Knowledge Base Pablo Mendes, Max Jakob and Christian Bizer
Dealing with unknown words in statistical machine translation João Silva, Luísa Coheur, Ângela Costa and Isabel Trancoso
DECODA: a call-centre human-human spoken conversation corpus Frederic Bechet, Benjamin Maza, Nicolas Bigouroux, Thierry Bazillon, Marc El-Beze, Renato De Mori and Eric Arbillot
DeCour: a corpus of DEceptive statements in Italian COURts Tommaso Fornaciari and Massimo Poesio
DEGELS1: A comparable corpus of French Sign Language and co-speech gestures Annelies Braffort and Leïla Boutora
Dependency parsing for interaction detection in pharmacogenomics Gerold Schneider, Fabio Rinaldi and Simon Clematide
Design and compilation of a specialized Spanish-German parallel corpus Carla Parra Escartín
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task Tomoyosi Akiba, Hiromitsu Nishizaki, Kiyoaki Aikawa, Tatsuya Kawahara and Tomoko Matsui
Designing a search interface for a Spanish learner spoken corpus: the end-user's evaluation Leonardo Campillos Llanos
Designing French Tale Corpora for Entertaining Text To Speech Synthesis David Doukhan, Sophie Rosset, Albert Rilliard, Christophe d'Alessandro and Martine Adda-Decker
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation Takafumi Suzuki, Yusuke Abe, Itsuki Toyota, Takehito Utsuro, Suguru Matsuyoshi and Masatoshi Tsuchiya
Detecting Reduplication in Videos of American Sign Language Zoya Gavrilov, Stan Sclaroff, Carol Neidle and Sven Dickinson
Detection of Peculiar Word Sense by Distance Metric Learning with Labeled Examples Minoru Sasaki and Hiroyuki Shinnou
Developing a large semantically annotated corpus Valerio Basile, Johan Bos, Kilian Evang and Noortje Venhuizen
Developing and evaluating an emergency scenario dialogue corpus Jolanta Bachan
Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects David Graff and Mohamed Maamouri
Developing Partially-Transcribed Speech Corpus from Edited Transcriptions Kengo Ohta, Masatoshi Tsuchiya and Seiichi Nakagawa
Development and Application of a Cross-language Document Comparability Metric Fangzhong Su and Bogdan Babych
Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information Chi-Hsin Yu, Yi-jie Tang and Hsin-Hsi Chen
Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment Shyam Agrawal, Shweta Sinha, Pooja Singh and Jesper Olson
DGT-TM: A freely available Translation Memory in 22 languages Ralf Steinberger, Andreas Eisele, Szymon Klocek, Spyridon Pilos and Patrick Schlüter
Diachronic Changes in Text Complexity in 20th Century English Language: An NLP Approach Sanja Štajner and Ruslan Mitkov
Dictionary Look-up with Katakana Variant Recognition Satoshi Sato
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns Andrei Popescu-Belis, Thomas Meyer, Jeevanthi Liyanapathirana, Bruno Cartoni and Sandrine Zufferey
Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation Els Lefever, Veronique Hoste and Martine De Cock
DISLOG: A logic-based language for processing discourse structures Patrick Saint-Dizier
Distractorless Authorship Verification John Noecker Jr and Michael Ryan
Diversifiable Bootstrapping for Acquiring High-Coverage Paraphrase Resource Hideki Shima and Teruko Mitamura
Document Attrition in Web Corpora: an Exploration Stephen Wattam, Paul Rayson and Damon Berridge
Domain-specific vs. Uniform Modeling for Coreference Resolution Olga Uryupina and Massimo Poesio
DramaBank: Annotating Agency in Narrative Discourse David Elson
DSim, a Danish Parallel Corpus for Text Simplification Sigrid Klerke and Anders Søgaard
DutchSemCor: Targeting the ideal sense-tagged corpus Piek Vossen, Attila Görög, Rubén Izquierdo and Antal Van den Bosch
Dynamic web service deployment in a cloud environment Marc Kemps-Snijders, Matthijs Brouwer, Jan Pieter Kunst and Tom Visser
Dysarthric Speech Database for Development of QoLT Software Technology Dae-Lim Choi, Bong-Wan Kim, Yeon-Whoa Kim, Yong-Ju Lee, Yongnam Um and Minhwa Chung

 

E  
Effects of Document Clustering in Modeling Wikipedia-style Term Descriptions Atsushi Fujii, Yuya Fujii and Takenobu Tokunaga
Efficient Dependency Graph Matching with the IMS Open Corpus Workbench Thomas Proisl and Peter Uhrig
Effort of Genre Variation and Prediction of System Performance Dong Wang and Fei Xia
ELAN development, keeping pace with communities' needs Han Sloetjes and Aarthy Somasundaram
ELRA in the heart of a cooperative HLT world Valérie Mapelli, Victoria Arranz, Matthieu Carré, Hélène Mazo, Djamel Mostefa and Khalid Choukri
EmpaTweet: Annotating and Detecting Emotions on Twitter Kirk Roberts, Michael A. Roach, Joseph Johnson, Josh Guthrie and Sanda M. Harabagiu
Empirical Comparisons of MASC Word Sense Annotations Gerard de Melo, Collin F. Baker, Nancy Ide, Rebecca J. Passonneau and Christiane Fellbaum
Empty Argument Insertion in the Hindi PropBank Ashwini Vaidya, Jinho D. Choi, Martha Palmer and Bhuvana Narasimhan
English to Indonesian Transliteration to Support English Pronunciation Practice Amalia Zahra and Julie Carson-Berndsen
Enriching the ISST-TANL Corpus with Semantic Frames Alessandro Lenci, Simonetta Montemagni, Giulia Venturi and Maria Grazia Cutrullà
Error profiling for evaluation of machine-translated text: a Polish-English case study Sandra Weiss and Lars Ahrenberg
EVALIEX ― A Proposal for an Extended Evaluation Methodology for Information Extraction Systems Christina Feilmayr, Birgit Pröll and Elisabeth Linsmayr
Evaluating and improving syntactic lexica by plugging them within a parser Elsa Tolone, Benoît Sagot and Éric Villemonte de la Clergerie
Evaluating Appropriateness Of System Responses In A Spoken CALL Game Manny Rayner, Pierrette Bouillon and Johanna Gerlach
Evaluating automatic cross-domain Dutch semantic role annotation Orphée De Clercq, Veronique Hoste and Paola Monachesi
Evaluating expressive speech synthesis from audiobook corpora for conversational phrases Eva Szekely, Joao Paulo Cabral, Mohamed Abou-Zleikha, Peter Cahill and Julie Carson-Berndsen
Evaluating Hebbian Self-Organizing Memories for Lexical Representation and Access Claudia Marzi, Marcello Ferro, Claudia Caudai and Vito Pirrelli
Evaluating Machine Reading Systems through Comprehension Tests Anselmo Peñas, Eduard Hovy, Pamela Forner, Álvaro Rodrigo, Richard Sutcliffe, Corina Forascu and Caroline Sporleder
Evaluating Multi-focus Natural Language Queries over Data Services Silvia Quarteroni, Vincenzo Guerrisi and Pietro La Torre
Evaluating Query Languages for a Corpus Processing System Elena Frick, Carsten Schnober and Piotr Bański
Evaluating the Impact of External Lexical Resources into a CRF-based Multiword Segmenter and Part-of-Speech Tagger Matthieu Constant and Isabelle Tellier
Evaluating the Impact of Phrase Recognition on Concept Tagging Pablo Mendes, Joachim Daiber, Rohana Rajapakse, Felix Sasaki and Christian Bizer
Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System Alexandra Roshchina, John Cardiff and Paolo Rosso
Evaluation of a Complex Information Extraction Application in Specific Domain Romaric Besançon, Olivier Ferret and Ludovic Jean-Louis
Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian Mladen Karan, Jan Šnajder and Bojana Dalbelo Bašić
Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank Sudheer Kolachina, Rashmi Prasad, Dipti Misra Sharma and Aravind Joshi
Evaluation of Online Dialogue Policy Learning Techniques Alexandros Papangelis, Vangelis Karkaletsis and Fillia Makedon
Evaluation of the KomParse Conversational Non-Player Characters in a Commercial Virtual World Tina Kluewer, Feiyu Xu, Peter Adophs and Hans Uszkoreit
Evaluation of Unsupervised Information Extraction Wei Wang, Romaric Besançon, Olivier Ferret and Brigitte Grau
Event Nominals: Annotation Guidelines and a Manually Annotated Corpus in French Béatrice Arnulphy, Xavier Tannier and Anne Vilnat
Evolution of Event Designation in Media: Preliminary Study Xavier Tannier, Véronique Moriceau, Béatrice Arnulphy and Ruixin He
Example-Based Treebank Querying Liesbeth Augustinus, Vincent Vandeghinste and Frank Van Eynde
EXMARaLDA and the FOLK tools ― two toolsets for transcribing and annotating spoken language Thomas Schmidt
Expanding Arabic Treebank to Speech: Results from Broadcast News Mohamed Maamouri, Ann Bies and Seth Kulick
Expanding Parallel Resources for Medium-Density Languages for Free Georgi Iliev and Angel Genov
Experiences in Resource Generation for Machine Translation through Crowdsourcing Anoop Kunchukuttan, Shourya Roy, Pratik Patel, Kushal Ladha, Somya Gupta, Mitesh M. Khapra and Pushpak Bhattacharyya
Expertise Mining for Enterprise Content Management Georgeta Bordea, Sabrina Kirrane, Paul Buitelaar and Bianca Pereira
Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign Olivier Galibert, Sophie Rosset, Cyril Grouin, Pierre Zweigenbaum and Ludovic Quintard
Extending a wordnet framework for simplicity and scalability Pedro Fialho, Sérgio Curto, Ana Cristina Mendes and Luísa Coheur
Extending the adverbial coverage of a French morphological lexicon Elsa Tolone, Stavroula Voyatzi, Claude Martineau and Matthieu Constant
Extending the EmotiNet Knowledge Base to Improve the Automatic Detection of Implicitly Expressed Emotions from Text Alexandra Balahur and Jesús M. Hermida
Extending the MPC corpus to Chinese and Urdu - A Multiparty Multi-Lingual Chat Corpus for Modeling Social Phenomena in Language Ting Liu, Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell, Jennifer Stromer-Galley, Sarah Taylor, Umit Boz, Xiaoai Ren and Jingsi Wu
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies Bruno Cartoni and Thomas Meyer
Extraction of unmarked quotations in Newspapers Stéphanie Weiser and Patrick Watrin
Eye Tracking as a Tool for Machine Translation Error Analysis Sara Stymne, Henrik Danielsson, Sofia Bremin, Hongzhan Hu, Johanna Karlsson, Anna Prytz Lillkull and Martin Wester

 

F  
Fast Labeling and Transcription with the Speechalyzer Toolkit Felix Burkhardt
Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach Stefania Degaetano-Ortlieb, Ekaterina Lapshinova-Koltunski and Elke Teich
Federated Search: Towards a Common Search Infrastructure Herman Stehouwer, Matej Durco, Eric Auer and Daan Broeder
Feedback in Nordic First-Encounters: a Comparative Study Costanza Navarretta, Elisabeth Ahlsén, Jens Allwood, Kristiina Jokinen and Patrizia Paggio
Fine-grained German Sentiment Analysis on Social Media Saeedeh Momtazi
First Results in a Study Evaluating Pre-annotation and Correction Propagation for Machine-Assisted Syriac Morphological Analysis Paul Felt, Eric Ringger, Kevin Seppi, Kristian Heal, Robbie Haertel and Deryle Lonsdale
First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin Marco Passarotti and Francesco Mambrini
Fivehundredmillionandone Tokens. Loading the AAC Container with Text Resources for Text Studies. Hanno Biber and Evelyn Breiteneder
Foundations of a Multilayer Annotation Framework for Twitter Communications During Crisis Events William J. Corvey, Sudha Verma, Sarah Vieweg, Martha Palmer and James H. Martin
FreeLing 3.0: Towards Wider Multilinguality Lluís Padró and Evgeny Stanilovsky
Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese Juan Pablo Martínez Cortés, Jim O'Regan and Francis Tyers
French and German Corpora for Audience-based Text Type Classification Amalia Todirascu, Sebastian Pado, Jennifer Krisch, Max Kisselew and Ulrich Heid
From Grammar Rule Extraction to Treebanking: A Bootstrapping Approach Masood Ghayoomi
From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information Lieve Macken, Veronique Hoste, Marielle Leijten and Luuk Van Waes
From medical language processing to BioNLP domain Gabriella Pardelli, Manuela Sassi, Sara Goggi and Stefania Biagioni
Further Developments in Treebank Error Detection Using Derivation Trees Seth Kulick, Ann Bies and Justin Mott

 

G  
GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German Silke Scheible, Richard J. Whitt, Martin Durrell and Paul Bennett
Generation of Verbal Stems in Derivationally Rich Language Krešimir Šojat, Nives Mikelić Preradović and Marko Tadić
German and English Treebanks and Lexica for Tree-Adjoining Grammars Miriam Kaeshammer and Vera Demberg
German """"nach""""-Particle Verbs in Semantic Theory and Corpus Data Boris Haselbach, Wolfgang Seeker and Kerstin Eckart
German Verb Patterns and Their Implementation in an Electronic Dictionary Marc Luder
GerNED: A German Corpus for Named Entity Disambiguation Danuta Ploch, Leonhard Hennig, Angelina Duka, Ernesto William De Luca and Sahin Albayrak
Getting more data -- Schoolkids as annotators Jirka Hana and Barbora Hladka
Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages Sebastian Nordhoff and Harald Hammarström
Grammatical Error Annotation for Korean Learners of Spoken English Hongsuck Seo, Kyusong Lee, Gary Geunbae Lee, Soo-Ok Kweon and Hae-Ri Kim

 

H  
HamleDT: To Parse or Not to Parse? Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský and Jan Hajič
Highlighting relevant concepts from Topic Signatures Montse Cuadros, Lluís Padró and German Rigau
Hindi Subjective Lexicon: A Lexical Resource for Hindi Adjective Polarity Classification Akshat Bakliwal, Piyush Arora and Vasudeva Varma
Holaaa!! writin like u talk is kewl but kinda hard 4 NLP Maite Melero, Marta R. Costa-Jussà, Judith Domingo, Montse Marquina and Martí Quixal
HunOr: A Hungarian―Russian Parallel Corpus Martina Katalin Szabó, Veronika Vincze and István Nagy T.

 

I  
IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus Septina Dian Larasati
Identification of Manner in Bio-Events Raheel Nawaz, Paul Thompson and Sophia Ananiadou
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation Dhouha Bouamor, Nasredine Semmar and Pierre Zweigenbaum
Identifying equivalents of specialized verbs in a bilingual comparable corpus of judgments: A frame-based methodology Janine Pimentel
Identifying Nuggets of Information in GALE Distillation Evaluation Olga Babko-Malaya, Greg Milette, Michael Schneider and Sarah Scogin
Identifying Word Translations from Comparable Documents Without a Seed Lexicon Reinhard Rapp, Serge Sharoff and Bogdan Babych
Improving corpus annotation productivity: a method and experiment with interactive tagging Atro Voutilainen
Improving K-Nearest Neighbor Efficacy for Farsi Text Classification Mohammad Hossein Elahimanesh, Behrouz Minaei and Hossein Malekinezhad
Improving the Recall of a Discourse Parser by Constraint-based Postprocessing Sucheta Ghosh, Richard Johansson, Giuseppe Riccardi and Sara Tonelli
Incorporating an Error Corpus into a Spellchecker for Maltese Michael Rosner, Albert Gatt, Andrew Attard and Jan Joachimsen
Inforex -- a web-based tool for text corpus management and semantic annotation Michał Marcińczuk, Jan Kocoń and Bartosz Broda
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser Francesco Rubino, Francesca Frontini and Valeria Quochi
Intelligibility assessment in forensic applications Giovanni Costantini, Andrea Paoloni and Massimiliano Todisco
International Multicultural Name Matching Competition: Design, Execution, Results, and Lessons Learned Keith J. Miller, Elizabeth Schroeder Richerson, Sarah McLeod, James Finley and Aaron Schein
Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component Lucie Poláková, Pavlína Jínová and Jiří Mírovský
In the same boat and other idiomatic seafaring expressions Rita Marinelli and Laura Cignoni
Introducing the Reference Corpus of Contemporary Portuguese Online Michel Généreux, Iris Hendrickx and Amália Mendes
Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish Elena Volodina and Sofie Johansson Kokkinakis
Investigating Engagement - intercultural and technological aspects of the collection, analysis, and use of the Estonian Multiparty Conversational video data Kristiina Jokinen and Silvi Tenjes
Investigating Verbal Intelligence Using the TF-IDF Approach Kseniya Zablotskaya, Fernando Fernández Martínez and Wolfgang Minker
Involving Language Professionals in the Evaluation of Machine Translation Eleftherios Avramidis, Aljoscha Burchardt, Christian Federmann, Maja Popović, Cindy Tscherwinka and David Vilar
Irish Treebanking and Parsing: A Preliminary Evaluation Teresa Lynn, Ozlem Cetinoglu, Jennifer Foster, Elaine Uí Dhonnchadha, Mark Dras and Josef van Genabith
Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing Elena Filatova
Irregularity Detection in Categorized Document Corpora Borut Sluban, Senja Pollak, Roel Coesemans and Nada Lavrac
Is it Useful to Support Users with Lexical Resources? A User Study. Ernesto William De Luca
ISO 24617-2: A semantically-based standard for dialogue annotation Harry Bunt, Jan Alexandersson, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Volha Petukhova, Andrei Popescu-Belis and David Traum
Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective. Lorenza Russo, Sharid Loáiciga and Asheesh Gulati
Item Development and Scoring for Japanese Oral Proficiency Testing Hitokazu Matsushita and Deryle Lonsdale
Iterative Refinement and Quality Checking of Annotation Guidelines ― How to Deal Effectively with Semantically Sloppy Named Entity Types, such as Pathological Phenomena Udo Hahn, Elena Beisswanger, Ekaterina Buyko, Erik Faessler, Jenny Traumüller, Susann Schröder and Kerstin Hornbostel
Iula2Standoff: a tool for creating standoff documents for the IULACT Carlos Morell, Jorge Vivaldi and Núria Bel

 

J  
Joint Grammar and Treebank Development for Mandarin Chinese with HPSG Yi Zhang, Rui Wang and Yu Chen
Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier Souhir Gahbiche-Braham, Hélène Bonneau-Maynard, Thomas Lavergne and François Yvon
JRC Eurovoc Indexer JEX - A freely available multi-label categorisation tool Ralf Steinberger, Mohamed Ebrahim and Marco Turchi

 

K  
KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments Luis Javier Rodriguez-Fuentes, Mikel Penagarikano, Amparo Varona, Mireia Diez and German Bordel
Kitten: a tool for normalizing HTML and extracting its textual content Mathieu-Henri Falco, Véronique Moriceau and Anne Vilnat
Knowledge-Rich Context Extraction and Ranking with KnowPipe Anne-Kathrin Schumann
Korean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability Hyejin Hong, Sunhee Kim and Minhwa Chung
Korp ― the corpus infrastructure of Spräkbanken Lars Borin, Markus Forsberg and Johan Roxendal
KPWr: Towards a Free Corpus of Polish Bartosz Broda, Michał Marcińczuk, Marek Maziarz, Adam Radziszewski and Adam Wardyński

 

L  
LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis Kais Dukes and Eric Atwell
Language Richness of the Web Martin Majliš and Zdeněk Žabokrtský
Large aligned treebanks for syntax-based machine translation Gideon Kotzé, Vincent Vandeghinste, Scott Martens and Jörg Tiedemann
Large Scale Lexical Analysis Gregor Thurmair, Vera Aleksic and Christoph Schwarz
Large Scale Semantic Annotation, Indexing and Search at The National Archives Diana Maynard and Mark A. Greenwood
LAST MINUTE: a Multimodal Corpus of Speech-based User-Companion Interactions Dietmar Rösner, Jörg Frommer, Rafael Friesen, Matthias Haase, Julia Lange and Mirko Otto
Latvian and Lithuanian Named Entity Recognition with TildeNER Mārcis Pinnis
LDC Forced Aligner Xiaoyi Ma
LDC Language Resource Database: Building a Bibliographic Database Eleftheria Ahtaridis, Christopher Cieri and Denise DiPersio
Learning Categories and their Instances by Contextual Features Antje Schlaf and Robert Remus
Learning Sentiment Lexicons in Spanish Veronica Perez-Rosas, Carmen Banea and Rada Mihalcea
Legal electronic dictionary for Czech František Cvrček, Karel Pala and Pavel Rychlý
Lemmatising Serbian as Category Tagging with Bidirectional Sequence Classification Andrea Gesmundo and Tanja Samardzic
Le Petit Prince in UNL Ronaldo Martins
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora Fabrice Lefèvre, Djamel Mostefa, Laurent Besacier, Yannick Estève, Matthieu Quignard, Nathalie Camelin, Benoit Favre, Bassam Jabaian and Lina M. Rojas-Barahona
Leveraging the Wisdom of the Crowds for the Acquisition of Multilingual Language Resources Arno Scharl, Marta Sabou, Stefan Gindl, Walter Rafelsberger and Albert Weichselbraun
LexIt: A Computational Resource on Italian Argument Structure Alessandro Lenci, Gabriella Lapesa and Giulia Bonansinga
LG-Eval: A Toolkit for Creating Online Language Evaluation Experiments Eric Kow and Anja Belz
LIE: Leadership, Influence and Expertise Roberta Catizone, Louise Guthrie, Arthur Thomas and Yorick Wilks
Light Verb Constructions in the SzegedParalellFX English--Hungarian Parallel Corpus Veronika Vincze
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language. Alessio Bosca, Luca Dini, Milen Kouylekov and Marco Trevisan
Linguistic Analysis Processing Line for Bulgarian Aleksandar Savkov, Laska Laskova, Stanislava Kancheva, Petya Osenova and Kiril Simov
Linguistic knowledge for specialized text production Miriam Buendía-Castro and Beatriz Sánchez-Cárdenas
Linguistic Resources for Entity Linking Evaluation: from Monolingual to Cross-lingual Xuansong Li, Stephanie Strassel, Heng Ji, Kira Griffitt and Joe Ellis
Linguistic Resources for Handwriting Recognition and Translation Evaluation Zhiyi Song, Safa Ismael, Stephen Grimes, David Doermann and Stephanie Strassel
Logical metonymies and qualia structures: an annotated database of logical metonymies for German Alessandra Zarcone and Stefan Rued
Logic Based Methods for Terminological Assessment Benoît Robichaud

 

M  
Making Ellipses Explicit in Dependency Conversion for a German Treebank Wolfgang Seeker and Jonas Kuhn
MaltOptimizer: A System for MaltParser Optimization Miguel Ballesteros and Joakim Nivre
Mapping WordNet synsets to Wikipedia articles Samuel Fernando and Mark Stevenson
Mapping WordNet to the Kyoto ontology Egoitz Laparra, German Rigau and Piek Vossen
Massively Increasing TIMEX3 Resources: A Transduction Approach Leon Derczynski, Hector Llorens and Estela Saquete
Matching Cultural Heritage items to Wikipedia Eneko Agirre, Ander Barrena, Oier Lopez de Lacalle, Aitor Soroa, Samuel Fernando and Mark Stevenson
Measuring Interlanguage: Native Language Identification with L1-influence Metrics Julian Brooke and Graeme Hirst
Measuring the compositionality of NV expressions in Basque by means of distributional similarity techniques Antton Gurrutxaga and Iñaki Alegria
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms Ryan Georgi, Fei Xia and William Lewis
Medical Term Extraction in an Arabic Medical Corpus Doaa Samy, Antonio Moreno-Sandoval, Conchi Bueno-Díaz, Marta Garrote-Salazar and José M. Guirao
META-SHARE v2: An Open Network of Repositories for Language Resources including Data and Tools Christian Federmann, Ioanna Giannopoulou, Christian Girardi, Olivier Hamon, Dimitris Mavroeidis, Salvatore Minutoli and Marc Schröder
Method for Collection of Acted Speech Using Various Situation Scripts Takahiro Miyajima, Hideaki Kikuchi, Katsuhiko Shirai and Shigeki Okawa
METU Turkish Discourse Bank Browser Utku Şirin, Ruket Çakıcı and Deniz Zeyrek
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics Kanika Gupta, Monojit Choudhury and Kalika Bali
Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition Yi-jie Tang and Hsin-Hsi Chen
MISTRAL+: A Melody Intonation Speaker Tonal Range semi-automatic Analysis using variable Levels Benoît Weber, Geneviève Caelen-Haumont, Binh Hai Pham and Do-Dat Tran
MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis Simon Clematide, Stefan Gindl, Manfred Klenner, Stefanos Petrakis, Robert Remus, Josef Ruppenhofer, Ulli Waltinger and Michael Wiegand
Modality in Text: a Proposal for Corpus Annotation Iris Hendrickx, Amália Mendes and Silvia Mencarelli
Morphosyntactic Analysis of the CHILDES and TalkBank Corpora Brian MacWhinney
Multi-Layer Discourse Annotation of a Dutch Text Corpus Gisela Redeker, Ildikó Berzlánovich, Nynke van der Vliet, Gosse Bouma and Markus Egg
Multilingual Central Repository version 3.0 Aitor Gonzalez-Agirre, Egoitz Laparra and German Rigau
Multimedia database of the cultural heritage of the Balkans Ivana Tanasijević, Biljana Sikimić and Gordana Pavlović-Lažetić
Multimodal Behaviour and Feedback in Different Types of Interaction Costanza Navarretta and Patrizia Paggio
Multimodal Corpus of Multi-party Conversations in Second Language Shota Yamasaki, Hirohisa Furukawa, Masafumi Nishida, Kristiina Jokinen and Seiichi Yamamoto
MULTIPHONIA: a MULTImodal database of PHONetics teaching methods in classroom InterActions. Charlotte Alazard, Corine Astésano and Michel Billières
MultiUN v2: UN Documents with Multilingual Alignments Yu Chen and Andreas Eisele

 

N  
NeoTag: a POS Tagger for Grammatical Neologism Detection Maarten Janssen
New language resources for the Pashto language Djamel Mostefa, Khalid Choukri, Sylvie Brunessaux, Karim Boudahmane
NgramQuery - Smart Information Extraction from Google N-gram using External Resources Martin Aleksandrov and Carlo Strapparava
NKI-CCRT Corpus - Speech Intelligibility Before and After Advanced Head and Neck Cancer Treated with Concomitant Chemoradiotherapy R.P. Clapham, L. van der Molen, R.J.JH. van Son, M. van den Brekel and F.J.M. Hilgers
NLP Challenges for Eunomos a Tool to Build and Manage Legal Knowledge Guido Boella, Luigi di Caro, Llio Humphreys, Livio Robaldo and Leon van der Torre
NTUSocialRec: An Evaluation Dataset Constructed from Microblogs for Recommendation Applications in Social Networks Chieh-Jen Wang, Shuk-Man Cheng, Lung-Hao Lee, Hsin-Hsi Chen, Wen-shen Liu, Pei-Wen Huang and Shih-Peng Lin

 

O  
On the practice of error analysis for machine translation evaluation Sara Stymne and Lars Ahrenberg
On the Way to a Legal Sharing of Web Applications in NLP Victoria Arranz and Olivier Hamon
Ontologies of Linguistic Annotation: Survey and perspectives Christian Chiarcos
Ontoterminology: How to unify terminology and ontology into a single paradigm Christophe Roche
On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market David Lewis, Alexander O'Connor, Andrzej Zydroń, Gerd Sjögren and Rahzeb Choudhury
Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing Claire Brierley, Majdi Sawalha and Eric Atwell
Orthographic Transcription: which enrichment is required for phonetization? Brigitte Bigi, Pauline Péri and Roxane Bertrand

 

P  
PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web Iñaki San Vicente and Iker Manterola
Págico: Evaluating Wikipedia-based information retrieval in Portuguese Cristina Mota, Alberto Simões, Cláudia Freitas, Luís Costa and Diana Santos
PAMOCAT: Automatic retrieval of specified postures Bernhard Brüning, Christian Schnier, Karola Pitsch and Sven Wasmuth
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures Xuansong Li, Stephanie Strassel, Stephen Grimes, Safa Ismael, Mohamed Maamouri, Ann Bies and Nianwen Xue
Parallel Data, Tools and Interfaces in OPUS Jörg Tiedemann
Parsing Any Domain English text to CoNLL dependencies Sudheer Kolachina and Prasanth Kolachina
PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases Maria Teresa Pazienza, Armando Stellato and Andrea Turbati
Pedagogical stances and their multimodal signals. Isabella Poggi, Francesca D'Errico and Giovanna Leone
PET: a Tool for Post-editing and Assessing Machine Translation Wilker Aziz, Sheila Castilho and Lucia Specia
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora Radu Ion
Polaris: Lymba's Semantic Parser Dan Moldovan and Eduardo Blanco
PoliMorf: a (not so) new open morphological dictionary for Polish Marcin Woliński, Marcin Miłkowski, Maciej Ogrodniczuk and Adam Przepiórkowski
Polish Multimodal Corpus ― a collection of referential gestures Magdalena Lis
Portuguese Text Generation from Large Corpora Eder Novais, Ivandre Paraboni and Douglas Silva
Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems Kallirroi Georgila, Alan Black, Kenji Sagae and David Traum
Pragmatic identification of the witness sets Livio Robaldo and Jakub Szymanik
Prague Dependency Style Treebank for Tamil Loganathan Ramasamy and Zdeněk Žabokrtský
Predicting Phrase Breaks in Classical and Modern Standard Arabic Text Majdi Sawalha, Claire Brierley and Eric Atwell
Prediction of Non-Linguistic Information of Spontaneous Speech from the Prosodic Annotation: Evaluation of the X-JToBI system Kikuo Maekawa
Project FLY: a multidisciplinary project within Linguistics Mariana Gomes, Ana Guilherme, Leonor Tavares and Rita Marquilhas
Propbank-Br: a Brazilian Treebank annotated with semantic role labels Magali Sanches Duran and Sandra Maria Aluísio
Proper Language Resource Centers Willem Elbers, Daan Broeder and Dieter van Uytvanck
Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabi fication Antonio Origlia and Iolanda Alfano
Pursing power in Arabic on-line discussion forums Marc Tomlinson, David Bracewell, Mary Draper, Zewar Almissour, Ying Shi and Jeremy Bensley

 

Q  
Quantising Opinions for Political Tweets Analysis Yulan He, Hassan Saif, Zhongyu Wei and Kam-Fai Wong
QurAna: Corpus of the Quran annotated with Pronominal Anaphora Abdul-Baquee Sharaf and Eric Atwell
QurSim: A corpus for evaluation of relatedness in short texts Abdul-Baquee Sharaf and Eric Atwell

 

R  
Rapid creation of large-scale corpora and frequency dictionaries Attila Zséder, Gábor Recski, Dániel Varga and András Kornai
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz Joao Paulo Cabral, Mark Kane, Zeeshan Ahmed, Mohamed Abou-Zleikha, Eva Szekely, Amalia Zahra, Kalu Ogbureke, Peter Cahill, Julie Carson-Berndsen and Stephan Schlogl
Recent Developments in CLARIN-NL Jan Odijk
Reclassifying subcategorization frames for experimental analysis and stimulus generation Paula Buttery and Andrew Caines
Recognition of Nonmanual Markers in American Sign Language (ASL) Using Non-Parametric Adaptive 2D-3D Face Tracking Dimitris Metaxas, Bo Liu, Fei Yang, Peng Yang, Nicholas Michael and Carol Neidle
Recognition of Polish Derivational Relations Based on Supervised Learning Scheme Maciej Piasecki, Radoslaw Ramocki and Marek Maziarz
Reconstructing the Diachronic Morphology of Romanian from Dictionary Citations Dan Cristea, Radu Simionescu and Gabriela Haja
Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores Kseniya Zablotskaya, Umair Rahim, Fernando Fernández Martínez and Wolfgang Minker
RELcat: a Relation Registry for ISOcat data categories Menzo Windhouwer
Rembrandt - a named-entity recognition framework Nuno Cardoso

 

R  
Re-ordering Source Sentences for SMT Amit Sangodkar and Om Damani
”Rendering Endangered Lexicons Interoperable through Standards Harmonization”: the RELISH project Helen Aristar-Dry, Sebastian Drude, Menzo Windhouwer, Jost Gippert and Irina Nevskaya
Representation of linguistic and domain knowledge for second language learning in virtual worlds Alexandre Denis, Ingrid Falk, Claire Gardent and Laura Perez-Beltrachini
Representing General Relational Knowledge in ConceptNet 5 Robert Speer and Catherine Havasi
Representing the Translation Relation in a Bilingual Wordnet Jyrki Niemi and Krister Lindén
Resource Evaluation for Usable Speech Interfaces: Utilizing Human-Human Dialogue Pepi Stavropoulou, Dimitris Spiliotopoulos and Georgios Kouroupetroglou
Resource production of written forms of Sign Languages by a user-centered editor, SWift (SignWriting improved fast transcriber) Fabrizio Borgia, Claudia S. Bianchini, Patrice Dalle and Maria De Marsico
Revealing Contentious Concepts Across Social Groups Ching-Sheng Lin, Zumrut Akcam, Samira Shaikh, Sharon Small, Ken Stahl, Tomek Strzalkowski and Nick Webb
Rhetorical Move Detection in English Abstracts: Multi-label Sentence Classifiers and their Annotated Corpora Carmen Dayrell, Arnaldo Candido Jr., Gabriel Lima, Danilo Machado Jr., Ann Copestake, Valéria Feltrim, Stella Tagnin and Sandra Aluisio
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building Alessandro Panunzi, Marco Fabbri, Massimo Moneglia, Lorenzo Gregori and Samuele Paladini
Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring Flore Barcellini, Camille Albert, Corinne Grosse and Patrick Saint-Dizier
Robust clause boundary identification for corpus annotation Heiki-Jaan Kaalep and Kadri Muischnek
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information Corina Forascu and Dan Tufiș
ROMBAC: The Romanian Balanced Annotated Corpus Radu Ion, Elena Irimia, Dan Ștefănescu and Dan Tufiș
Rule-Based Detection of Clausal Coordinate Ellipsis Kristiina Muhonen and Tanja Purtonen
Rule-based Entity Recognition and Coverage of SNOMED CT in Swedish Clinical Text Maria Skeppstedt, Maria Kvist and Hercules Dalianis
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus Jens Forster, Christoph Schmidt, Thomas Hoyoux, Oscar Koller, Uwe Zelle, Justus Piater and Hermann Ney

 

S  
Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation Monica Gavrila, Walther v. Hahn and Cristina Vertan
Semantic annotation of French corpora: animacy and verb semantic classes Juliette Thuilier and Laurence Danlos
Semantic Annotations in Japanese FrameNet: Comparing Frames in Japanese and English Kyoko Ohara
Semantic metadata mapping in practice: the Virtual Language Observatory Dieter van Uytvanck, Herman Stehouwer and Lari Lampen
Semantic Relations Established by Specialized Processes Expressed by Nouns and Verbs: Identification in a Corpus by means of Syntactico-semantic Annotation Nava Maroto, Marie-Claude L'Homme and Amparo Alcina
Semantic Role Labeling with the Swedish FrameNet Richard Johansson, Karin Friberg Heppin and Dimitrios Kokkinakis
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs Matilde Gonzalez, Michael Filhol and Christophe Collet
Semi-Supervised Technical Term Tagging With Minimal User Feedback Behrang QasemiZadeh, Paul Buitelaar, Tianqi Chen and Georgeta Bordea
SemScribe: Natural Language Generation for Medical Reports Sebastian Varges, Heike Bieler, Manfred Stede, Lukas C. Faulstich, Kristin Irsig and Malik Atalla
SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks Elias Iosif and Alexandros Potamianos
Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies Thomas Ulrich Christiansen and Peter Juel Henrichsen
SentiSense: An easily scalable concept-based affective lexicon for sentiment analysis Jorge Carrillo de Albornoz, Laura Plaza and Pablo Gervás
Service Composition Scenarios for Task-Oriented Translation Chunqi Shi, Donghui Lin and Toru Ishida
Similarity Ranking as Attribute for Machine Learning Approach to Authorship Identification Jan Rygl and Aleš Horák
Simplified guidelines for the creation of Large Scale Dialectal Arabic Annotations Heba Elfardy and Mona Diab
SMALLWorlds -- Multilingual Content-Controlled Monologues Peter Juel Henrichsen and Marcus Uneson
Smooth Sailing for STEVIN Peter Spyns and Elisabeth D'Halleweyn
Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation Víctor M. Sánchez-Cartagena, Miquel Esplà-Gomis and Juan Antonio Pérez-Ortiz
Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3 Atro Voutilainen, Kristiina Muhonen, Tanja Purtonen and Krister Lindén
Speech and Language Resources for LVCSR of Russian Sergey Zablotskiy, Alexander Shvets, Maxim Sidorov, Eugene Semenkin and Wolfgang Minker
Spell Checking for Chinese Shaohua Yang, Hai Zhao, Xiaolin Wang and Bao-liang Lu
Spell Checking in Spanish: The Case of Diacritic Accents Jordi Atserias, Maria Fuentes, Rogelio Nazar and Irene Renau
Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese Antonio Moreno-Sandoval, Leonardo Campillos Llanos, Yang Dong, Emi Takamori, José M. Guirao, Paula Gozalo, Chieko Kimura, Kengo Matsui and Marta Garrote-Salazar
SPPAS: a tool for the phonetic segmentation of speech Brigitte Bigi
Standardizing a Component Metadata Infrastructure Daan Broeder, Dieter van Uytvanck, Maria Gavrilidou, Thorsten Trippel and Menzo Windhouwer
Statistical Evaluation of Pronunciation Encoding Iris Merkus and Florian Schiel
Statistical Machine Translation without Source-side Parallel Corpus Using Word Lattice and Phrase Extension Takanori Kusumoto and Tomoyosi Akiba
Statistical Section Segmentation in Free-Text Clinical Records Michael Tepper, Daniel Capurro, Fei Xia, Lucy Vanderwende and Meliha Yetisgen-Yildiz
Strategies to Improve a Speaker Diarisation Tool David Tavarez, Eva Navas, Daniel Erro and Ibon Saratxaga
Structural alignment of plain text books André Santos, José João Almeida and Nuno Carvalho
Suffix Trees as Language Models Casey Redd Kennington, Martin Kay and Annemarie Friedrich
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles Volha Petukhova, Rodrigo Agerri, Mark Fishel, Sergio Penkale, Arantza del Pozo, Mirjam Sepesy Maucec, Andy Way, Panayota Georgakopoulou and Martin Volk
Summarizing a multimodal set of documents in a Smart Room Maria Fuentes, Horacio Rodríguez and Jordi Turmo
Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization Luís Marujo, Anatole Gershman, Jaime Carbonell, Robert Frederking and João P. Neto
SUTAV: A Turkish Audio-Visual Database Ibrahim Saygin Topkaya and Hakan Erdogan
SUTime: A library for recognizing and normalizing time expressions Angel X. Chang and Christopher Manning
Syntactic annotation of spontaneous speech: application to call-center conversation data Thierry Bazillon, Melanie Deplano, Frederic Bechet, Alexis Nasr and Benoit Favre

 

T  
Tackling interoperability issues within UIMA workflows Nicolas Hernandez
Tajik-Farsi Persian Transliteration Using Statistical Machine Translation Chris Irwin Davis
Task-Driven Linguistic Analysis based on an Underspecified Features Representation Stasinos Konstantopoulos, Valia Kordoni, Nicola Cancedda, Vangelis Karkaletsis, Dietrich Klakow and Jean-Michel Renders
TED-LIUM: an Automatic Speech Recognition dedicated corpus Anthony Rousseau, Paul Deléglise and Yannick Estève
Temporal Annotation: A Proposal for Guidelines and an Experiment with Inter-annotator Agreement André Bittar, Caroline Hagège, Véronique Moriceau, Xavier Tannier and Charles Teissèdre
Temporal Tagging on Different Domains: Challenges, Strategies, and Gold Standards Jannik Strötgen and Michael Gertz
Terra: a Collection of Translation Error-Annotated Corpora Mark Fishel, Ondřej Bojar and Maja Popović
Texto4Science: a Quebec French Database of Annotated Short Text Messages Philippe Langlais, Patrick Drouin, Amélie Paulus, Eugénie Rompré Brodeur and Florent Cottin
Text Simplification Tools for Spanish Stefan Bott, Horacio Saggion and Simon Mille
Textual Characteristics for Language Engineering Mathias Bank, Robert Remus and Martin Schierle
The acquisition and dialog act labeling of the EDECAN-SPORTS corpus Lluis-F. Hurtado, Fernando Garcia, Emilio Sanchis and Encarna Segarra
The annotation of the C-ORAL-BRASIL oral through the implementation of the Palavras Parser Eckhard Bick, Heliana Mello, Alessandro Panunzi and Tommaso Raso
The Australian National Corpus: National Infrastructure for Language Resources Steve Cassidy, Michael Haugh, Pam Peters and Mark Fallu
The BladeMistress Corpus: From Talk to Action in Virtual Worlds Anton Leuski, Carsten Eickhoff, James Ganis and Victor Lavrenko
The coding and annotation of multimodal dialogue acts Volha Petukhova and Harry Bunt
The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language José Pedro Ferreira, Maarten Janssen, Gladis Barcellos de Oliveira, Margarita Correia and Gilvan Müller de Oliveira
The CONCISUS Corpus of Event Summaries Horacio Saggion and Sandra Szasz
The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese Tommaso Raso, Heliana Mello and Maryualê Malvessi Mittmann
The Dependency-Parsed FrameNet Corpus Daniel Bauer, Hagen Fürstenau and Owen Rambow
The DISCO ASR-based CALL system: practicing L2 oral skills and beyond Helmer Strik, Jozef Colpaert, Joost Van Doremalen and Catia Cucchiarini
The ETAPE corpus for the evaluation of speech-based TV content processing in the French language Guillaume Gravier, Gilles Adda, Niklas Paulsson, Matthieu Carré, Aude Giraudel and Olivier Galibert
The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output Daniele Pighin, Lluís Màrquez and Lluís Formiga
The FLaReNet Strategic Language Resource Agenda Claudia Soria, Núria Bel, Khalid Choukri, Joseph Mariani, Monica Monachini, Jan Odijk, Stelios Piperidis, Valeria Quochi and Nicoletta Calzolari
The goo300k corpus of historical Slovene Tomaž Erjavec
The Herme Database of Spontaneous Multimodal Human-Robot Dialogues Jing Guang Han, Emer Gilmartin, Celine DeLooze, Brian Vaughan and Nick Campbell
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech Juan María Garrido, Yesika Laplaza, Montse Marquina, Andrea Pearman, José Gregorio Escalada, Miguel Ángel Rodríguez and Ana Armenta
The Icelandic Parsed Historical Corpus (IcePaHC) Eiríkur Rögnvaldsson, Anton Karl Ingason, Einar Freyr Sigurðsson and Joel Wallenberg
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation Massimo Moneglia, Monica Monachini, Omar Calabrese, Alessandro Panunzi, Francesca Frontini, Gloria Gagliardi and Irene Russo
The Impact of Automatic Morphological Analysis & Disambiguation on Dependency Parsing of Turkish Gülşen Eryiğit
The Influence of Corpus Quality on Statistical Measurements on Language Resources Thomas Eckart, Uwe Quasthoff and Dirk Goldhahn
The IULA Treebank Montserrat Marimon, Beatríz Fisas, Núria Bel, Marta Villegas, Jorge Vivaldi, Sergi Torner, Mercè Lorente, Silvia Vázquez and Marta Villegas
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation Marcello Federico, Sebastian Stüker, Luisa Bentivogli, Michael Paul, Mauro Cettolo, Teresa Herrmann, Jan Niehues and Giovanni Moretti
The Joy of Parallelism with CzEng 1.0 Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel and Aleš Tamchyna
The KIT Lecture Corpus for Speech Translation Sebastian Stüker, Florian Kraft, Christian Mohr, Teresa Herrmann, Eunah Cho and Alex Waibel
The KnowledgeStore: an Entity-Based Storage System Roldano Cattoni, Francesco Corcoglioniti, Christian Girardi, Bernardo Magnini, Luciano Serafini and Roberto Zanoli
The Language Archive ― a new hub for language resources Sebastian Drude, Daan Broeder, Paul Trilsbeek and Peter Wittenburg
The Language Library: supporting community effort for collective resource production Riccardo Del Gratta, Francesca Frontini, Francesco Rubino, Irene Russo and Nicoletta Calzolari
The LRE Map. Harmonising Community Descriptions of Resources Nicoletta Calzolari, Riccardo Del Gratta, Gil Francopoulo, Joseph Mariani, Francesco Rubino, Irene Russo and Claudia Soria
The MASC Word Sense Corpus Rebecca J. Passonneau, Collin F. Baker, Christiane Fellbaum and Nancy Ide
The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions Stelios Piperidis
The META-SHARE Metadata Schema for the Description of Language Resources Maria Gavrilidou, Penny Labropoulou, Elina Desipri, Stelios Piperidis, Haris Papageorgiou, Monica Monachini, Francesca Frontini, Thierry Declerck, Gil Francopoulo, Victoria Arranz and Valérie Mapelli
The Minho Quotation Resource Brett Drury and José João Almeida
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation Christian Federmann, Eleftherios Avramidis, Marta R. Costa-Jussà, Josef van Genabith, Maite Melero and Pavel Pecina
The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language Mike Kestemont, Claudia Peersman, Benny De Decker, Guy De Pauw, Kim Luyckx, Roser Morante, Frederik Vaassen, Janneke van de Loo and Walter Daelemans
The New IDS Corpus Analysis Platform: Challenges and Prospects Piotr Bański, Peter M. Fischer, Elena Frick, Erik Ketzan, Marc Kupietz, Carsten Schnober, Oliver Schonefeld and Andreas Witt
The Nordic Dialect Corpus Janne Bondi Johannessen, Joel Priestley, Kristin Hagen, Anders Nøklestad and André Lynum
The open lexical infrastructure of Spräkbanken Lars Borin, Markus Forsberg, Leif-Jöran Olsson and Jonatan Uppström
The Open Linguistics Working Group Christian Chiarcos, Sebastian Hellmann, Sebastian Nordhoff, Steven Moran, Richard Littauer, Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek and Christian M. Meyer
The Parallel-TUT: a multilingual and multiformat treebank Cristina Bosco, Manuela Sanguinetti and Leonardo Lesmo
The Polish Sejm Corpus Maciej Ogrodniczuk
The Political Speech Corpus of Bulgarian Petya Osenova and Kiril Simov
The Quaero Evaluation Initiative on Term Extraction Thibault Mondary, Adeline Nazarenko, Haïfa Zargayouna and Sabine Barreaux
The REPERE Corpus : a multimodal corpus for person recognition Aude Giraudel, Matthieu Carré, Valérie Mapelli, Juliette Kahn, Olivier Galibert and Ludovic Quintard
The REX corpora: A collection of multimodal corpora of referring expressions in collaborative problem solving dialogues Takenobu Tokunaga, Ryu Iida, Asuka Terai and Naoko Kuriyama
The Rocky Road towards a Swedish FrameNet - Creating SweFN Karin Friberg Heppin and Maria Toporowska Gronostaj
The Role of Model Testing in Standards Development: The Case of ISO-Space James Pustejovsky and Jessica Moszkowicz
The Romanian Neuter Examined Through A Two-Gender N-Gram Classification System Liviu P. Dinu, Vlad Niculae and Octavia-Maria Şulea
The SERENOA Project: Multidimensional Context-Aware Adaptation of Service Front-Ends Javier Caminero, Mari Carmen Rodríguez, Jean Vanderdonckt, Fabio Paternò, Joerg Rett, Dave Raggett, Jean-Loup Comeliau and Ignacio Marín
The SYNC3 Collaborative Annotation Tool Georgios Petasis
The TARSQI Toolkit Marc Verhagen and James Pustejovsky
The Trilingual ALLEGRA Corpus: Presentation and Possible Use for Lexicon Induction Yves Scherrer and Bruno Cartoni
The Twins Corpus of Museum Visitor Questions Priti Aggarwal, Ron Artstein, Jillian Gerten, Athanasios Katsamanis, Shrikanth Narayanan, Angela Nazarian and David Traum
The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English Stefanie Dipper, Melanie Seiss and Heike Zinsmeister
The WeSearch Corpus, Treebank, and Treecache -- A Comprehensive Sample of User-Generated Content Jonathon Read, Dan Flickinger, Rebecca Dridan, Stephan Oepen and Lilja Øvrelid
This also affects the context - Errors in extraction based summaries Thomas Kaspersson, Christian Smith, Henrik Danielsson and Arne Jönsson
TimeBankPT: A TimeML Annotated Corpus of Portuguese Francisco Costa and António Branco
TIMEN: An Open Temporal Expression Normalisation Resource Hector Llorens, Leon Derczynski, Robert Gaizauskas and Estela Saquete
Tools for plWordNet Development. Presentation and Perspectives Bartosz Broda, Marek Maziarz and Maciej Piasecki
Towards a comprehensive open repository of Polish language resources Maciej Ogrodniczuk, Piotr Pęzik and Adam Przepiórkowski
Towards a methodology for automatic identification of hypernyms in the definitions of large-scale dictionary Inga Gheorghita and Jean-Marie Pierrel
Towards an LFG parser for Polish: An exercise in parasitic grammar development Agnieszka Patejuk and Adam Przepiórkowski
Towards a richer wordnet representation of properties Sanni Nimb and Bolette Sandford Pedersen
Towards a User-Friendly Platform for Building Language Resources based on Web Services Marc Poch, Antonio Toral, Olivier Hamon, Valeria Quochi and Núria Bel
Towards Automatic Gesture Stroke Detection Binyam Gebrekidan Gebre, Peter Wittenburg and Przemyslaw Lenkiewicz
Towards automation in using multi-modal language resources: compatibility and interoperability for multi-modal features in Kachako Yoshinobu Kano
Towards Emotion and Affect Detection in the Multimodal LAST MINUTE Corpus Jörg Frommer, Bernd Michaelis, Dietmar Rösner, Andreas Wendemuth, Rafael Friesen, Matthias Haase, Manuela Kunze, Rico Andrich, Julia Lange, Axel Panning and Ingo Siegert
Towards Fully Automatic Annotation of Audio Books for TTS Olivier Boeffard, Laure Charonnat, Sébastien Le Maguer and Damien Lolive
Translog-II: a Program for Recording User Activity Data for Empirical Reading and Writing Research Michael Carl
Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese Patricia Gonçalves, Rita Santos and António Branco
Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results Marco Dinarelli and Sophie Rosset
Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution Chris Biemann
Turkish Paraphrase Corpus Seniz Demir, Ilknur Durgar El-Kahlout, Erdem Unal and Hamza Kaya
Twenty Years of Language Resource Development and Distribution: A Progress Report on LDC Activities Christopher Cieri, Marian Reed, Denise DiPersio and Mark Liberman
Two Database Resources for Processing Social Media English Text Eleanor Clark and Kenji Araki
Two Phase Evaluation for Selecting Machine Translation Services Chunqi Shi, Donghui Lin, Masahiko Shimada and Toru Ishida
Typing Race Games as a Method to Create Spelling Error Corpora Paul Rodrigues and C. Anton Rytting

 

U  
Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus Djamé Seddah, Marie Candito, Benoit Crabbé and Enrique Henestroza Anguiano
UBY-LMF -- A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek and Christian M. Meyer
ULex: new data models and a mobile environment for corpus enrichment. Dafydd Gibbon
UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese Toshinobu Ogiso, Mamoru Komachi, Yasuharu Den and Yuji Matsumoto
Unsupervised acquisition of concatenative morphology Lionel Nicolas, Jacques Farré and Cécile Darme
Unsupervised document zone identification using probabilistic graphical models Andrea Varga, Daniel Preotiuc-Pietro and Fabio Ciravegna
Unsupervised Word Sense Disambiguation with Multilingual Representations Erwin Fernandez-Ordonez, Rada Mihalcea and Samer Hassan
Using a Goodness Measurement for Domain Adaptation: A Case Study on Chinese Word Segmentation Yan Song and Fei Xia
Using an ASR database to design a pronunciation evaluation system in Basque Igor Odriozola, Eva Navas, Inma Hernáez, Iñaki Sainz, Ibon Saratxaga, Jon Sánchez and Daniel Erro
Using DiAML and ANVIL for multimodal dialogue annotations Harry Bunt, Michael Kipp and Volha Petukhova
Using Language Resources in Humanities research Marta Villegas, Núria Bel, Carlos Gonzalo, Amparo Moreno and Nuria Simelio
Using multimodal resources for explanation approaches in intelligent systems Florian Nothdurft and Wolfgang Minker
Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions Shiva Taslimipoor, Afsaneh Fazly and Ali Hamzeh
Using semi-experts to derive judgments on word sense alignment: a pilot study Soojeong Eom, Markus Dickinson and Graham Katz
Using the International Standard Language Resource Number: Practical and Technical Aspects Khalid Choukri, Victoria Arranz, Olivier Hamon and Jungyeul Park
Using Verb Subcategorization for Word Sense Disambiguation Will Roberts and Valia Kordoni
Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks Jorge Vivaldi, Luis Adrián Cabrera-Diego, Gerardo Sierra and María Pozzi

 

V  
Versatile Speech Databases for High Quality Synthesis for Basque Iñaki Sainz, Daniel Erro, Eva Navas, Inma Hernáez, Jon Sánchez, Ibon Saratxaga and Igor Odriozola
VERTa: Linguistic features in MT evaluation Elisabet Comelles, Jordi Atserias, Victoria Arranz and Irene Castellón
Visualizing Sentiment Analysis on a User Forum Rasmus Sundberg, Anders Eriksson, Johan Bini and Pierre Nugues
Visualizing word senses in WordNet Atlas Matteo Abrate and Clara Bacciu
“Vreselijk mooi!” (terribly beautiful): A Subjectivity Lexicon for Dutch Adjectives. Tom De Smedt and Walter Daelemans

 

W  
WebAnnotator, an Annotation Tool for Web Pages Xavier Tannier
Web Service integration platform for Polish linguistic resources Maciej Ogrodniczuk and Michał Lenart
Word Alignment for English-Turkish Language Pair Mehmet Talha Çakmak, Süleyman Acar and Gülşen Eryiğit
Wordnet Based Lexicon Grammar for Polish Zygmunt Vetulani
Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources Valérie Hanoka and Benoît Sagot
Word Sense Inventories by Non-Experts. Anna Rumshisky, Nick Botchan, Sophie Kushkuley and James Pustejovsky
Word Sketches for Turkish Bharat Ram Ambati, Siva Reddy and Adam Kilgarriff
W-PhAMT: A web tool for phonetic multilevel timeline visualization Francesco Cutugno, Vincenza Anna Leano and Antonio Origlia

 

Y  
YADAC: Yet another Dialectal Arabic Corpus Rania Al-Sabbagh and Roxana Girju
Yes we can!? Annotating English modal verbs Josef Ruppenhofer and Ines Rehbein
“You Seem Aggressive!” Monitoring Anger in a Practical Application Felix Burkhardt

Powered by ELDA © 2012 ELDA/ELRA