LREC 2016 Proceedings

INTRODUCTORY MESSAGES:

Nicoletta Calzolari - Introduction of the Conference Chair & Message from ELRA President
Khalid Choukri - Message from ELRA Secretary General and ELDA Managing Director
Marko Grobelnik - Message of the Chair of the Local Organizing Committee

INVITED TALK:

Simon Krek - Slovene and the South Slavic Language Family - the Obligation of Expressing Duality and Other Stories from the South

KEYNOTES SPEECHES:

Roberto Pieraccini - The challenge of conversational machines: from MUSA to Jibo
Ryan McDonald - The Language Resource Spectrum: A Perspective from Google

SESSIONS: Browse articles of the conference sorted by session number

Day 1, Oral Sessions:

	Session O1 - Machine Translation and Evaluation (1)	Chairperson: Bente Maegaard
11:35-11:55	Rosa Gaudio, Aljoscha Burchardt and António Branco	Evaluating Machine Translation in a Usage Scenario
11:55-12:15	Jinhua Du, Andy Way and Andrzej Zydron	Using BabelNet to Improve OOV Coverage in SMT
12:15-12:35	Valia Kordoni, Antal van den Bosch, Katia Lida Kermanidis, Vilelmini Sosoni, Kostadin Cholakov, Iris Hendrickx, Matthias Huck and Andy Way	Enhancing Access to Online Education: Quality Machine Translation of MOOC Content
12:35-12:55	Lena Marg	The Trials and Tribulations of Predicting Post-Editing Productivity
12:55-13:15	Maja Popović and Mihael Arcan	PE2rr Corpus: Manual Error Annotation of Automatically Pre-annotated MT Post-edits

	Session O2 - Sentiment Analysis and Emotion Recognition	Chairperson: Núria Bel
11:35-11:55	Saif Mohammad, Mohammad Salameh and Svetlana Kiritchenko	Sentiment Lexicons for Arabic Social Media
11:55-12:15	Giuseppe Castellucci, Danilo Croce and Roberto Basili	A Language Independent Method for Generating Large Scale Polarity Lexicons
12:15-12:35	Debashis Naskar, Sidahmed Mokaddem, Miguel Rebollo and Eva Onaindia	Sentiment Analysis in Social Networks through Topic modeling
12:35-12:55	Aitor García Pablos, Montse Cuadros and German Rigau	A Comparison of Domain-based Word Polarity Estimation using different Word Embeddings
12:55-13:15	Maxim Sidorov, Alexander Schmitt, Eugene Semenkin and Wolfgang Minker	Could Speaker, Gender or Age Awareness be beneficial in Speech-based Emotion Recognition?

	Session O3 - Corpora for Language Analysis	Chairperson: Stelios Piperidis
11:35-11:55	Hiroya Takamura, Ryo Nagata and Yoshifumi Kawasaki	Discriminative Analysis of Linguistic Features for Typological Study
11:55-12:15	Dieuwke Hupkes and Rens Bod	POS-tagging of Historical Dutch
12:15-12:35	Maria Rauschenberger, Luz Rello, Silke Füchsel and Jörg Thomaschewski	A Language Resource of German Errors Written by Children with Dyslexia
12:35-12:55	Alessia Barbagli, Pietro Lucisano, Felice Dell'Orletta, Simonetta Montemagni and Giulia Venturi	CItA: an L1 Italian Learners Corpus to Study the Development of Writing Competence
12:55-13:15	Zhiwei Yu, David Mareček, Zdeněk Žabokrtský and Daniel Zeman	If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers

	Session O4 - Spoken Corpus Dialogue	Chairperson: Asuncion Moreno
11:35-11:55	José Lopes, Arodami Chorianopoulou, Elisavet Palogiannidi, Helena Moniz, Alberto Abad, Katerina Louka, Elias Iosif and Alexandros Potamianos	The SpeDial datasets: datasets for Spoken Dialogue Systems analytics
11:55-12:15	Dilafruz Amanova, Volha Petukhova and Dietrich Klakow	Creating Annotated Dialogue Resources: Cross-domain Dialogue Act Classification
12:15-12:35	Kathryn J. Collins and David Traum	Towards a Multi-dimensional Taxonomy of Stories in Dialogue
12:35-12:55	Sina Zarrieß, Julian Hough, Casey Kennington, Ramesh Manuvinakurike, David DeVault, Raquel Fernandez and David Schlangen	PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
12:55-13:15	Shammur Absar Chowdhury, Evgeny Stepanov and Giuseppe Riccardi	Transfer of Corpus-Specific Dialogue Act Annotation to ISO Standard: Is it worth it?

	Session O5 - LR Infrastructures and Architectures	Chairperson: Franciska de Jong
14:45-15:05	Adam Funk, Robert Gaizauskas and Benoit Favre	A Document Repository for Social Media and Speech Conversations
15:05-15:25	Artemis Parvizi, Matt Kohl, Meritxell Gonzàlez and Roser Saurí	Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge Reuse
15:25-15:45	Bente Maegaard, Lina Henriksen, Andrew Joscelyne, Vesna Lusicky, Margaretha Mazura, Sussi Olsen, Claus Povlsen and Philippe Wacker	Providing a Catalogue of Language Resources for Commercial Users
15:45-16:05	Nancy Ide, Keith Suderman, James Pustejovsky, Marc Verhagen and Christopher Cieri	The Language Application Grid and Galaxy
16:05-16:25	Khalid Choukri, Valérie Mapelli, Hélène Mazo and Vladimir Popescu	ELRA Activities and Services

	Session O6 - Multimodality	Chairperson: Kristiina Jokinen
14:45-15:05	Costanza Navarretta	Mirroring Facial Expressions and Emotions in Dyadic Conversations
15:05-15:25	Dragomir Radev, Amanda Stent, Joel Tetreault, Aasish Pappu, Aikaterini Iliakopoulou, Agustin Chanfreau, Paloma de Juan, Jordi Vallmitjana, Alejandro Jaimes, Rahul Jha and Robert Mankoff	Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest
15:25-15:45	Victoria Yaneva, Irina Temnikova and Ruslan Mitkov	A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults
15:45-16:05	Mathieu Chollet, Torsten Wörtwein, Louis-Philippe Morency and Stefan Scherer	A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety
16:05-16:25	Dario Bertero and Pascale Fung	Deep Learning of Audio and Language Features for Humor Prediction

	Session O7 - Multiword Expressions	Chairperson: Aline Villavicencio
14:45-15:05	Ayman Alghamdi, Eric Atwell and Claire Brierley	An Empirical Study of Arabic Formulaic Sequence Extraction Methods
15:05-15:25	Ranka Stankovic, Cvetana Krstev, Ivan Obradovic, Biljana Lazic and Aleksandra Trtovac	Rule-based Automatic Multi-word Term Extraction and Lemmatization
15:25-15:45	Václava Kettnerová and Eduard Bejček	Distribution of Valency Complements in Czech Complex Predicates: Between Verb and Noun
15:45-16:05	Chaya Liebeskind and Yaakov HaCohen-Kerner	A Lexical Resource of Hebrew Verb-Noun Multi-Word Expressions
16:05-16:25	Guillaume Jacquet, Maud Ehrmann, Ralf Steinberger and Jaakko Väyrynen	Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms

	Session O8 - Named Entity Recognition	Chairperson: Yuji Matsumoto
14:45-15:05	Marie-Jean Meurs, Hayda Almeida, Ludovic Jean-Louis and Eric Charton	SemLinker, a Modular and Open Source Framework for Named Entity Discovery and Linking
15:05-15:25	Filip Ilievski, Giuseppe Rizzo, Marieke van Erp, Julien Plu and Raphael Troncy	Context-enhanced Adaptive Entity Linking
15:25-15:45	Eda Okur, Hakan Demir and Arzucan Özgür	Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word Embeddings
15:45-16:05	Maria Pershina, Yifan He and Ralph Grishman	Entity Linking with a Paraphrase Flavor
16:05-16:25	Tian Tian, Marco Dinarelli, Isabelle Tellier and Pedro Dias Cardoso	Domain Adaptation for Named Entity Recognition Using CRFs

	Session O9 - Linked Data	Chairperson: John McCrae
16:45-17:05	Marta Villegas, Maite Melero, Núria Bel and Jorge Gracia	Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
17:05-17:25	Francesco Corcoglioniti, Marco Rospocher, Alessio Palmero Aprosio and Sara Tonelli	PreMOn: a Lemon Extension for Exposing Predicate Models as Linked Data
17:25-17:45	Fabricio Chalub, Livy Real, Alexandre Rademaker and Valeria de Paiva	Semantic Links for Portuguese
17:45-18:05	Bettina Klimek, Natanael Arndt, Sebastian Krause and Timotheus Arndt	Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme Inventory

	Session O10 - Multilingual Corpora	Chairperson: Hitoshi Isahara
16:45-17:05	Prokopis Prokopidis, Vassilis Papavassiliou and Stelios Piperidis	Parallel Global Voices: a Collection of Multilingual Corpora with Citizen Media Stories
17:05-17:25	Xuansong Li, Martha Palmer, Nianwen Xue, Lance Ramshaw, Mohamed Maamouri, Ann Bies, Kathryn Conger, Stephen Grimes and Stephanie Strassel	Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
17:25-17:45	Ivan Habernal, Omnia Zayed and Iryna Gurevych	C4Corpus: Multilingual Web-size Corpus with Free License
17:45-18:05	Pierre Lison and Jörg Tiedemann	OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles

	Session O11 - Lexicons	Chairperson: Bolette Pedersen
16:45-17:05	Giulia Rambelli, Gianluca Lebani, Laurent Prévot and Alessandro Lenci	LexFr: Adapting the LexIt Framework to Build a Corpus-based French Subcategorization Lexicon
17:05-17:25	Iñaki San Vicente and Xabier Saralegi	Polarity Lexicon Building: to what Extent Is the Manual Effort Worth?
17:25-17:45	Ouafae Nahli, Francesca Frontini, Monica Monachini, Fahad Khan, Arsalan Zarghili and Mustapha Khalfi	Al Qamus al Muhit, a Medieval Arabic Lexicon in LMF
17:45-18:05	Ricardo Baeza-Yates, Luz Rello and Julia Dembowski	CASSAurus: A Resource of Simpler Spanish Synonyms

	Session O12 - OCR for Historical Text	Chairperson: Thierry Declerck
16:45-17:05	Kimmo Kettunen and Tuula Pääkkönen	Measuring Lexical Quality of a Historical Finnish Newspaper Collection ― Analysis of Garbled OCR Data with Basic Language Technology Tools and Means
17:05-17:25	Haithem Afli, Zhengwei Qiu, Andy Way and Páraic Sheridan	Using SMT for OCR Error Correction of Historical Texts
17:25-17:45	Martin Reynaert	OCR Post-Correction Evaluation of Early Dutch Books Online - Revisited
17:45-18:05	Simon Clematide, Lenz Furrer and Martin Volk	Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus

	Session O13 - Large Projects and Infrastructures	Chairperson: Walter Daelemans
18:10-18:30	Cristina Mota, Paula Carvalho and Anabela Barreiro	Port4NooJ v3.0: Integrated Linguistic Resources for Portuguese NLP
18:30-18:50	Roberts Rozis, Andrejs Vasiļjevs and Raivis Skadiņš	Collecting Language Resources for the Latvian e-Government Machine Translation Platform
18:50-19:10	Hennie Brugman, Martin Reynaert, Nicoline van der Sijs, René van Stipriaan, Erik Tjong Kim Sang and Antal van den Bosch	Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora

	Session O14 - Document Classification and Text Categorisation	Chairperson: Robert Frederking
18:10-18:30	Juan Soler and Leo Wanner	A Semi-Supervised Approach for Gender Identification
18:30-18:50	Yannis Korkontzelos, Beverley Thomas, Makoto Miwa and Sophia Ananiadou	Ensemble Classification of Grants using LDA-based Features
18:50-19:10	Diyi Yang, Aaron Halfaker, Robert Kraut and Eduard Hovy	Edit Categories and Editor Role Identification in Wikipedia

	Session O15 - Morphology (1)	Chairperson: Tamás Váradi
18:10-18:30	Faisal Al-Shargi, Aidan Kaplan, Ramy Eskander, Nizar Habash and Owen Rambow	Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni Arabic
18:30-18:50	Zdeněk Žabokrtský, Magda Sevcikova, Milan Straka, Jonáš Vidra and Adéla Limburská	Merging Data Resources for Inflectional and Derivational Morphology in Czech
18:50-19:10	Attila Novák, Borbála Siklósi and Csaba Oravecz	A New Integrated Open-source Morphological Analyzer for Hungarian

	Session O16 - Phonetics and Prosody	Chairperson: Dafydd Gibbon
18:10-18:30	Eleanor Chodroff, Matthew Maciejewski, Jan Trmal, Sanjeev Khudanpur and John Godfrey	New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification
18:30-18:50	Eduardo Coutinho, Florian Hönig, Yue Zhang, Simone Hantke, Anton Batliner, Elmar Nöth and Björn Schuller	Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
18:50-19:10	Juergen Trouvain, Anne Bonneau, Vincent Colotte, Camille Fauth, Dominique Fohr, Denis Jouvet, Jeanin Jügler, Yves Laprie, Odile Mella, Bernd Möbius and Frank Zimmerer	The IFCASL Corpus of French and German Non-native and Native Read Speech

Day 1, Poster Sessions:

	Session P01 - Anaphora and Coreference	Chair: Steve Cassidy
11:35-13:15	Abbas Ghaddar and Phillippe Langlais	WikiCoref: An English Coreference-annotated Corpus of Wikipedia Articles
11:35-13:15	Dominik Schlechtweg	Exploitation of Co-reference in Distributional Semantics
11:35-13:15	Evandro Fonseca, Renata Vieira and Aline Vanin	Adapting an Entity Centric Model for Portuguese Coreference Resolution
11:35-13:15	Ina Roesiger and Jonas Kuhn	IMS HotCoref DE: A Data-driven Co-reference Resolver for German
11:35-13:15	Vandan Mujadia, Palash Gupta and Dipti Misra Sharma	Coreference Annotation Scheme and Relation Types for Hindi
11:35-13:15	Anna Nedoluzhko, Michal Novák, Silvie Cinkova, Marie Mikulová and Jiří Mírovský	Coreference in Prague Czech-English Dependency Treebank
11:35-13:15	Dane Bell, Gus Hahn-Powell, Marco A. Valenzuela-Escárcega and Mihai Surdeanu	Sieve-based Coreference Resolution in the Biomedical Domain
11:35-13:15	Hardik Vala, Stefan Dimitrov, David Jurgens, Andrew Piper and Derek Ruths	Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel

	Session P02 - Computer Aided Language Learning	Chair: Stephanie Strassel
11:35-13:15	Marie Garnier and Patrick Saint-Dizier	Error Typology and Remediation Strategies for Requirements Written in English by Non-Native Speakers
11:35-13:15	Lena Keiper, Andrea Horbach and Stefan Thater	Improving POS Tagging of German Learner Language in a Reading Comprehension Scenario
11:35-13:15	Elena Volodina, Ildikó Pilán, Ingegerd Enström, Lorena Llozhi, Peter Lundkvist, Gunlög Sundberg and Monica Sandell	SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies
11:35-13:15	Thomas Francois, Elena Volodina, Ildikó Pilán and Anaïs Tack	SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language Learners
11:35-13:15	Yow-Ting Shiue and Hsin-Hsi Chen	Detecting Word Usage Errors in Chinese Sentences for Learning Chinese as a Foreign Language
11:35-13:15	Meishan Zhang, Jie Yang, Zhiyang Teng and Yue Zhang	LibN3L:A Lightweight Package for Neural NLP
11:35-13:15	Anaïs Tack, Thomas Francois, Anne-Laure Ligozat and Cédrick Fairon	Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource
11:35-13:15	Claudia Baur, Johanna Gerlach, Manny Rayner, Martin Russell and Helmer Strik	A Shared Task for Spoken CALL?
11:35-13:15	AlBara Khalifa, Tsuneo Kato and Seiichi Yamamoto	Joining-in-type Humanoid Robot Assisted Language Learning System

	Session P03 - Evaluation Methodologies (1)	Chair: Ann Bies
11:35-13:15	Mahmoud El-Haj and Paul Rayson	OSMAN ― A Novel Arabic Readability Metric
11:35-13:15	Edouard Geoffrois	Evaluating Interactive System Adaptation
11:35-13:15	Leon Derczynski	Complementarity, F-score, and NLP Evaluation
11:35-13:15	Mauro Dragoni, Andrea Tettamanzi and Célia da Costa Pereira	DRANZIERA: An Evaluation Protocol For Multi-Domain Opinion Mining
11:35-13:15	Richard Fothergill, Paul Cook and Timothy Baldwin	Evaluating a Topic Modelling Approach to Measuring Corpus Similarity
11:35-13:15	Christian Fandrych, Elena Frick, Hanna Hedeland, Anna Iliash, Daniel Jettka, Cordula Meißner, Thomas Schmidt, Franziska Wallner, Kathrin Weigert and Swantje Westpfahl	User, who art thou? User Profiling for Oral Corpus Platforms
11:35-13:15	Angela Costa, Rui Correia and Luisa Coheur	Building a Corpus of Errors and Quality in Machine Translation: Experiments on Error Impact
11:35-13:15	Victoria Yaneva, Irina Temnikova and Ruslan Mitkov	Evaluating the Readability of Text Simplification Output for Readers with Cognitive Disabilities
11:35-13:15	Sahar Ghannay, Benoit Favre, Yannick Estève and Nathalie Camelin	Word Embedding Evaluation and Combination
11:35-13:15	Johann Poignant, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau and Thomas Tamisier	Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015
11:35-13:15	Sheila Castilho and Sharon O'Brien	Evaluating the Impact of Light Post-Editing on Usability

	Session P04 - Information Extraction and Retrieval (1)	Chair: Diana Maynard
11:35-13:15	Elizabeth Salesky, Jessica Ray and Wade Shen	Operational Assessment of Keyword Search on Oral History
11:35-13:15	Marco A. Valenzuela-Escárcega, Gus Hahn-Powell and Mihai Surdeanu	Odin's Runes: A Rule Language for Information Extraction
11:35-13:15	Els Lefever and Véronique Hoste	A Classification-based Approach to Economic Event Detection in Dutch News Text
11:35-13:15	Gil Francopoulo, Joseph Mariani and Patrick Paroubek	Predictive Modeling: Guessing the NLP Terms of Tomorrow
11:35-13:15	Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Jussi Karlgren, Fredrik Olsson, Per Persson, Akshay Viswanathan and Anders Holst	The Gavagai Living Lexicon
11:35-13:15	Hamdy Mubarak and Ahmed Abdelali	Arabic to English Person Name Transliteration using Twitter
11:35-13:15	Young-Seob Jeong, Won-Tae Joo, Hyun-Woo Do, Chae-Gyun Lim, Key-Sun Choi and Ho-Jin Choi	Korean TimeML and Korean TimeBank
11:35-13:15	Julian Seitner, Christian Bizer, Kai Eckert, Stefano Faralli, Robert Meusel, Heiko Paulheim and Simone Paolo Ponzetto	A Large DataBase of Hypernymy Relations Extracted from the Web.
11:35-13:15	Nikolaos Katris, Richard Sutcliffe and Theodore Kalamboukis	Using a Cross-Language Information Retrieval System based on OHSUMED to Evaluate the Moses and KantanMT Statistical Machine Translation Systems
11:35-13:15	Gabriella Pardelli, Sara Goggi, Silvia Giannini and Stefania Biagioni	Two Decades of Terminology: European Framework Programmes Titles
11:35-13:15	Wim Peters and Adam Wyner	Legal Text Interpretation: Identifying Hohfeldian Relations from Text
11:35-13:15	Ryuichi Tachibana and Mamoru Komachi	Analysis of English Spelling Errors in a Word-Typing Game
11:35-13:15	Vojtěch Kovář, Monika Močiariková and Pavel Rychlý	Finding Definitions in Large Corpora with Sketch Engine
11:35-13:15	Teresa Rodriguez-Ferreira, Adrian Rabadan, Raquel Hervas and Alberto Diaz	Improving Information Extraction from Wikipedia Texts using Basic English
11:35-13:15	Tommaso Caselli, Giovanni Moretti, Rachele Sprugnoli, Sara Tonelli, Damien Lanfrey and Donatella Solda Kutzmann	NLP and Public Engagement: The Case of the Italian School Reform
11:35-13:15	Xabier Saralegi, Eneko Agirre and Iñaki Alegria	Evaluating Translation Quality and CLIR Performance of Query Sessions
11:35-13:15	Dieu-Thu Le and Uwe Quasthoff	Construction and Analysis of a Large Vietnamese Text Corpus
11:35-13:15	Kartik Asooja, Georgeta Bordea, Gabriela Vulcu and Paul Buitelaar	Forecasting Emerging Trends from Scientific Literature
11:35-13:15	Eunsol Choi, Matic Horvat, Jonathan May, Kevin Knight and Daniel Marcu	Extracting Structured Scholarly Information from the Machine Translation Literature
11:35-13:15	Stephen Wu, Chung-Il Wi, Sunghwan Sohn, Hongfang Liu and Young Juhn	Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events
11:35-13:15	Stefano Menini, Rachele Sprugnoli and Antonio Uva	“Who was Pietro Badoglio?” Towards a QA system for Italian History

	Session P05 - Machine Translation (1)	Chair: Martin Volk
14:45-16:25	Mihael Arcan, Caoilfhionn Lane, Eoin Ó Droighneáin and Paul Buitelaar	IRIS: English-Irish Machine Translation System
14:45-16:25	George Tambouratzis and Vasiliki Pouli	Linguistically Inspired Language Model Augmentation for MT
14:45-16:25	Gavin Abercrombie	A Rule-based Shallow-transfer Machine Translation System for Scots and English
14:45-16:25	Matīss Rikters and Inguna Skadina	Syntax-based Multi-system Machine Translation
14:45-16:25	Sanja Štajner, Andreia Querido, Nuno Rendeiro, João António Rodrigues and António Branco	Use of Domain-Specific Language Resources in Machine Translation
14:45-16:25	Santanu Pal, Marcos Zampieri, Sudip Kumar Naskar, Tapas Nayak, Mihaela Vela and Josef van Genabith	CATaLog Online: Porting a Post-editing Tool to the Web
14:45-16:25	Akira Hayakawa, Saturnino Luz, Loredana Cerrato and Nick Campbell	The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus
14:45-16:25	Kugatsu Sadamitsu, Itsumi Saito, Taichi Katayama, Hisako Asano and Yoshihiro Matsuo	Name Translation based on Fine-grained Named Entity Recognition in a Single Language
14:45-16:25	Sreelekha S and Pushpak Bhattacharyya	Lexical Resources to Enrich English Malayalam Machine Translation
14:45-16:25	Yong Xu and François Yvon	Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts
14:45-16:25	Liane Guillou and Christian Hardmeier	PROTEST: A Test Suite for Evaluating Pronouns in Machine Translation
14:45-16:25	Chenhui Chu and Sadao Kurohashi	Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation

	Session P06 - Parsing	Chair: Giuseppe Attardi
14:45-16:25	Joachim Daiber and Rob van der Goot	The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions
14:45-16:25	Xiaoyin Che, Cheng Wang, Haojin Yang and Christoph Meinel	Punctuation Prediction for Unsegmented Transcript Based on Word Vector
14:45-16:25	Hao Zhou, Yue Zhang, Shujian Huang, Xin-Yu Dai and Jiajun Chen	Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing
14:45-16:25	Atsushi Ushiku, Tetsuro Sasada and Shinsuke Mori	Language Resource Addition Strategies for Raw Text Parsing
14:45-16:25	Yuval Marton and Kristina Toutanova	E-TIPSY: Search Query Corpus Annotated with Entities, Term Importance, POS Tags, and Syntactic Parses
14:45-16:25	Liesbeth Augustinus, Peter Dirix, Daniel Van Niekerk, Ineke Schuurman, Vincent Vandeghinste, Frank Van Eynde and Gerhard Van Huyssteen	AfriBooms: An Online Treebank for Afrikaans
14:45-16:25	Edoardo Maria Ponti and Marco Passarotti	Differentia compositionem facit. A Slower-Paced and Reliable Parser for Latin
14:45-16:25	Roald Eiselen	South African Language Resources: Phrase Chunking
14:45-16:25	Jindřich Libovický	Neural Scoring Function for MST Parser
14:45-16:25	Inari Listenmaa and Koen Claessen	Analysing Constraint Grammars with a SAT-solver
14:45-16:25	Achim Stein	Old French Dependency Parsing: Results of Two Parsers Analysed from a Linguistic Point of View
14:45-16:25	Maria Pia di Buono	Semi-automatic Parsing for Web Knowledge Extraction through Semantic Annotation

	Session P07 - Speech Corpora and Databases (1)	Chair: Carmen García Mateo
14:45-16:25	Jochen Weiner, Claudia Frankenberg, Dominic Telaar, Britta Wendelstein, Johannes Schröder and Tanja Schultz	Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging
14:45-16:25	Moez Ajili, Jean-françois Bonastre, Juliette Kahn, Solange Rossato and Guillaume Bernard	FABIOLE, a Speech Database for Forensic Speaker Comparison
14:45-16:25	Nawar Halabi and Mike Wald	Phonetic Inventory for an Arabic Speech Corpus
14:45-16:25	Yun-Nung Chen and Dilek Hakkani-Tur	AIMU: Actionable Items for Meeting Understanding
14:45-16:25	Felix Burkhardt and Uwe D. Reichel	A Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source Performance
14:45-16:25	Patricia Braunger, Hansjörg Hofmann, Steffen Werner and Maria Schmidt	A Comparative Analysis of Crowdsourced Natural Language Corpora for Spoken Dialog Systems
14:45-16:25	Xabier Sarasola, Eva Navas, David Tavarez, Daniel Erro, Ibon Saratxaga and Inma Hernaez	A Singing Voice Database in Basque for Statistical Singing Synthesis of Bertsolaritza
14:45-16:25	Hannes Pessentheiner, Thomas Pichler and Martin Hagmüller	AMISCO: The Austrian German Multi-Sensor Corpus
14:45-16:25	Philipp Aichinger, Immer Roesner, Matthias Leonhard, Doris-Maria Denk-Linnert, Wolfgang Bigenzahn and Berit Schneider-Stickler	A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices
14:45-16:25	Neli Hateva, Petar Mitankin and Stoyan Mihov	BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology
14:45-16:25	Mārcis Pinnis, Askars Salimbajevs and Ilze Auzina	Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian
14:45-16:25	Jorge Proença, Dirce Celorico, Sara Candeias, Carla Lopes and Fernando Perdigão	The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation
14:45-16:25	Uwe Reichel, Florian Schiel, Thomas Kisler, Christoph Draxler and Nina Pörner	The BAS Speech Data Repository
14:45-16:25	Emre Yilmaz, Mario Ganzeboom, Lilian Beijer, Catia Cucchiarini and Helmer Strik	A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research

	Session P08 - Summarisation	Chair: Gerard de Melo
14:45-16:25	Muhammad Humayoun, Rao Muhammad Adeel Nawab, Muhammad Uzair, Saba Aslam and Omer Farzand	Urdu Summary Corpus
14:45-16:25	Fajri Koto	A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization
14:45-16:25	Arman Cohan and Nazli Goharian	Revisiting Summarization Evaluation for Scientific Articles
14:45-16:25	Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Jorge Valderrama and Hugo Zaragoza	The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015

	Session P09 - Word Sense Disambiguation (1)	Chair: Luca Dini
14:45-16:25	Luigi Di Caro and Guido Boella	Automatic Enrichment of WordNet with Common-Sense Knowledge
14:45-16:25	Vít Baisa, Silvie Cinkova, Ema Krejčová and Anna Vernerová	VPS-GradeUp: Graded Decisions on Usage Patterns
14:45-16:25	Tristan Miller, Mohamed Khemakhem, Richard Eckart de Castilho and Iryna Gurevych	Sense-annotating a Lexical Substitution Data Set with Ubyline
14:45-16:25	Andrea Horbach, Andrea Hensler, Sabine Krome, Jakob Prange, Werner Scholze-Stubenrecht, Diana Steffen, Stefan Thater, Christian Wellner and Manfred Pinkal	A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
14:45-16:25	Bolette Pedersen, Anna Braasch, Anders Johannsen, Héctor Martínez Alonso, Sanni Nimb, Sussi Olsen, Anders Søgaard and Nicolai Hartvig Sørensen	The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories
14:45-16:25	Silvie Cinkova, Ema Krejčová, Anna Vernerová and Vít Baisa	Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
14:45-16:25	Yanan Lu, Yue Zhang and Donghong Ji	Multi-prototype Chinese Character Embedding
14:45-16:25	Angel Chang, Valentin I. Spitkovsky, Christopher D. Manning and Eneko Agirre	A comparison of Named-Entity Disambiguation and Word Sense Disambiguation

	Session P10 - Discourse (1)	Chair: Elena Cabrio
16:45-18:05	Patrick Saint-Dizier	Argument Mining: the Bottleneck of Knowledge and Language Resources
16:45-18:05	Ekaterina Lapshinova-Koltunski, Kerstin Anna Kunz and Anna Nedoluzhko	From Interoperable Annotations towards Interoperable Resources: A Multilingual Approach to the Analysis of Discourse
16:45-18:05	Henk van den Heuvel and Nelleke Oostdijk	Falling silent, lost for words ... Tracing personal involvement in interviews with Dutch war veterans
16:45-18:05	Yang Liu, Jiajun Zhang, Chengqing Zong, Yating Yang and Xi Zhou	A Bilingual Discourse Corpus and Its Applications
16:45-18:05	Tatjana Scheffler and Manfred Stede	Adding Semantic Relations to a Large-Coverage Connective Lexicon of German
16:45-18:05	Mathilde Janier and Chris Reed	Corpus Resources for Dispute Mediation Discourse
16:45-18:05	Carlos Valmaseda, Juan Martinez-Romo and Lourdes Araujo	A Tagged Corpus for Automatic Labeling of Disabilities in Medical Scientific Papers
16:45-18:05	Stephanie Lukin, Kevin Bowden, Casey Barackman and Marilyn Walker	PersonaBank: A Corpus of Personal Narratives and Their Story Intention Graphs
16:45-18:05	Huan-Yuan Chen, Wan-Shan Liao, Hen-Hsen Huang and Hsin-Hsi Chen	Fine-Grained Chinese Discourse Relation Labelling
16:45-18:05	Ines Rehbein, Merel Scholman and Vera Demberg	Annotating Discourse Relations in Spoken Language: A Comparison of the PDTB and CCR Frameworks
16:45-18:05	Carole Lailler, Anaïs Landeau, Frédéric Béchet, Yannick Estève and Paul Deléglise	Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP Tasks
16:45-18:05	Manfred Stede, Stergos Afantenos, Andreas Peldszus, Nicholas Asher and Jérémy Perret	Parallel Discourse Annotations on a Corpus of Short Texts
16:45-18:05	John Lee and Chak Yan Yeung	An Annotated Corpus of Direct Speech

	Session P11 - Morphology (1)	Chair: Éric de la Clergerie
16:45-18:05	Izaskun Etxeberria, Iñaki Alegria, Larraitz Uria and Mans Hulden	Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and Slovene
16:45-18:05	Kareem Darwish and Hamdy Mubarak	Farasa: A New Fast and Accurate Arabic Word Segmenter
16:45-18:05	Eckhard Bick	A Morphological Lexicon of Esperanto with Morpheme Frequencies
16:45-18:05	Wuying Liu and Lin Wang	How does Dictionary Size Influence Performance of Vietnamese Word Segmentation?
16:45-18:05	Nabil Hathout and Fiammetta Namer	Giving Lexical Resources a Second Life: Démonette, a Multi-sourced Morpho-semantic Network for French
16:45-18:05	Carola Trips	Syntactic Analysis of Phrasal Compounds in Corpora: a Challenge for NLP Tools
16:45-18:05	Salam Khalifa, Houda Bouamor and Nizar Habash	DALILA: The Dialectal Arabic Linguistic Learning Assistant
16:45-18:05	Petra Steiner	Refurbishing a Morphological Database for German

	Session P12 - Sentiment Analysis and Opinion Mining (1)	Chair: German Rigau
16:45-18:05	Cédric Lopez, Frederique Segond and Christiane Fellbaum	Encoding Adjective Scales for Fine-grained Resources
16:45-18:05	Mario Sänger, Ulf Leser, Steffen Kemmerer, Peter Adolphs and Roman Klinger	SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German
16:45-18:05	Marianna Apidianaki, Xavier Tannier and Cécile Richart	Datasets for Aspect-Based Sentiment Analysis in French
16:45-18:05	Samira Shaikh, Kit Cho, Tomek Strzalkowski, Laurie Feldman, John Lien, Ting Liu and George Aaron Broadwell	ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages
16:45-18:05	Uladzimir Sidarenka	PotTS: The Potsdam Twitter Sentiment Corpus
16:45-18:05	Diana Maynard and Kalina Bontcheva	Challenges of Evaluating Sentiment Analysis Tools on Social Media
16:45-18:05	Jasy Suet Yan Liew, Howard R. Turtle and Elizabeth D. Liddy	EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment Analysis
16:45-18:05	Svetlana Kiritchenko and Saif Mohammad	Happy Accident: A Sentiment Composition Lexicon for Opposing Polarity Phrases
16:45-18:05	Alexandra Balahur and Hristo Tanev	Detecting Implicit Expressions of Affect from Text using Semantic Knowledge on Common Concept Properties
16:45-18:05	Natalia Loukachevitch and Anatolii Levchik	Creating a General Russian Sentiment Lexicon
16:45-18:05	Chantal van Son, Tommaso Caselli, Antske Fokkens, Isa Maks, Roser Morante, Lora Aroyo and Piek Vossen	GRaSP: A Multilayered Annotation Scheme for Perspectives
16:45-18:05	Wejdene Khiari, Mathieu Roche and Asma Bouhafs Hafsia	Integration of Lexical and Semantic Knowledge for Sentiment Analysis in SMS
16:45-18:05	Fabio Tamburini	Specialising Paragraph Vectors for Text Polarity Detection
16:45-18:05	Grégoire Jadi, Vincent Claveau, Béatrice Daille and Laura Monceaux	Evaluating Lexical Similarity to build Sentiment Similarity

	Session P13 - Semantics (1)	Chair: Christian Chiarcos
16:45-18:05	Maximilian Köper, Melanie Zaiß, Qi Han, Steffen Koch and Sabine Schulte im Walde	Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic Classification
16:45-18:05	Nabin Maharjan, Rajendra Banjade, Nobal Bikram Niraula and Vasile Rus	SemAligner: A Method and Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores
16:45-18:05	Ingrid Falk and Fabienne Martin	Aspectual Flexibility Increases with Agentivity and Concreteness\\ A Computational Classification Experiment on Polysemous Verbs
16:45-18:05	Silvio Cordeiro, Carlos Ramisch and Aline Villavicencio	mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE Processing
16:45-18:05	Elias Iosif, Spiros Georgiladakis and Alexandros Potamianos	Cognitively Motivated Distributional Representations of Meaning
16:45-18:05	Yoshihiko Hayashi and Wentao Luo	Extending Monolingual Semantic Textual Similarity Task to Multiple Cross-lingual Settings
16:45-18:05	Ann Copestake, Guy Emerson, Michael Wayne Goodman, Matic Horvat, Alexander Kuhnle and Ewa Muszyńska	Resources for building applications with Dependency Minimal Recursion Semantics
16:45-18:05	Chung-Lun Kuo and Hsin-Hsi Chen	Subtask Mining from Search Query Logs for How-Knowledge Acceleration
16:45-18:05	Daria Ryzhova, Maria Kyuseva and Denis Paperno	Typology of Adjectives Benchmark for Compositional Distributional Models
16:45-18:05	Tom Bosc, Elena Cabrio and Serena Villata	DART: a Dataset of Arguments and their Relations on Twitter

	Session P14 - Lexical Databases	Chair: Amália Mendes
18:10-19:10	Patrick Saint-Dizier	LELIO: An Auto-Adaptative System to Acquire Domain Lexical Knowledge in Technical Texts
18:10-19:10	Yugo Murawaki and Shinsuke Mori	Wikification for Scriptio Continua
18:10-19:10	Bartłomiej Nitoń, Tomasz Bartosiak and Elżbieta Hajnicz	Accessing and Elaborating Walenty - a Valence Dictionary of Polish - via Internet Browser
18:10-19:10	Ana Lúcia Santos, Maria João Freitas and Aida Cardoso	CEPLEXicon ― A Lexicon of Child European Portuguese
18:10-19:10	Gregory Grefenstette	Extracting Weighted Language Lexicons from Wikipedia
18:10-19:10	Nabil Hathout and Franck Sajous	Wiktionnaire's Wikicode GLAWIfied: a Workable French Machine-Readable Dictionary

	Session P15 - Multimodality	Chair: Carlo Strapparava
18:10-19:10	Laura Hollink, Adriatik Bedjeti, Martin van Harmelen and Desmond Elliott	A Corpus of Images and Text in Online News
18:10-19:10	Necati Cihan Camgöz, Ahmet Alp Kındıroğlu, Serpil Karabüklü, Meltem Kelepir, Ayşe Sumru Özsoy and Lale Akarun	BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains
18:10-19:10	Michel Vacher, Saïda Bouakaz, Marc-Eric Bobillier Chaumon, Frédéric Aman, R. A. Khan, Slima Bekkadja, François Portet, Erwan Guillou, Solange Rossato and Benjamin Lecouteux	The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People
18:10-19:10	Niraj Shrestha and Marie-Francine Moens	Semi-automatically Alignment of Predicates between Speech and OntoNotes data
18:10-19:10	María del Carmen Cabeza-Pereiro, José Mª García-Miguel, Carmen García Mateo and José Luis Alba Castro	CORILSE: a Spanish Sign Language Repository for Linguistic Analysis
18:10-19:10	Stephanie Schreitter and Brigitte Krenn	The OFAI Multi-Modal Task Description Corpus
18:10-19:10	Shinsuke Mori, John Richardson, Atsushi Ushiku, Tetsuro Sasada, Hirotaka Kameko and Yoshimasa Tsuruoka	A Japanese Chess Commentary Corpus
18:10-19:10	Johann Poignant, Mateusz Budnik, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau, Gilles Adda, Laurent Besacier, Hazim Ekenel, Gil Francopoulo, Javier Hernando, Joseph Mariani, Ramon Morros, Georges Quénot, Sophie Rosset and Thomas Tamisier	The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
18:10-19:10	Andy Luecking, Alexander Mehler, Désirée Walther, Marcel Mauri and Dennis Kurfürst	Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus
18:10-19:10	Kai Frederic Engelmann, Patrick Holthaus, Britta Wrede and Sebastian Wrede	An Interaction-Centric Dataset for Learning Automation Rules in Smart Homes
18:10-19:10	Alex Becker, Fabio Kepler and Sara Candeias	A Web Tool for Building Parallel Corpora of Spoken and Sign Languages

	Session P16 - Ontologies	Chair: Elena Montiel Ponsoda
18:10-19:10	Sharmin Muzaffar, Pitambar Behera and Girish Jha	Issues and Challenges in Annotating Urdu Action Verbs on the IMAGACT4ALL Platform
18:10-19:10	Liumingjing Xiao, Chong Ruan, An Yang, Junhao Zhang and Junfeng Hu	Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpedia
18:10-19:10	Janne M Johannessen, Arash Saidi and Kristin Hagen	Constructing a Norwegian Academic Wordlist
18:10-19:10	Roxane Segers, Marco Rospocher, Piek Vossen, Egoitz Laparra, German Rigau and Anne-Lyse Minard	The Event and Implied Situation Ontology (ESO): Application and Evaluation
18:10-19:10	Maria Sukhareva and Christian Chiarcos	Combining Ontologies and Neural Networks for Analyzing Historical Language Varieties. A Case Study in Middle Low German
18:10-19:10	Maxence Girard-Rivier, Romain Magnani, Veronique Auberge, Yuko Sasa, Liliya Tsvetanova, Frederic Aman and Clarisse Bayol	Ecological Gestures for HRI: the GEE Corpus
18:10-19:10	Rogelio Nazar and Irene Renau	A Taxonomy of Spanish Nouns, a Statistical Algorithm to Generate it and its Implementation in Open Source Code

	Session P17 - Part of Speech Tagging (1)	Chair: Krister Linden
18:10-19:10	Swantje Westpfahl and Thomas Schmidt	FOLK-Gold ― A Gold Standard for Part-of-Speech-Tagging of Spoken German
18:10-19:10	Fahad Albogamy and Allan Ramsay	Fast and Robust POS tagger for Arabic Tweets Using Agreement-based Bootstrapping
18:10-19:10	Steffen Eger, Rüdiger Gleim and Alexander Mehler	Lemmatization and Morphological Tagging in German and Latin: A Comparison and a Survey of the State-of-the-art
18:10-19:10	Tim vor der Brück and Alexander Mehler	TLT-CRF: A Lexicon-supported Morphological Tagger for Latin Based on Conditional Random Fields
18:10-19:10	Lauriane Aufrant, Guillaume Wisniewski and François Yvon	Cross-lingual and Supervised Models for Morphosyntactic Annotation: a Comparison on Romanian
18:10-19:10	Nikola Ljubešić and Tomaž Erjavec	Corpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of Slovene

	Session P18 - Treebanks (1)	Chair: Béatrice Daille
18:10-19:10	Quy Nguyen, Yusuke Miyao, Ha Le and Ngan Nguyen	Challenges and Solutions for Consistent Annotation of Vietnamese Treebank
18:10-19:10	Kanta Suzuki, Yoshihide Kato and Shigeki Matsubara	Correcting Errors in a Treebank Based on Tree Mining
18:10-19:10	Philippe Blache, Gregoire de Montcheuil, Laurent Prévot and Stéphane Rauzy	4Couv: A New Treebank for French
18:10-19:10	Rita de Carvalho, Andreia Querido, Marisa Campos, Rita Valadas Pereira, João Silva and António Branco	CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese
18:10-19:10	Kadri Muischnek, Kaili Müürisep and Tiina Puolakainen	Estonian Dependency Treebank: from Constraint Grammar tagset to Universal Dependencies
18:10-19:10	Kaja Dobrovoljc and Joakim Nivre	The Universal Dependencies Treebank of Spoken Slovenian
18:10-19:10	Ye Kyaw Thu, Win Pa Pa, Masao Utiyama, Andrew Finch and Eiichiro Sumita	Introducing the Asian Language Treebank (ALT)
18:10-19:10	Lilja Øvrelid and Petter Hohle	Universal Dependencies for Norwegian

Day 2, Oral Sessions:

	Session O17 - Language Resource Policies	Chairperson: Edouard Geoffrois
9:45-10:05	Georg Rehm, Jan Hajic, Josef van Genabith and Andrejs Vasiļjevs	Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities
10:05-10:25	Karën Fort and Alain Couillault	Yes, We Care! Results of the Ethics and Natural Language Processing Surveys
10:25-10:45	David Lewis, Kaniz Fatema, Alfredo Maldonado, Brian Walshe and Arturo Calvo	Open Data Vocabularies for Assigning Usage Rights to Data Resources from Translation Projects
10:45-11:05	Valérie Mapelli, Vladimir Popescu, Lin Liu and Khalid Choukri	Language Resource Citation: the ISLRN Dissemination and Further Developments
11:05-11:25	Denise DiPersio and Christopher Cieri	Trends in HLT Research: A Survey of LDC's Data Scholarship Program

	Session O18 - Tweet Corpora and Analysis	Chairperson: Bernardo Magnini
9:45-10:05	Cristina Bosco, Mirko Lai, Viviana Patti and Daniela Virone	Tweeting and Being Ironic in the Debate about a Political Reform: the French Annotated Corpus TWitter-MariagePourTous
10:05-10:25	Ayman Alhelbawy, Poesio Massimo and Udo Kruschwitz	Towards a Corpus of Violence Acts in Arabic Social Media
10:25-10:45	Ben Verhoeven, Walter Daelemans and Barbara Plank	TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling
10:45-11:05	Muhammad Imran, Prasenjit Mitra and Carlos Castillo	Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages
11:05-11:25	Rafiya Begum, Kalika Bali, Monojit Choudhury, Koustav Rudra and Niloy Ganguly	Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments

	Session O19 - Dependency Treebanks	Chairperson: Simonetta Montemagni
9:45-10:05	Takaaki Tanaka, Yusuke Miyao, Masayuki Asahara, Sumire Uematsu, Hiroshi Kanayama, Shinsuke Mori and Yuji Matsumoto	Universal Dependencies for Japanese
10:05-10:25	Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajic, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty and Daniel Zeman	Universal Dependencies v1: A Multilingual Treebank Collection
10:25-10:45	Akihiko Kato, Hiroyuki Shindo and Yuji Matsumoto	Construction of an English Dependency Corpus incorporating Compound Function Words
10:45-11:05	Maria Simi and Giuseppe Attardi	Adapting the TANL tool suite to Universal Dependencies
11:05-11:25	Tak-sum Wong and John Lee	A Dependency Treebank of the Chinese Buddhist Canon

	Session O20 - Word Sense Disambiguation	Chairperson: Nancy Ide
9:45-10:05	Juan Antonio Lossio-Ventura, Clement Jonquet, Mathieu Roche and Maguelonne Teisseire	Automatic Biomedical Term Polysemy Detection
10:05-10:25	Domagoj Alagić and Jan Šnajder	Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation
10:25-10:45	Marten Postma, Ruben Izquierdo, Eneko Agirre, German Rigau and Piek Vossen	Addressing the MFS Bias in WSD systems
10:45-11:05	José Camacho-Collados, Claudio Delli Bovi, Alessandro Raganato and Roberto Navigli	A Large-Scale Multilingual Disambiguation of Glosses
11:05-11:25	Stefan Ecker, Andrea Horbach and Stefan Thater	Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages

	Session O21 - Social Media	Chairperson: Piek Vossen
11:45-12:05	Geraldine Damnati, Aleksandra Guerraz and Delphine Charlet	Web Chat Conversations from Contact Centers: a Descriptive Study
12:05-12:25	François Morlane-Hondère, Cyril Grouin and Pierre Zweigenbaum	Identification of Drug-Related Medical Conditions in Social Media
12:25-12:45	Manuel Burghardt, Daniel Granvogl and Christian Wolff	Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing
12:45-13:05	Vinodkumar Prabhakaran and Owen Rambow	A Corpus of Wikipedia Discussions: Over the Years, with Topic, Power and Gender Labels

	Session O22 - Anaphora and Coreference	Chairperson: Eva Hajičová
11:45-12:05	Jon Chamberlain, Massimo Poesio and Udo Kruschwitz	Phrase Detectives Corpus 1.0 Crowdsourced Anaphoric Coreference.
12:05-12:25	Evandro Fonseca, André Antonitsch, Sandra Collovini, Daniela Amaral, Renata Vieira and Anny Figueira	Summ-it++: an Enriched Version of the Summ-it Corpus
12:25-12:45	Alicia Burga, Sergio Cajal, Joan Codina-Filba and Leo Wanner	Towards Multiple Antecedent Coreference Resolution in Specialized Discourse
12:45-13:05	Olga Uryupina, Ron Artstein, Antonella Bristot, Federica Cavicchio, Kepa Rodriguez and Massimo Poesio	ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions

	Session O23 - Machine Learning and Information Extraction	Chairperson: Feiyu Xu
11:45-12:05	Eric Yeh, John Niekrasz, Dayne Freitag and Richard Rohwer	An Annotated Corpus and Method for Analysis of Ad-Hoc Structures Embedded in Text
12:05-12:25	Rosa Tsegaye Aga, Christian Wartena, Lucas Drumond and Lars Schmidt-Thieme	Learning Thesaurus Relations from Distributional Features
12:25-12:45	Dina Wonsever, Aiala Rosá and Marisa Malcuori	Factuality Annotation and Learning in Spanish Texts
12:45-13:05	Frederico Tommasi Caroli, André Freitas, João Carlos Pereira da Silva and Siegfried Handschuh	NNBlocks: A Deep Learning Framework for Computational Linguistics Neural Network Models

	Session O24 - Speech Corpus for Health	Chairperson: Eleni Efthimiou
11:45-12:05	Daniela Beltrami, Laura Calzà, Gloria Gagliardi, Enrico Ghidoni, Norina Marcello, Rema Rossini Favretti and Fabio Tamburini	Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions
12:05-12:25	Mario Corrales-Astorgano, David Escudero-Mancebo, Yurena Gutiérrez-González, Valle Flores-Lucas, César González-Ferreras and Valentín Cardeñoso-Payo	On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual Disabilities
12:25-12:45	Julia Parish-Morris, Christopher Cieri, Mark Liberman, Leila Bateman, Emily Ferguson and Robert T. Schultz	Building Language Resources for Exploring Autism Spectrum Disorders
12:45-13:05	Naim Terbeh and Mounir Zrigui	Vocal Pathologies Detection and Mispronounced Phonemes Identification: Case of Arabic Continuous Speech

	Session O25 - Sentiment Analysis	Chairperson: Frédérique Segond
14:55-15:15	Vuk Batanović, Boško Nikolić and Milan Milosavljević	Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review Dataset
15:15-15:35	Shih-Ming Wang and Lun-Wei Ku	ANTUSD: A Large Chinese Sentiment Dictionary
15:35-15:55	Md Shad Akhtar, Asif Ekbal and Pushpak Bhattacharyya	Aspect based Sentiment Analysis in Hindi: Resource Creation and Evaluation
15:55-16:15	Wafia Adouane and Richard Johansson	Gulf Arabic Linguistic Resource Building for Sentiment Analysis
16:15-16:35	Samira Noferesti and Mehrnoush Shamsfard	Using Data Mining Techniques for Sentiment Shifter Identification

	Session O26 - Discourse and Dialogue	Chairperson: Mark Liberman
14:55-15:15	Nicholas Asher, Julie Hunter, Mathieu Morey, Benamara Farah and Stergos Afantenos	Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus
15:15-15:35	Guillaume Dubuisson Duplessis, Vincent Letard, Anne-Laure Ligozat and Sophie Rosset	Purely Corpus-based Automatic Conversation Authoring
15:35-15:55	Masashi Inoue and Hiroshi Ueno	Dialogue System Characterisation by Back-channelling Patterns Extracted from Dialogue Corpus
15:55-16:15	Eli Pincus and David Traum	Towards Automatic Identification of Effective Clues for Team Word-Guessing Games
16:15-16:35	Longyue Wang, Xiaojun Zhang, Zhaopeng Tu, Andy Way and Qun Liu	Automatic Construction of Discourse Corpora for Dialogue Translation

	Session O27 - Machine Translation and Evaluation (2)	Chairperson: Nizar Habash
14:55-15:15	Marina Fomicheva and Núria Bel	Using Contextual Information for Machine Translation Evaluation
15:15-15:35	João António Rodrigues, Nuno Rendeiro, Andreia Querido, Sanja Štajner and António Branco	Bootstrapping a Hybrid MT System to a New Language Pair
15:35-15:55	Márton Makrai	Filtering Wiktionary Triangles by Linear Mbetween Distributed Word Models
15:55-16:15	Takuya Matsuzaki, Akira Fujita, Naoya Todo and Noriko H. Arai	Translation Errors and Incomprehensibility: a Case Study using Machine-Translated Second Language Proficiency Tests
16:15-16:35	Steven Neale, Luís Gomes, Eneko Agirre, Oier Lopez de Lacalle and António Branco	Word Sense-Aware Machine Translation: Including Senses as Contextual Features for Improved Translation Models

	Session O28 - Corpus Querying and Crawling	Chairperson: Tomaž Erjavec
14:55-15:15	K. Bretonnel Cohen, William A. Baumgartner Jr. and Irina Temnikova	SuperCAT: The (New and Improved) Corpus Analysis Toolkit
15:15-15:35	Szymon Roziewski and Wojciech Stokowiec	LanguageCrawl: A Generic Tool for Building Language Models Upon Common-Crawl
15:35-15:55	Thomas Eckart, Christoph Kuras and Uwe Quasthoff	Features for Generic Corpus Querying
15:55-16:15	Vít Baisa, Jan Michelfeit, Marek Medveď and Milos Jakubicek	European Union Language Resources in Sketch Engine
16:15-16:35	Piotr Banski, Elena Frick and Andreas Witt	Corpus Query Lingua Franca (CQLF)

	Session O29 - Panel on International Initiatives from Public Agencies	Chairperson: Khalid Choukri
16:55-18:15

	Session O30 - Multimodality, Multimedia and Evaluation	Chairperson: Nick Campbell
16:55-17:15	Aitor Alvarez, Marina Balenciaga, Arantza del Pozo, Haritz Arzelus, Anna Matamala and Carlos-D. Martínez-Hinarejos	Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles
17:15-17:35	Desmond Elliott and Martijn Kleppe	1 Million Captioned Dutch Newspaper Images
17:35-17:55	Josiah Wang and Robert Gaizauskas	Cross-validating Image Description Datasets and Evaluation Metrics
17:55-18:15	Polina Yanovich, Carol Neidle and Dimitris Metaxas	Detection of Major ASL Sign Types in Continuous Signing For ASL Recognition

	Session O31 - Summarisation and Simplification	Chairperson: Udo Kruschwitz
16:55-17:15	Gustavo Paetzold and Lucia Specia	Benchmarking Lexical Simplification Systems
17:15-17:35	Beatriz Fisas, Francesco Ronzano and Horacio Saggion	A Multi-Layered Annotated Corpus of Scientific Papers
17:35-17:55	Yashar Mehdad, Amanda Stent, Kapil Thadani, Dragomir Radev, Youssef Billawala and Karolina Buchner	Extractive Summarization under Strict Length Constraints
17:55-18:15	Emma Barker, Monica Paramita, Adam Funk, Emina Kurtic, Ahmet Aker, Jonathan Foster, Mark Hepple and Robert Gaizauskas	What’s the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems

	Session O32 - Morphology (2)	Chairperson: Marko Tadić
16:55-17:15	Javad Nouri and Roman Yangarber	A Novel Evaluation Method for Morphological Segmentation
17:15-17:35	Amir Hazem and Béatrice Daille	Bilingual Lexicon Extraction at the Morpheme Level Using Distributional Analysis
17:35-17:55	John Sylak-Glassman, Christo Kirov and David Yarowsky	Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages
17:55-18:15	Christo Kirov, John Sylak-Glassman, Roger Que and David Yarowsky	Very-large Scale Parsing and Normalization of Wiktionary Morphological Paradigms

	Session O33 - Textual Entailment	Chairperson: Lucia Specia
18:20-18:40	Kathrin Eichler, Feiyu Xu, Hans Uszkoreit, Leonhard Hennig and Sebastian Krause	TEG-REP: A corpus of Textual Entailment Graphs based on Relation Extraction Patterns
18:40-19:00	Biralatei Fawei, Adam Wyner and Jeff Pan	Passing a USA National Bar Exam: a First Corpus for Experimentation
19:00-19:20	Ngoc Phuoc An Vo and Octavian Popescu	Corpora for Learning the Mutual Relationship between Semantic Relatedness and Textual Entailment

	Session O34 - Document Classification, Text categorisation and Topic Detection	Chairperson: Iryna Gurevych
18:20-18:40	Adriana Ferrugento, Hugo Gonçalo Oliveira, Ana Alves and Filipe Rodrigues	Can Topic Modelling benefit from Word Sense Information?
18:40-19:00	Prasha Shrestha, Nicolas Rey-Villamizar, Farig Sadeque, Ted Pedersen, Steven Bethard and Thamar Solorio	Age and Gender Prediction on Health Forum Data
19:00-19:20	Sergiu Nisioi	Comparing Speech and Text Classification on ICNALE

	Session O35 - Detecting Information in Medical Domain	Chairperson: Dimitrios Kokkinakis
18:20-18:40	Elena Arsevska, Mathieu Roche, Sylvain Falala, Renaud Lancelot, David Chavernac, Pascal Hendrikx and Barbara Dufour	Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge
18:40-19:00	Stephen Wu, Tamara Timmons, Amy Yates, Meikun Wang, Steven Bedrick, William Hersh and Hongfang Liu	On Developing Resources for Patient-level Information Retrieval
19:00-19:20	Prescott Klassen, Fei Xia and Meliha Yetisgen	Annotating and Detecting Medical Events in Clinical Notes

	Session O36 - Speech Synthesis	Chairperson: Diana Santos
18:20-18:40	Sunayana Sitaram and Alan W Black	Speech Synthesis of Code-Mixed Text
18:40-19:00	Neasa Ní Chiaráin and Ailbhe Ní Chasaide	Chatbot Technology with Synthetic Voices in the Acquisition of an Endangered Language: Motivation, Development and Evaluation of a Platform for Irish
19:00-19:20	Nick Campbell	CHATR the Corpus; a 20-year-old archive of Concatenative Speech Synthesis

Day 2, Poster Sessions:

	Session P19 - Discourse (2)	Chair: Olga Uryupina
9:45-11:25	Manfred Stede and Sara Mamprin	Information structure in the Potsdam Commentary Corpus: Topics
9:45-11:25	Jonathon Read, Erik Velldal, Marc Cavazza and Gersende Georg	A Corpus of Clinical Practice Guidelines Annotated with the Importance of Recommendations
9:45-11:25	Amy Isard	The Methodius Corpus of Rhetorical Discourse Structures and Generated Texts
9:45-11:25	Daniel Duma, Maria Liakata, Amanda Clare, James Ravenscroft and Ewan Klein	Applying Core Scientific Concepts to Context-Based Citation Recommendation
9:45-11:25	Ina Roesiger	SciCorp: A Corpus of English Scientific Articles Annotated for Information Status Analysis
9:45-11:25	Rohit Jain, Himanshu Sharma and Dipti Sharma	Using lexical and Dependency Features to Disambiguate Discourse Connectives in Hindi
9:45-11:25	Marta Andersson, Adnan Ozturel and Silvia Pareti	Annotating Topic Development in Information Seeking Queries
9:45-11:25	Jiří Mírovský, Lucie Poláková and Jan Štěpánek	Searching in the Penn Discourse Treebank Using the PML-Tree Query
9:45-11:25	Ghada Alharbi and Thomas Hain	The OpenCourseWare Metadiscourse (OCWMD) Corpus
9:45-11:25	Nicolas Hernandez, Soufian Salim and Elizaveta Loginova Clouet	Ubuntu-fr: A Large and Open Corpus for Multi-modal Analysis of Online Written Conversations
9:45-11:25	Julian Hough, Ye Tian, Laura de Ruiter, Simon Betz, Spyros Kousidis, David Schlangen and Jonathan Ginzburg	DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter

	Session P20 - Document Classification and Text Categorisation (1)	Chair: Fabio Tamburini
9:45-11:25	Guntis Barzdins, Steve Renals and Didzis Gosko	Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project
9:45-11:25	Cynthia Van Hee, Els Lefever and Veronique Hoste	Exploring the Realization of Irony in Twitter Data
9:45-11:25	Cyril Goutte, Serge Léger, Shervin Malmasi and Marcos Zampieri	Discriminating Similar Languages: Evaluations and Explorations
9:45-11:25	Latifa Al-Sulaiti, Noorhan Abbas, Claire Brierley, Eric Atwell and Ayman Alghamdi	Compilation of an Arabic Children’s Corpus
9:45-11:25	Robin Eriksson	Quality Assessment of the Reuters Vol. 2 Multilingual Corpus
9:45-11:25	Mahmoud El-Haj, Paul Rayson, Steve Young, Andrew Moore, Martin Walker, Thomas Schleicher and Vasiliki Athanasakou	Learning Tone and Attribution for Financial Text Mining
9:45-11:25	Roman Sergienko, Muhammad Shan and Wolfgang Minker	A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances
9:45-11:25	Muhammad Sharjeel, Paul Rayson and Rao Muhammad Adeel Nawab	UPPC - Urdu Paraphrase Plagiarism Corpus
9:45-11:25	Yannis Korkontzelos, Paul Thompson and Sophia Ananiadou	Identifying Content Types of Messages Related to Open Source Software Projects
9:45-11:25	Minglei Li, Yunfei Long, Lu Qin and Wenjie Li	Emotion Corpus Construction Based on Selection from Hashtags

	Session P21 - Evaluation Methodologies (2)	Chair: António Branco
9:45-11:25	Björn Gambäck and Amitava Das	Comparing the Level of Code-Switching in Corpora
9:45-11:25	Markus Müller, Sarah Fünfer, Sebastian Stüker and Alex Waibel	Evaluation of the KIT Lecture Translation System
9:45-11:25	Behrang QasemiZadeh and Anne-Kathrin Schumann	The ACL RD-TEC 2.0: A Language Resource for Evaluating Term Extraction and Entity Recognition Methods
9:45-11:25	Wajdi Zaghouani, Nizar Habash, Ossama Obeid, Behrang Mohit, Houda Bouamor and Kemal Oflazer	Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation
9:45-11:25	Nora Aranberri, Eleftherios Avramidis, Aljoscha Burchardt, Ondrej Klejch, Martin Popel and Maja Popović	Tools and Guidelines for Principled Machine Translation Development
9:45-11:25	Olivier Galibert, Mohamed Ameur Ben Jannet, Juliette Kahn and Sophie Rosset	Generating Task-Pertinent sorted Error Lists for Speech Recognition

	Session P22 - Information Extraction and Retrieval (2)	Chair: Robert Gaizauskas
9:45-11:25	Gil Francopoulo, Joseph Mariani and Patrick Paroubek	A Study of Reuse and Plagiarism in LREC papers
9:45-11:25	Debasis Ganguly, Iacer Calixto and Gareth Jones	Developing a Dataset for Evaluating Approaches for Document Expansion with Images
9:45-11:25	Pablo Ruiz, Clément Plancq and Thierry Poibeau	More than Word Cooccurrence: Exploring Support and Opposition in International Climate Negotiations with Semantic Parsing
9:45-11:25	Sandra Collovini, Gabriel Machado and Renata Vieira	A Sequence Model Approach to Relation Extraction in Portuguese
9:45-11:25	Daniel Hládek, Ján Staš and Jozef Juhár	Evaluation Set for Slovak News Information Retrieval
9:45-11:25	Takakazu Imada, Yusuke Inoue, Lei Chen, Syunya Doi, Tian Nie, Chen Zhao, Takehito Utsuro and Yasuhide Kawada	Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests
9:45-11:25	Adrien Bougouin, Sabine Barreaux, Laurent Romary, Florian Boudin and Beatrice Daille	TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation
9:45-11:25	Hannah Kermes, Stefania Degaetano-Ortlieb, Ashraf Khamis, Jörg Knappen and Elke Teich	The Royal Society Corpus: From Uncharted Data to Corpus
9:45-11:25	Lorraine Goeuriot, Liadh Kelly, Guido Zuccon and Joao Palotti	Building Evaluation Datasets for Consumer-Oriented Information Retrieval
9:45-11:25	Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret and Romaric Besançon	A Dataset for Open Event Extraction in English

	Session P23 - Prosody and Phonology	Chair: Björn Schuller
9:45-11:25	Daniil Kocharov	Phoneme Alignment Using the Information on Phonological Processes in Continuous Speech
9:45-11:25	Tatiana Kachkovskaia, Daniil Kocharov, Pavel Skrelin and Nina Volskaya	CoRuSS - a New Prosodically Annotated Corpus of Russian Spontaneous Speech
9:45-11:25	Dan Dediu and Scott Moisik	Defining and Counting Phonological Classes in Cross-linguistic Segment Databases
9:45-11:25	Marta Martinez, Rocio Varela, Carmen García Mateo, Elisa Fernandez Rei and Adela Martinez Calvo	Introducing the SEA_AP: an Enhanced Tool for Automatic Prosodic Analysis
9:45-11:25	Naziba Mostafa, Yan Wan, Unnayan Amitabh and Pascale Fung	A Machine Learning based Music Retrieval and Recommendation System

	Session P24 - Speech Processing (1)	Chair: Andrew Caines
9:45-11:25	Frédéric Aman, Michel Vacher, François Portet, William Duclot and Benjamin Lecouteux	CirdoX: an on/off-line multisource speech and sound analysis software
9:45-11:25	Matthias Sperber, Graham Neubig, Satoshi Nakamura and Alex Waibel	Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces
9:45-11:25	Mauro Nicolao, Heidi Christensen, Stuart Cunningham, Phil Green and Thomas Hain	A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus
9:45-11:25	Imed Laaridh, Corinne Fredouille and Christine Meunier	Automatic Anomaly Detection for Dysarthria across Two Speech Styles: Read vs Spontaneous Speech
9:45-11:25	Alexander Gutkin, Linne Ha, Martin Jansche, Knot Pipatsrisawat and Richard Sproat	TTS for Low Resource Languages: A Bangla Synthesizer
9:45-11:25	Félicien Vallet, Jim Uro, Jérémy Andriamakaoly, Hakim Nabi, Mathieu Derval and Jean Carrive	Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context

	Session P25 - Crowdsourcing	Chair: Monica Monachini
11:45-13:05	Armin Hoenen	Wikipedia Titles As Noun Tag Predictors
11:45-13:05	Jun Harashima	Japanese Word―Color Associations with and without Contexts
11:45-13:05	Emiel van Miltenburg, Benjamin Timmermans and Lora Aroyo	The VU Sound Corpus: Adding More Fine-grained Annotations to the Freesound Database
11:45-13:05	Maria Sukhareva, Judith Eckle-Kohler, Ivan Habernal and Iryna Gurevych	Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations
11:45-13:05	Anna Feltracco, Simone Magnolini, Elisabetta Jezek and Bernardo Magnini	Acquiring Opposition Relations among Italian Verb Senses using Crowdsourcing
11:45-13:05	Andrew Caines, Christian Bentz, Calbert Graham, Tim Polzehl and Paula Buttery	Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora
11:45-13:05	Phil Bartie, William Mackaness, Dimitra Gkatzia and Verena Rieser	The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban Scenes
11:45-13:05	Simone Hantke, Erik Marchi and Björn Schuller	Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification

	Session P26 - Emotion Recognition/Generation	Chair: Saif Mohammad
11:45-13:05	Yoshiko Arimoto and Kazuo Okanoya	Comparison of Emotional Understanding in Modality-Controlled Environments using Multimodal Online Emotional Communication Corpus
11:45-13:05	Brigitte Bigi and Roxane Bertrand	Laughter in French Spontaneous Conversational Dialogs
11:45-13:05	Kevin El Haddad, Huseyin Cakmak, Stéphane Dupont and Thierry Dutoit	AVAB-DBS: an Audio-Visual Affect Bursts Database for Synthesis
11:45-13:05	Nurul Lubis, Randy Gomez, Sakriani Sakti, Keisuke Nakamura, Koichiro Yoshino, Satoshi Nakamura and Kazuhiro Nakadai	Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition
11:45-13:05	Lucia C. Passaro and Alessandro Lenci	Evaluating Context Selection Strategies to Build Emotive Vector Space Models

	Session P27 - Machine Translation (2)	Chair: Aljoscha Burchardt
11:45-13:05	Antoine Bourlon, Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi	Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons
11:45-13:05	Diptesh Kanojia, Aditya Joshi, Pushpak Bhattacharyya and Mark James Carman	That'll Do Fine!: A Coarse Lexical Resource for English-Hindi MT, Using Polylingual Topic Models
11:45-13:05	Toshiaki Nakazawa, Manabu Yaguchi, Kiyotaka Uchimoto, Masao Utiyama, Eiichiro Sumita, Sadao Kurohashi and Hitoshi Isahara	ASPEC: Asian Scientific Paper Excerpt Corpus
11:45-13:05	Gorka Labaka, Iñaki Alegria and Kepa Sarasola	Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation
11:45-13:05	Xiaofeng Wu, Jinhua Du, Qun Liu and Andy Way	ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool
11:45-13:05	Jingyi Han and Núria Bel	Towards producing bilingual lexica from monolingual corpora
11:45-13:05	Luís Gomes and Gabriel Pereira Lopes	First Steps Towards Coverage-Based Sentence Alignment
11:45-13:05	Jeevanthi Liyanapathirana and Andrei Popescu-Belis	Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation
11:45-13:05	Frédéric Blain, Varvara Logacheva and Lucia Specia	Phrase Level Segmentation and Labelling of Machine Translation Errors
11:45-13:05	José Manuel Martínez Martínez and Mihaela Vela	SubCo: A Learner Translation Corpus of Human and Machine Subtitles

	Session P28 - Multiword Expressions	Chair: Irina Temnikova
11:45-13:05	Diana Bogantes, Eric Rodríguez, Alejandro Arauco, Alejandro Rodríguez and Agata Savary	Towards Lexical Encoding of Multi-Word Expressions in Spanish Dialects
11:45-13:05	Ziqi Zhang, Jie Gao and Fabio Ciravegna	JATE 2.0: Java Automatic Term Extraction with Apache Solr
11:45-13:05	Francesca Strik Lievers and Chu-Ren Huang	A lexicon of perception for the identification of synaesthetic metaphors in corpora
11:45-13:05	Malgorzata Marciniak, Agnieszka Mykowiecka and Piotr Rychlik	TermoPL - a Flexible Tool for Terminology Extraction
11:45-13:05	Sabine Schulte im Walde, Anna Hätty, Stefan Bott and Nana Khvtisavrishvili	GhoSt-NN: A Representative Gold Standard of German Noun-Noun Compounds
11:45-13:05	Carlos Ramisch, Alexis Nasr, André Valli and José Deulofeu	DeQue: A Lexicon of Complex Prepositions and Conjunctions in French
11:45-13:05	Gyri Smørdal Losnegaard, Federico Sangati, Carla Parra Escartín, Agata Savary, Sascha Bargmann and Johanna Monti	PARSEME Survey on MWE Resources
11:45-13:05	Rodrigo Wilkens, Marco Idiart and Aline Villavicencio	Multiword Expressions in Child Language
11:45-13:05	Dhouha Bouamor, Leonardo Campillos Llanos, Anne-Laure Ligozat, Sophie Rosset and Pierre Zweigenbaum	Transfer-Based Learning-to-Rank Assessment of Medical Term Technicality
11:45-13:05	Sara Rodríguez-Fernández, Roberto Carlini, Luis Espinosa Anke and Leo Wanner	Example-based Acquisition of Fine-grained Collocation Resources
11:45-13:05	Victoria Rosén, Koenraad De Smedt, Gyri Smørdal Losnegaard, Eduard Bejček, Agata Savary and Petya Osenova	MWEs in Treebanks: From Survey to Guidelines
11:45-13:05	Dhirendra Singh, Sudha Bhingardive and Pushpak Bhattacharya	Multiword Expressions Dataset for Indian Languages

	Session P29 - Treebanks (2)	Chair: Claire Bonial
11:45-13:05	Philippe Blache, Stéphane Rauzy and Grégoire Montcheuil	MarsaGram: an excursion in the forests of parsing trees
11:45-13:05	Alexa Little and Stephen Tratz	EasyTree: A Graphical Tool for Dependency Tree Annotation
11:45-13:05	Edmundo Pavel Soriano Morales, Julien Ah-Pine and Sabine Loudcher	Hypergraph Modelization of a Syntactically Annotated English Wikipedia Dump
11:45-13:05	Yannick Versley and Julius Steen	Detecting Annotation Scheme Variation in Out-of-Domain Treebanks
11:45-13:05	Mojgan Seraji, Filip Ginter and Joakim Nivre	Universal Dependencies for Persian
11:45-13:05	Djamé Seddah and Marie Candito	Hard Time Parsing Questions: Building a QuestionBank for French
11:45-13:05	Sebastian Schuster and Christopher D. Manning	Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks
11:45-13:05	Maaz Anwar, Riyaz Ahmad Bhat, Dipti Sharma, Ashwini Vaidya, Martha Palmer and Tafseer Ahmed Khan	A Proposition Bank of Urdu
11:45-13:05	Vincent Kríž, Barbora Hladka and Zdenka Uresova	Czech Legal Text Treebank 1.0

	Session P30 - Linked Data	Chair: Felix Sasaki
14:55-16:35	Johann-Mattis List, Michael Cysouw and Robert Forkel	Concepticon: A Resource for the Linking of Concept Lists
14:55-16:35	Ingrid Falk and Achim Stein	LVF-lemon ― Towards a Linked Data Representation of “Les Verbes français”
14:55-16:35	Paloma Galvan, Virginia Francisco, Raquel Hervas and Gonzalo Mendez	Riddle Generation using Word Associations
14:55-16:35	Ewa Rudnicka, Wojciech Witkowski and Katarzyna Podlaska	Challenges of Adjective Mapping between plWordNet and Princeton WordNet
14:55-16:35	Aleksandra Gabryszak, Sebastian Krause, Leonhard Hennig, Feiyu Xu and Hans Uszkoreit	Relation- and Phrase-level Linking of FrameNet with Sar-graphs
14:55-16:35	Balázs Indig, Márton Miháltz and András Simonyi	Mapping Ontologies Using Ontologies: Cross-lingual Semantic Role Information Transfer
14:55-16:35	Ravindra Harige and Paul Buitelaar	Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text
14:55-16:35	John Philip McCrae, Christian Chiarcos, Francis Bond, Philipp Cimiano, Thierry Declerck, Gerard de Melo, Jorge Gracia, Sebastian Hellmann, Bettina Klimek, Steven Moran, Petya Osenova, Antonio Pareja-Lora and Jonathan Pool	The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud
14:55-16:35	Tatiana Lesnikova, Jérôme David and Jérôme Euzenat	Cross-lingual RDF Thesauri Interlinking

	Session P31 - LR Infrastructures and Architectures (1)	Chair: Yohei Murakami
14:55-16:35	Georg Rehm	The Language Resource Life Cycle: Towards a Generic Model for Creating, Maintaining, Using and Distributing Language Resources
14:55-16:35	Jun Harashima, Michiaki Ariga, Kenta Murata and Masayuki Ioki	A Large-scale Recipe and Meal Data Collection as Infrastructure for Food Research
14:55-16:35	Siim Orasmaa, Timo Petmanson, Alexander Tkachenko, Sven Laur and Heiki-Jaan Kaalep	EstNLTK - NLP Toolkit for Estonian
14:55-16:35	Justus Roux	South African National Centre for Digital Language Resources
14:55-16:35	Verena Lyding and Karin Schöne	Design and Development of the MERLIN Learner Corpus Platform
14:55-16:35	Menzo Windhouwer, Marc Kemps-Snijders, Paul Trilsbeek, André Moreira, Bas Van der Veen, Guilherme Silva and Daniel Von Reihn	FLAT: Constructing a CLARIN Compatible Home for Language Resources
14:55-16:35	Jan Odijk	CLARIAH in the Netherlands
14:55-16:35	Claus Zinn, Thorsten Trippel, Steve Kaminski and Emanuel Dima	Crosswalking from CMDI to Dublin Core and MARC 21
14:55-16:35	Denise DiPersio, Christopher Cieri and Daniel Jaquette	Data Management Plans and Data Centers
14:55-16:35	Udo Hahn, Franz Matthies, Erik Faessler and Johannes Hellrich	UIMA-Based JCoRe 2.0 Goes GitHub and Maven Central ― State-of-the-Art Software Resource Engineering and Distribution of NLP Pipelines
14:55-16:35	Lene Offersgaard and Dorte Haltrup Hansen	Facilitating Metadata Interoperability in CLARIN-DK
14:55-16:35	Nancy Ide, Keith Suderman, James Pustejovsky, Marc Verhagen and Christopher Cieri	The Language Application Grid and Galaxy

	Session P32 - Large Projects and Infrastructures (1)	Chair: Zygmunt Vetulani
14:55-16:35	Dan Tufiș, Verginica Barbu Mititelu, Elena Irimia, Ștefan Daniel Dumitrescu and Tiberiu Boroș	The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language
14:55-16:35	Michal Křen, Václav Cvrček, Tomáš Čapka, Anna Čermáková, Milena Hnátková, Lucie Chlumská, Tomáš Jelínek, Dominika Kováříková, Vladimír Petkevič, Pavel Procházka, Hana Skoumalová, Michal Škrabal, Petr Truneček, Pavel Vondřička and Adrian Jan Zasina	SYN2015: Representative Corpus of Contemporary Written Czech
14:55-16:35	Riccardo Del Gratta, Francesca Frontini, Monica Monachini, Gabriella Pardelli, Irene Russo, Roberto Bartolini, Fahad Khan, Claudia Soria and Nicoletta Calzolari	LREC as a Graph: People and Resources in a Network
14:55-16:35	Pawel Kamocki, Pavel Straňák and Michal Sedlák	The Public License Selector:  Making Open Licensing Easier
14:55-16:35	Daiva Vitkutė-Adžgauskienė, Andrius Utka, Darius Amilevičius and Tomas Krilavičius	NLP Infrastructure for the Lithuanian Language
14:55-16:35	Ulrike Krieg-Holz, Christian Schuschnig, Franz Matthies, Benjamin Redling and Udo Hahn	CodE Alltag: A German-Language E-Mail Corpus

	Session P33 - Morphology (2)	Chair: Felice dell'Orletta
14:55-16:35	Seth Kulick and Ann Bies	Rapid Development of Morphological Analyzers for Typologically Diverse Languages
14:55-16:35	Abhisek Chakrabarty, Akshay Chaturvedi and Utpal Garain	A Neural Lemmatizer for Bengali
14:55-16:35	Francis Tyers, Aziyana Bayyr-ool, Aelita Salchak and Jonathan Washington	A Finite-state Morphological Analyser for Tuvan
14:55-16:35	Andrejs Spektors, Ilze Auziņa, Roberts Darģis, Normunds Grūzītis, Pēteris Paikens, Lauma Pretkalniņa, Laura Rituma and Baiba Saulīte	Tēzaurs.lv: the Largest Open Lexical Database for Latvian
14:55-16:35	Raveesh Motlani, Francis Tyers and Dipti Sharma	A Finite-State Morphological Analyser for Sindhi
14:55-16:35	Markus Forsberg and Mans Hulden	Deriving Morphological Analyzers from Example Inflections
14:55-16:35	Daniel Smith and Mans Hulden	Morphological Analysis of Sahidic Coptic for Automatic Glossing
14:55-16:35	Marcin Woliński and Witold Kieraś	The on-line version of Grammatical Dictionary of Polish

	Session P34 - Semantic Lexicons	Chair: Kiril Simov
14:55-16:35	Maximilian Köper and Sabine Schulte im Walde	Automatically Generated Affective Norms of Abstractness, Arousal, Imageability and Valence for 350 000 German Lemmas
14:55-16:35	Marco Passarotti, Berta González Saavedra and Christophe Onambele	Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin
14:55-16:35	Yoshihiko Hayashi	A Framework for Cross-lingual/Node-wise Alignment of Lexical-Semantic Resources
14:55-16:35	Scott Piao, Paul Rayson, Dawn Archer, Francesca Bianchi, Carmen Dayrell, Mahmoud El-Haj, Ricardo-María Jiménez, Dawn Knight, Michal Křen, Laura Löfberg, Rao Muhammad Adeel Nawab, Jawad Shafi, Phoey Lee Teh and Olga Mudraya	Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages
14:55-16:35	Gábor Recski	Building Concept Graphs from Monolingual Dictionary Entries
14:55-16:35	Elżbieta Hajnicz, Anna Andrzejczuk and Tomasz Bartosiak	Semantic Layer of the Valence Dictionary of Polish Walenty
14:55-16:35	Lucia Busso and Alessandro Lenci	Italian VerbNet: A Construction-based Approach to Italian Verb Classification
14:55-16:35	Natalia Grabar and Thierry Hamon	A Large Rated Lexicon with French Medical Words
14:55-16:35	Alexander Panchenko	Best of Both Worlds: Making Word Sense Embeddings Interpretable
14:55-16:35	Leonardo Zilio, Maria José Bocorny Finatto and Aline Villavicencio	VerbLexPor: a lexical resource with semantic roles for Portuguese
14:55-16:35	Maddalen Lopez de Lacalle, Egoitz Laparra, Itziar Aldabe and German Rigau	A Multilingual Predicate Matrix
14:55-16:35	Bryan Wilkinson and Oates Tim	A Gold Standard for Scalar Adjectives
14:55-16:35	Ivan Sekulić and Jan Šnajder	VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian
14:55-16:35	Alberto Simões, Xavier Gómez Guinovart and José João Almeida	Enriching a Portuguese WordNet using Synonyms from a Monolingual Dictionary

	Session P35 - Grammar and Syntax	Chair: Maria Simi
16:55-18:15	Tibor Kiss, Francis Jeffry Pelletier, Halima Husic, Roman Nino Simunic and Johanna Marie Poppek	A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon
16:55-18:15	Andras Kornai, Dávid Márk Nemeskey and Gábor Recski	Detecting Optional Arguments of Verbs
16:55-18:15	Lennart Kloppenburg and Malvina Nissim	Leveraging Native Data to Correct Preposition Errors in Learners' Dutch
16:55-18:15	Angelo Basile and Federico Sangati	D(H)ante: A New Set of Tools for XIII Century Italian
16:55-18:15	Fabio Celli, Giuseppe Riccardi and Firoj Alam	Multilevel Annotation of Agreement and Disagreement in Italian News Blogs
16:55-18:15	Şaziye Betül Özateş, Arzucan Özgür and Dragomir Radev	Sentence Similarity based on Dependency Tree Kernels for Multi-document Summarization
16:55-18:15	Sharid Loáiciga and Kristina Gulordava	Discontinuous Verb Phrases in Parsing and Machine Translation of English and German
16:55-18:15	Jennifer Krisch, Melanie Dick, Ronny Jauch and Ulrich Heid	A Lexical Resource for the Identification of “Weak Words” in German Specification Documents
16:55-18:15	Zygmunt Vetulani, Grażyna Vetulani and Bartłomiej Kochanowski	Recent Advances in Development of a Lexicon-Grammar of Polish: PolNet 3.0
16:55-18:15	Cerstin Mahlow	C-WEP―Rich Annotated Collection of Writing Errors by Professionals
16:55-18:15	Natalia Klyueva and Pavel Straňák	Improving corpus search via parsing

	Session P36 - Sentiment Analysis and Opinion Mining (2)	Chair: Manfred Stede
16:55-18:15	Elisavet Palogiannidi, Polychronis Koutsakis, Elias Iosif and Alexandros Potamianos	Affective Lexicon Creation for the Greek Language
16:55-18:15	Martina Katalin Szabó, Veronika Vincze, Katalin Ilona Simkó, Viktor Varga and Viktor Hangya	A Hungarian Sentiment Corpus Manually Annotated at Aspect Level
16:55-18:15	Josef Ruppenhofer and Jasper Brandes	Effect Functors for Opinion Inference
16:55-18:15	Manfred Klenner and Michael Amsler	Sentiframes: A Resource for Verb-centered German Sentiment Inference
16:55-18:15	Marco Stranisci, Cristina Bosco, Delia Irazú Hernández Farías and Viviana Patti	Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola
16:55-18:15	Samhaa R. El-Beltagy	NileULex: A Phrase and Word Level Sentiment Lexicon for Egyptian and Modern Standard Arabic
16:55-18:15	Aleksander Wawer	OPFI: A Tool for Opinion Finding in Polish
16:55-18:15	Orphee De Clercq and Veronique Hoste	Rude waiter but mouthwatering pastries! An exploratory study into Dutch Aspect-Based Sentiment Analysis

	Session P37 - Parallel and Comparable Corpora	Chair: Jörg Tiedemann
16:55-18:15	Huaxing Shi, Tiejun Zhao and Keh-Yih Su	Building A Case-based Semantic English-Chinese Parallel Treebank
16:55-18:15	Xuansong Li, Jennifer Tracey, Stephen Grimes and Stephanie Strassel	Uzbek-English and Turkish-English Morpheme Alignment Corpora
16:55-18:15	Chenhui Chu, Raj Dabre and Sadao Kurohashi	Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
16:55-18:15	Iñaki San Vicente, Iñaki Alegria, Cristina España-Bonet, Pablo Gamallo, Hugo Gonçalo Oliveira, Eva Martinez Garcia, Antonio Toral, Arkaitz Zubiaga and Nora Aranberri	TweetMT: A Parallel Microblog Corpus
16:55-18:15	Mariana Neves, Antonio Jimeno Yepes and Aurélie Névéol	The Scielo Corpus: a Parallel Corpus of Scientific Publications for Biomedicine
16:55-18:15	Nikola Ljubešić, Miquel Esplà-Gomis, Antonio Toral, Sergio Ortiz Rojas and Filip Klubička	Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair

	Session P38 - Social Media	Chair: Fei Xia
16:55-18:15	Dane Bell, Daniel Fried, Luwen Huangfu, Mihai Surdeanu and Stephen Kobourov	Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness
16:55-18:15	Bridget Sommerdijk, Eric Sanders and Antal van den Bosch	Can Tweets Predict TV Ratings?
16:55-18:15	SoHyun Park, Afsaneh Fazly, Annie Lee, Brandon Seibel, Wenjie Zi and Paul Cook	Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus
16:55-18:15	Shigeyuki Sakaki, Francine Chen, Mandy Korpusik and Yan-Ying Chen	Corpus for Customer Purchase Behavior Prediction in Social Media
16:55-18:15	Arda Celebi and Arzucan Özgür	Segmenting Hashtags using Automatically Created Training Data
16:55-18:15	Dirk Hovy and Anders Johannsen	Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics
16:55-18:15	Wanru Zhang, Andrew Caines, Dimitrios Alikaniotis and Paula Buttery	Predicting Author Age from Weibo Microblog Posts
16:55-18:15	Andrew Yates, Alek Kolcz, Nazli Goharian and Ophir Frieder	Effects of Sampling on Twitter Trend Detection
16:55-18:15	Nicolas Foucault and Antoine Courtin	Automatic Classification of Tweets for Analyzing Communication Behavior of Museums

	Session P39 - Word Sense Disambiguation (2)	Chair: Elisabetta Jezek
16:55-18:15	Marko Bekavac and Jan Šnajder	Graph-Based Induction of Word Senses in Croatian
16:55-18:15	Richard Johansson, Yvonne Adesam, Gerlof Bouma and Karin Hedberg	A Multi-domain Corpus of Swedish Word Sense Annotation
16:55-18:15	Arantxa Otegi, Nora Aranberri, António Branco, Jan Hajic, Martin Popel, Kiril Simov, Eneko Agirre, Petya Osenova, Rita Pereira, João Silva and Steven Neale	QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages
16:55-18:15	Éva Mújdricza-Maydt, Silvana Hartmann, Iryna Gurevych and Anette Frank	Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data
16:55-18:15	Sudha Bhingardive, Rajita Shukla, Jaya Saraswati, Laxmi Kashyap, Dhirendra Singh and Pushpak Bhattacharya	Synset Ranking of Hindi WordNet
16:55-18:15	Andrey Kutuzov and Elizaveta Kuzmenko	Neural Embedding Language Models in Semantic Clustering of Web Search Results

	Session P40 - Dialogue (1)	Chair: Jens Edlund
18:20-19:20	Ming Sun, Yun-Nung Chen, Zhenhao Hua, Yulian Tamres-Rudnicky, Arnab Dash and Alexander Rudnicky	AppDialogue: Multi-App Dialogues for Intelligent Assistants
18:20-19:20	Volha Petukhova, Christopher Stevens, Harmen de Weerd, Niels Taatgen, Fokie Cnossen and Andrei Malchanau	Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus
18:20-19:20	Vasily Konovalov, Ron Artstein, Oren Melamud and Ido Dagan	The Negochat Corpus of Human-agent Negotiation Dialogues
18:20-19:20	Ryuichiro Higashinaka, Kotaro Funakoshi, Yuka Kobayashi and Michimasa Inaba	The dialogue breakdown detection challenge: Task description, datasets, and evaluation metrics
18:20-19:20	Harry Bunt, Volha Petukhova, Andrei Malchanau, Kars Wijnhoven and Alex Fang	The DialogBank
18:20-19:20	Kris Liu, Jean Fox Tree and Marilyn Walker	Coordinating Communication in the Wild: The Artwalk Dialogue Corpus of Pedestrian Navigation and Mobile Referential Communication
18:20-19:20	Leonardo Campillos Llanos, Dhouha Bouamor, Pierre Zweigenbaum and Sophie Rosset	Managing Linguistic and Terminological Variation in a Medical Dialogue System
18:20-19:20	Ajda Gokcen, Evan Jaffe, Johnsey Erdmann, Michael White and Douglas Danforth	A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System
18:20-19:20	Laurent Prévot, Jan Gorisch and Roxane Bertrand	A CUP of CoFee: A large Collection of feedback Utterances Provided with communicative function annotations

	Session P41 - Language Learning	Chair: Costanza Navarretta
18:20-19:20	Eric Sanders, Pepi Burgos, Catia Cucchiarini and Roeland van Hout	Palabras: Crowdsourcing Transcriptions of L2 Speech
18:20-19:20	Beata Megyesi, Jesper Näsman and Anne Palmér	The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis
18:20-19:20	Kay Berkling	Corpus for Children’s Writing with Enhanced Output for Specific Spelling Patterns (2nd and 3rd Grade)
18:20-19:20	Amália Mendes, Sandra Antunes, Maarten Janssen and Anabela Gonçalves	The COPLE2 corpus: a learner corpus for Portuguese
18:20-19:20	Jane Wottawa and Martine Adda-Decker	French Learners Audio Corpus of German Speech (FLACGS)
18:20-19:20	Vanja Štefanec, Nikola Ljubešić and Jelena Kuvač Kraljević	Croatian Error-Annotated Corpus of Non-Professional Written Language

	Session P42 - Less-Resourced Languages	Chair: Laurette Pretorius
18:20-19:20	Isabell Hubert, Antti Arppe, Jordan Lachler and Eddie Antonio Santos	Training & Quality Assessment of an Optical Character Recognition Model for Northern Haida
18:20-19:20	Dafydd Gibbon	Legacy language atlas data mining: mapping Kru languages
18:20-19:20	Kazushi Ohya	Data Formats and Management Strategies from the Perspective of Language Resource Producers ― Personal Diachronic and Social Synchronic Data Sharing ―
18:20-19:20	Henk van den Heuvel, Eric Sanders and Nicoline van der Sijs	Curation of Dutch Regional Dictionaries
18:20-19:20	Claudia Soria, Irene Russo, Valeria Quochi, Davyth Hicks, Antton Gurrutxaga, Anneli Sarhimaa and Matti Tuomisto	Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project
18:20-19:20	Delyth Prys, Gruffudd Prys and Dewi Bryn Jones	Cysill Ar-lein: A Corpus of Written Contemporary Welsh Compiled from an On-line Spelling and Grammar Checker
18:20-19:20	Martijn Wieling, Eva Sassolini, Sebastiana Cucurullo and Simonetta Montemagni	ALT Explored: Integrating an Online Dialectometric Tool and an Online Dialect Atlas
18:20-19:20	Stephanie Strassel and Jennifer Tracey	LORELEI Language Packs: Data, Tools, and Resources for Technology Development in Low Resource Languages
18:20-19:20	Alina Maria Ciobanu and Liviu P. Dinu	A Computational Perspective on the Romanian Dialects
18:20-19:20	Sebastian Nordhoff, Siri Tuttle and Olga Lovick	The Alaskan Athabascan Grammar Database
18:20-19:20	Arbi Haza Nasution, Yohei Murakami and Toru Ishida	Constraint-Based Bilingual Lexicon Induction for Closely Related Languages

	Session P43 - Named Entity Recognition	Chair: Sara Tonelli
18:20-19:20	Lubomir Otrusina and Pavel Smrz	WTF-LOD - A New Resource for Large-Scale NER Evaluation
18:20-19:20	Julian Bleicken, Thomas Hanke, Uta Salden and Sven Wagner	Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data
18:20-19:20	Milan Dojchinovski, Dinesh Reddy, Tomáš Kliegr, Tomas Vitvar and Harald Sack	Crowdsourced Corpus with Entity Salience Annotations
18:20-19:20	Sergio Oramas, Luis Espinosa Anke, Mohamed Sordo, Horacio Saggion and Xavier Serra	ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain
18:20-19:20	Patrick Littell, David R. Mortensen, Kartik Goyal, Chris Dyer and Lori Levin	Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik
18:20-19:20	Halil Kilicoglu, Asma Ben Abacha, Yassine Mrabet, Kirk Roberts, Laritza Rodriguez, Sonya Shooshan and Dina Demner-Fushman	Annotating Named Entities in Consumer Health Questions
18:20-19:20	Adrian Brasoveanu, Lyndon J.B. Nixon, Albert Weichselbraun and Arno Scharl	A Regional News Corpora for Contextualized Entity Discovery and Linking
18:20-19:20	Martin Brümmer, Milan Dojchinovski and Sebastian Hellmann	DBpedia Abstracts: A Large-Scale, Open, Multilingual NLP Training Corpus
18:20-19:20	Roald Eiselen	Government Domain Named Entity Recognition for South African Languages
18:20-19:20	Maud Ehrmann, Damien Nouvel and Sophie Rosset	Named Entity Resources - Overview and Outlook
18:20-19:20	Marcos Garcia	Incorporating Lexico-semantic Heuristics into Coreference Resolution Sieves for Named Entity Recognition at Document-level
18:20-19:20	Octavia-Maria Şulea, Sergiu Nisioi and Liviu P. Dinu	Using Word Embeddings to Translate Named Entities

Day 3, Oral Sessions:

	Session O37 - Robots and Conversational Agents Interaction	Chairperson: Claude Barras
9:45-10:05	Patrick Holthaus, Christian Leichsenring, Jasmin Bernotat, Viktor Richter, Marian Pohling, Birte Carlmeyer, Norman Köster, Sebastian Meyer zu Borgsen, René Zorn, Birte Schiffhauer, Kai Frederic Engelmann, Florian Lier, Simon Schulz, Philipp Cimiano, Friederike Eyssel, Thomas Hermann, Franz Kummert, David Schlangen, Sven Wachsmuth, Petra Wagner, Britta Wrede and Sebastian Wrede	How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
10:05-10:25	Zhichao Hu, Michelle Dick, Chung-Ning Chang, Kevin Bowden, Michael Neff, Jean Fox Tree and Marilyn Walker	A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives
10:25-10:45	Stavroula―Evita Fotinea, Eleni Efthimiou, Maria Koutsombogera, Athanasia-Lida Dimou, Theodore Goulas and Kyriaki Vasilaki	Multimodal Resources for Human-Robot Communication Modelling
10:45-11:05	Jackson Tolins, Kris Liu, Michael Neff, Marilyn Walker and Jean Fox Tree	A Verbal and Gestural Corpus of Story Retellings to an Expressive Embodied Virtual Character
11:05-11:25	Jackson Tolins, Kris Liu, Yingying Wang, Jean Fox Tree, Marilyn Walker and Michael Neff	A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs

	Session O38 - Crowdsourcing	Chairperson: Andrejs Vasiļjevs
9:45-10:05	Bettina Lanser, Christina Unger and Philipp Cimiano	Crowdsourcing Ontology Lexicons
10:05-10:25	Ashutosh Modi, Tatjana Anikina, Simon Ostermann and Manfred Pinkal	InScript: Narrative texts annotated with script information
10:25-10:45	Lilian D. A. Wanzare, Alessandra Zarcone, Stefan Thater and Manfred Pinkal	A Crowdsourced Database of Event Sequence Descriptions for the Acquisition of High-quality Script Knowledge
10:45-11:05	Tommaso Caselli, Rachele Sprugnoli and Oana Inel	Temporal Information Annotation: Crowd vs. Experts
11:05-11:25	Franco Salvetti, John B. Lowe and James H. Martin	A Tangled Web: The Faint Signals of Deception in Text - Boulder Lies and Truth Corpus (BLT-C)

	Session O39 - Corpora for Machine Translation	Chairperson: Christopher Cieri
9:45-10:05	Jörg Tiedemann	Finding Alternative Translations in a Large Corpus of Movie Subtitle
10:05-10:25	Thierry Etchegoyhen, Andoni Azpeitia and Naiara Pérez	Exploiting a Large Strongly Comparable Corpus
10:25-10:45	Michał Ziemski, Marcin Junczys-Dowmunt and Bruno Pouliquen	The United Nations Parallel Corpus v1.0
10:45-11:05	Luisa Bentivogli, Mauro Cettolo, M. Amin Farajian and Marcello Federico	WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words
11:05-11:25	Aleš Tamchyna and Petra Barancikova	Manual and Automatic Paraphrases for MT Evaluation

	Session O40 - Treebanks and Syntactic and Semantic Analysis	Chairperson: Joakim Nivre
9:45-10:05	Liesbeth Augustinus, Vincent Vandeghinste and Tom Vanallemeersch	Poly-GrETEL: Cross-Lingual Example-based Querying of Syntactic Constructions
10:05-10:25	Helge Dyvik, Paul Meurer, Victoria Rosén, Koenraad De Smedt, Petter Haugereid, Gyri Smørdal Losnegaard, Gunn Inger Lyse and Martha Thunes	NorGramBank: A ‘Deep’ Treebank for Norwegian
10:25-10:45	Corentin Ribeyre, Eric Villemonte de la Clergerie and Djamé Seddah	Accurate Deep Syntactic Parsing of Graphs: The Case of French
10:45-11:05	Abdelati Hawwari, Mohammed Attia, Mahmoud Ghoneim and Mona Diab	Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic
11:05-11:25

	Session O41 - Discourse	Chairperson: Justus Roux
11:45-12:05	Barbara Konat, John Lawrence, Joonsuk Park, Katarzyna Budzynska and Chris Reed	A Corpus of Argument Networks: Using Graph Properties to Analyse Divisive Issues
12:05-12:25	Rui Correia, Nuno Mamede, Jorge Baptista and Maxine Eskenazi	metaTED: a Corpus of Metadiscourse for Spoken Language
12:25-12:45	Silvia Pareti	PARC 3.0: A Corpus of Attribution Relations
12:45-13:05	Junyi Jessy Li, Bridget O'Daniel, Yi Wu, Wenli Zhao and Ani Nenkova	Improving the Annotation of Sentence Specificity
13:05-13:25	Kordula De Kuthy, Ramon Ziai and Detmar Meurers	Focus Annotation of Task-based Data: A Comparison of Expert and Crowd-Sourced Annotation in a Reading Comprehension Corpus

	Session O42 - Twitter Related Analysis	Chairperson: Xavier Tannier
11:45-12:05	Beatrice Alex, Clare Llewellyn, Claire Grover, Jon Oberlander and Richard Tobin	Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile Locations
12:05-12:25	Saif Mohammad, Svetlana Kiritchenko, Parinaz Sobhani, Xiaodan Zhu and Colin Cherry	A Dataset for Detecting Stance in Tweets
12:25-12:45	Luca Dini and André Bittar	Emotion Analysis on Twitter: The Hidden Challenge
12:45-13:05	Oana Inel, Tommaso Caselli and Lora Aroyo	Crowdsourcing Salient Information from News and Tweets
13:05-13:25	Francesco Barbieri, Francesco Ronzano and Horacio Saggion	What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis

	Session O43 - Semantics	Chairperson: James Pustejovsky
11:45-12:05	Elias Iosif and Alexandros Potamianos	Crossmodal Network-Based Distributional Semantic Models
12:05-12:25	Claire Bonial and Martha Palmer	Comprehensive and Consistent PropBank Light Verb Annotation
12:25-12:45	Nora Hollenstein, Nathan Schneider and Bonnie Webber	Inconsistency Detection in Semantic Annotation
12:45-13:05	Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinkova, Dan Flickinger, Jan Hajic, Angelina Ivanova and Zdenka Uresova	Towards Comparability of Linguistic Graph Banks for Semantic Parsing
13:05-13:25	Jing Lu and Vincent Ng	Event Coreference Resolution with Multi-Pass Sieves

	Session O44 - Speech Resources	Chairperson: Sophie Rosset
11:45-12:05	Malgorzata Cavar, Damir Cavar and Hilaria Cruz	Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR
12:05-12:25	Miguel Matos, Alberto Abad and António Serralheiro	The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.
12:25-12:45	Hiroki Mori, Atsushi Nagaoka and Yoshiko Arimoto	Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus Commonization
12:45-13:05	Michael Carl, Akiko Aizawa and Masaru Yamada	English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual Setting
13:05-13:25	Karl Neergaard, Hongzhi Xu and Chu-Ren Huang	Database of Mandarin Neighborhood Statistics

	Session O45 - Lexicons: Wordnet and Framenet	Chairperson: Dan Tufiș
14:55-15:15	Luís Morgado da Costa and Francis Bond	Wow! What a Useful Extension! Introducing Non-Referential Concepts to Wordnet
15:15-15:35	Shehzaad Dhuliawala, Diptesh Kanojia and Pushpak Bhattacharyya	SlangNet: A WordNet like resource for English Slang
15:35-15:55	Hugo Gonçalo Oliveira and Fábio Santos	Discovering Fuzzy Synsets from the Redundancy in Different Lexical-Semantic Resources
15:55-16:15	Avi Hayoun and Michael Elhadad	The Hebrew FrameNet Project

	Session O46 - Digital Humanities	Chairperson: Andreas Witt
14:55-15:15	Clemens Neudecker	An Open Corpus for Named Entity Recognition in Historic Newspapers
15:15-15:35	Béatrice Daille, Evelyne Jacquey, Gaël Lejeune, Luis Felipe Melo and Yannick Toussaint	Ambiguity Diagnosis for Terms in Digital Humanities
15:35-15:55	Borja Navarro, María Ribes-Lafoz and Noelia Sánchez	Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation
15:55-16:15	Susanne Haaf	Corpus Analysis based on Structural Phenomena in Texts: Exploiting TEI Encoding for Linguistic Research

	Session O47 - Text Mining and Information Extraction	Chairperson: Gregory Grefenstette
14:55-15:15	Marieke van Erp, Pablo Mendes, Heiko Paulheim, Filip Ilievski, Julien Plu, Giuseppe Rizzo and Joerg Waitelonis	Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job
15:15-15:35	Daniel Preoţiuc-Pietro, P. K. Srijith, Mark Hepple and Trevor Cohn	Studying the Temporal Dynamics of Word Co-occurrences: An Application to Event Detection
15:35-15:55	Luis Gerardo Mojica de la Vega and Vincent Ng	Markov Logic Networks for Text Mining: A Qualitative and Empirical Comparison with Integer Linear Programming
15:55-16:15	Ayman Al Zaatari, Rim El Ballouli, Shady ELbassouni, Wassim El-Hajj, Hazem Hajj, Khaled Shaban, Nizar Habash and Emad Yahya	Arabic Corpora for Credibility Analysis

	Session O48 - Corpus Creation and Analysis	Chairperson: Paul Rayson
14:55-15:15	Dain Kaplan, Neil Rubens, Simone Teufel and Takenobu Tokunaga	Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus Construction
15:15-15:35	Cláudia Freitas, Bianca Freitas and Diana Santos	QUEMDISSE? Reported speech in Portuguese
15:35-15:55	Anne-Lyse Minard, Manuela Speranza, Ruben Urizar, Begoña Altuna, Marieke van Erp, Anneleen Schoen and Chantal van Son	MEANTIME, the NewsReader Multilingual Event and Time Corpus
15:55-16:15	Steven Moran	The ACQDIV Database: Min(d)ing the Ambient Language

Day 3, Poster Sessions

	Session P44 - Corpus Creation and Querying (1)	Chair: Cristina Bosco
9:45-11:25	Anne-Kathrin Schumann and Stefan Fischer	Compasses, Magnets, Water Microscopes: Annotation of Terminology in a Diachronic Corpus of Scientific Texts
9:45-11:25	Nils Diewald, Michael Hanl, Eliza Margaretha, Joachim Bingel, Marc Kupietz, Piotr Banski and Andreas Witt	KorAP Architecture ― Diving in the Deep Sea of Corpus Data
9:45-11:25	Cyril Grouin	Text Segmentation of Digitized Clinical Texts
9:45-11:25	Elif Ahsen Acar, Deniz Zeyrek, Murathan Kurfalı and Cem Bozşahin	A Turkish Database for Psycholinguistic Studies Based on Frequency, Age of Acquisition, and Imageability
9:45-11:25	Steffen Remus and Chris Biemann	Domain-Specific Corpus Expansion with Focused Webcrawling
9:45-11:25	Nikola Ljubešić, Tomaž Erjavec and Darja Fišer	Corpus-Based Diacritic Restoration for South Slavic Languages
9:45-11:25	Daniel Couto-Vale, Stella Neumann and Paula Niemietz	Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs
9:45-11:25	Elena Manishina, Bassam Jabaian, Stéphane Huet and Fabrice Lefevre	Automatic Corpus Extension for Data-driven Natural Language Generation
9:45-11:25	Amal Htait, Sebastien Fournier and Patrice Bellot	Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers
9:45-11:25	Wajdi Zaghouani, Houda Bouamor, Abdelati Hawwari, Mona Diab, Ossama Obeid, Mahmoud Ghoneim, Sawsan Alqahtani and Kemal Oflazer	Guidelines and Framework for a Large Scale Arabic Diacritized Corpus

	Session P45 - Evaluation Methodologies (3)	Chair: Marta Villegas
9:45-11:25	Irina Temnikova, Wajdi Zaghouani, Stephan Vogel and Nizar Habash	Applying the Cognitive Machine Translation Evaluation Approach to Arabic
9:45-11:25	Carolina Scarton and Lucia Specia	A Reading Comprehension Corpus for Machine Translation Evaluation
9:45-11:25	Rodrigo Wilkens, Leonardo Zilio, Eduardo Ferreira and Aline Villavicencio	B2SG: a TOEFL-like Task for Portuguese
9:45-11:25	Yu Yuan, Serge Sharoff and Bogdan Babych	MoBiL: A Hybrid Feature Set for Automatic Human Translation Quality Assessment
9:45-11:25	Varvara Logacheva, Chris Hokamp and Lucia Specia	MARMOT: A Toolkit for Translation Quality Estimation at the Word Level
9:45-11:25	Denys Katerenchuk and Andrew Rosenberg	RankDCG: Rank-Ordering Evaluation Measure
9:45-11:25	Mathias Etcheverry and Dina Wonsever	Spanish Word Vectors from Wikipedia

	Session P46 - Information Extraction and Retrieval (3)	Chair: Aurelie Neveol
9:45-11:25	Muhammad Humayoun and Hwanjo Yu	Analyzing Pre-processing Settings for Urdu Single-document Extractive Summarization
9:45-11:25	Kata Gábor, Haifa Zargayouna, Davide Buscaldi, Isabelle Tellier and Thierry Charnois	Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature
9:45-11:25	Leon Derczynski, Jannik Strötgen, Diana Maynard, Mark A. Greenwood and Manuel Jung	GATE-Time: Extraction of Temporal Expressions and Events
9:45-11:25	Vincent Claveau and Ewa Kijak	Distributional Thesauri for Information Retrieval and vice versa
9:45-11:25	Justin Mott, Ann Bies, Zhiyi Song and Stephanie Strassel	Parallel Chinese-English Entities, Relations and Events Corpora
9:45-11:25	Tilia Ellendorff, Simon Foster and Fabio Rinaldi	The PsyMine Corpus - A Corpus annotated with Psychiatric Disorders and their Etiological Factors
9:45-11:25	Dean Fulgoni, Jordan Carpenter, Lyle Ungar and Daniel Preoţiuc-Pietro	An Empirical Exploration of Moral Foundations Theory in Partisan News Sources
9:45-11:25	Carmen Banea, Xi Chen and Rada Mihalcea	Building a Dataset for Possessions Identification in Text
9:45-11:25	Kira Griffitt and Stephanie Strassel	The Query of Everything: Developing Open-Domain, Natural-Language Queries for BOLT Information Retrieval
9:45-11:25	Ting Liu, Kit Cho, Tomek Strzalkowski, Samira Shaikh and Mehrdad Mirzaei	The Validation of MRCPD Cross-language Expansions on Imageability Ratings
9:45-11:25	Dipawesh Pawar, Mohammed Hasanuzzaman and Asif Ekbal	Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi

	Session P47 - Semantic Corpora	Chair: Eneko Agirre
9:45-11:25	Natalia Grabar and Iris Eshkol-Taravela	Detection of Reformulations in Spoken French
9:45-11:25	Rajendra Banjade and Vasile Rus	DT-Neg: Tutorial Dialogues Annotated for Negation Scope and Focus in Context
9:45-11:25	Kirk Roberts and Dina Demner-Fushman	Annotating Logical Forms for EHR Questions
9:45-11:25	Steven Bethard and Jonathan Parker	A Semantically Compositional Annotation Scheme for Time Normalization
9:45-11:25	Gözde Özbal, Carlo Strapparava and Serra Sinem Tekiroglu	PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors
9:45-11:25	Marianne Djemaa, Marie Candito, Philippe Muller and Laure Vieu	Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology
9:45-11:25	Anaïs Lefeuvre-Halftermeyer, Jean-Yves Antoine, Alain Couillault, Emmanuel Schang, Lotfi Abouda, Agata Savary, Denis Maurel, Iris Eshkol and Delphine Battistelli	Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
9:45-11:25	Laure Vieu, Philippe Muller, Marie Candito and Marianne Djemaa	A General Framework for the Annotation of Causality Based on FrameNet
9:45-11:25	Alakananda Vempala and Eduardo Blanco	Annotating Temporally-Anchored Spatial Knowledge on Top of OntoNotes Semantic Roles
9:45-11:25	Jana Götze and Johan Boye	SpaceRef: A corpus of street-level geographic descriptions
9:45-11:25	Azadeh Mirzaei and Amirsaeid Moloodi	Persian Proposition Bank
9:45-11:25	Yuka Tateisi, Tomoko Ohta, Sampo Pyysalo, Yusuke Miyao and Akiko Aizawa	Typed Entity and Relation Annotation on Computer Science Papers
9:45-11:25	Volker Gast, Lennart Bierkandt, Stephan Druskat and Christoph Rzymski	Enriching TimeBank: Towards a more precise annotation of temporal relations in a text

	Session P48 - Speech Processing (2)	Chair: Denise DiPersio
9:45-11:25	Imran Sheikh, Irina Illina and Dominique Fohr	How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News
9:45-11:25	Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin LU, Minglei Li, Dan Xiong, Roy Shing Yu and Vincent T.Y. Ng	Syllable based DNN-HMM Cantonese Speech to Text System
9:45-11:25	Elodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese and Uriel Pascal Elingui	Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof
9:45-11:25	Joris Pelemans, Lyan Verwimp, Kris Demuynck, Hugo Van hamme and Patrick Wambacq	SCALE: A Scalable Language Engineering Toolkit
9:45-11:25	Sandrine Brognaux, Thomas Francois and Marco Saerens	Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis
9:45-11:25	Thomas Kisler, Uwe Reichel, Florian Schiel, Christoph Draxler, Bernhard Jackl and Nina Pörner	BAS Speech Science Web Services - an Update of Current Developments
9:45-11:25	Fernando Batista, Pedro Curto, Isabel Trancoso, Alberto Abad, Jaime Ferreira, Eugénio Ribeiro, Helena Moniz, David Martins de Matos and Ricardo Ribeiro	SPA: Web-based Platform for easy Access to Speech Processing Modules
9:45-11:25	Roberto Seara, Marta Martinez, Rocio Varela, Carmen García Mateo, Elisa Fernandez Rei and Xose Luis Regueira	Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech

	Session P49 - Corpus Creation and Querying (2)	Chair: Menzo Windhouwer
11:45-13:25	Maarten Janssen	TEITOK: Text-Faithful Annotated Corpora
11:45-13:25	Mathias Schenner and Sebastian Nordhoff	Extracting Interlinear Glossed Text from LaTeX Documents
11:45-13:25	Talvany Carlotto, Zuhaitz Beloki, Xabier Artola and Aitor Soroa	Interoperability of Annotation Schemes: Using the Pepper Framework to Display AWA Documents in the ANNIS Interface
11:45-13:25	Mohamed Al-Badrashiny, Arfath Pasha, Mona Diab, Nizar Habash, Owen Rambow, Wael Salloum and Ramy Eskander	SPLIT: Smart Preprocessing (Quasi) Language Independent Tool
11:45-13:25	Tanja Samardzic, Yves Scherrer and Elvira Glaser	ArchiMob - A Corpus of Spoken Swiss German
11:45-13:25	Timo Homburg and Christian Chiarcos	Word Segmentation for Akkadian Cuneiform
11:45-13:25	Cyril Grouin	Controlled Propagation of Concept Annotations in Textual Corpora
11:45-13:25	Koiti Hasida	Graphical Annotation for Syntax-Semantics Mapping
11:45-13:25	Mark Sammons, Christos Christodoulopoulos, Parisa Kordjamshidi, Daniel Khashabi, Vivek Srikumar and Dan Roth	EDISON: Feature Extraction for NLP, Simplified

	Session P50 - Document Classification and Text Categorisation (2)	Chair: Thierry Hamon
11:45-13:25	Nora Al-Twairesh, Abeer Al-Dayel, Hend Al-Khalifa, Maha Al-Yahya, Sinaa Alageel, Nora Abanmy and Nouf Al-Shenaifi	MADAD: A Readability Annotation Tool for Arabic Text
11:45-13:25	Marcos Zampieri, Shervin Malmasi and Mark Dras	Modeling Language Change in Historical Corpora: The Case of Portuguese
11:45-13:25	Filip Graliński, Łukasz Borchmann and Piotr Wierzchoń	“He Said She Said” ― a Male/Female Corpus of Polish
11:45-13:25	Karin Sim Smith, Wilker Aziz and Lucia Specia	Cohere: A Toolkit for Local Coherence
11:45-13:25	James Ravenscroft, Anika Oellrich, Shyamasree Saha and Maria Liakata	Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
11:45-13:25	Udochukwu Orizu and Yulan He	Detecting Expressions of Blame or Praise in Text
11:45-13:25	Stephan Tulkens, Chris Emmery and Walter Daelemans	Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource
11:45-13:25	Andre Quispersaravia and Walter Perez and Marco Sobrevilla and Fernando Alva-Manchengo	Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish
11:45-13:25	Alice Frain and Sander Wubben	SatiricLR: a Language Resource of Satirical News Articles

	Session P51 - Multilingual Corpora	Chair: Penny Labropoulou
11:45-13:25	Marcus Klang and Pierre Nugues	WIKIPARQ: A Tabulated Wikipedia Resource Using the Parquet Format
11:45-13:25	David Vilares, Miguel A. Alonso and Carlos Gómez-Rodríguez	EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis
11:45-13:25	Darina Benikova and Chris Biemann	SemRelData ― Multilingual Contextual Annotation of Semantic Relations between Nominals: Dataset and Guidelines
11:45-13:25	Jérémy Ferrero, Frédéric Agnès, Laurent Besacier and Didier Schwab	A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection
11:45-13:25	Younes Samih and Wolfgang Maier	An Arabic-Moroccan Darija Code-Switched Corpus
11:45-13:25	Navid Rekabsaz, Serwah Sabetghadam, Mihai Lupu, Linda Andersson and Allan Hanbury	Standard Test Collection for English-Persian Cross-Lingual Word Sense Disambiguation
11:45-13:25	Milan Dojchinovski, Felix Sasaki, Tatjana Gornostaja, Sebastian Hellmann, Erik Mannens, Frank Salliau, Michele Osella, Phil Ritchie, Giannis Stoitsis, Kevin Koidl, Markus Ackermann and Nilesh Chakraborty	FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies
11:45-13:25	Amir Hazem and Emmanuel Morin	Improving Bilingual Terminology Extraction from Comparable Corpora via Multiple Word-Space Models
11:45-13:25	Alexandre Berard, Christophe Servan, Olivier Pietquin and Laurent Besacier	MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP
11:45-13:25	Murad Abouammoh, Kashif Shah and Ahmet Aker	Creation of comparable corpora for English-{Urdu, Arabic, Persian}
11:45-13:25	Sergiu Nisioi, Ella Rabinovich, Liviu P. Dinu and Shuly Wintner	A Corpus of Native, Non-native and Translated Texts
11:45-13:25	Andrea Fischer, Klara Jagrova, Irina Stenger, Tania Avgustinova, Dietrich Klakow and Roland Marti	Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility
11:45-13:25	Ximena Gutierrez-Vasques, Gerardo Sierra and Isaac Hernandez Pompa	Axolotl: a Web Accessible Parallel Corpus for Spanish-Nahuatl
11:45-13:25	Özlem Çetinoğlu	A Turkish-German Code-Switching Corpus
11:45-13:25	Michael Mohler, Mary Brunson, Bryan Rink and Marc Tomlinson	Introducing the LCC Metaphor Datasets
11:45-13:25	Mona Diab, Mahmoud Ghoneim, Abdelati Hawwari, Fahad AlGhamdi, Nada AlMarwani and Mohamed Al-Badrashiny	Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data
11:45-13:25	Laurence Meurant, Maxime Gobert and Anthony Cleve	Modelling a Parallel Corpus of French and French Belgian Sign Language
11:45-13:25	Ines Cebović and Marko Tadić	Building the Macedonian-Croatian Parallel Corpus
11:45-13:25	Vladimír Benko	Two Years of Aranea: Increasing Counts and Tuning the Pipeline
11:45-13:25	Ichiro Umata, Koki Ijuin, Mitsuru Ishida, Moe Takeuchi and Seiichi Yamamoto	Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 Conversations
11:45-13:25	Karen Jones, Stephanie Strassel, Kevin Walker, David Graff and Jonathan Wright	Multi-language Speech Collection for NIST LRE

	Session P52 - Part of Speech Tagging (2)	Chair: Piotr Banski
11:45-13:25	Torsten Zesch and Tobias Horsmann	FlexTag: A Highly Flexible PoS Tagging Framework
11:45-13:25	Nikola Ljubešić, Filip Klubička, Željko Agić and Ivo-Pavao Jazbec	New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian
11:45-13:25	Andy Luecking, Armin Hoenen and Alexander Mehler	TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics
11:45-13:25	Katalin Pajkossy and Attila Zséder	The hunvec framework for NN-CRF-based sequential tagging
11:45-13:25	Salam Khalifa, Nizar Habash, Dana Abdulrahim and Sara Hassan	A Large Scale Corpus of Gulf Arabic
11:45-13:25	Milan Straka, Jan Hajic and Jana Straková	UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing
11:45-13:25	Nizar Habash, Anas Shahrour and Muhamed Al-Khalil	Exploiting Arabic Diacritization for High Quality Automatic Annotation
11:45-13:25	Besim Kabashi and Thomas Proisl	A Proposal for a Part-of-Speech Tagset for the Albanian Language
11:45-13:25	Mohamed Outahajala and Paolo Rosso	Using a Small Lexicon with CRFs Confidence Measure to Improve POS Tagging Accuracy
11:45-13:25	Sarah Schulz and Jonas Kuhn	Learning from Within? Comparing PoS Tagging Approaches for Historical Text

	Session P53 - Dialogue (2)	Chair: Thorsten Trippel
14:55-16:15	Morena Danieli, Balamurali A R, Evgeny Stepanov, Benoit Favre, Frederic Bechet and Giuseppe Riccardi	Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations
14:55-16:15	Hanae Koiso, Tomoyuki Tsuchiya, Ryoko Watanabe, Daisuke Yokomori, Masao Aizawa and Yasuharu Den	Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation
14:55-16:15	Kalin Stefanov and Jonas Beskow	A Multi-party Multi-modal Dataset for Focus of Visual Attention in Human-human and Human-robot Interaction
14:55-16:15	Rob Abbott, Brian Ecker, Pranav Anand and Marilyn Walker	Internet Argument Corpus 2.0: An SQL schema for Dialogic Social Media and the Corpora to go with it
14:55-16:15	Emer Gilmartin and Nick Campbell	Capturing Chat: Annotation and Tools for Multiparty Casual Conversation.

	Session P54 - LR Infrastructures and Architectures (2)	Chair: Koiti Hasida
14:55-16:15	Pawel Kamocki and Jim O'Regan	Privacy Issues in Online Machine Translation Services - European Perspective
14:55-16:15	Christian Chiarcos, Christian Fäth, Heike Renner-Westermann, Frank Abromeit and Vanya Dimitrova	Lin\|gu\|is\|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open Data
14:55-16:15	Ngoc Nguyen, Donghui Lin, Takao Nakaguchi and Toru Ishida	Towards a Language Service Infrastructure for Mobile Environments
14:55-16:15	Maristella Agosti, Emanuele Di Buccio, Giorgio Maria Di Nunzio, Cecilia Poletto and Esther Rinke	Designing A Long Lasting Linguistic Project: The Case Study of ASIt
14:55-16:15	Damir Cavar, Malgorzata Cavar and Lwin Moe	Global Open Resources and Information for Language and Linguistic Analysis (GORILLA)
14:55-16:15	Stephan Druskat, Volker Gast, Thomas Krause and Florian Zipser	corpus-tools.org: An Interoperable Generic Software Tool Set for Multi-layer Linguistic Corpora
14:55-16:15	Roland Schäfer	CommonCOW: Massively Huge Web Corpora from CommonCrawl Data and a Method to Distribute them Freely under Restrictive EU Copyright Laws
14:55-16:15	Ioannis Manousos Katakis, Georgios Petasis and Vangelis Karkaletsis	CLARIN-EL Web-based Annotation Tool
14:55-16:15	Mathijs Kattenberg, Zuhaitz Beloki, Aitor Soroa, Xabier Artola, Antske Fokkens, Paul Huygen and Kees Verstoep	Two Architectures for Parallel Processing of Huge Amounts of Text
14:55-16:15	Steve Cassidy	Publishing the Trove Newspaper Corpus
14:55-16:15	Vladimir Popescu, Lin Liu, Riccardo Del Gratta, Khalid Choukri and Nicoletta Calzolari	New Developments in the LRE Map

	Session P55 - Large Projects and Infrastructures (2)	Chair: Dieter Van Uytvanck
14:55-16:15	Jens Edlund and Joakim Gustafson	Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives
14:55-16:15	Valérie Mapelli, Vladimir Popescu, Lin Liu, Meritxell Fernández Barrera and Khalid Choukri	The ELRA License Wizard
14:55-16:15	Thibault Grouas, Valérie Mapelli and Quentin Samier	Review on the Existing Language Resources for Languages of France
14:55-16:15	Christopher Cieri, Mike Maxwell, Stephanie Strassel and Jennifer Tracey	Selection Criteria for Low Resource Language Programs
14:55-16:15	Meritxell Fernández Barrera, Vladimir Popescu, Antonio Toral, Federico Gaspari and Khalid Choukri	Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities

	Session P56 - Semantics (2)	Chair: Yoshihiko Hayashi
14:55-16:15	Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu and Chu-Ren Huang	Nine Features in a Random Forest to Learn Taxonomical Semantic Relations
14:55-16:15	Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu and Chu-Ren Huang	What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets
14:55-16:15	Marco Del Tredici and Nuria Bel	Assessing the Potential of Metaphoricity of verbs using corpus data
14:55-16:15	Mathieu Lafourcade and Lionel Ramadier	Semantic Relation Extraction with Semantic Patterns Experiment on Radiology Reports
14:55-16:15	Liu Hongchao, Karl Neergaard, Enrico Santus and Chu-Ren Huang	EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMs
14:55-16:15	Maaz Anwar and Dipti Sharma	Towards Building Semantic Role Labeler for Indian Languages
14:55-16:15	Tanja Samardzic and Maja Miličević	A Framework for Automatic Acquisition of Croatian and Serbian Verb Aspect from Corpora
14:55-16:15	Piroska Lendvai, Isabelle Augenstein, Kalina Bontcheva and Thierry Declerck	Monolingual Social Media Datasets for Detecting Contradiction and Entailment
14:55-16:15	James Pustejovsky and Nikhil Krishnaswamy	VoxML: A Visualization Modeling Language
14:55-16:15	Takehiro Teraoka	Metonymy Analysis Using Associative Relations between Words
14:55-16:15	Travis Goodwin and Sanda Harabagiu	Embedding Open-domain Common-sense Knowledge from Text
14:55-16:15	Eneldo Loza Mencía, Gerard de Melo and Jinseok Nam	Medical Concept Embeddings via Labeled Background Corpora
14:55-16:15	Corentin Dumont, Ran Tian and Kentaro Inui	Question-Answering with Logic Specific to Video Games

	Session P57 - Speech Corpora and Databases (2)	Chair: Satoshi Nakamura
14:55-16:15	Arne Köhn, Florian Stegen and Timo Baumann	Mining the Spoken Wikipedia for Speech Data and Beyond
14:55-16:15	Robert Herms, Laura Seelig, Stefanie Münch and Maximilian Eibl	A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation
14:55-16:15	Koichiro Yoshino, Naoki Hirayama, Shinsuke Mori, Fumihiko Takahashi, Katsutoshi Itoyama and Hiroshi G. Okuno	Parallel Speech Corpora of Japanese Dialects
14:55-16:15	Christine Meunier, Cecile Fougeron, Corinne Fredouille, Brigitte Bigi, Lise Crevier-Buchman, Elisabeth Delais-Roussarie, Laurianne Georgeton, Alain Ghio, Imed Laaridh, Thierry Legou, Claire Pillot-Loiseau and Gilles Pouchoulin	The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles
14:55-16:15	Emre Yilmaz, Maaike Andringa, Sigrid Kingma, Jelske Dijkstra, Frits Van der Kuip, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel and David van Leeuwen	A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research
14:55-16:15	Andrej Zgank, Mirjam Sepesy Maucec and Darinka Verdonik	The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource
14:55-16:15	Yurie Iribe, Norihide Kitaoka and Shuhei Segawa	Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese
14:55-16:15	Agnieszka Wagner, Katarzyna Klessa and Jolanta Bachan	Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis
14:55-16:15	Peter Viszlay, Ján Staš, Tomáš Koctúr, Martin Lojka and Jozef Juhár	An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation
14:55-16:15	Malgorzata Cavar, Damir Cavar, Dov-Ber Kerler and Anya Quilitzsch	Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project