|
INTRODUCTORY MESSAGES:
INVITED TALK:
KEYNOTES SPEECHES:
SESSIONS: Browse articles of the conference sorted by session number
Day 1, Oral Sessions:
|
Session O4 - Spoken Corpus Dialogue |
Chairperson: Asuncion Moreno |
11:35-11:55 |
José Lopes, Arodami Chorianopoulou, Elisavet Palogiannidi, Helena Moniz, Alberto Abad, Katerina Louka, Elias Iosif and Alexandros Potamianos |
The SpeDial datasets: datasets for Spoken Dialogue Systems analytics |
11:55-12:15 |
Dilafruz Amanova, Volha Petukhova and Dietrich Klakow |
Creating Annotated Dialogue Resources: Cross-domain Dialogue Act Classification |
12:15-12:35 |
Kathryn J. Collins and David Traum |
Towards a Multi-dimensional Taxonomy of Stories in Dialogue |
12:35-12:55 |
Sina Zarrieß, Julian Hough, Casey Kennington, Ramesh Manuvinakurike, David DeVault, Raquel Fernandez and David Schlangen |
PentoRef: A Corpus of Spoken References in Task-oriented Dialogues |
12:55-13:15 |
Shammur Absar Chowdhury, Evgeny Stepanov and Giuseppe Riccardi |
Transfer of Corpus-Specific Dialogue Act Annotation to ISO Standard: Is it worth it? |
|
Session O5 - LR Infrastructures and Architectures |
Chairperson: Franciska de Jong |
14:45-15:05 |
Adam Funk, Robert Gaizauskas and Benoit Favre |
A Document Repository for Social Media and Speech Conversations |
15:05-15:25 |
Artemis Parvizi, Matt Kohl, Meritxell Gonzàlez and Roser Saurí |
Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge Reuse |
15:25-15:45 |
Bente Maegaard, Lina Henriksen, Andrew Joscelyne, Vesna Lusicky, Margaretha Mazura, Sussi Olsen, Claus Povlsen and Philippe Wacker |
Providing a Catalogue of Language Resources for Commercial Users |
15:45-16:05 |
Nancy Ide, Keith Suderman, James Pustejovsky, Marc Verhagen and Christopher Cieri |
The Language Application Grid and Galaxy |
16:05-16:25 |
Khalid Choukri, Valérie Mapelli, Hélène Mazo and Vladimir Popescu |
ELRA Activities and Services |
|
Session O6 - Multimodality |
Chairperson: Kristiina Jokinen |
14:45-15:05 |
Costanza Navarretta |
Mirroring Facial Expressions and Emotions in Dyadic Conversations |
15:05-15:25 |
Dragomir Radev, Amanda Stent, Joel Tetreault, Aasish Pappu, Aikaterini Iliakopoulou, Agustin Chanfreau, Paloma de Juan, Jordi Vallmitjana, Alejandro Jaimes, Rahul Jha and Robert Mankoff |
Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest |
15:25-15:45 |
Victoria Yaneva, Irina Temnikova and Ruslan Mitkov |
A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults |
15:45-16:05 |
Mathieu Chollet, Torsten Wörtwein, Louis-Philippe Morency and Stefan Scherer |
A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety |
16:05-16:25 |
Dario Bertero and Pascale Fung |
Deep Learning of Audio and Language Features for Humor Prediction |
|
Session O8 - Named Entity Recognition |
Chairperson: Yuji Matsumoto |
14:45-15:05 |
Marie-Jean Meurs, Hayda Almeida, Ludovic Jean-Louis and Eric Charton |
SemLinker, a Modular and Open Source Framework for Named Entity Discovery and Linking |
15:05-15:25 |
Filip Ilievski, Giuseppe Rizzo, Marieke van Erp, Julien Plu and Raphael Troncy |
Context-enhanced Adaptive Entity Linking |
15:25-15:45 |
Eda Okur, Hakan Demir and Arzucan Özgür |
Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word Embeddings |
15:45-16:05 |
Maria Pershina, Yifan He and Ralph Grishman |
Entity Linking with a Paraphrase Flavor |
16:05-16:25 |
Tian Tian, Marco Dinarelli, Isabelle Tellier and Pedro Dias Cardoso |
Domain Adaptation for Named Entity Recognition Using CRFs |
|
Session O10 - Multilingual Corpora |
Chairperson: Hitoshi Isahara |
16:45-17:05 |
Prokopis Prokopidis, Vassilis Papavassiliou and Stelios Piperidis |
Parallel Global Voices: a Collection of Multilingual Corpora with Citizen Media Stories |
17:05-17:25 |
Xuansong Li, Martha Palmer, Nianwen Xue, Lance Ramshaw, Mohamed Maamouri, Ann Bies, Kathryn Conger, Stephen Grimes and Stephanie Strassel |
Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus |
17:25-17:45 |
Ivan Habernal, Omnia Zayed and Iryna Gurevych |
C4Corpus: Multilingual Web-size Corpus with Free License |
17:45-18:05 |
Pierre Lison and Jörg Tiedemann |
OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles |
|
Session O16 - Phonetics and Prosody |
Chairperson: Dafydd Gibbon |
18:10-18:30 |
Eleanor Chodroff, Matthew Maciejewski, Jan Trmal, Sanjeev Khudanpur and John Godfrey |
New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification |
18:30-18:50 |
Eduardo Coutinho, Florian Hönig, Yue Zhang, Simone Hantke, Anton Batliner, Elmar Nöth and Björn Schuller |
Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets |
18:50-19:10 |
Juergen Trouvain, Anne Bonneau, Vincent Colotte, Camille Fauth, Dominique Fohr, Denis Jouvet, Jeanin Jügler, Yves Laprie, Odile Mella, Bernd Möbius and Frank Zimmerer |
The IFCASL Corpus of French and German Non-native and Native Read Speech |
Day 1, Poster Sessions:
|
Session P01 - Anaphora and Coreference |
Chair: Steve Cassidy |
11:35-13:15 |
Abbas Ghaddar and Phillippe Langlais |
WikiCoref: An English Coreference-annotated Corpus of Wikipedia Articles |
11:35-13:15 |
Dominik Schlechtweg |
Exploitation of Co-reference in Distributional Semantics |
11:35-13:15 |
Evandro Fonseca, Renata Vieira and Aline Vanin |
Adapting an Entity Centric Model for Portuguese Coreference Resolution |
11:35-13:15 |
Ina Roesiger and Jonas Kuhn |
IMS HotCoref DE: A Data-driven Co-reference Resolver for German |
11:35-13:15 |
Vandan Mujadia, Palash Gupta and Dipti Misra Sharma |
Coreference Annotation Scheme and Relation Types for Hindi |
11:35-13:15 |
Anna Nedoluzhko, Michal Novák, Silvie Cinkova, Marie Mikulová and Jiří Mírovský |
Coreference in Prague Czech-English Dependency Treebank |
11:35-13:15 |
Dane Bell, Gus Hahn-Powell, Marco A. Valenzuela-Escárcega and Mihai Surdeanu |
Sieve-based Coreference Resolution in the Biomedical Domain |
11:35-13:15 |
Hardik Vala, Stefan Dimitrov, David Jurgens, Andrew Piper and Derek Ruths |
Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel |
|
Session P02 - Computer Aided Language Learning |
Chair: Stephanie Strassel |
11:35-13:15 |
Marie Garnier and Patrick Saint-Dizier |
Error Typology and Remediation Strategies for Requirements Written in English by Non-Native Speakers |
11:35-13:15 |
Lena Keiper, Andrea Horbach and Stefan Thater |
Improving POS Tagging of German Learner Language in a Reading Comprehension Scenario |
11:35-13:15 |
Elena Volodina, Ildikó Pilán, Ingegerd Enström, Lorena Llozhi, Peter Lundkvist, Gunlög Sundberg and Monica Sandell |
SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies |
11:35-13:15 |
Thomas Francois, Elena Volodina, Ildikó Pilán and Anaïs Tack |
SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language Learners |
11:35-13:15 |
Yow-Ting Shiue and Hsin-Hsi Chen |
Detecting Word Usage Errors in Chinese Sentences for Learning Chinese as a Foreign Language |
11:35-13:15 |
Meishan Zhang, Jie Yang, Zhiyang Teng and Yue Zhang |
LibN3L:A Lightweight Package for Neural NLP |
11:35-13:15 |
Anaïs Tack, Thomas Francois, Anne-Laure Ligozat and Cédrick Fairon |
Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource |
11:35-13:15 |
Claudia Baur, Johanna Gerlach, Manny Rayner, Martin Russell and Helmer Strik |
A Shared Task for Spoken CALL? |
11:35-13:15 |
AlBara Khalifa, Tsuneo Kato and Seiichi Yamamoto |
Joining-in-type Humanoid Robot Assisted Language Learning System |
|
Session P03 - Evaluation Methodologies (1) |
Chair: Ann Bies |
11:35-13:15 |
Mahmoud El-Haj and Paul Rayson |
OSMAN ― A Novel Arabic Readability Metric |
11:35-13:15 |
Edouard Geoffrois |
Evaluating Interactive System Adaptation |
11:35-13:15 |
Leon Derczynski |
Complementarity, F-score, and NLP Evaluation |
11:35-13:15 |
Mauro Dragoni, Andrea Tettamanzi and Célia da Costa Pereira |
DRANZIERA: An Evaluation Protocol For Multi-Domain Opinion Mining |
11:35-13:15 |
Richard Fothergill, Paul Cook and Timothy Baldwin |
Evaluating a Topic Modelling Approach to Measuring Corpus Similarity |
11:35-13:15 |
Christian Fandrych, Elena Frick, Hanna Hedeland, Anna Iliash, Daniel Jettka, Cordula Meißner, Thomas Schmidt, Franziska Wallner, Kathrin Weigert and Swantje Westpfahl |
User, who art thou? User Profiling for Oral Corpus Platforms |
11:35-13:15 |
Angela Costa, Rui Correia and Luisa Coheur |
Building a Corpus of Errors and Quality in Machine Translation: Experiments on Error Impact |
11:35-13:15 |
Victoria Yaneva, Irina Temnikova and Ruslan Mitkov |
Evaluating the Readability of Text Simplification Output for Readers with Cognitive Disabilities |
11:35-13:15 |
Sahar Ghannay, Benoit Favre, Yannick Estève and Nathalie Camelin |
Word Embedding Evaluation and Combination |
11:35-13:15 |
Johann Poignant, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau and Thomas Tamisier |
Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015 |
11:35-13:15 |
Sheila Castilho and Sharon O'Brien |
Evaluating the Impact of Light Post-Editing on Usability |
|
Session P04 - Information Extraction and Retrieval (1) |
Chair: Diana Maynard |
11:35-13:15 |
Elizabeth Salesky, Jessica Ray and Wade Shen |
Operational Assessment of Keyword Search on Oral History |
11:35-13:15 |
Marco A. Valenzuela-Escárcega, Gus Hahn-Powell and Mihai Surdeanu |
Odin's Runes: A Rule Language for Information Extraction |
11:35-13:15 |
Els Lefever and Véronique Hoste |
A Classification-based Approach to Economic Event Detection in Dutch News Text |
11:35-13:15 |
Gil Francopoulo, Joseph Mariani and Patrick Paroubek |
Predictive Modeling: Guessing the NLP Terms of Tomorrow |
11:35-13:15 |
Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Jussi Karlgren, Fredrik Olsson, Per Persson, Akshay Viswanathan and Anders Holst |
The Gavagai Living Lexicon |
11:35-13:15 |
Hamdy Mubarak and Ahmed Abdelali |
Arabic to English Person Name Transliteration using Twitter |
11:35-13:15 |
Young-Seob Jeong, Won-Tae Joo, Hyun-Woo Do, Chae-Gyun Lim, Key-Sun Choi and Ho-Jin Choi |
Korean TimeML and Korean TimeBank |
11:35-13:15 |
Julian Seitner, Christian Bizer, Kai Eckert, Stefano Faralli, Robert Meusel, Heiko Paulheim and Simone Paolo Ponzetto |
A Large DataBase of Hypernymy Relations Extracted from the Web. |
11:35-13:15 |
Nikolaos Katris, Richard Sutcliffe and Theodore Kalamboukis |
Using a Cross-Language Information Retrieval System based on OHSUMED to Evaluate the Moses and KantanMT Statistical Machine Translation Systems |
11:35-13:15 |
Gabriella Pardelli, Sara Goggi, Silvia Giannini and Stefania Biagioni |
Two Decades of Terminology: European Framework Programmes Titles |
11:35-13:15 |
Wim Peters and Adam Wyner |
Legal Text Interpretation: Identifying Hohfeldian Relations from Text |
11:35-13:15 |
Ryuichi Tachibana and Mamoru Komachi |
Analysis of English Spelling Errors in a Word-Typing Game |
11:35-13:15 |
Vojtěch Kovář, Monika Močiariková and Pavel Rychlý |
Finding Definitions in Large Corpora with Sketch Engine |
11:35-13:15 |
Teresa Rodriguez-Ferreira, Adrian Rabadan, Raquel Hervas and Alberto Diaz |
Improving Information Extraction from Wikipedia Texts using Basic English |
11:35-13:15 |
Tommaso Caselli, Giovanni Moretti, Rachele Sprugnoli, Sara Tonelli, Damien Lanfrey and Donatella Solda Kutzmann |
NLP and Public Engagement: The Case of the Italian School Reform |
11:35-13:15 |
Xabier Saralegi, Eneko Agirre and Iñaki Alegria |
Evaluating Translation Quality and CLIR Performance of Query Sessions |
11:35-13:15 |
Dieu-Thu Le and Uwe Quasthoff |
Construction and Analysis of a Large Vietnamese Text Corpus |
11:35-13:15 |
Kartik Asooja, Georgeta Bordea, Gabriela Vulcu and Paul Buitelaar |
Forecasting Emerging Trends from Scientific Literature |
11:35-13:15 |
Eunsol Choi, Matic Horvat, Jonathan May, Kevin Knight and Daniel Marcu |
Extracting Structured Scholarly Information from the Machine Translation Literature |
11:35-13:15 |
Stephen Wu, Chung-Il Wi, Sunghwan Sohn, Hongfang Liu and Young Juhn |
Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events |
11:35-13:15 |
Stefano Menini, Rachele Sprugnoli and Antonio Uva |
Who was Pietro Badoglio? Towards a QA system for Italian History |
|
Session P05 - Machine Translation (1) |
Chair: Martin Volk |
14:45-16:25 |
Mihael Arcan, Caoilfhionn Lane, Eoin Ó Droighneáin and Paul Buitelaar |
IRIS: English-Irish Machine Translation System |
14:45-16:25 |
George Tambouratzis and Vasiliki Pouli |
Linguistically Inspired Language Model Augmentation for MT |
14:45-16:25 |
Gavin Abercrombie |
A Rule-based Shallow-transfer Machine Translation System for Scots and English |
14:45-16:25 |
Matīss Rikters and Inguna Skadina |
Syntax-based Multi-system Machine Translation |
14:45-16:25 |
Sanja Štajner, Andreia Querido, Nuno Rendeiro, João António Rodrigues and António Branco |
Use of Domain-Specific Language Resources in Machine Translation |
14:45-16:25 |
Santanu Pal, Marcos Zampieri, Sudip Kumar Naskar, Tapas Nayak, Mihaela Vela and Josef van Genabith |
CATaLog Online: Porting a Post-editing Tool to the Web |
14:45-16:25 |
Akira Hayakawa, Saturnino Luz, Loredana Cerrato and Nick Campbell |
The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus |
14:45-16:25 |
Kugatsu Sadamitsu, Itsumi Saito, Taichi Katayama, Hisako Asano and Yoshihiro Matsuo |
Name Translation based on Fine-grained Named Entity Recognition in a Single Language |
14:45-16:25 |
Sreelekha S and Pushpak Bhattacharyya |
Lexical Resources to Enrich English Malayalam Machine Translation |
14:45-16:25 |
Yong Xu and François Yvon |
Novel elicitation and annotation schemes for sentential and sub-sentential alignments of bitexts |
14:45-16:25 |
Liane Guillou and Christian Hardmeier |
PROTEST: A Test Suite for Evaluating Pronouns in Machine Translation |
14:45-16:25 |
Chenhui Chu and Sadao Kurohashi |
Paraphrasing Out-of-Vocabulary Words with Word Embeddings and Semantic Lexicons for Low Resource Statistical Machine Translation |
|
Session P06 - Parsing |
Chair: Giuseppe Attardi |
14:45-16:25 |
Joachim Daiber and Rob van der Goot |
The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input Conditions |
14:45-16:25 |
Xiaoyin Che, Cheng Wang, Haojin Yang and Christoph Meinel |
Punctuation Prediction for Unsegmented Transcript Based on Word Vector |
14:45-16:25 |
Hao Zhou, Yue Zhang, Shujian Huang, Xin-Yu Dai and Jiajun Chen |
Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing |
14:45-16:25 |
Atsushi Ushiku, Tetsuro Sasada and Shinsuke Mori |
Language Resource Addition Strategies for Raw Text Parsing |
14:45-16:25 |
Yuval Marton and Kristina Toutanova |
E-TIPSY: Search Query Corpus Annotated with Entities, Term Importance, POS Tags, and Syntactic Parses |
14:45-16:25 |
Liesbeth Augustinus, Peter Dirix, Daniel Van Niekerk, Ineke Schuurman, Vincent Vandeghinste, Frank Van Eynde and Gerhard Van Huyssteen |
AfriBooms: An Online Treebank for Afrikaans |
14:45-16:25 |
Edoardo Maria Ponti and Marco Passarotti |
Differentia compositionem facit. A Slower-Paced and Reliable Parser for Latin |
14:45-16:25 |
Roald Eiselen |
South African Language Resources: Phrase Chunking |
14:45-16:25 |
Jindřich Libovický |
Neural Scoring Function for MST Parser |
14:45-16:25 |
Inari Listenmaa and Koen Claessen |
Analysing Constraint Grammars with a SAT-solver |
14:45-16:25 |
Achim Stein |
Old French Dependency Parsing: Results of Two Parsers Analysed from a Linguistic Point of View |
14:45-16:25 |
Maria Pia di Buono |
Semi-automatic Parsing for Web Knowledge Extraction through Semantic Annotation |
|
Session P07 - Speech Corpora and Databases (1) |
Chair: Carmen García Mateo |
14:45-16:25 |
Jochen Weiner, Claudia Frankenberg, Dominic Telaar, Britta Wendelstein, Johannes Schröder and Tanja Schultz |
Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging |
14:45-16:25 |
Moez Ajili, Jean-françois Bonastre, Juliette Kahn, Solange Rossato and Guillaume Bernard |
FABIOLE, a Speech Database for Forensic Speaker Comparison |
14:45-16:25 |
Nawar Halabi and Mike Wald |
Phonetic Inventory for an Arabic Speech Corpus |
14:45-16:25 |
Yun-Nung Chen and Dilek Hakkani-Tur |
AIMU: Actionable Items for Meeting Understanding |
14:45-16:25 |
Felix Burkhardt and Uwe D. Reichel |
A Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source Performance |
14:45-16:25 |
Patricia Braunger, Hansjörg Hofmann, Steffen Werner and Maria Schmidt |
A Comparative Analysis of Crowdsourced Natural Language Corpora for Spoken Dialog Systems |
14:45-16:25 |
Xabier Sarasola, Eva Navas, David Tavarez, Daniel Erro, Ibon Saratxaga and Inma Hernaez |
A Singing Voice Database in Basque for Statistical Singing Synthesis of Bertsolaritza |
14:45-16:25 |
Hannes Pessentheiner, Thomas Pichler and Martin Hagmüller |
AMISCO: The Austrian German Multi-Sensor Corpus |
14:45-16:25 |
Philipp Aichinger, Immer Roesner, Matthias Leonhard, Doris-Maria Denk-Linnert, Wolfgang Bigenzahn and Berit Schneider-Stickler |
A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological Voices |
14:45-16:25 |
Neli Hateva, Petar Mitankin and Stoyan Mihov |
BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology |
14:45-16:25 |
Mārcis Pinnis, Askars Salimbajevs and Ilze Auzina |
Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian |
14:45-16:25 |
Jorge Proença, Dirce Celorico, Sara Candeias, Carla Lopes and Fernando Perdigão |
The LetsRead Corpus of Portuguese Children Reading Aloud for Performance Evaluation |
14:45-16:25 |
Uwe Reichel, Florian Schiel, Thomas Kisler, Christoph Draxler and Nina Pörner |
The BAS Speech Data Repository |
14:45-16:25 |
Emre Yilmaz, Mario Ganzeboom, Lilian Beijer, Catia Cucchiarini and Helmer Strik |
A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research |
|
Session P09 - Word Sense Disambiguation (1) |
Chair: Luca Dini |
14:45-16:25 |
Luigi Di Caro and Guido Boella |
Automatic Enrichment of WordNet with Common-Sense Knowledge |
14:45-16:25 |
Vít Baisa, Silvie Cinkova, Ema Krejčová and Anna Vernerová |
VPS-GradeUp: Graded Decisions on Usage Patterns |
14:45-16:25 |
Tristan Miller, Mohamed Khemakhem, Richard Eckart de Castilho and Iryna Gurevych |
Sense-annotating a Lexical Substitution Data Set with Ubyline |
14:45-16:25 |
Andrea Horbach, Andrea Hensler, Sabine Krome, Jakob Prange, Werner Scholze-Stubenrecht, Diana Steffen, Stefan Thater, Christian Wellner and Manfred Pinkal |
A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds |
14:45-16:25 |
Bolette Pedersen, Anna Braasch, Anders Johannsen, Héctor Martínez Alonso, Sanni Nimb, Sussi Olsen, Anders Søgaard and Nicolai Hartvig Sørensen |
The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories |
14:45-16:25 |
Silvie Cinkova, Ema Krejčová, Anna Vernerová and Vít Baisa |
Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study |
14:45-16:25 |
Yanan Lu, Yue Zhang and Donghong Ji |
Multi-prototype Chinese Character Embedding |
14:45-16:25 |
Angel Chang, Valentin I. Spitkovsky, Christopher D. Manning and Eneko Agirre |
A comparison of Named-Entity Disambiguation and Word Sense Disambiguation |
|
Session P10 - Discourse (1) |
Chair: Elena Cabrio |
16:45-18:05 |
Patrick Saint-Dizier |
Argument Mining: the Bottleneck of Knowledge and Language Resources |
16:45-18:05 |
Ekaterina Lapshinova-Koltunski, Kerstin Anna Kunz and Anna Nedoluzhko |
From Interoperable Annotations towards Interoperable Resources: A Multilingual Approach to the Analysis of Discourse |
16:45-18:05 |
Henk van den Heuvel and Nelleke Oostdijk |
Falling silent, lost for words ... Tracing personal involvement in interviews with Dutch war veterans |
16:45-18:05 |
Yang Liu, Jiajun Zhang, Chengqing Zong, Yating Yang and Xi Zhou |
A Bilingual Discourse Corpus and Its Applications |
16:45-18:05 |
Tatjana Scheffler and Manfred Stede |
Adding Semantic Relations to a Large-Coverage Connective Lexicon of German |
16:45-18:05 |
Mathilde Janier and Chris Reed |
Corpus Resources for Dispute Mediation Discourse |
16:45-18:05 |
Carlos Valmaseda, Juan Martinez-Romo and Lourdes Araujo |
A Tagged Corpus for Automatic Labeling of Disabilities in Medical Scientific Papers |
16:45-18:05 |
Stephanie Lukin, Kevin Bowden, Casey Barackman and Marilyn Walker |
PersonaBank: A Corpus of Personal Narratives and Their Story Intention Graphs |
16:45-18:05 |
Huan-Yuan Chen, Wan-Shan Liao, Hen-Hsen Huang and Hsin-Hsi Chen |
Fine-Grained Chinese Discourse Relation Labelling |
16:45-18:05 |
Ines Rehbein, Merel Scholman and Vera Demberg |
Annotating Discourse Relations in Spoken Language: A Comparison of the PDTB and CCR Frameworks |
16:45-18:05 |
Carole Lailler, Anaïs Landeau, Frédéric Béchet, Yannick Estève and Paul Deléglise |
Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP Tasks |
16:45-18:05 |
Manfred Stede, Stergos Afantenos, Andreas Peldszus, Nicholas Asher and Jérémy Perret |
Parallel Discourse Annotations on a Corpus of Short Texts |
16:45-18:05 |
John Lee and Chak Yan Yeung |
An Annotated Corpus of Direct Speech |
|
Session P12 - Sentiment Analysis and Opinion Mining (1) |
Chair: German Rigau |
16:45-18:05 |
Cédric Lopez, Frederique Segond and Christiane Fellbaum |
Encoding Adjective Scales for Fine-grained Resources |
16:45-18:05 |
Mario Sänger, Ulf Leser, Steffen Kemmerer, Peter Adolphs and Roman Klinger |
SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in German |
16:45-18:05 |
Marianna Apidianaki, Xavier Tannier and Cécile Richart |
Datasets for Aspect-Based Sentiment Analysis in French |
16:45-18:05 |
Samira Shaikh, Kit Cho, Tomek Strzalkowski, Laurie Feldman, John Lien, Ting Liu and George Aaron Broadwell |
ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple Languages |
16:45-18:05 |
Uladzimir Sidarenka |
PotTS: The Potsdam Twitter Sentiment Corpus |
16:45-18:05 |
Diana Maynard and Kalina Bontcheva |
Challenges of Evaluating Sentiment Analysis Tools on Social Media |
16:45-18:05 |
Jasy Suet Yan Liew, Howard R. Turtle and Elizabeth D. Liddy |
EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment Analysis |
16:45-18:05 |
Svetlana Kiritchenko and Saif Mohammad |
Happy Accident: A Sentiment Composition Lexicon for Opposing Polarity Phrases |
16:45-18:05 |
Alexandra Balahur and Hristo Tanev |
Detecting Implicit Expressions of Affect from Text using Semantic Knowledge on Common Concept Properties |
16:45-18:05 |
Natalia Loukachevitch and Anatolii Levchik |
Creating a General Russian Sentiment Lexicon |
16:45-18:05 |
Chantal van Son, Tommaso Caselli, Antske Fokkens, Isa Maks, Roser Morante, Lora Aroyo and Piek Vossen |
GRaSP: A Multilayered Annotation Scheme for Perspectives |
16:45-18:05 |
Wejdene Khiari, Mathieu Roche and Asma Bouhafs Hafsia |
Integration of Lexical and Semantic Knowledge for Sentiment Analysis in SMS |
16:45-18:05 |
Fabio Tamburini |
Specialising Paragraph Vectors for Text Polarity Detection |
16:45-18:05 |
Grégoire Jadi, Vincent Claveau, Béatrice Daille and Laura Monceaux |
Evaluating Lexical Similarity to build Sentiment Similarity |
|
Session P13 - Semantics (1) |
Chair: Christian Chiarcos |
16:45-18:05 |
Maximilian Köper, Melanie Zaiß, Qi Han, Steffen Koch and Sabine Schulte im Walde |
Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic Classification |
16:45-18:05 |
Nabin Maharjan, Rajendra Banjade, Nobal Bikram Niraula and Vasile Rus |
SemAligner: A Method and Tool for Aligning Chunks with Semantic Relation Types and Semantic Similarity Scores |
16:45-18:05 |
Ingrid Falk and Fabienne Martin |
Aspectual Flexibility Increases with Agentivity and Concreteness\\ A Computational Classification Experiment on Polysemous Verbs |
16:45-18:05 |
Silvio Cordeiro, Carlos Ramisch and Aline Villavicencio |
mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE Processing |
16:45-18:05 |
Elias Iosif, Spiros Georgiladakis and Alexandros Potamianos |
Cognitively Motivated Distributional Representations of Meaning |
16:45-18:05 |
Yoshihiko Hayashi and Wentao Luo |
Extending Monolingual Semantic Textual Similarity Task to Multiple Cross-lingual Settings |
16:45-18:05 |
Ann Copestake, Guy Emerson, Michael Wayne Goodman, Matic Horvat, Alexander Kuhnle and Ewa Muszyńska |
Resources for building applications with Dependency Minimal Recursion Semantics |
16:45-18:05 |
Chung-Lun Kuo and Hsin-Hsi Chen |
Subtask Mining from Search Query Logs for How-Knowledge Acceleration |
16:45-18:05 |
Daria Ryzhova, Maria Kyuseva and Denis Paperno |
Typology of Adjectives Benchmark for Compositional Distributional Models |
16:45-18:05 |
Tom Bosc, Elena Cabrio and Serena Villata |
DART: a Dataset of Arguments and their Relations on Twitter |
|
Session P15 - Multimodality |
Chair: Carlo Strapparava |
18:10-19:10 |
Laura Hollink, Adriatik Bedjeti, Martin van Harmelen and Desmond Elliott |
A Corpus of Images and Text in Online News |
18:10-19:10 |
Necati Cihan Camgöz, Ahmet Alp Kındıroğlu, Serpil Karabüklü, Meltem Kelepir, Ayşe Sumru Özsoy and Lale Akarun |
BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains |
18:10-19:10 |
Michel Vacher, Saïda Bouakaz, Marc-Eric Bobillier Chaumon, Frédéric Aman, R. A. Khan, Slima Bekkadja, François Portet, Erwan Guillou, Solange Rossato and Benjamin Lecouteux |
The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People |
18:10-19:10 |
Niraj Shrestha and Marie-Francine Moens |
Semi-automatically Alignment of Predicates between Speech and OntoNotes data |
18:10-19:10 |
María del Carmen Cabeza-Pereiro, José Mª García-Miguel, Carmen García Mateo and José Luis Alba Castro |
CORILSE: a Spanish Sign Language Repository for Linguistic Analysis |
18:10-19:10 |
Stephanie Schreitter and Brigitte Krenn |
The OFAI Multi-Modal Task Description Corpus |
18:10-19:10 |
Shinsuke Mori, John Richardson, Atsushi Ushiku, Tetsuro Sasada, Hirotaka Kameko and Yoshimasa Tsuruoka |
A Japanese Chess Commentary Corpus |
18:10-19:10 |
Johann Poignant, Mateusz Budnik, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau, Gilles Adda, Laurent Besacier, Hazim Ekenel, Gil Francopoulo, Javier Hernando, Joseph Mariani, Ramon Morros, Georges Quénot, Sophie Rosset and Thomas Tamisier |
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents |
18:10-19:10 |
Andy Luecking, Alexander Mehler, Désirée Walther, Marcel Mauri and Dennis Kurfürst |
Finding Recurrent Features of Image Schema Gestures: the FIGURE corpus |
18:10-19:10 |
Kai Frederic Engelmann, Patrick Holthaus, Britta Wrede and Sebastian Wrede |
An Interaction-Centric Dataset for Learning Automation Rules in Smart Homes |
18:10-19:10 |
Alex Becker, Fabio Kepler and Sara Candeias |
A Web Tool for Building Parallel Corpora of Spoken and Sign Languages |
|
Session P16 - Ontologies |
Chair: Elena Montiel Ponsoda |
18:10-19:10 |
Sharmin Muzaffar, Pitambar Behera and Girish Jha |
Issues and Challenges in Annotating Urdu Action Verbs on the IMAGACT4ALL Platform |
18:10-19:10 |
Liumingjing Xiao, Chong Ruan, An Yang, Junhao Zhang and Junfeng Hu |
Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpedia |
18:10-19:10 |
Janne M Johannessen, Arash Saidi and Kristin Hagen |
Constructing a Norwegian Academic Wordlist |
18:10-19:10 |
Roxane Segers, Marco Rospocher, Piek Vossen, Egoitz Laparra, German Rigau and Anne-Lyse Minard |
The Event and Implied Situation Ontology (ESO): Application and Evaluation |
18:10-19:10 |
Maria Sukhareva and Christian Chiarcos |
Combining Ontologies and Neural Networks for Analyzing Historical Language Varieties. A Case Study in Middle Low German |
18:10-19:10 |
Maxence Girard-Rivier, Romain Magnani, Veronique Auberge, Yuko Sasa, Liliya Tsvetanova, Frederic Aman and Clarisse Bayol |
Ecological Gestures for HRI: the GEE Corpus |
18:10-19:10 |
Rogelio Nazar and Irene Renau |
A Taxonomy of Spanish Nouns, a Statistical Algorithm to Generate it and its Implementation in Open Source Code |
|
Session P18 - Treebanks (1) |
Chair: Béatrice Daille |
18:10-19:10 |
Quy Nguyen, Yusuke Miyao, Ha Le and Ngan Nguyen |
Challenges and Solutions for Consistent Annotation of Vietnamese Treebank |
18:10-19:10 |
Kanta Suzuki, Yoshihide Kato and Shigeki Matsubara |
Correcting Errors in a Treebank Based on Tree Mining |
18:10-19:10 |
Philippe Blache, Gregoire de Montcheuil, Laurent Prévot and Stéphane Rauzy |
4Couv: A New Treebank for French |
18:10-19:10 |
Rita de Carvalho, Andreia Querido, Marisa Campos, Rita Valadas Pereira, João Silva and António Branco |
CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese |
18:10-19:10 |
Kadri Muischnek, Kaili Müürisep and Tiina Puolakainen |
Estonian Dependency Treebank: from Constraint Grammar tagset to Universal Dependencies |
18:10-19:10 |
Kaja Dobrovoljc and Joakim Nivre |
The Universal Dependencies Treebank of Spoken Slovenian |
18:10-19:10 |
Ye Kyaw Thu, Win Pa Pa, Masao Utiyama, Andrew Finch and Eiichiro Sumita |
Introducing the Asian Language Treebank (ALT) |
18:10-19:10 |
Lilja Øvrelid and Petter Hohle |
Universal Dependencies for Norwegian |
Day 2, Oral Sessions:
|
Session O19 - Dependency Treebanks |
Chairperson: Simonetta Montemagni |
9:45-10:05 |
Takaaki Tanaka, Yusuke Miyao, Masayuki Asahara, Sumire Uematsu, Hiroshi Kanayama, Shinsuke Mori and Yuji Matsumoto |
Universal Dependencies for Japanese |
10:05-10:25 |
Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajic, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty and Daniel Zeman |
Universal Dependencies v1: A Multilingual Treebank Collection |
10:25-10:45 |
Akihiko Kato, Hiroyuki Shindo and Yuji Matsumoto |
Construction of an English Dependency Corpus incorporating Compound Function Words |
10:45-11:05 |
Maria Simi and Giuseppe Attardi |
Adapting the TANL tool suite to Universal Dependencies |
11:05-11:25 |
Tak-sum Wong and John Lee |
A Dependency Treebank of the Chinese Buddhist Canon |
|
Session O22 - Anaphora and Coreference |
Chairperson: Eva Hajičová |
11:45-12:05 |
Jon Chamberlain, Massimo Poesio and Udo Kruschwitz |
Phrase Detectives Corpus 1.0 Crowdsourced Anaphoric Coreference. |
12:05-12:25 |
Evandro Fonseca, André Antonitsch, Sandra Collovini, Daniela Amaral, Renata Vieira and Anny Figueira |
Summ-it++: an Enriched Version of the Summ-it Corpus |
12:25-12:45 |
Alicia Burga, Sergio Cajal, Joan Codina-Filba and Leo Wanner |
Towards Multiple Antecedent Coreference Resolution in Specialized Discourse |
12:45-13:05 |
Olga Uryupina, Ron Artstein, Antonella Bristot, Federica Cavicchio, Kepa Rodriguez and Massimo Poesio |
ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions |
|
Session O24 - Speech Corpus for Health |
Chairperson: Eleni Efthimiou |
11:45-12:05 |
Daniela Beltrami, Laura Calzà, Gloria Gagliardi, Enrico Ghidoni, Norina Marcello, Rema Rossini Favretti and Fabio Tamburini |
Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productions |
12:05-12:25 |
Mario Corrales-Astorgano, David Escudero-Mancebo, Yurena Gutiérrez-González, Valle Flores-Lucas, César González-Ferreras and Valentín Cardeñoso-Payo |
On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual Disabilities |
12:25-12:45 |
Julia Parish-Morris, Christopher Cieri, Mark Liberman, Leila Bateman, Emily Ferguson and Robert T. Schultz |
Building Language Resources for Exploring Autism Spectrum Disorders |
12:45-13:05 |
Naim Terbeh and Mounir Zrigui |
Vocal Pathologies Detection and Mispronounced Phonemes Identification: Case of Arabic Continuous Speech |
|
Session O29 - Panel on International Initiatives from Public Agencies |
Chairperson: Khalid Choukri |
16:55-18:15 |
|
|
|
Session O31 - Summarisation and Simplification |
Chairperson: Udo Kruschwitz |
16:55-17:15 |
Gustavo Paetzold and Lucia Specia |
Benchmarking Lexical Simplification Systems |
17:15-17:35 |
Beatriz Fisas, Francesco Ronzano and Horacio Saggion |
A Multi-Layered Annotated Corpus of Scientific Papers |
17:35-17:55 |
Yashar Mehdad, Amanda Stent, Kapil Thadani, Dragomir Radev, Youssef Billawala and Karolina Buchner |
Extractive Summarization under Strict Length Constraints |
17:55-18:15 |
Emma Barker, Monica Paramita, Adam Funk, Emina Kurtic, Ahmet Aker, Jonathan Foster, Mark Hepple and Robert Gaizauskas |
Whats the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems |
|
Session O35 - Detecting Information in Medical Domain |
Chairperson: Dimitrios Kokkinakis |
18:20-18:40 |
Elena Arsevska, Mathieu Roche, Sylvain Falala, Renaud Lancelot, David Chavernac, Pascal Hendrikx and Barbara Dufour |
Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge |
18:40-19:00 |
Stephen Wu, Tamara Timmons, Amy Yates, Meikun Wang, Steven Bedrick, William Hersh and Hongfang Liu |
On Developing Resources for Patient-level Information Retrieval |
19:00-19:20 |
Prescott Klassen, Fei Xia and Meliha Yetisgen |
Annotating and Detecting Medical Events in Clinical Notes |
Day 2, Poster Sessions:
|
Session P19 - Discourse (2) |
Chair: Olga Uryupina |
9:45-11:25 |
Manfred Stede and Sara Mamprin |
Information structure in the Potsdam Commentary Corpus: Topics |
9:45-11:25 |
Jonathon Read, Erik Velldal, Marc Cavazza and Gersende Georg |
A Corpus of Clinical Practice Guidelines Annotated with the Importance of Recommendations |
9:45-11:25 |
Amy Isard |
The Methodius Corpus of Rhetorical Discourse Structures and Generated Texts |
9:45-11:25 |
Daniel Duma, Maria Liakata, Amanda Clare, James Ravenscroft and Ewan Klein |
Applying Core Scientific Concepts to Context-Based Citation Recommendation |
9:45-11:25 |
Ina Roesiger |
SciCorp: A Corpus of English Scientific Articles Annotated for Information Status Analysis |
9:45-11:25 |
Rohit Jain, Himanshu Sharma and Dipti Sharma |
Using lexical and Dependency Features to Disambiguate Discourse Connectives in Hindi |
9:45-11:25 |
Marta Andersson, Adnan Ozturel and Silvia Pareti |
Annotating Topic Development in Information Seeking Queries |
9:45-11:25 |
Jiří Mírovský, Lucie Poláková and Jan Štěpánek |
Searching in the Penn Discourse Treebank Using the PML-Tree Query |
9:45-11:25 |
Ghada Alharbi and Thomas Hain |
The OpenCourseWare Metadiscourse (OCWMD) Corpus |
9:45-11:25 |
Nicolas Hernandez, Soufian Salim and Elizaveta Loginova Clouet |
Ubuntu-fr: A Large and Open Corpus for Multi-modal Analysis of Online Written Conversations |
9:45-11:25 |
Julian Hough, Ye Tian, Laura de Ruiter, Simon Betz, Spyros Kousidis, David Schlangen and Jonathan Ginzburg |
DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and Laughter |
|
Session P20 - Document Classification and Text Categorisation (1) |
Chair: Fabio Tamburini |
9:45-11:25 |
Guntis Barzdins, Steve Renals and Didzis Gosko |
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project |
9:45-11:25 |
Cynthia Van Hee, Els Lefever and Veronique Hoste |
Exploring the Realization of Irony in Twitter Data |
9:45-11:25 |
Cyril Goutte, Serge Léger, Shervin Malmasi and Marcos Zampieri |
Discriminating Similar Languages: Evaluations and Explorations |
9:45-11:25 |
Latifa Al-Sulaiti, Noorhan Abbas, Claire Brierley, Eric Atwell and Ayman Alghamdi |
Compilation of an Arabic Childrens Corpus |
9:45-11:25 |
Robin Eriksson |
Quality Assessment of the Reuters Vol. 2 Multilingual Corpus |
9:45-11:25 |
Mahmoud El-Haj, Paul Rayson, Steve Young, Andrew Moore, Martin Walker, Thomas Schleicher and Vasiliki Athanasakou |
Learning Tone and Attribution for Financial Text Mining |
9:45-11:25 |
Roman Sergienko, Muhammad Shan and Wolfgang Minker |
A Comparative Study of Text Preprocessing Approaches for Topic Detection of User Utterances |
9:45-11:25 |
Muhammad Sharjeel, Paul Rayson and Rao Muhammad Adeel Nawab |
UPPC - Urdu Paraphrase Plagiarism Corpus |
9:45-11:25 |
Yannis Korkontzelos, Paul Thompson and Sophia Ananiadou |
Identifying Content Types of Messages Related to Open Source Software Projects |
9:45-11:25 |
Minglei Li, Yunfei Long, Lu Qin and Wenjie Li |
Emotion Corpus Construction Based on Selection from Hashtags |
|
Session P21 - Evaluation Methodologies (2) |
Chair: António Branco |
9:45-11:25 |
Björn Gambäck and Amitava Das |
Comparing the Level of Code-Switching in Corpora |
9:45-11:25 |
Markus Müller, Sarah Fünfer, Sebastian Stüker and Alex Waibel |
Evaluation of the KIT Lecture Translation System |
9:45-11:25 |
Behrang QasemiZadeh and Anne-Kathrin Schumann |
The ACL RD-TEC 2.0: A Language Resource for Evaluating Term Extraction and Entity Recognition Methods |
9:45-11:25 |
Wajdi Zaghouani, Nizar Habash, Ossama Obeid, Behrang Mohit, Houda Bouamor and Kemal Oflazer |
Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and Annotation |
9:45-11:25 |
Nora Aranberri, Eleftherios Avramidis, Aljoscha Burchardt, Ondrej Klejch, Martin Popel and Maja Popović |
Tools and Guidelines for Principled Machine Translation Development |
9:45-11:25 |
Olivier Galibert, Mohamed Ameur Ben Jannet, Juliette Kahn and Sophie Rosset |
Generating Task-Pertinent sorted Error Lists for Speech Recognition |
|
Session P22 - Information Extraction and Retrieval (2) |
Chair: Robert Gaizauskas |
9:45-11:25 |
Gil Francopoulo, Joseph Mariani and Patrick Paroubek |
A Study of Reuse and Plagiarism in LREC papers |
9:45-11:25 |
Debasis Ganguly, Iacer Calixto and Gareth Jones |
Developing a Dataset for Evaluating Approaches for Document Expansion with Images |
9:45-11:25 |
Pablo Ruiz, Clément Plancq and Thierry Poibeau |
More than Word Cooccurrence: Exploring Support and Opposition in International Climate Negotiations with Semantic Parsing |
9:45-11:25 |
Sandra Collovini, Gabriel Machado and Renata Vieira |
A Sequence Model Approach to Relation Extraction in Portuguese |
9:45-11:25 |
Daniel Hládek, Ján Staš and Jozef Juhár |
Evaluation Set for Slovak News Information Retrieval |
9:45-11:25 |
Takakazu Imada, Yusuke Inoue, Lei Chen, Syunya Doi, Tian Nie, Chen Zhao, Takehito Utsuro and Yasuhide Kawada |
Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine Suggests |
9:45-11:25 |
Adrien Bougouin, Sabine Barreaux, Laurent Romary, Florian Boudin and Beatrice Daille |
TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation |
9:45-11:25 |
Hannah Kermes, Stefania Degaetano-Ortlieb, Ashraf Khamis, Jörg Knappen and Elke Teich |
The Royal Society Corpus: From Uncharted Data to Corpus |
9:45-11:25 |
Lorraine Goeuriot, Liadh Kelly, Guido Zuccon and Joao Palotti |
Building Evaluation Datasets for Consumer-Oriented Information Retrieval |
9:45-11:25 |
Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret and Romaric Besançon |
A Dataset for Open Event Extraction in English |
|
Session P24 - Speech Processing (1) |
Chair: Andrew Caines |
9:45-11:25 |
Frédéric Aman, Michel Vacher, François Portet, William Duclot and Benjamin Lecouteux |
CirdoX: an on/off-line multisource speech and sound analysis software |
9:45-11:25 |
Matthias Sperber, Graham Neubig, Satoshi Nakamura and Alex Waibel |
Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces |
9:45-11:25 |
Mauro Nicolao, Heidi Christensen, Stuart Cunningham, Phil Green and Thomas Hain |
A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus |
9:45-11:25 |
Imed Laaridh, Corinne Fredouille and Christine Meunier |
Automatic Anomaly Detection for Dysarthria across Two Speech Styles: Read vs Spontaneous Speech |
9:45-11:25 |
Alexander Gutkin, Linne Ha, Martin Jansche, Knot Pipatsrisawat and Richard Sproat |
TTS for Low Resource Languages: A Bangla Synthesizer |
9:45-11:25 |
Félicien Vallet, Jim Uro, Jérémy Andriamakaoly, Hakim Nabi, Mathieu Derval and Jean Carrive |
Speech Trax: A Bottom to the Top Approach for Speaker Tracking and Indexing in an Archiving Context |
|
Session P25 - Crowdsourcing |
Chair: Monica Monachini |
11:45-13:05 |
Armin Hoenen |
Wikipedia Titles As Noun Tag Predictors |
11:45-13:05 |
Jun Harashima |
Japanese Word―Color Associations with and without Contexts |
11:45-13:05 |
Emiel van Miltenburg, Benjamin Timmermans and Lora Aroyo |
The VU Sound Corpus: Adding More Fine-grained Annotations to the Freesound Database |
11:45-13:05 |
Maria Sukhareva, Judith Eckle-Kohler, Ivan Habernal and Iryna Gurevych |
Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations |
11:45-13:05 |
Anna Feltracco, Simone Magnolini, Elisabetta Jezek and Bernardo Magnini |
Acquiring Opposition Relations among Italian Verb Senses using Crowdsourcing |
11:45-13:05 |
Andrew Caines, Christian Bentz, Calbert Graham, Tim Polzehl and Paula Buttery |
Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora |
11:45-13:05 |
Phil Bartie, William Mackaness, Dimitra Gkatzia and Verena Rieser |
The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban Scenes |
11:45-13:05 |
Simone Hantke, Erik Marchi and Björn Schuller |
Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification |
|
Session P27 - Machine Translation (2) |
Chair: Aljoscha Burchardt |
11:45-13:05 |
Antoine Bourlon, Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi |
Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated Lexicons |
11:45-13:05 |
Diptesh Kanojia, Aditya Joshi, Pushpak Bhattacharyya and Mark James Carman |
That'll Do Fine!: A Coarse Lexical Resource for English-Hindi MT, Using Polylingual Topic Models |
11:45-13:05 |
Toshiaki Nakazawa, Manabu Yaguchi, Kiyotaka Uchimoto, Masao Utiyama, Eiichiro Sumita, Sadao Kurohashi and Hitoshi Isahara |
ASPEC: Asian Scientific Paper Excerpt Corpus |
11:45-13:05 |
Gorka Labaka, Iñaki Alegria and Kepa Sarasola |
Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation |
11:45-13:05 |
Xiaofeng Wu, Jinhua Du, Qun Liu and Andy Way |
ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool |
11:45-13:05 |
Jingyi Han and Núria Bel |
Towards producing bilingual lexica from monolingual corpora |
11:45-13:05 |
Luís Gomes and Gabriel Pereira Lopes |
First Steps Towards Coverage-Based Sentence Alignment |
11:45-13:05 |
Jeevanthi Liyanapathirana and Andrei Popescu-Belis |
Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation |
11:45-13:05 |
Frédéric Blain, Varvara Logacheva and Lucia Specia |
Phrase Level Segmentation and Labelling of Machine Translation Errors |
11:45-13:05 |
José Manuel Martínez Martínez and Mihaela Vela |
SubCo: A Learner Translation Corpus of Human and Machine Subtitles |
|
Session P28 - Multiword Expressions |
Chair: Irina Temnikova |
11:45-13:05 |
Diana Bogantes, Eric Rodríguez, Alejandro Arauco, Alejandro Rodríguez and Agata Savary |
Towards Lexical Encoding of Multi-Word Expressions in Spanish Dialects |
11:45-13:05 |
Ziqi Zhang, Jie Gao and Fabio Ciravegna |
JATE 2.0: Java Automatic Term Extraction with Apache Solr |
11:45-13:05 |
Francesca Strik Lievers and Chu-Ren Huang |
A lexicon of perception for the identification of synaesthetic metaphors in corpora |
11:45-13:05 |
Malgorzata Marciniak, Agnieszka Mykowiecka and Piotr Rychlik |
TermoPL - a Flexible Tool for Terminology Extraction |
11:45-13:05 |
Sabine Schulte im Walde, Anna Hätty, Stefan Bott and Nana Khvtisavrishvili |
GhoSt-NN: A Representative Gold Standard of German Noun-Noun Compounds |
11:45-13:05 |
Carlos Ramisch, Alexis Nasr, André Valli and José Deulofeu |
DeQue: A Lexicon of Complex Prepositions and Conjunctions in French |
11:45-13:05 |
Gyri Smørdal Losnegaard, Federico Sangati, Carla Parra Escartín, Agata Savary, Sascha Bargmann and Johanna Monti |
PARSEME Survey on MWE Resources |
11:45-13:05 |
Rodrigo Wilkens, Marco Idiart and Aline Villavicencio |
Multiword Expressions in Child Language |
11:45-13:05 |
Dhouha Bouamor, Leonardo Campillos Llanos, Anne-Laure Ligozat, Sophie Rosset and Pierre Zweigenbaum |
Transfer-Based Learning-to-Rank Assessment of Medical Term Technicality |
11:45-13:05 |
Sara Rodríguez-Fernández, Roberto Carlini, Luis Espinosa Anke and Leo Wanner |
Example-based Acquisition of Fine-grained Collocation Resources |
11:45-13:05 |
Victoria Rosén, Koenraad De Smedt, Gyri Smørdal Losnegaard, Eduard Bejček, Agata Savary and Petya Osenova |
MWEs in Treebanks: From Survey to Guidelines |
11:45-13:05 |
Dhirendra Singh, Sudha Bhingardive and Pushpak Bhattacharya |
Multiword Expressions Dataset for Indian Languages |
|
Session P30 - Linked Data |
Chair: Felix Sasaki |
14:55-16:35 |
Johann-Mattis List, Michael Cysouw and Robert Forkel |
Concepticon: A Resource for the Linking of Concept Lists |
14:55-16:35 |
Ingrid Falk and Achim Stein |
LVF-lemon ― Towards a Linked Data Representation of Les Verbes français |
14:55-16:35 |
Paloma Galvan, Virginia Francisco, Raquel Hervas and Gonzalo Mendez |
Riddle Generation using Word Associations |
14:55-16:35 |
Ewa Rudnicka, Wojciech Witkowski and Katarzyna Podlaska |
Challenges of Adjective Mapping between plWordNet and Princeton WordNet |
14:55-16:35 |
Aleksandra Gabryszak, Sebastian Krause, Leonhard Hennig, Feiyu Xu and Hans Uszkoreit |
Relation- and Phrase-level Linking of FrameNet with Sar-graphs |
14:55-16:35 |
Balázs Indig, Márton Miháltz and András Simonyi |
Mapping Ontologies Using Ontologies: Cross-lingual Semantic Role Information Transfer |
14:55-16:35 |
Ravindra Harige and Paul Buitelaar |
Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text |
14:55-16:35 |
John Philip McCrae, Christian Chiarcos, Francis Bond, Philipp Cimiano, Thierry Declerck, Gerard de Melo, Jorge Gracia, Sebastian Hellmann, Bettina Klimek, Steven Moran, Petya Osenova, Antonio Pareja-Lora and Jonathan Pool |
The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud |
14:55-16:35 |
Tatiana Lesnikova, Jérôme David and Jérôme Euzenat |
Cross-lingual RDF Thesauri Interlinking |
|
Session P31 - LR Infrastructures and Architectures (1) |
Chair: Yohei Murakami |
14:55-16:35 |
Georg Rehm |
The Language Resource Life Cycle: Towards a Generic Model for Creating, Maintaining, Using and Distributing Language Resources |
14:55-16:35 |
Jun Harashima, Michiaki Ariga, Kenta Murata and Masayuki Ioki |
A Large-scale Recipe and Meal Data Collection as Infrastructure for Food Research |
14:55-16:35 |
Siim Orasmaa, Timo Petmanson, Alexander Tkachenko, Sven Laur and Heiki-Jaan Kaalep |
EstNLTK - NLP Toolkit for Estonian |
14:55-16:35 |
Justus Roux |
South African National Centre for Digital Language Resources |
14:55-16:35 |
Verena Lyding and Karin Schöne |
Design and Development of the MERLIN Learner Corpus Platform |
14:55-16:35 |
Menzo Windhouwer, Marc Kemps-Snijders, Paul Trilsbeek, André Moreira, Bas Van der Veen, Guilherme Silva and Daniel Von Reihn |
FLAT: Constructing a CLARIN Compatible Home for Language Resources |
14:55-16:35 |
Jan Odijk |
CLARIAH in the Netherlands |
14:55-16:35 |
Claus Zinn, Thorsten Trippel, Steve Kaminski and Emanuel Dima |
Crosswalking from CMDI to Dublin Core and MARC 21 |
14:55-16:35 |
Denise DiPersio, Christopher Cieri and Daniel Jaquette |
Data Management Plans and Data Centers |
14:55-16:35 |
Udo Hahn, Franz Matthies, Erik Faessler and Johannes Hellrich |
UIMA-Based JCoRe 2.0 Goes GitHub and Maven Central ― State-of-the-Art Software Resource Engineering and Distribution of NLP Pipelines |
14:55-16:35 |
Lene Offersgaard and Dorte Haltrup Hansen |
Facilitating Metadata Interoperability in CLARIN-DK |
14:55-16:35 |
Nancy Ide, Keith Suderman, James Pustejovsky, Marc Verhagen and Christopher Cieri |
The Language Application Grid and Galaxy |
|
Session P32 - Large Projects and Infrastructures (1) |
Chair: Zygmunt Vetulani |
14:55-16:35 |
Dan Tufiș, Verginica Barbu Mititelu, Elena Irimia, Ștefan Daniel Dumitrescu and Tiberiu Boroș |
The IPR-cleared Corpus of Contemporary Written and Spoken Romanian Language |
14:55-16:35 |
Michal Křen, Václav Cvrček, Tomáš Čapka, Anna Čermáková, Milena Hnátková, Lucie Chlumská, Tomáš Jelínek, Dominika Kováříková, Vladimír Petkevič, Pavel Procházka, Hana Skoumalová, Michal Škrabal, Petr Truneček, Pavel Vondřička and Adrian Jan Zasina |
SYN2015: Representative Corpus of Contemporary Written Czech |
14:55-16:35 |
Riccardo Del Gratta, Francesca Frontini, Monica Monachini, Gabriella Pardelli, Irene Russo, Roberto Bartolini, Fahad Khan, Claudia Soria and Nicoletta Calzolari |
LREC as a Graph: People and Resources in a Network |
14:55-16:35 |
Pawel Kamocki, Pavel Straňák and Michal Sedlák |
The Public License Selector:
Making Open Licensing Easier |
14:55-16:35 |
Daiva Vitkutė-Adžgauskienė, Andrius Utka, Darius Amilevičius and Tomas Krilavičius |
NLP Infrastructure for the Lithuanian Language |
14:55-16:35 |
Ulrike Krieg-Holz, Christian Schuschnig, Franz Matthies, Benjamin Redling and Udo Hahn |
CodE Alltag: A German-Language E-Mail Corpus |
|
Session P33 - Morphology (2) |
Chair: Felice dell'Orletta |
14:55-16:35 |
Seth Kulick and Ann Bies |
Rapid Development of Morphological Analyzers for Typologically Diverse Languages |
14:55-16:35 |
Abhisek Chakrabarty, Akshay Chaturvedi and Utpal Garain |
A Neural Lemmatizer for Bengali |
14:55-16:35 |
Francis Tyers, Aziyana Bayyr-ool, Aelita Salchak and Jonathan Washington |
A Finite-state Morphological Analyser for Tuvan |
14:55-16:35 |
Andrejs Spektors, Ilze Auziņa, Roberts Darģis, Normunds Grūzītis, Pēteris Paikens, Lauma Pretkalniņa, Laura Rituma and Baiba Saulīte |
Tēzaurs.lv: the Largest Open Lexical Database for Latvian |
14:55-16:35 |
Raveesh Motlani, Francis Tyers and Dipti Sharma |
A Finite-State Morphological Analyser for Sindhi |
14:55-16:35 |
Markus Forsberg and Mans Hulden |
Deriving Morphological Analyzers from Example Inflections |
14:55-16:35 |
Daniel Smith and Mans Hulden |
Morphological Analysis of Sahidic Coptic for Automatic Glossing |
14:55-16:35 |
Marcin Woliński and Witold Kieraś |
The on-line version of Grammatical Dictionary of Polish |
|
Session P34 - Semantic Lexicons |
Chair: Kiril Simov |
14:55-16:35 |
Maximilian Köper and Sabine Schulte im Walde |
Automatically Generated Affective Norms of Abstractness, Arousal, Imageability and Valence for 350 000 German Lemmas |
14:55-16:35 |
Marco Passarotti, Berta González Saavedra and Christophe Onambele |
Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin |
14:55-16:35 |
Yoshihiko Hayashi |
A Framework for Cross-lingual/Node-wise Alignment of Lexical-Semantic Resources |
14:55-16:35 |
Scott Piao, Paul Rayson, Dawn Archer, Francesca Bianchi, Carmen Dayrell, Mahmoud El-Haj, Ricardo-María Jiménez, Dawn Knight, Michal Křen, Laura Löfberg, Rao Muhammad Adeel Nawab, Jawad Shafi, Phoey Lee Teh and Olga Mudraya |
Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve Languages |
14:55-16:35 |
Gábor Recski |
Building Concept Graphs from Monolingual Dictionary Entries |
14:55-16:35 |
Elżbieta Hajnicz, Anna Andrzejczuk and Tomasz Bartosiak |
Semantic Layer of the Valence Dictionary of Polish Walenty |
14:55-16:35 |
Lucia Busso and Alessandro Lenci |
Italian VerbNet: A Construction-based Approach to Italian Verb Classification |
14:55-16:35 |
Natalia Grabar and Thierry Hamon |
A Large Rated Lexicon with French Medical Words |
14:55-16:35 |
Alexander Panchenko |
Best of Both Worlds: Making Word Sense Embeddings Interpretable |
14:55-16:35 |
Leonardo Zilio, Maria José Bocorny Finatto and Aline Villavicencio |
VerbLexPor: a lexical resource with semantic roles for Portuguese |
14:55-16:35 |
Maddalen Lopez de Lacalle, Egoitz Laparra, Itziar Aldabe and German Rigau |
A Multilingual Predicate Matrix |
14:55-16:35 |
Bryan Wilkinson and Oates Tim |
A Gold Standard for Scalar Adjectives |
14:55-16:35 |
Ivan Sekulić and Jan Šnajder |
VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian |
14:55-16:35 |
Alberto Simões, Xavier Gómez Guinovart and José João Almeida |
Enriching a Portuguese WordNet using Synonyms from a Monolingual Dictionary |
|
Session P37 - Parallel and Comparable Corpora |
Chair: Jörg Tiedemann |
16:55-18:15 |
Huaxing Shi, Tiejun Zhao and Keh-Yih Su |
Building A Case-based Semantic English-Chinese Parallel Treebank |
16:55-18:15 |
Xuansong Li, Jennifer Tracey, Stephen Grimes and Stephanie Strassel |
Uzbek-English and Turkish-English Morpheme Alignment Corpora |
16:55-18:15 |
Chenhui Chu, Raj Dabre and Sadao Kurohashi |
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features |
16:55-18:15 |
Iñaki San Vicente, Iñaki Alegria, Cristina España-Bonet, Pablo Gamallo, Hugo Gonçalo Oliveira, Eva Martinez Garcia, Antonio Toral, Arkaitz Zubiaga and Nora Aranberri |
TweetMT: A Parallel Microblog Corpus |
16:55-18:15 |
Mariana Neves, Antonio Jimeno Yepes and Aurélie Névéol |
The Scielo Corpus: a Parallel Corpus of Scientific Publications for Biomedicine |
16:55-18:15 |
Nikola Ljubešić, Miquel Esplà-Gomis, Antonio Toral, Sergio Ortiz Rojas and Filip Klubička |
Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair |
|
Session P38 - Social Media |
Chair: Fei Xia |
16:55-18:15 |
Dane Bell, Daniel Fried, Luwen Huangfu, Mihai Surdeanu and Stephen Kobourov |
Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness |
16:55-18:15 |
Bridget Sommerdijk, Eric Sanders and Antal van den Bosch |
Can Tweets Predict TV Ratings? |
16:55-18:15 |
SoHyun Park, Afsaneh Fazly, Annie Lee, Brandon Seibel, Wenjie Zi and Paul Cook |
Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media Corpus |
16:55-18:15 |
Shigeyuki Sakaki, Francine Chen, Mandy Korpusik and Yan-Ying Chen |
Corpus for Customer Purchase Behavior Prediction in Social Media |
16:55-18:15 |
Arda Celebi and Arzucan Özgür |
Segmenting Hashtags using Automatically Created Training Data |
16:55-18:15 |
Dirk Hovy and Anders Johannsen |
Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics |
16:55-18:15 |
Wanru Zhang, Andrew Caines, Dimitrios Alikaniotis and Paula Buttery |
Predicting Author Age from Weibo Microblog Posts |
16:55-18:15 |
Andrew Yates, Alek Kolcz, Nazli Goharian and Ophir Frieder |
Effects of Sampling on Twitter Trend Detection |
16:55-18:15 |
Nicolas Foucault and Antoine Courtin |
Automatic Classification of Tweets for Analyzing Communication Behavior of Museums |
|
Session P39 - Word Sense Disambiguation (2) |
Chair: Elisabetta Jezek |
16:55-18:15 |
Marko Bekavac and Jan Šnajder |
Graph-Based Induction of Word Senses in Croatian |
16:55-18:15 |
Richard Johansson, Yvonne Adesam, Gerlof Bouma and Karin Hedberg |
A Multi-domain Corpus of Swedish Word Sense Annotation |
16:55-18:15 |
Arantxa Otegi, Nora Aranberri, António Branco, Jan Hajic, Martin Popel, Kiril Simov, Eneko Agirre, Petya Osenova, Rita Pereira, João Silva and Steven Neale |
QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages |
16:55-18:15 |
Éva Mújdricza-Maydt, Silvana Hartmann, Iryna Gurevych and Anette Frank |
Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language Data |
16:55-18:15 |
Sudha Bhingardive, Rajita Shukla, Jaya Saraswati, Laxmi Kashyap, Dhirendra Singh and Pushpak Bhattacharya |
Synset Ranking of Hindi WordNet |
16:55-18:15 |
Andrey Kutuzov and Elizaveta Kuzmenko |
Neural Embedding Language Models in Semantic Clustering of Web Search Results |
|
Session P40 - Dialogue (1) |
Chair: Jens Edlund |
18:20-19:20 |
Ming Sun, Yun-Nung Chen, Zhenhao Hua, Yulian Tamres-Rudnicky, Arnab Dash and Alexander Rudnicky |
AppDialogue: Multi-App Dialogues for Intelligent Assistants |
18:20-19:20 |
Volha Petukhova, Christopher Stevens, Harmen de Weerd, Niels Taatgen, Fokie Cnossen and Andrei Malchanau |
Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus |
18:20-19:20 |
Vasily Konovalov, Ron Artstein, Oren Melamud and Ido Dagan |
The Negochat Corpus of Human-agent Negotiation Dialogues |
18:20-19:20 |
Ryuichiro Higashinaka, Kotaro Funakoshi, Yuka Kobayashi and Michimasa Inaba |
The dialogue breakdown detection challenge: Task description, datasets, and evaluation metrics |
18:20-19:20 |
Harry Bunt, Volha Petukhova, Andrei Malchanau, Kars Wijnhoven and Alex Fang |
The DialogBank |
18:20-19:20 |
Kris Liu, Jean Fox Tree and Marilyn Walker |
Coordinating Communication in the Wild: The Artwalk Dialogue Corpus of Pedestrian Navigation and Mobile Referential Communication |
18:20-19:20 |
Leonardo Campillos Llanos, Dhouha Bouamor, Pierre Zweigenbaum and Sophie Rosset |
Managing Linguistic and Terminological Variation in a Medical Dialogue System |
18:20-19:20 |
Ajda Gokcen, Evan Jaffe, Johnsey Erdmann, Michael White and Douglas Danforth |
A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System |
18:20-19:20 |
Laurent Prévot, Jan Gorisch and Roxane Bertrand |
A CUP of CoFee: A large Collection of feedback Utterances Provided with communicative function annotations |
|
Session P42 - Less-Resourced Languages |
Chair: Laurette Pretorius |
18:20-19:20 |
Isabell Hubert, Antti Arppe, Jordan Lachler and Eddie Antonio Santos |
Training & Quality Assessment of an Optical Character Recognition Model for Northern Haida |
18:20-19:20 |
Dafydd Gibbon |
Legacy language atlas data mining: mapping Kru languages |
18:20-19:20 |
Kazushi Ohya |
Data Formats and Management Strategies from the Perspective of Language Resource Producers ― Personal Diachronic and Social Synchronic Data Sharing ― |
18:20-19:20 |
Henk van den Heuvel, Eric Sanders and Nicoline van der Sijs |
Curation of Dutch Regional Dictionaries |
18:20-19:20 |
Claudia Soria, Irene Russo, Valeria Quochi, Davyth Hicks, Antton Gurrutxaga, Anneli Sarhimaa and Matti Tuomisto |
Fostering digital representation of EU regional and minority languages: the Digital Language Diversity Project |
18:20-19:20 |
Delyth Prys, Gruffudd Prys and Dewi Bryn Jones |
Cysill Ar-lein: A Corpus of Written Contemporary Welsh Compiled from an On-line Spelling and Grammar Checker |
18:20-19:20 |
Martijn Wieling, Eva Sassolini, Sebastiana Cucurullo and Simonetta Montemagni |
ALT Explored: Integrating an Online Dialectometric Tool and an Online Dialect Atlas |
18:20-19:20 |
Stephanie Strassel and Jennifer Tracey |
LORELEI Language Packs: Data, Tools, and Resources for Technology Development in Low Resource Languages |
18:20-19:20 |
Alina Maria Ciobanu and Liviu P. Dinu |
A Computational Perspective on the Romanian Dialects |
18:20-19:20 |
Sebastian Nordhoff, Siri Tuttle and Olga Lovick |
The Alaskan Athabascan Grammar Database |
18:20-19:20 |
Arbi Haza Nasution, Yohei Murakami and Toru Ishida |
Constraint-Based Bilingual Lexicon Induction for Closely Related Languages |
|
Session P43 - Named Entity Recognition |
Chair: Sara Tonelli |
18:20-19:20 |
Lubomir Otrusina and Pavel Smrz |
WTF-LOD - A New Resource for Large-Scale NER Evaluation |
18:20-19:20 |
Julian Bleicken, Thomas Hanke, Uta Salden and Sven Wagner |
Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus Data |
18:20-19:20 |
Milan Dojchinovski, Dinesh Reddy, Tomáš Kliegr, Tomas Vitvar and Harald Sack |
Crowdsourced Corpus with Entity Salience Annotations |
18:20-19:20 |
Sergio Oramas, Luis Espinosa Anke, Mohamed Sordo, Horacio Saggion and Xavier Serra |
ELMD: An Automatically Generated Entity Linking Gold Standard Dataset in the Music Domain |
18:20-19:20 |
Patrick Littell, David R. Mortensen, Kartik Goyal, Chris Dyer and Lori Levin |
Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik |
18:20-19:20 |
Halil Kilicoglu, Asma Ben Abacha, Yassine Mrabet, Kirk Roberts, Laritza Rodriguez, Sonya Shooshan and Dina Demner-Fushman |
Annotating Named Entities in Consumer Health Questions |
18:20-19:20 |
Adrian Brasoveanu, Lyndon J.B. Nixon, Albert Weichselbraun and Arno Scharl |
A Regional News Corpora for Contextualized Entity Discovery and Linking |
18:20-19:20 |
Martin Brümmer, Milan Dojchinovski and Sebastian Hellmann |
DBpedia Abstracts: A Large-Scale, Open, Multilingual NLP Training Corpus |
18:20-19:20 |
Roald Eiselen |
Government Domain Named Entity Recognition for South African Languages |
18:20-19:20 |
Maud Ehrmann, Damien Nouvel and Sophie Rosset |
Named Entity Resources - Overview and Outlook |
18:20-19:20 |
Marcos Garcia |
Incorporating Lexico-semantic Heuristics into Coreference Resolution Sieves for Named Entity Recognition at Document-level |
18:20-19:20 |
Octavia-Maria Şulea, Sergiu Nisioi and Liviu P. Dinu |
Using Word Embeddings to Translate Named Entities |
Day 3, Oral Sessions:
|
Session O37 - Robots and Conversational Agents Interaction |
Chairperson: Claude Barras |
9:45-10:05 |
Patrick Holthaus, Christian Leichsenring, Jasmin Bernotat, Viktor Richter, Marian Pohling, Birte Carlmeyer, Norman Köster, Sebastian Meyer zu Borgsen, René Zorn, Birte Schiffhauer, Kai Frederic Engelmann, Florian Lier, Simon Schulz, Philipp Cimiano, Friederike Eyssel, Thomas Hermann, Franz Kummert, David Schlangen, Sven Wachsmuth, Petra Wagner, Britta Wrede and Sebastian Wrede |
How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment |
10:05-10:25 |
Zhichao Hu, Michelle Dick, Chung-Ning Chang, Kevin Bowden, Michael Neff, Jean Fox Tree and Marilyn Walker |
A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal Narratives |
10:25-10:45 |
Stavroula―Evita Fotinea, Eleni Efthimiou, Maria Koutsombogera, Athanasia-Lida Dimou, Theodore Goulas and Kyriaki Vasilaki |
Multimodal Resources for Human-Robot Communication Modelling |
10:45-11:05 |
Jackson Tolins, Kris Liu, Michael Neff, Marilyn Walker and Jean Fox Tree |
A Verbal and Gestural Corpus of Story Retellings to an Expressive Embodied Virtual Character |
11:05-11:25 |
Jackson Tolins, Kris Liu, Yingying Wang, Jean Fox Tree, Marilyn Walker and Michael Neff |
A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs |
|
Session O40 - Treebanks and Syntactic and Semantic Analysis |
Chairperson: Joakim Nivre |
9:45-10:05 |
Liesbeth Augustinus, Vincent Vandeghinste and Tom Vanallemeersch |
Poly-GrETEL: Cross-Lingual Example-based Querying of Syntactic Constructions |
10:05-10:25 |
Helge Dyvik, Paul Meurer, Victoria Rosén, Koenraad De Smedt, Petter Haugereid, Gyri Smørdal Losnegaard, Gunn Inger Lyse and Martha Thunes |
NorGramBank: A Deep Treebank for Norwegian |
10:25-10:45 |
Corentin Ribeyre, Eric Villemonte de la Clergerie and Djamé Seddah |
Accurate Deep Syntactic Parsing of Graphs: The Case of French |
10:45-11:05 |
Abdelati Hawwari, Mohammed Attia, Mahmoud Ghoneim and Mona Diab |
Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic |
11:05-11:25 |
|
|
|
Session O47 - Text Mining and Information Extraction |
Chairperson: Gregory Grefenstette |
14:55-15:15 |
Marieke van Erp, Pablo Mendes, Heiko Paulheim, Filip Ilievski, Julien Plu, Giuseppe Rizzo and Joerg Waitelonis |
Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better Job |
15:15-15:35 |
Daniel Preoţiuc-Pietro, P. K. Srijith, Mark Hepple and Trevor Cohn |
Studying the Temporal Dynamics of Word Co-occurrences: An Application to Event Detection |
15:35-15:55 |
Luis Gerardo Mojica de la Vega and Vincent Ng |
Markov Logic Networks for Text Mining: A Qualitative and Empirical Comparison with Integer Linear Programming |
15:55-16:15 |
Ayman Al Zaatari, Rim El Ballouli, Shady ELbassouni, Wassim El-Hajj, Hazem Hajj, Khaled Shaban, Nizar Habash and Emad Yahya |
Arabic Corpora for Credibility Analysis |
Day 3, Poster Sessions
|
Session P44 - Corpus Creation and Querying (1) |
Chair: Cristina Bosco |
9:45-11:25 |
Anne-Kathrin Schumann and Stefan Fischer |
Compasses, Magnets, Water Microscopes: Annotation of Terminology in a Diachronic Corpus of Scientific Texts |
9:45-11:25 |
Nils Diewald, Michael Hanl, Eliza Margaretha, Joachim Bingel, Marc Kupietz, Piotr Banski and Andreas Witt |
KorAP Architecture ― Diving in the Deep Sea of Corpus Data |
9:45-11:25 |
Cyril Grouin |
Text Segmentation of Digitized Clinical Texts |
9:45-11:25 |
Elif Ahsen Acar, Deniz Zeyrek, Murathan Kurfalı and Cem Bozşahin |
A Turkish Database for Psycholinguistic Studies Based on Frequency, Age of Acquisition, and Imageability |
9:45-11:25 |
Steffen Remus and Chris Biemann |
Domain-Specific Corpus Expansion with Focused Webcrawling |
9:45-11:25 |
Nikola Ljubešić, Tomaž Erjavec and Darja Fišer |
Corpus-Based Diacritic Restoration for South Slavic Languages |
9:45-11:25 |
Daniel Couto-Vale, Stella Neumann and Paula Niemietz |
Automatic Recognition of Linguistic Replacements in Text Series Generated from Keystroke Logs |
9:45-11:25 |
Elena Manishina, Bassam Jabaian, Stéphane Huet and Fabrice Lefevre |
Automatic Corpus Extension for Data-driven Natural Language Generation |
9:45-11:25 |
Amal Htait, Sebastien Fournier and Patrice Bellot |
Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers |
9:45-11:25 |
Wajdi Zaghouani, Houda Bouamor, Abdelati Hawwari, Mona Diab, Ossama Obeid, Mahmoud Ghoneim, Sawsan Alqahtani and Kemal Oflazer |
Guidelines and Framework for a Large Scale Arabic Diacritized Corpus |
|
Session P46 - Information Extraction and Retrieval (3) |
Chair: Aurelie Neveol |
9:45-11:25 |
Muhammad Humayoun and Hwanjo Yu |
Analyzing Pre-processing Settings for Urdu Single-document Extractive Summarization |
9:45-11:25 |
Kata Gábor, Haifa Zargayouna, Davide Buscaldi, Isabelle Tellier and Thierry Charnois |
Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature |
9:45-11:25 |
Leon Derczynski, Jannik Strötgen, Diana Maynard, Mark A. Greenwood and Manuel Jung |
GATE-Time: Extraction of Temporal Expressions and Events |
9:45-11:25 |
Vincent Claveau and Ewa Kijak |
Distributional Thesauri for Information Retrieval and vice versa |
9:45-11:25 |
Justin Mott, Ann Bies, Zhiyi Song and Stephanie Strassel |
Parallel Chinese-English Entities, Relations and Events Corpora |
9:45-11:25 |
Tilia Ellendorff, Simon Foster and Fabio Rinaldi |
The PsyMine Corpus - A Corpus annotated with Psychiatric Disorders and their Etiological Factors |
9:45-11:25 |
Dean Fulgoni, Jordan Carpenter, Lyle Ungar and Daniel Preoţiuc-Pietro |
An Empirical Exploration of Moral Foundations Theory in Partisan News Sources |
9:45-11:25 |
Carmen Banea, Xi Chen and Rada Mihalcea |
Building a Dataset for Possessions Identification in Text |
9:45-11:25 |
Kira Griffitt and Stephanie Strassel |
The Query of Everything: Developing Open-Domain, Natural-Language Queries for BOLT Information Retrieval |
9:45-11:25 |
Ting Liu, Kit Cho, Tomek Strzalkowski, Samira Shaikh and Mehrdad Mirzaei |
The Validation of MRCPD Cross-language Expansions on Imageability Ratings |
9:45-11:25 |
Dipawesh Pawar, Mohammed Hasanuzzaman and Asif Ekbal |
Building Tempo-HindiWordNet: A resource for effective temporal information access in Hindi |
|
Session P47 - Semantic Corpora |
Chair: Eneko Agirre |
9:45-11:25 |
Natalia Grabar and Iris Eshkol-Taravela |
Detection of Reformulations in Spoken French |
9:45-11:25 |
Rajendra Banjade and Vasile Rus |
DT-Neg: Tutorial Dialogues Annotated for Negation Scope and Focus in Context |
9:45-11:25 |
Kirk Roberts and Dina Demner-Fushman |
Annotating Logical Forms for EHR Questions |
9:45-11:25 |
Steven Bethard and Jonathan Parker |
A Semantically Compositional Annotation Scheme for Time Normalization |
9:45-11:25 |
Gözde Özbal, Carlo Strapparava and Serra Sinem Tekiroglu |
PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors |
9:45-11:25 |
Marianne Djemaa, Marie Candito, Philippe Muller and Laure Vieu |
Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology |
9:45-11:25 |
Anaïs Lefeuvre-Halftermeyer, Jean-Yves Antoine, Alain Couillault, Emmanuel Schang, Lotfi Abouda, Agata Savary, Denis Maurel, Iris Eshkol and Delphine Battistelli |
Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility |
9:45-11:25 |
Laure Vieu, Philippe Muller, Marie Candito and Marianne Djemaa |
A General Framework for the Annotation of Causality Based on FrameNet |
9:45-11:25 |
Alakananda Vempala and Eduardo Blanco |
Annotating Temporally-Anchored Spatial Knowledge on Top of OntoNotes Semantic Roles |
9:45-11:25 |
Jana Götze and Johan Boye |
SpaceRef: A corpus of street-level geographic descriptions |
9:45-11:25 |
Azadeh Mirzaei and Amirsaeid Moloodi |
Persian Proposition Bank |
9:45-11:25 |
Yuka Tateisi, Tomoko Ohta, Sampo Pyysalo, Yusuke Miyao and Akiko Aizawa |
Typed Entity and Relation Annotation on Computer Science Papers |
9:45-11:25 |
Volker Gast, Lennart Bierkandt, Stephan Druskat and Christoph Rzymski |
Enriching TimeBank: Towards a more precise annotation of temporal relations in a text |
|
Session P48 - Speech Processing (2) |
Chair: Denise DiPersio |
9:45-11:25 |
Imran Sheikh, Irina Illina and Dominique Fohr |
How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News |
9:45-11:25 |
Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin LU, Minglei Li, Dan Xiong, Roy Shing Yu and Vincent T.Y. Ng |
Syllable based DNN-HMM Cantonese Speech to Text System |
9:45-11:25 |
Elodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese and Uriel Pascal Elingui |
Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof |
9:45-11:25 |
Joris Pelemans, Lyan Verwimp, Kris Demuynck, Hugo Van hamme and Patrick Wambacq |
SCALE: A Scalable Language Engineering Toolkit |
9:45-11:25 |
Sandrine Brognaux, Thomas Francois and Marco Saerens |
Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis |
9:45-11:25 |
Thomas Kisler, Uwe Reichel, Florian Schiel, Christoph Draxler, Bernhard Jackl and Nina Pörner |
BAS Speech Science Web Services - an Update of Current Developments |
9:45-11:25 |
Fernando Batista, Pedro Curto, Isabel Trancoso, Alberto Abad, Jaime Ferreira, Eugénio Ribeiro, Helena Moniz, David Martins de Matos and Ricardo Ribeiro |
SPA: Web-based Platform for easy Access to Speech Processing Modules |
9:45-11:25 |
Roberto Seara, Marta Martinez, Rocio Varela, Carmen García Mateo, Elisa Fernandez Rei and Xose Luis Regueira |
Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech |
|
Session P49 - Corpus Creation and Querying (2) |
Chair: Menzo Windhouwer |
11:45-13:25 |
Maarten Janssen |
TEITOK: Text-Faithful Annotated Corpora |
11:45-13:25 |
Mathias Schenner and Sebastian Nordhoff |
Extracting Interlinear Glossed Text from LaTeX Documents |
11:45-13:25 |
Talvany Carlotto, Zuhaitz Beloki, Xabier Artola and Aitor Soroa |
Interoperability of Annotation Schemes: Using the Pepper Framework to Display AWA Documents in the ANNIS Interface |
11:45-13:25 |
Mohamed Al-Badrashiny, Arfath Pasha, Mona Diab, Nizar Habash, Owen Rambow, Wael Salloum and Ramy Eskander |
SPLIT: Smart Preprocessing (Quasi) Language Independent Tool |
11:45-13:25 |
Tanja Samardzic, Yves Scherrer and Elvira Glaser |
ArchiMob - A Corpus of Spoken Swiss German |
11:45-13:25 |
Timo Homburg and Christian Chiarcos |
Word Segmentation for Akkadian Cuneiform |
11:45-13:25 |
Cyril Grouin |
Controlled Propagation of Concept Annotations in Textual Corpora |
11:45-13:25 |
Koiti Hasida |
Graphical Annotation for Syntax-Semantics Mapping |
11:45-13:25 |
Mark Sammons, Christos Christodoulopoulos, Parisa Kordjamshidi, Daniel Khashabi, Vivek Srikumar and Dan Roth |
EDISON: Feature Extraction for NLP, Simplified |
|
Session P50 - Document Classification and Text Categorisation (2) |
Chair: Thierry Hamon |
11:45-13:25 |
Nora Al-Twairesh, Abeer Al-Dayel, Hend Al-Khalifa, Maha Al-Yahya, Sinaa Alageel, Nora Abanmy and Nouf Al-Shenaifi |
MADAD: A Readability Annotation Tool for Arabic Text |
11:45-13:25 |
Marcos Zampieri, Shervin Malmasi and Mark Dras |
Modeling Language Change in Historical Corpora: The Case of Portuguese |
11:45-13:25 |
Filip Graliński, Łukasz Borchmann and Piotr Wierzchoń |
He Said She Said ― a Male/Female Corpus of Polish |
11:45-13:25 |
Karin Sim Smith, Wilker Aziz and Lucia Specia |
Cohere: A Toolkit for Local Coherence |
11:45-13:25 |
James Ravenscroft, Anika Oellrich, Shyamasree Saha and Maria Liakata |
Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus |
11:45-13:25 |
Udochukwu Orizu and Yulan He |
Detecting Expressions of Blame or Praise in Text |
11:45-13:25 |
Stephan Tulkens, Chris Emmery and Walter Daelemans |
Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource |
11:45-13:25 |
Andre Quispersaravia and Walter Perez and Marco Sobrevilla and Fernando Alva-Manchengo |
Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish |
11:45-13:25 |
Alice Frain and Sander Wubben |
SatiricLR: a Language Resource of Satirical News Articles |
|
Session P51 - Multilingual Corpora |
Chair: Penny Labropoulou |
11:45-13:25 |
Marcus Klang and Pierre Nugues |
WIKIPARQ: A Tabulated Wikipedia Resource Using the Parquet Format |
11:45-13:25 |
David Vilares, Miguel A. Alonso and Carlos Gómez-Rodríguez |
EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis |
11:45-13:25 |
Darina Benikova and Chris Biemann |
SemRelData ― Multilingual Contextual Annotation of Semantic Relations between Nominals: Dataset and Guidelines |
11:45-13:25 |
Jérémy Ferrero, Frédéric Agnès, Laurent Besacier and Didier Schwab |
A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity Detection |
11:45-13:25 |
Younes Samih and Wolfgang Maier |
An Arabic-Moroccan Darija Code-Switched Corpus |
11:45-13:25 |
Navid Rekabsaz, Serwah Sabetghadam, Mihai Lupu, Linda Andersson and Allan Hanbury |
Standard Test Collection for English-Persian Cross-Lingual Word Sense Disambiguation |
11:45-13:25 |
Milan Dojchinovski, Felix Sasaki, Tatjana Gornostaja, Sebastian Hellmann, Erik Mannens, Frank Salliau, Michele Osella, Phil Ritchie, Giannis Stoitsis, Kevin Koidl, Markus Ackermann and Nilesh Chakraborty |
FREME: Multilingual Semantic Enrichment with Linked Data and Language Technologies |
11:45-13:25 |
Amir Hazem and Emmanuel Morin |
Improving Bilingual Terminology Extraction from Comparable Corpora via Multiple Word-Space Models |
11:45-13:25 |
Alexandre Berard, Christophe Servan, Olivier Pietquin and Laurent Besacier |
MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP |
11:45-13:25 |
Murad Abouammoh, Kashif Shah and Ahmet Aker |
Creation of comparable corpora for English-{Urdu, Arabic, Persian} |
11:45-13:25 |
Sergiu Nisioi, Ella Rabinovich, Liviu P. Dinu and Shuly Wintner |
A Corpus of Native, Non-native and Translated Texts |
11:45-13:25 |
Andrea Fischer, Klara Jagrova, Irina Stenger, Tania Avgustinova, Dietrich Klakow and Roland Marti |
Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility |
11:45-13:25 |
Ximena Gutierrez-Vasques, Gerardo Sierra and Isaac Hernandez Pompa |
Axolotl: a Web Accessible Parallel Corpus for Spanish-Nahuatl |
11:45-13:25 |
Özlem Çetinoğlu |
A Turkish-German Code-Switching Corpus |
11:45-13:25 |
Michael Mohler, Mary Brunson, Bryan Rink and Marc Tomlinson |
Introducing the LCC Metaphor Datasets |
11:45-13:25 |
Mona Diab, Mahmoud Ghoneim, Abdelati Hawwari, Fahad AlGhamdi, Nada AlMarwani and Mohamed Al-Badrashiny |
Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic Data |
11:45-13:25 |
Laurence Meurant, Maxime Gobert and Anthony Cleve |
Modelling a Parallel Corpus of French and French Belgian Sign Language |
11:45-13:25 |
Ines Cebović and Marko Tadić |
Building the Macedonian-Croatian Parallel Corpus |
11:45-13:25 |
Vladimír Benko |
Two Years of Aranea: Increasing Counts and Tuning the Pipeline |
11:45-13:25 |
Ichiro Umata, Koki Ijuin, Mitsuru Ishida, Moe Takeuchi and Seiichi Yamamoto |
Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 Conversations |
11:45-13:25 |
Karen Jones, Stephanie Strassel, Kevin Walker, David Graff and Jonathan Wright |
Multi-language Speech Collection for NIST LRE |
|
Session P53 - Dialogue (2) |
Chair: Thorsten Trippel |
14:55-16:15 |
Morena Danieli, Balamurali A R, Evgeny Stepanov, Benoit Favre, Frederic Bechet and Giuseppe Riccardi |
Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations |
14:55-16:15 |
Hanae Koiso, Tomoyuki Tsuchiya, Ryoko Watanabe, Daisuke Yokomori, Masao Aizawa and Yasuharu Den |
Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation |
14:55-16:15 |
Kalin Stefanov and Jonas Beskow |
A Multi-party Multi-modal Dataset for Focus of Visual Attention in Human-human and Human-robot Interaction |
14:55-16:15 |
Rob Abbott, Brian Ecker, Pranav Anand and Marilyn Walker |
Internet Argument Corpus 2.0: An SQL schema for Dialogic Social Media and the Corpora to go with it |
14:55-16:15 |
Emer Gilmartin and Nick Campbell |
Capturing Chat: Annotation and Tools for Multiparty Casual Conversation. |
|
Session P54 - LR Infrastructures and Architectures (2) |
Chair: Koiti Hasida |
14:55-16:15 |
Pawel Kamocki and Jim O'Regan |
Privacy Issues in Online Machine Translation Services - European Perspective |
14:55-16:15 |
Christian Chiarcos, Christian Fäth, Heike Renner-Westermann, Frank Abromeit and Vanya Dimitrova |
Lin|gu|is|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open Data |
14:55-16:15 |
Ngoc Nguyen, Donghui Lin, Takao Nakaguchi and Toru Ishida |
Towards a Language Service Infrastructure for Mobile Environments |
14:55-16:15 |
Maristella Agosti, Emanuele Di Buccio, Giorgio Maria Di Nunzio, Cecilia Poletto and Esther Rinke |
Designing A Long Lasting Linguistic Project: The Case Study of ASIt |
14:55-16:15 |
Damir Cavar, Malgorzata Cavar and Lwin Moe |
Global Open Resources and Information for Language and Linguistic Analysis (GORILLA) |
14:55-16:15 |
Stephan Druskat, Volker Gast, Thomas Krause and Florian Zipser |
corpus-tools.org: An Interoperable Generic Software Tool Set for Multi-layer Linguistic Corpora |
14:55-16:15 |
Roland Schäfer |
CommonCOW: Massively Huge Web Corpora from CommonCrawl Data and a Method to Distribute them Freely under Restrictive EU Copyright Laws |
14:55-16:15 |
Ioannis Manousos Katakis, Georgios Petasis and Vangelis Karkaletsis |
CLARIN-EL Web-based Annotation Tool |
14:55-16:15 |
Mathijs Kattenberg, Zuhaitz Beloki, Aitor Soroa, Xabier Artola, Antske Fokkens, Paul Huygen and Kees Verstoep |
Two Architectures for Parallel Processing of Huge Amounts of Text |
14:55-16:15 |
Steve Cassidy |
Publishing the Trove Newspaper Corpus |
14:55-16:15 |
Vladimir Popescu, Lin Liu, Riccardo Del Gratta, Khalid Choukri and Nicoletta Calzolari |
New Developments in the LRE Map |
|
Session P55 - Large Projects and Infrastructures (2) |
Chair: Dieter Van Uytvanck |
14:55-16:15 |
Jens Edlund and Joakim Gustafson |
Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives |
14:55-16:15 |
Valérie Mapelli, Vladimir Popescu, Lin Liu, Meritxell Fernández Barrera and Khalid Choukri |
The ELRA License Wizard |
14:55-16:15 |
Thibault Grouas, Valérie Mapelli and Quentin Samier |
Review on the Existing Language Resources for Languages of France |
14:55-16:15 |
Christopher Cieri, Mike Maxwell, Stephanie Strassel and Jennifer Tracey |
Selection Criteria for Low Resource Language Programs |
14:55-16:15 |
Meritxell Fernández Barrera, Vladimir Popescu, Antonio Toral, Federico Gaspari and Khalid Choukri |
Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and Opportunities |
|
Session P56 - Semantics (2) |
Chair: Yoshihiko Hayashi |
14:55-16:15 |
Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu and Chu-Ren Huang |
Nine Features in a Random Forest to Learn Taxonomical Semantic Relations |
14:55-16:15 |
Enrico Santus, Alessandro Lenci, Tin-Shing Chiu, Qin Lu and Chu-Ren Huang |
What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets |
14:55-16:15 |
Marco Del Tredici and Nuria Bel |
Assessing the Potential of Metaphoricity of verbs using corpus data |
14:55-16:15 |
Mathieu Lafourcade and Lionel Ramadier |
Semantic Relation Extraction with Semantic Patterns Experiment on Radiology Reports |
14:55-16:15 |
Liu Hongchao, Karl Neergaard, Enrico Santus and Chu-Ren Huang |
EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMs |
14:55-16:15 |
Maaz Anwar and Dipti Sharma |
Towards Building Semantic Role Labeler for Indian Languages |
14:55-16:15 |
Tanja Samardzic and Maja Miličević |
A Framework for Automatic Acquisition of Croatian and Serbian Verb Aspect from Corpora |
14:55-16:15 |
Piroska Lendvai, Isabelle Augenstein, Kalina Bontcheva and Thierry Declerck |
Monolingual Social Media Datasets for Detecting Contradiction and Entailment |
14:55-16:15 |
James Pustejovsky and Nikhil Krishnaswamy |
VoxML: A Visualization Modeling Language |
14:55-16:15 |
Takehiro Teraoka |
Metonymy Analysis Using Associative Relations between Words |
14:55-16:15 |
Travis Goodwin and Sanda Harabagiu |
Embedding Open-domain Common-sense Knowledge from Text |
14:55-16:15 |
Eneldo Loza Mencía, Gerard de Melo and Jinseok Nam |
Medical Concept Embeddings via Labeled Background Corpora |
14:55-16:15 |
Corentin Dumont, Ran Tian and Kentaro Inui |
Question-Answering with Logic Specific to Video Games |
|
Session P57 - Speech Corpora and Databases (2) |
Chair: Satoshi Nakamura |
14:55-16:15 |
Arne Köhn, Florian Stegen and Timo Baumann |
Mining the Spoken Wikipedia for Speech Data and Beyond |
14:55-16:15 |
Robert Herms, Laura Seelig, Stefanie Münch and Maximilian Eibl |
A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR Evaluation |
14:55-16:15 |
Koichiro Yoshino, Naoki Hirayama, Shinsuke Mori, Fumihiko Takahashi, Katsutoshi Itoyama and Hiroshi G. Okuno |
Parallel Speech Corpora of Japanese Dialects |
14:55-16:15 |
Christine Meunier, Cecile Fougeron, Corinne Fredouille, Brigitte Bigi, Lise Crevier-Buchman, Elisabeth Delais-Roussarie, Laurianne Georgeton, Alain Ghio, Imed Laaridh, Thierry Legou, Claire Pillot-Loiseau and Gilles Pouchoulin |
The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous Styles |
14:55-16:15 |
Emre Yilmaz, Maaike Andringa, Sigrid Kingma, Jelske Dijkstra, Frits Van der Kuip, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel and David van Leeuwen |
A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching Research |
14:55-16:15 |
Andrej Zgank, Mirjam Sepesy Maucec and Darinka Verdonik |
The SI TEDx-UM speech database: a new Slovenian Spoken Language Resource |
14:55-16:15 |
Yurie Iribe, Norihide Kitaoka and Shuhei Segawa |
Speech Corpus Spoken by Young-old, Old-old and Oldest-old Japanese |
14:55-16:15 |
Agnieszka Wagner, Katarzyna Klessa and Jolanta Bachan |
Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm Analysis |
14:55-16:15 |
Peter Viszlay, Ján Staš, Tomáš Koctúr, Martin Lojka and Jozef Juhár |
An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic Annotation |
14:55-16:15 |
Malgorzata Cavar, Damir Cavar, Dov-Ber Kerler and Anya Quilitzsch |
Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM Project |
|
|