|
SESSIONS: Browse articles of the conference by session
DAY 1 Oral Sessions
|
Session O3 - Corpus Annotation & Tagging |
Chair: Tomaž Erjavec |
11.35-11.55 |
Oliver Hellwig, Heinrich Hettrich, Ashutosh Modi and Manfred Pinkal |
Multi-layer Annotation of the Rigveda
|
11.55-12.15 |
Richard Futrell, Edward Gibson, Harry J. Tily, Idan Blank, Anastasia Vishnevetsky, Steven Piantadosi and Evelina Fedorenko |
The Natural Stories Corpus
|
12.15-12.35 |
Younggyun Hahm, Jiseong Kim, Sunggoo Kwon and KEY-SUN CHOI |
Semi-automatic Korean FrameNet Annotation over KAIST Treebank
|
12.35-12.55 |
Géraldine Damnati, Jérémy Auguste, Alexis Nasr, Delphine Charlet, Johannes Heinecke and Frédéric Béchet |
Handling Normalization Issues for Part-of-Speech Tagging of Online Conversational Text
|
12.55-13.15 |
Kareem Darwish, Hamdy Mubarak, Ahmed Abdelali, Mohamed Eldesouki, Younes Samih, Randah Alharbi, Mohammed Attia, Walid Magdy and Laura Kallmeyer |
Multi-Dialect Arabic POS Tagging: A CRF Approach
|
|
Session O4 - Dialogue |
Chair: Anna Rumshinsky |
11.35-11.55 |
Sushant Kafle and Matt Huenerfauth |
A Corpus for Modeling Word Importance in Spoken Dialogue Transcripts
|
11.55-12.15 |
David Traum, Cassidy Henry, Stephanie Lukin, Ron Artstein, Felix Gervits, Kimberly Pollard, Claire Bonial, Su Lei, Clare Voss, Matthew Marge, Cory Hayes and Susan Hill |
Dialogue Structure Annotation for Multi-Floor Interaction
|
12.15-12.35 |
Matthias Kraus, Johannes Kraus, Martin Baumann and Wolfgang Minker |
Effects of Gender Stereotypes on Trust and Likability in Spoken Human-Robot Interaction
|
12.35-12.55 |
Dimosthenis Kontogiorgos, Vanya Avramova, Simon Alexandersson, Patrik Jonell, Catharine Oertel, Jonas Beskow, Gabriel Skantze and Joakim Gustafson |
A Multimodal Corpus for Mutual Gaze and Joint Attention in Multiparty Situated Interaction
|
12.55-13.15 |
AbdelRahim Elmadany, Sherif Abdou and Mervat Gheith |
Improving Dialogue Act Classification for Spontaneous Arabic Speech and Instant Messages at Utterance Level
|
|
Session O5 - Language Resource Policies & Management |
Chair: Stelios Piperidis |
14.35-14.55 |
Pawel Kamocki, Valérie Mapelli and Khalid Choukri |
Data Management Plan (DMP) for Language Data under the New General Da-ta Protection Regulation (GDPR)
|
14.55-15.15 |
António Branco
|
We Are Depleting Our Research Subject as We Are Investigating It: In Language Technology, more Replication and Diversity Are Needed
|
15.15-15.35 |
Thorsten Trippel and Claus Zinn |
Lessons Learned: On the Challenges of Migrating a Research Data Repository from a Research Institution to a University Library.
|
15.35-15.55 |
Christopher Cieri, James Fiumara, Mark Liberman, Chris Callison-Burch and Jonathan Wright |
Introducing NIEUW: Novel Incentives and Workflows for Eliciting Linguistic Data
|
15.55-16.15 |
K. Bretonnel Cohen, Jingbo Xia, Pierre Zweigenbaum, Tiffany Callahan, Orin Hargraves, Foster Goss, Nancy Ide, Aurélie Névéol, Cyril Grouin and Lawrence E. Hunter |
Three Dimensions of Reproducibility in Natural Language Processing
|
|
Session O8 - Corpus Creation, Use & Evaluation (1) |
Chair: Patrizia Paggio |
14.35-14.55 |
Fan Xu, Mingwen Wang and Maoxi Li |
Building Parallel Monolingual Gan Chinese Dialects Corpus
|
14.55-15.15 |
Shachar Mirkin, Michal Jacovi, Tamar Lavee, Hong-Kwang Kuo, Samuel Thomas, Leslie Sager, Lili Kotlerman, Elad Venezian and Noam Slonim |
A Recorded Debating Dataset
|
15.15-15.35 |
Kyoko Sugisaki, Nicolas Wiedmer and Heiko Hausendorf |
Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging
|
15.35-15.55 |
Marcos García Salido, Marcos Garcia, Milka Villayandre-Llamazares and Margarita Alonso-Ramos |
A Lexical Tool for Academic Writing in Spanish based on Expert and Novice Corpora
|
15.55-16.15 |
Adrian Brasoveanu, Giuseppe Rizzo, Philipp Kuntschick, Albert Weichselbraun and Lyndon J.B. Nixon |
Framing Named Entity Linking Error Types
|
|
Session O9 - Bio-medical Corpora |
Chair: Paul Rayson |
16.35-16.55 |
Kirk Roberts, Yuqi Si, Anshul Gandhi and Elmer Bernstam |
A FrameNet for Cancer Information in Clinical Narratives: Schema and Annotation
|
16.55-17.15 |
Matthew Shardlow, Nhung Nguyen, Gareth Owen, Claire O'Donovan, Andrew Leach, John McNaught, Steve Turner and Sophia Ananiadou |
A New Corpus to Support Text Mining for the Curation of Metabolites in the ChEBI Database
|
17.15-17.35 |
Aurélie Névéol, Antonio Jimeno Yepes, Mariana Neves and Karin Verspoor |
Parallel Corpora for the Biomedical Domain
|
17.35-17.55 |
Markus Zlabinger, Linda Andersson, Allan Hanbury, Michael Andersson, Vanessa Quasnik and Jon Brassey |
Medical Entity Corpus with PICO elements and Sentiment Analysis
|
DAY 1 Poster Sessions
|
Session P1 - Anaphora, Coreference |
Chair: Scott Piao |
11.35-13.15 |
Montserrat Marimon, Lluís Padró and Jordi Turmo |
Coreference Resolution in FreeLing 4.0
|
11.35-13.15 |
Ina Roesiger
|
BASHI: A Corpus of Wall Street Journal Articles Annotated with Bridging Links
|
11.35-13.15 |
Bruno Oberle
|
SACR: A Drag-and-Drop Based Tool for Coreference Annotation
|
11.35-13.15 |
Bartłomiej Nitoń, Paweł Morawiecki and Maciej Ogrodniczuk |
Deep Neural Networks for Coreference Resolution for Polish
|
11.35-13.15 |
Veronika Vincze, Klára Hegedűs, Alex Sliz-Nagy and Richárd Farkas |
SzegedKoref: A Hungarian Coreference Corpus
|
11.35-13.15 |
Wasi Ahmad and Kai-Wei Chang |
A Corpus to Learn Refer-to-as Relations for Nominals
|
11.35-13.15 |
Julien Plu, Roman Prokofyev, Alberto Tonon, Philippe Cudré-Mauroux, Djellel Eddine Difallah, Raphael Troncy and Giuseppe Rizzo |
Sanaphor++: Combining Deep Neural Networks with Semantics for Coreference Resolution
|
11.35-13.15 |
Loïc Grobol, Isabelle Tellier, Eric De La Clergerie, Marco Dinarelli and Frédéric Landragin |
ANCOR-AS: Enriching the ANCOR Corpus with Syntactic Annotations
|
11.35-13.15 |
Ekaterina Lapshinova-Koltunski, Christian Hardmeier and Pauline Krielke |
ParCorFull: a Parallel Corpus Annotated with Full Coreference
|
|
Session P2 - Collaborative Resource Construction & Crowdsourcing |
Chair: Asad Sayeed |
11.35-13.15 |
Bartosz Ziółko, Piotr Żelasko, Ireneusz Gawlik, Tomasz Pędzimąż and Tomasz Jadczyk |
An Application for Building a Polish Telephone Speech Corpus
|
11.35-13.15 |
Shinnosuke Takamichi and Hiroshi Saruwatari |
CPJD Corpus: Crowdsourced Parallel Speech Corpus of Japanese Dialects
|
11.35-13.15 |
Kevin Yancey and Yves Lepage |
Korean L2 Vocabulary Prediction: Can a Large Annotated Corpus be Used to Train Better Models for Predicting Unknown Words?
|
11.35-13.15 |
Adeline Granet, Benjamin Hervy, Geoffrey Roman-Jimenez, Marouane Hachicha, Emmanuel Morin, Harold Mouchère, Solen Quiniou, Guillaume Raschia, Françoise Rubellin and Christian Viard-Gaudin |
Crowdsourcing-based Annotation of the Accounting Registers of the Italian Comedy
|
11.35-13.15 |
Leonidas Lefakis, Alan Akbik and Roland Vollgraf |
FEIDEGGER: A Multi-modal Corpus of Fashion Images and Descriptions in German
|
11.35-13.15 |
Alice Millour and Karën Fort |
Toward a Lightweight Solution for Less-resourced Languages: Creating a POS Tagger for Alsatian Using Voluntary Crowdsourcing
|
11.35-13.15 |
Akihiro Katsuta and Kazuhide Yamamoto |
Crowdsourced Corpus of Sentence Simplification with Core Vocabulary
|
11.35-13.15 |
Iris Hendrickx, Eirini Takoulidou, Thanasis Naskos, Katia Lida Kermanidis, Vilelmini Sosoni, Hugo De Vos, Maria Stasimioti, Menno Van Zaanen, Panayota Georgakopoulou, Valia Kordoni, Maja Popovic, Markus Egg and Antal Van den Bosch |
A Multilingual Wikified Data Set of Educational Material
|
11.35-13.15 |
Amarsanaa Ganbold, Altangerel Chagnaa and Gábor Bella |
Using Crowd Agreement for Wordnet Localization
|
11.35-13.15 |
Vilelmini Sosoni, Katia Lida Kermanidis, Maria Stasimioti, Thanasis Naskos, Eirini Takoulidou, Menno Van Zaanen, Sheila Castilho, Panayota Georgakopoulou, Valia Kordoni and Markus Egg |
Translation Crowdsourcing: Creating a Multilingual Corpus of Online Educational Content
|
11.35-13.15 |
Yo Ehara
|
Building an English Vocabulary Knowledge Dataset of Japanese English-as-a-Second-Language Learners Using Crowdsourcing
|
|
Session P3 - Information Extraction, Information Retrieval, Text Analytics (1) |
Chair: Hikaru Yokono |
11.35-13.15 |
Linrui Zhang and Dan Moldovan |
Chinese Relation Classification using Long Short Term Memory Networks
|
11.35-13.15 |
Binyang Li, Jun Xiang, Le Chen, Xu Han, Xiaoyan Yu, Ruifeng Xu, Tengjiao Wang and Kam-Fai Wong |
The UIR Uncertainty Corpus for Chinese: Annotating Chinese Microblog Corpus for Uncertainty Identification from Social Media
|
11.35-13.15 |
Tao Ge, Lei Cui, Baobao Chang, Zhifang Sui, Furu Wei and Ming Zhou |
EventWiki: A Knowledge Base of Major Events
|
11.35-13.15 |
Anna Koroleva and Patrick Paroubek |
Annotating Spin in Biomedical Scientific Publications : the case of Random Controlled Trials (RCTs)
|
11.35-13.15 |
Ryusei Matsumoto, Minoru Yoshida, Kazuyuki Matsumoto, Hironobu Matsuda and Kenji Kita |
Visualization of the occurrence trend of infectious diseases using Twitter
|
11.35-13.15 |
Matej Martinc and Senja Pollak |
Reusable workflows for gender prediction
|
11.35-13.15 |
Armin Hoenen and Niko Schenk |
Knowing the Author by the Company His Words Keep
|
11.35-13.15 |
Andrea Zielinski and Peter Mutschke |
Towards a Gold Standard Corpus for Variable Detection and Linking in Social Science Publications
|
11.35-13.15 |
Jannik Strötgen, Anne-Lyse Minard, Lukas Lange, Manuela Speranza and Bernardo Magnini |
KRAUTS: A German Temporally Annotated News Corpus
|
|
Session P4 - Infrastructural Issues/Large Projects (1) |
Chair: Denise Di Persio |
11.35-13.15 |
Daniel Khashabi, Mark Sammons, Ben Zhou, Tom Redman, Christos Christodoulopoulos, Vivek Srikumar, Nickolas Rizzolo, Lev Ratinov, Guanheng Luo, Quang Do, Chen-Tse Tsai, Subhro Roy, Stephen Mayhew, Zhili Feng, John Wieting, Xiaodong Yu, Yangqiu Song, Shashank Gupta, Shyam Upadhyay, Naveen Arivazhagan, Qiang Ning, Shaoshi Ling and Dan Roth |
CogCompNLP: Your Swiss Army Knife for NLP
|
11.35-13.15 |
Jan Nehring and Felix Sasaki |
A Framework for the Needs of Different Types of Users in Multilingual Semantic Enrichment
|
11.35-13.15 |
Roberto Bartolini, Sara Goggi, Monica Monachini and Gabriella Pardelli |
The LREC Workshops Map
|
11.35-13.15 |
Markus Gärtner, Uli Hahn and Sibylle Hermann |
Preserving Workflow Reproducibility: The RePlay-DH Client as a Tool for Process Documentation
|
11.35-13.15 |
Christian Chiarcos and Niko Schenk |
The ACoLi CoNLL Libraries: Beyond Tab-Separated Values
|
11.35-13.15 |
Balázs Indig, András Simonyi and Noémi Ligeti-Nagy |
What's Wrong, Python? -- A Visual Differ and Graph Library for NLP in Python
|
|
Session P6 - Opinion Mining / Sentiment Analysis (1) |
Chair: Cristina Bosco |
11.35-13.15 |
Michael Wiegand, Sylvette Loda and Josef Ruppenhofer |
Disambiguation of Verbal Shifters
|
11.35-13.15 |
Luwen Huangfu and Mihai Surdeanu |
Bootstrapping Polar-Opposite Emotion Dimensions from Online Reviews
|
11.35-13.15 |
Pavithra Rajendran, Danushka Bollegala and Simon Parsons |
Sentiment-Stance-Specificity (SSS) Dataset: Identifying Support-based Entailment among Opinions.
|
11.35-13.15 |
Rama Rohit Reddy Gangula and Radhika Mamidi |
Resource Creation Towards Automated Sentiment Analysis in Telugu (a low resource language) and Integrating Multiple Domain Sources to Enhance Sentiment Prediction
|
11.35-13.15 |
Mohammed Attia, Younes Samih, Ali Elkahky and Laura Kallmeyer |
Multilingual Multi-class Sentiment Classification Using Convolutional Neural Networks
|
11.35-13.15 |
Mikhail Khodak, Nikunj Saunshi and Kiran Vodrahalli |
A Large Self-Annotated Corpus for Sarcasm
|
11.35-13.15 |
Akari Asai, Sara Evensen, Behzad Golshan, Alon Halevy, Vivian Li, Andrei Lopatenko, Daniela Stepanov, Yoshihiko Suhara, Wang-Chiew Tan and Yinzhan Xu |
HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments
|
11.35-13.15 |
Jeremy Barnes, Toni Badia and Patrik Lambert |
MultiBooked: A Corpus of Basque and Catalan Hotel Reviews Annotated for Aspect-level Sentiment Classification
|
|
Session P9 - Conversational Systems/Dialogue/Chatbots/Human-Robot Interaction (1) |
Chair: Leo Wanner |
14.35-16.15 |
Patrik Jonell, Catharine Oertel, Dimosthenis Kontogiorgos, Jonas Beskow and Joakim Gustafson |
Crowdsourced Multimodal Corpora Collection Tool
|
14.35-16.15 |
Juliana Miehle, Nadine Gerstenlauer, Daniel Ostler, Hubertus Feußner, Wolfgang Minker and Stefan Ultes |
Expert Evaluation of a Spoken Dialogue System in a Clinical Operating Room
|
14.35-16.15 |
Kiyoaki Shirai and Tomotaka Fukuoka |
JAIST Annotated Corpus of Free Conversation
|
14.35-16.15 |
Volha Petukhova, Andrei Malchanau, Youssef Oualil, Dietrich Klakow, Saturnino Luz, Fasih Haider, Nick Campbell, Dimitris Koryzis, Dimitris Spiliotopoulos, Pierre Albert, Nicklas Linz and Jan Alexandersson |
The Metalogue Debate Trainee Corpus: Data Collection and Annotations
|
14.35-16.15 |
Andrei Malchanau, Volha Petukhova and Harry Bunt |
Towards Continuous Dialogue Corpus Creation: writing to corpus and generating from it
|
14.35-16.15 |
Andreas Liesenfeld
|
MYCanCor: A Video Corpus of spoken Malaysian Cantonese
|
14.35-16.15 |
Todd Shore, Theofronia Androulakaki and Gabriel Skantze |
KTH Tangrams: A Dataset for Research on Alignment and Conceptual Pacts in Task-Oriented Dialogue
|
14.35-16.15 |
Louisa Pragst, Niklas Rach, Wolfgang Minker and Stefan Ultes |
On the Vector Representation of Utterances in Dialogue Context
|
14.35-16.15 |
Laura García-Sardiña, Manex Serras and Arantza Del Pozo |
ES-Port: a Spontaneous Spoken Human-Human Technical Support Corpus for Dialogue Research in Spanish
|
14.35-16.15 |
Soumia Dermouche and Catherine Pelachaud |
From analysis to modeling of engagement as sequences of multimodal behaviors
|
|
Session P12 - Machine Translation, SpeechToSpeech Translation (1) |
Chair: Laurent Besacier |
14.35-16.15 |
Inigo Jauregi Unanue, Lierni Garmendia Arratibel, Ehsan Zare Borzeshi and Massimo Piccardi |
English-Basque Statistical and Neural Machine Translation
|
14.35-16.15 |
Vivien Macketanz, Renlong Ai, Aljoscha Burchardt and Hans Uszkoreit |
TQ-AutoTest – An Automated Test Suite for (Machine) Translation Quality
|
14.35-16.15 |
Yang Zhao, Jiajun Zhang and Chengqing Zong |
Exploiting Pre-Ordering for Neural Machine Translation
|
14.35-16.15 |
Gyu Hyeon Choi, Jong Hun Shin and Young Kil Kim |
Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages
|
14.35-16.15 |
Zi-Yi Dou, Hao Zhou, Shu-Jian Huang, Xin-Yu Dai and Jia-Jun Chen |
Dynamic Oracle for Neural Machine Translation in Decoding Phase
|
14.35-16.15 |
Xiaoqing Li, Jiajun Zhang and Chengqing Zong |
One Sentence One Model for Neural Machine Translation
|
14.35-16.15 |
Go Inoue, Nizar Habash, Yuji Matsumoto and Hiroyuki Aoyama |
A Parallel Corpus of Arabic-Japanese News Articles
|
14.35-16.15 |
Marzieh Fadaee, Arianna Bisazza and Christof Monz |
Examining the Tip of the Iceberg: A Data Set for Idiom Translation
|
14.35-16.15 |
Mihael Arcan, Elena Montiel-Ponsoda, John Philip McCrae and Paul Buitelaar |
Automatic Enrichment of Terminological Resources: the IATE RDF Example
|
14.35-16.15 |
Winston Wu and David Yarowsky |
A Comparative Study of Extremely Low-Resource Transliteration of the World’s Languages
|
14.35-16.15 |
Adarsh Kumar, Sandipan Dandapat and Sushil Chordia |
Translating Web Search Queries into Natural Language Questions
|
|
Session P13 - Semantics (1) |
Chair: Kyoko Kanzaki |
14.35-16.15 |
Yuya Sakaizawa and Mamoru Komachi |
Construction of a Japanese Word Similarity Dataset
|
14.35-16.15 |
Olga Majewska, Diana McCarthy, Ivan Vulić and Anna Korhonen |
Acquiring Verb Classes Through Bottom-Up Semantic Verb Clustering
|
14.35-16.15 |
Haoyue Shi, Xihao Wang, Yuqi Sun and Junfeng Hu |
Constructing High Quality Sense-specific Corpus and Word Embedding via Unsupervised Elimination of Pseudo Multi-sense
|
14.35-16.15 |
Samar Haider
|
Urdu Word Embeddings
|
14.35-16.15 |
Mika Hasegawa, Tetsunori Kobayashi and Yoshihiko Hayashi |
Social Image Tags as a Source of Word Embeddings: A Task-oriented Evaluation
|
14.35-16.15 |
Rafael Anchiêta and Thiago Pardo |
Towards AMR-BR: A SemBank for Brazilian Portuguese Language
|
14.35-16.15 |
Scott Piao, Paul Rayson, Dawn Knight and Gareth Watkins |
Towards a Welsh Semantic Annotation System
|
14.35-16.15 |
Gabriel Marzinotto, Jeremy Auguste, Frederic Bechet, Géraldine Damnati and Alexis Nasr |
Semantic Frame Parsing for Information Extraction : the CALOR corpus
|
14.35-16.15 |
Kathleen Ahrens, Huiheng Zeng and Shun-han Rebekah Wong |
Using a Corpus of English and Chinese Political Speeches for Metaphor Analysis
|
14.35-16.15 |
João Sequeira, Teresa Gonçalves, Paulo Quaresma, Amália Mendes and Iris Hendrickx |
A Multi- versus a Single-classifier Approach for the Identification of Modality in the Portuguese Language
|
|
Session P14 - Word Sense Disambiguation |
Chair: Maite Melero |
14.35-16.15 |
Rui Suzuki, Kanako Komiya, Masayuki Asahara, Minoru Sasaki and Hiroyuki Shinnou |
All-words Word Sense Disambiguation Using Concept Embeddings
|
14.35-16.15 |
Stefano Melacci, Achille Globo and Leonardo Rigutini |
Enhancing Modern Supervised Word Sense Disambiguation Models by Semantic Lexical Resources
|
14.35-16.15 |
Dmitry Ustalov, Denis Teslenko, Alexander Panchenko, Mikhail Chersnoskutov, Chris Biemann and Simone Paolo Ponzetto |
An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages
|
14.35-16.15 |
Kijong Han, Sangha Nam, Jiseong Kim, Younggyun Hahm and KEY-SUN CHOI |
Unsupervised Korean Word Sense Disambiguation using CoreNet
|
14.35-16.15 |
Loïc Vial, Benjamin Lecouteux and Didier Schwab |
UFSAC: Unification of Sense Annotated Corpora and Tools
|
14.35-16.15 |
Steffen Remus and Chris Biemann |
Retrofitting Word Representations for Unsupervised Sense Aware Word Similarities
|
14.35-16.15 |
Tolga Uslu, Alexander Mehler, Daniel Baumartz, Alexander Henlein and Wahed Hemati |
FastSense: An Efficient Word Sense Disambiguation Classifier
|
|
Session P15 - Annotation Methods and Tools |
Chair: Ron Artstein |
16.35-17.55 |
Angus Forbes, Kristine Lee, Gus Hahn-Powell, Marco A. Valenzuela-Escarcega and Mihai Surdeanu |
Text Annotation Graphs: Annotating Complex Natural Language Phenomena
|
16.35-17.55 |
Arianne Reimerink and Pilar León-Araúz |
Manzanilla: An Image Annotation Tool for TKB Building
|
16.35-17.55 |
Rashel Fam and Yves Lepage |
Tools for The Production of Analogical Grids and a Resource of N-gram Analogical Grids in 11 Languages
|
16.35-17.55 |
Costanza Navarretta
|
The Automatic Annotation of the Semiotic Type of Hand Gestures in Obama' s Humorous Speeches
|
16.35-17.55 |
Fahad AlGhamdi and Mona Diab |
WASA: A Web Application for Sequence Annotation
|
16.35-17.55 |
Makoto Yamazaki, Yumi Miyazaki and Wakako Kashino |
Annotation and Quantitative Analysis of Speaker Information in Novel Conversation Sentences in Japanese
|
16.35-17.55 |
Hiroyuki Shindo, Yohei Munesada and Yuji Matsumoto |
PDFAnno: a Web-based Linguistic Annotation Tool for PDF Documents
|
16.35-17.55 |
Markus Gärtner and Jonas Kuhn |
A Lightweight Modeling Middleware for Corpus Processing
|
16.35-17.55 |
Adeline Nazarenko, Francois Levy and Adam Wyner |
An Annotation Language for Semantic Search of Legal Sources
|
16.35-17.55 |
Chantal Van Son, Oana Inel, Roser Morante, Lora Aroyo and Piek Vossen |
Resource Interoperability for Sustainable Benchmarking: The Case of Events
|
16.35-17.55 |
Salar Mohtaj, Behnam Roshanfekr, Atefeh Zafarian and Habibollah Asghari |
Parsivar: A Language Processing Toolkit for Persian
|
16.35-17.55 |
Erwan Moreau and Carl Vogel |
Multilingual Word Segmentation: Training Many Language-Specific Tokenizers Smoothly Thanks to the Universal Dependencies Corpus
|
16.35-17.55 |
Hamdy Mubarak
|
Build Fast and Accurate Lemmatization for Arabic
|
|
Session P16 - Corpus Creation, Annotation, Use (1) |
Chair: Prokopis Prokopidis |
16.35-17.55 |
Reid Pryzant, Youngjoo Chung, Dan Jurafsky and Denny Britz |
JESC: Japanese-English Subtitle Corpus
|
16.35-17.55 |
Ricelli Ramos, Georges Neto, Barbara Silva, Danielle Monteiro, Ivandré Paraboni and Rafael Dias |
Building a Corpus for Personality-dependent Natural Language Understanding and Generation
|
16.35-17.55 |
Marijn Schraagen, Feike Dietz and Marjo Van Koppen |
Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters
|
16.35-17.55 |
Takumi Maruyama and Kazuhide Yamamoto |
Simplified Corpus with Core Vocabulary
|
16.35-17.55 |
Shilei Huang and Jiangqin Wu |
A Pragmatic Approach for Classical Chinese Word Segmentation
|
16.35-17.55 |
Sandeep Mathias and Pushpak Bhattacharyya |
ASAP++: Enriching the ASAP Automated Essay Grading Dataset with Essay Attribute Scores
|
16.35-17.55 |
Behnam Sabeti, Hossein Abedi Firouzjaee, Ali Janalizadeh Choobbasti, Seyed hani elamahdi Mortazavi Najafabadi and Amir Vaheb |
MirasText: An Automatically Generated Text Corpus for Persian
|
16.35-17.55 |
Verginica Barbu Mititelu, Dan Tufiș and Elena Irimia |
The Reference Corpus of the Contemporary Romanian Language (CoRoLa)
|
16.35-17.55 |
Sarah Masud Preum, Md. Rizwan Parvez, Kai-Wei Chang and John Stankovic |
A Corpus of Drug Usage Guidelines Annotated with Type of Advice
|
16.35-17.55 |
Maria Mitrofan and Dan Tufis |
BioRo: The Biomedical Corpus for the Romanian Language
|
|
Session P17 - Emotion Recognition/Generation |
Chair: Lluís Padró |
16.35-17.55 |
Ian Wood, John Philip McCrae, Vladimir Andryushechkin and Paul Buitelaar |
A Comparison Of Emotion Annotation Schemes And A New Annotated Data Set
|
16.35-17.55 |
Ankush Khandelwal, Sahil Swami, Syed Sarfaraz Akhtar and Manish Shrivastava |
Humor Detection in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System
|
16.35-17.55 |
Koichiro Yoshino, Yoko Ishikawa, Masahiro Mizukami, Yu Suzuki, Sakriani Sakti and Satoshi Nakamura |
Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing
|
16.35-17.55 |
Ramy Eskander
|
SentiArabic: A Sentiment Analyzer for Standard Arabic
|
16.35-17.55 |
Dmitrii Fedotov, Denis Ivanko, Maxim Sidorov and Wolfgang Minker |
Contextual Dependencies in Time-Continuous Multidimensional Affect Recognition
|
16.35-17.55 |
Saif Mohammad and Svetlana Kiritchenko |
WikiArt Emotions: An Annotated Dataset of Emotions Evoked by Art
|
16.35-17.55 |
Paul Rodrigues, Valerie Novak, C. Anton Rytting, Julie Yelle and Jennifer Boutz |
Arabic Data Science Toolkit: An API for Arabic Language Feature Extraction
|
16.35-17.55 |
Shabnam Tafreshi and Mona Diab |
Sentence and Clause Level Emotion Annotation, Detection, and Classification in a Multi-Genre Corpus
|
|
Session P19 - LR Infrastructures and Architectures |
Chair: Dieter van Uytvanck |
16.35-17.55 |
Riccardo Del Gratta, Sara Goggi, Gabriella Pardelli and Nicoletta Calzolari |
LREMap, a Song of Resources and Evaluation
|
16.35-17.55 |
Henk Van den Heuvel, Erwin Komen and Nelleke Oostdijk |
Metadata Collection Records for Language Resources
|
16.35-17.55 |
Stelios Piperidis, Penny Labropoulou, Miltos Deligiannis and Maria Giagkou |
Managing Public Sector Data for Multilingual Applications Development
|
16.35-17.55 |
Erhard Hinrichs, Nancy Ide, James Pustejovsky, Jan Hajic, Marie Hinrichs, Mohammad Fazleh Elahi, Keith Suderman, Marc Verhagen, Kyeongmin Rim, Pavel Stranak and Jozef Misutka |
Bridging the LAPPS Grid and CLARIN
|
16.35-17.55 |
Shu-Kai HSIEH, Yu-Hsiang Tseng, Chi-Yao Lee and Chiung-Yu Chiang |
Fluid Annotation: A Granularity-aware Annotation Tool for Chinese Word Fluidity
|
16.35-17.55 |
Tamás Váradi, Eszter Simon, Bálint Sass, Iván Mittelholcz, Attila Novák, Balázs Indig, Richárd Farkas and Veronika Vincze |
E-magyar -- A Digital Language Processing System
|
16.35-17.55 |
Andreas Niekler, Arnim Bleier, Christian Kahmann, Lisa Posch, Gregor Wiedemann, Kenan Erdogan, Gerhard Heyer and Markus Strohmaier |
ILCM - A Virtual Research Infrastructure for Large-Scale Qualitative Data
|
16.35-17.55 |
Darja Fišer, Jakob Lenardič and Tomaž Erjavec |
CLARIN’s Key Resource Families
|
16.35-17.55 |
Juliano Efson Sales, Leonardo Souza, Siamak Barzegar, Brian Davis, André Freitas and Siegfried Handschuh |
Indra: A Word Embedding and Semantic Relatedness Server
|
16.35-17.55 |
Giuseppe Abrami and Alexander Mehler |
A UIMA Database Interface for Managing NLP-related Text Annotations
|
16.35-17.55 |
Andrea Lösch, Valérie Mapelli, Stelios Piperidis, Andrejs Vasiļjevs, Lilli Smal, Thierry Declerck, Eileen Schnur, Khalid Choukri and Josef Van Genabith |
European Language Resource Coordination: Collecting Language Resources for Public Sector Multilingual Information Management
|
DAY 2 Oral Sessions
|
Session O15 - Semantics & Lexicon (2) |
Chair: Reinhard Rapp |
9.45-10.05 |
Zdenka Uresova, Eva Fucikova, Eva Hajicova and Jan Hajic |
Creating a Verb Synonym Lexicon Based on a Parallel Corpus
|
10.05-10.25 |
Farhad Nooralahzadeh, Lilja Øvrelid and Jan Tore Lønning |
Evaluation of Domain-specific Word Embeddings using Knowledge Resources
|
10.25-10.45 |
Chaya Liebeskind, Ido Dagan and Jonathan Schler |
Automatic Thesaurus Construction for Modern Hebrew
|
10.45-11.05 |
Jiseong Kim, Younggyun Hahm, Sunggoo Kwon and KEY-SUN CHOI |
Automatic Wordnet Mapping: from CoreNet to Princeton WordNet
|
11.05-11.25 |
Tim O'Gorman, Sameer Pradhan, Martha Palmer, Julia Bonn, Kathryn Conger and James Gung |
The New Propbank: Aligning Propbank with AMR through POS Unification
|
|
Session O20 - Dialogue, Emotion, Multimodality |
Chair: Nick Campbell |
11.45-12.05 |
Austin Baird, Anissa Hamza and Daniel Hardt |
Classifying Sluice Occurrences in Dialogue
|
12.05-12.25 |
Masahiro Araki, Sayaka Tomimasu, Mikio Nakano, Kazunori Komatani, Shogo Okada, Shinya Fujie and Hiroaki Sugiyama |
Collection of Multimodal Dialog Data and Analysis of the Result of Annotation of Users' Interest Level
|
12.25-12.45 |
Alicia Lotz, Klas Ihme, Audrey Charnoz, Pantelis Maroudis, Ivan Dmitriev and Andreas Wendemuth |
Recognizing Behavioral Factors while Driving: A Real-World Multimodal Corpus to Monitor the Driver’s Affective State
|
12.45-13.05 |
Chao-Chun Hsu, Sheng-Yeh Chen, Chuan-Chun Kuo, Ting-Hao Huang and Lun-Wei Ku |
EmotionLines: An Emotion Corpus of Multi-Party Conversations
|
|
Session O22 - Less-Resourced & Ancient Languages |
Chair: Mark Liberman |
14.50-15.10 |
Pasindu Tennage, Prabath Sandaruwan, Malith Thilakarathne, Achini Herath and Surangika Ranathunga |
Handling Rare Word Problem using Synthetic Training Data for Sinhala and Tamil Neural Machine Translation
|
15.10-15.30 |
Egidio Marsico, Sebastien Flavier, Annemarie Verkerk and Steven Moran |
BDPROTO: A Database of Phonological Inventories from Ancient and Reconstructed Languages
|
15.30-15.50 |
Winston Wu, Nidhi Vyas and David Yarowsky |
Creating a Translation Matrix of the Bible’s Names Across 591 Languages
|
15.50-16.10 |
Vikas Reddy, Amrith Krishna, Vishnu Sharma, Prateek Gupta, Vineeth M R and Pawan Goyal |
Building a Word Segmenter for Sanskrit Overnight
|
16.10-16.30 |
Kira Griffitt, Jennifer Tracey, Ann Bies and Stephanie Strassel |
Simple Semantic Annotation and Situation Frames: Two Approaches to Basic Text Understanding in LORELEI
|
|
Session O23 - Semantics & Evaluation |
Chair: Gerard de Melo |
14.50-15.10 |
Claire Bonial, Bianca Badarau, Kira Griffitt, Ulf Hermjakob, Kevin Knight, Tim O'Gorman, Martha Palmer and Nathan Schneider |
Abstract Meaning Representation of Constructions: The More We Include, the Better the Representation
|
15.10-15.30 |
Rik Van Noord, Lasha Abzianidze, Hessel Haagsma and Johan Bos |
Evaluating Scoped Meaning Representations
|
15.30-15.50 |
Tommaso Pasini, Francesco Elia and Roberto Navigli |
Huge Automatically Extracted Training-Sets for Multilingual Word SenseDisambiguation
|
15.50-16.10 |
Alexis Conneau and Douwe Kiela |
SentEval: An Evaluation Toolkit for Universal Sentence Representations
|
16.10-16.30 |
Steven Neale
|
A Survey on Automatically-Constructed WordNets and their Evaluation: Lexical and Word Embedding-based Approaches
|
|
Session O25 - Social Media & Evaluation |
Chair: Nasredine Semmar |
16.50-17.10 |
Xuancheng Ren, Xu SUN, Ji Wen, Bingzhen Wei, Weidong Zhan and Zhiyuan Zhang |
Building an Ellipsis-aware Chinese Dependency Treebank for Web Text
|
17.10-17.30 |
Cyril Goutte, Yunli Wang, FangMing Liao, Zachary Zanussi, Samuel Larkin and Yuri Grinberg |
EuroGames16: Evaluating Change Detection in Online Conversation
|
17.30-17.50 |
Deepak Gupta, Asif Ekbal and Pushpak Bhattacharyya |
A Deep Neural Network based Approach for Entity Extraction in Code-Mixed Indian Social Media Text
|
17.50-18.10 |
Manuela Sanguinetti, Cristina Bosco, Alberto Lavelli, Alessandro Mazzei, Oronzo Antonelli and Fabio Tamburini |
PoSTWITA-UD: an Italian Twitter Treebank in Universal Dependencies
|
18.10-18.30 |
Vivek Reddy Doudagiri, Alakananda Vempala and Eduardo Blanco |
Annotating If the Authors of a Tweet are Located at the Locations They Tweet About
|
|
Session O27 - Treebanks & Parsing |
Chair: Wenliang Chen |
16.50-17.10 |
Alexander Panchenko, Eugen Ruppert, Stefano Faralli, Simone Paolo Ponzetto and Chris Biemann |
Building a Web-Scale Dependency-Parsed Corpus from CommonCrawl
|
17.10-17.30 |
Masayuki Asahara, Hiroshi Kanayama, Takaaki Tanaka, Yusuke Miyao, Sumire Uematsu, Shinsuke Mori, Yuji Matsumoto, Mai Omura and Yugo Murawaki |
Universal Dependencies Version 2 for Japanese
|
17.30-17.50 |
Debopam Das and Manfred Stede |
Developing the Bangla RST Discourse Treebank
|
17.50-18.10 |
Marcin Woliński, Elżbieta Hajnicz and Tomasz Bartosiak |
A New Version of the Składnica Treebank of Polish Harmonised with the Walenty Valency Dictionary
|
18.10-18.30 |
Kira Droganova, Daniel Zeman, Jenna Kanerva and Filip Ginter |
Parse Me if You Can: Artificial Treebanks for Parsing Experiments on Elliptical Constructions
|
|
Session O28 - Morphology & Lexicons |
Chair: Tamás Varádi |
16.50-17.10 |
Mateusz Lango, Magda Sevcikova and Zdeněk Žabokrtský |
Semi-Automatic Construction of Word-Formation Networks (for Polish and Spanish)
|
17.10-17.30 |
Benoît Sagot
|
A multilingual collection of CoNLL-U-compatible morphological lexicons
|
17.30-17.50 |
Christo Kirov, Ryan Cotterell, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sebastian Mielke, Arya D. McCarthy, Sandra Kübler, David Yarowsky, Jason Eisner and Mans Hulden |
UniMorph 2.0: Universal Morphology
|
17.50-18.10 |
Olga Lovick, Christopher Cox, Miikka Silfverberg, Antti Arppe and Mans Hulden |
A Computational Architecture for the Morphology of Upper Tanana
|
18.10-18.30 |
Piotr Żelasko
|
Expanding Abbreviations in a Strongly Inflected Language: Are Morphosyntactic Tags Sufficient?
|
DAY 2 Poster Sessions
|
Session P20 - Bibliometrics, Scientometrics, Infometrics |
Chair: Richard Eckart de Castilho |
9.45-11.25 |
Michael Färber, Alexander Thiemann and Adam Jatowt |
A High-Quality Gold Standard for Citation-based Tasks
|
9.45-11.25 |
Joseph Mariani, Gil Francopoulo and Patrick Paroubek |
Measuring Innovation in Speech and Language Processing Publications.
|
9.45-11.25 |
Daniel Ferrés, Horacio Saggion, Francesco Ronzano and Àlex Bravo |
PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles
|
9.45-11.25 |
Eric Kergosien, Amin Farvardin, Maguelonne Teisseire, Marie-Noelle BESSAGNET, Joachim Schöpfel, Stéphane Chaudiron, Bernard Jacquemin, Annig Lacayrelle, Mathieu Roche, Christian Sallaberry and Jean-Philippe Tonneau |
Automatic Identification of Research Fields in Scientific Papers
|
|
Session P21 - Discourse Annotation, Representation and Processing (1) |
Chair: Silvia Pareti |
9.45-11.25 |
Andrej Kibrik and Olga Fedorova |
A «Portrait» Approach to Multichannel Discourse
|
9.45-11.25 |
Deniz Zeyrek, Amália Mendes and Murathan Kurfalı |
Multilingual Extension of PDTB-Style Annotation: The Case of TED Multilingual Discourse Bank
|
9.45-11.25 |
Xiaomin Chu, Feng Jiang, Sheng Xu and Qiaoming Zhu |
Building a Macro Chinese Discourse Treebank
|
9.45-11.25 |
Tuomo Hiippala and Serafina Orekhova |
Enhancing the AI2 Diagrams Dataset Using Rhetorical Structure Theory
|
9.45-11.25 |
Kordula De Kuthy, Nils Reiter and Arndt Riester |
QUD-Based Annotation of Discourse Structure and Information Structure: Tool and Evaluation
|
9.45-11.25 |
José Lopes, Nils Hemmingsson and Oliver Åstrand |
The Spot the Difference corpus: a multi-modal corpus of spontaneous task oriented spoken interactions
|
9.45-11.25 |
Andre Cianflone and Leila Kosseim |
Attention for Implicit Discourse Relation Recognition
|
9.45-11.25 |
Chandrakant Bothe, Cornelius Weber, Sven Magg and Stefan Wermter |
A Context-based Approach for Dialogue Act Recognition using Simple Recurrent Neural Networks
|
9.45-11.25 |
Philipp Helfrich, Elias Rieb, Giuseppe Abrami, Andy Lücking and Alexander Mehler |
TreeAnnotator: Versatile Visual Annotation of Hierarchical Text Relations
|
9.45-11.25 |
Emer Gilmartin, Carl Vogel and Nick Campbell |
Chats and Chunks: Annotation and Analysis of Multiparty Long Casual Conversations
|
|
Session P22 - Evaluation Methodologies |
Chair: Edouard Geoffrois |
9.45-11.25 |
Ludovic Tanguy, Cécile Fabre and Laura Rivière |
Extending the gold standard for a lexical substitution task: is it worth it?
|
9.45-11.25 |
Maria Moritz and David Steding |
Lexical and Semantic Features for Cross-lingual Text Reuse Classification: an Experiment in English and Latin Paraphrases
|
9.45-11.25 |
YU Yuan and Serge Sharoff |
Investigating the Influence of Bilingual MWU on Trainee Translation Quality
|
9.45-11.25 |
Zsanett Ferenczi, Iván Mittelholcz, Eszter Simon and Tamás Váradi |
Evaluation of Dictionary Creating Methods for Finno-Ugric Minority Languages
|
9.45-11.25 |
Imed Laaridh, Christine Meunier and Corinne Fredouille |
Dysarthric speech evaluation: automatic and perceptual approaches
|
9.45-11.25 |
Patricia Braunger, Wolfgang Maier, Jan Wessling and Maria Schmidt |
Towards an Automatic Assessment of Crowdsourced Data for NLU
|
9.45-11.25 |
Jinyoung Yeo, Gyeongbok Lee, Gengyu Wang, Seungtaek Choi, Hyunsouk Cho, Reinald Kim Amplayo and Seung-won Hwang |
Visual Choice of Plausible Alternatives: An Evaluation of Image-based Commonsense Causal Reasoning
|
9.45-11.25 |
Filip Klubička, Giancarlo D. Salton and John D. Kelleher |
Is it worth it? Budget-related evaluation metrics for model selection
|
9.45-11.25 |
Patrick Huber, Jan Niehues and Alex Waibel |
Automated Evaluation of Out-of-Context Errors
|
9.45-11.25 |
Olivier Galibert, Guillaume Bernard, Agnes Delaborde, Sabrina Lecadre and Juliette Kahn |
Matics Software Suite: New Tools for Evaluation and Data Exploration
|
9.45-11.25 |
Mostafa Abdou, Artur Kulmizev and Vinit Ravishankar |
MGAD: Multilingual Generation of Analogy Datasets
|
|
Session P23 - Information Extraction, Information Retrieval, Text Analytics (2) |
Chair: Pierre Zweigenbaum |
9.45-11.25 |
Stefano Faralli, Els Lefever and Simone Paolo Ponzetto |
MIsA: Multilingual "IsA" Extraction from Corpora
|
9.45-11.25 |
Naiara Perez, Montse Cuadros and German Rigau |
Biomedical term normalization of EHRs with UMLS
|
9.45-11.25 |
William Lechelle and Phillippe Langlais |
Revisiting the Task of Scoring Open IE Relations
|
9.45-11.25 |
Rajdeep Sarkar, John Philip McCrae and Paul Buitelaar |
A supervised approach to taxonomy extraction using word embeddings
|
9.45-11.25 |
Yi Zhang and Sun Xu |
A Chinese Dataset with Negative Full Forms for General Abbreviation Prediction
|
9.45-11.25 |
Chae-Gyun Lim, Young-Seob Jeong and Ho-Jin Choi |
Korean TimeBank Including Relative Temporal Information
|
9.45-11.25 |
Nancy Ide, Keith Suderman and Jin-Dong Kim |
Mining Biomedical Publications With The LAPPS Grid
|
9.45-11.25 |
Rashmi Sankepally and Douglas W. Oard |
An Initial Test Collection for Ranked Retrieval of SMS Conversations
|
9.45-11.25 |
Nathalie Camelin, Géraldine Damnati, Abdessalam Bouchekif, Anais Landeau, Delphine Charlet and Yannick Esteve |
FrNewsLink : a corpus linking TV Broadcast News Segments and Press Articles
|
9.45-11.25 |
Nicolas Hernandez and Amir Hazem |
PyRATA, Python Rule-based feAture sTructure Analysis
|
|
Session P24 - Multimodality |
Chair: Martin Braschler |
9.45-11.25 |
Zbynek Zajic, Lucie Skorkovska, Petr Neduchal, Pavel Ircing, Josef V. Psutka, Marek Hruz, Ales Prazak, Daniel Soutner, Jan Švec, Lukas Bures and Ludek Muller |
Towards Processing of the Oral History Interviews and Related Printed Documents
|
9.45-11.25 |
Armin Hoenen
|
Multi Modal Distance - An Approach to Stemma Generation With Weighting
|
9.45-11.25 |
Ting Han and David Schlangen |
A Corpus of Natural Multimodal Spatial Scene Descriptions
|
9.45-11.25 |
Fernando T. Ito, Helena de Medeiros Caseli and Jander Moreira |
The Effects of Unimodal Representation Choices on Multimodal Learning
|
9.45-11.25 |
Nikhil Krishnaswamy and James Pustejovsky |
An Evaluation Framework for Multimodal Interaction
|
9.45-11.25 |
Ahmed Abdelali, Irina Temnikova, Samy Hedaya and Stephan Vogel |
The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic
|
9.45-11.25 |
Alina Wróblewska
|
Polish Corpus of Annotated Descriptions of Images
|
9.45-11.25 |
Stephanie Gross, Matthias Hirschmanner, Brigitte Krenn, Friedrich Neubarth and Michael Zillich |
Action Verb Corpus
|
9.45-11.25 |
Cédric Fayet, Arnaud Delhay, Damien Lolive and Pierre-françois Marteau |
EMO&LY (EMOtion and AnomaLY) : A new corpus for anomaly detection in an audiovisual stream with emotional context.
|
9.45-11.25 |
Keith Curtis, Nick Campbell and Gareth Jones |
Development of an Annotated Multimodal Dataset for the Investigation of Classification and Summarisation of Presentations using High-Level Paralinguistic Features
|
|
Session P25 - Parsing, Syntax, Treebank (1) |
Chair: Simonetta Montemagni |
9.45-11.25 |
Kiem-Hieu Nguyen
|
BKTreebank: Building a Vietnamese Dependency Treebank
|
9.45-11.25 |
Roman Schneider and Monica Fürbacher |
GeCoTagger: Annotation of German Verb Complements with Conditional Random Fields
|
9.45-11.25 |
Tatiana Bladier, Esther Seyffarth, Oliver Hellwig and Wiebke Petersen |
AET: Web-based Adjective Exploration Tool for German
|
9.45-11.25 |
Alan Akbik and Roland Vollgraf |
ZAP: An Open-Source Multilingual Annotation Projection Framework
|
9.45-11.25 |
Talha Javed, Nizar Habash and Dima Taji |
Palmyra: A Platform Independent Dependency Annotation Tool for Morphologically Rich Languages
|
9.45-11.25 |
Stephen Tratz and Nhien Phan |
A Web-based System for Crowd-in-the-Loop Dependency Treebanking
|
9.45-11.25 |
Jayeol Chun, Na-Rae Han, Jena D. Hwang and Jinho D. Choi |
Building Universal Dependency Treebanks in Korean
|
9.45-11.25 |
Agnieszka Falenska, Kerstin Eckart and Jonas Kuhn |
Moving TIGER beyond Sentence-Level
|
9.45-11.25 |
Luis Chiruzzo and Dina Wonsever |
Spanish HPSG Treebank based on the AnCora Corpus
|
9.45-11.25 |
Binyam Ephrem Seyoum, Yusuke Miyao and Baye Yimam Mekonnen |
Universal Dependencies for Amharic
|
9.45-11.25 |
David Arps and Simon Petitjean |
A Parser for LTAG and Frame Semantics
|
9.45-11.25 |
KyungTae Lim, Niko Partanen and Thierry Poibeau |
Multilingual Dependency Parsing for Low-Resource Languages: Case Studies on North Saami and Komi-Zyrian
|
|
Session I-P1 - Industry Track - Industrial Systems |
Chair: Linne Ha |
11.45-13.05 |
Alexander Gutkin, Martin Jansche and Tatiana Merkulova |
FonBund: A Library for Combining Cross-lingual Phonological Segment Data
|
11.45-13.05 |
Pasindu De Silva, Theeraphol Wattanavekin, Tang Hao and Knot Pipatsrisawat |
Voice Builder: A Tool for Building Text-To-Speech Voices
|
11.45-13.05 |
Kazuma Takaoka, Sorami Hisamoto, Noriko Kawahara, Miho Sakamoto, Yoshitaka Uchida and Yuji Matsumoto |
Sudachi: a Japanese Tokenizer for Business
|
11.45-13.05 |
Kazunari Tanaka, Tomoya Iwakura, Yusuke Koyanagi, Noriko Ikeda, Hiroyuki Shindo and Yuji Matsumoto |
Chemical Compounds Knowledge Visualization with Natural Language Processing and Linked Data
|
|
Session P26 - Language Acquisition & CALL (1) |
Chair: Donghui Lin |
11.45-13.05 |
Shuyuan Cao and Harritxu Gete |
Using Discourse Information for Education with a Spanish-Chinese Parallel Corpus
|
11.45-13.05 |
Kay Berkling
|
A 2nd Longitudinal Corpus for Children's Writing with Enhanced Output for Specific Spelling Patterns
|
11.45-13.05 |
Masaya Yamaguchi, Masanori Kitamura and Naomi Yanagida |
Development of a Mobile Observation Support System for Students: FishWatchr Mini
|
11.45-13.05 |
Jan Odijk, Alexis Dimitriadis, Martijn Van der Klis, Marjo Van Koppen, Meie Otten and Remco Van der Veen |
The AnnCor CHILDES Treebank
|
11.45-13.05 |
Xuan-Nga Cao, Cyrille Dakhlia, Patricia Del Carmen, Mohamed-Amine Jaouani, Malik Ould-Arbi and Emmanuel Dupoux |
BabyCloud, a Technological Platform for Parents and Researchers
|
11.45-13.05 |
Yasuhiro Minami, Tessei Kobayashi and Yuko Okumura |
Infant Word Comprehension-to-Production Index Applied to Investigation of Noun Learning Predominance Using Cross-lingual CDI database
|
11.45-13.05 |
Lung-Hao Lee, Yuen-Hsien Tseng and Liping Chang |
Building a TOCFL Learner Corpus for Chinese Grammatical Error Diagnosis
|
11.45-13.05 |
Jean-Philippe Goldman and Sandra Schwab |
MIAPARLE: Online training for the discrimination of stress contrasts
|
11.45-13.05 |
Torsten Zesch and Andrea Horbach |
ESCRITO - An NLP-Enhanced Educational Scoring Toolkit
|
11.45-13.05 |
Muhamed Al Khalil, Hind Saddiki, Nizar Habash and Latifa Alfalasi |
A Leveled Reading Corpus of Modern Standard Arabic
|
|
Session P27 - Less-Resourced/Endangered Languages (1) |
Chair: Valérie Mapelli |
11.45-13.05 |
Rodrigo Agerri, Xavier Gómez Guinovart, German Rigau and Miguel Anxo Solla Portela |
Developing New Linguistic Resources and Tools for the Galician Language
|
11.45-13.05 |
Jordan Lachler, Lene Antonsen, Trond Trosterud, Sjur Moshagen and Antti Arppe |
Modeling Northern Haida Verb Morphology
|
11.45-13.05 |
Caitlin Richter, Matthew Wickes, Deniz Beser and Mitchell Marcus |
Low-resource Post Processing of Noisy OCR Output for Historical Corpus Digitisation
|
11.45-13.05 |
Hanna Hedeland, Timm Lehmberg, Felix Rau, Sophie Salffner, Mandana Seyfeddinipur and Andreas Witt |
Introducing the CLARIN Knowledge Centre for Linguistic Diversity and Language Documentation
|
11.45-13.05 |
Petra Galuscakova and Lucie Neuzilova |
Low Resource Methods for Medieval Document Sections Analysis
|
11.45-13.05 |
Ralf Grubenmann, Don Tuggener, Pius Von Däniken, Jan Deriu and Mark Cieliebak |
SB-CH: A Swiss German Corpus with Sentiment Annotations
|
11.45-13.05 |
Hajime Senuma and Akiko Aizawa |
Universal Dependencies for Ainu
|
|
Session P28 - Lexicon (2) |
Chair: Lionel Nicolas |
11.45-13.05 |
Steve Cassidy, Onno Crasborn, Henri Nieminen, Wessel Stoop, Micha Hulsbosch, Susan Even, Erwin Komen and Trevor Johnson |
Signbank: Software to Support Web Based Dictionaries of Sign Language
|
11.45-13.05 |
Kaoru Ito, Hiroyuki Nagai, Taro Okahisa, Shoko Wakamiya, Tomohide Iwao and Eiji Aramaki |
J-MeDic: A Japanese Disease Name Dictionary based on Real Clinical Usage
|
11.45-13.05 |
Kyoko Kanzaki and Hitoshi Isahara |
Building a List of Synonymous Words and Phrases of Japanese Compound Verbs
|
11.45-13.05 |
Pilar León-Araúz and Arianne Reimerink |
Evaluating EcoLexiCAT: a Terminology-Enhanced CAT Tool
|
11.45-13.05 |
Bolette Pedersen, Sanni Nimb, Anders Søgaard, Mareike Hartmann and Sussi Olsen |
A Danish FrameNet Lexicon and an Annotated Corpus Used for Training and Evaluating a Semantic Frame Classifier
|
11.45-13.05 |
Charles Jochim, Francesca Bonin, Roy Bar-Haim and Noam Slonim |
SLIDE - a Sentiment Lexicon of Common Idioms
|
11.45-13.05 |
Haris Bin Zia, Agha Ali Raza and Awais Athar |
PronouncUR: An Urdu Pronunciation Lexicon Generator
|
11.45-13.05 |
Agnieszka Mykowiecka, Malgorzata Marciniak and Piotr Rychlik |
SimLex-999 for Polish
|
11.45-13.05 |
João Rodrigues and António Branco |
Finely Tuned, 2 Billion Token Based Word Embeddings for Portuguese
|
|
Session P31 - MultiWord Expressions & Collocations |
Chair: Brigitte Krenn |
11.45-13.05 |
Diego Moussallem, Mohamed Ahmed Sherif, Diego Esteves, Marcos Zampieri and Axel-Cyrille Ngonga Ngomo |
LIdioms: A Multilingual Linked Idioms Data Set
|
11.45-13.05 |
Suguru Matsuyoshi, Hirotaka Kameko, Yugo Murawaki and Shinsuke Mori |
Annotating Modality Expressions and Event Factuality for a Japanese Chess Commentary Corpus
|
11.45-13.05 |
Menghan Jiang, Natalia Klyueva, Hongzhi Xu and Chu-Ren Huang |
Annotating Chinese Light Verb Constructions according to PARSEME guidelines
|
11.45-13.05 |
Cvetana Krstev, Branislava Šandrih, Ranka Stankovic and Miljana Mladenović |
Using English Baits to Catch Serbian Multi-Word Terminology
|
11.45-13.05 |
Akihiko Kato, Hiroyuki Shindo and Yuji Matsumoto |
Construction of Large-scale English Verbal Multiword Expression Annotated Corpus
|
11.45-13.05 |
Uxoa Iñurrieta, Itziar Aduriz, Arantza Diaz de Ilarraza, Gorka Labaka and Kepa Sarasola |
Konbitzul: an MWE-specific database for Spanish-Basque
|
11.45-13.05 |
Juliano Efson Sales, Siamak Barzegar, Wellington Franco, Bernhard Bermeitinger, Tiago Cunha, Brian Davis, André Freitas and Siegfried Handschuh |
A Multilingual Test Collection for the Semantic Search of Entity Categories
|
11.45-13.05 |
Melania Cabezas-García and Pilar León-Araúz |
Towards the Inference of Semantic Relations in Complex Nominals: a Pilot Study
|
11.45-13.05 |
Sara Rodríguez-Fernández, Roberto Carlini and Leo Wanner |
Generation of a Spanish Artificial Collocation Error Corpus
|
11.45-13.05 |
Dušan Variš and Natalia Klyueva |
Improving a Neural-based Tagger for Multiword Expressions Identification
|
11.45-13.05 |
Katsiaryna Aharodnik, Anna Feldman and Jing Peng |
Designing a Russian Idiom-Annotated Corpus
|
|
Session P32 - Document Classification, Text Categorisation (1) |
Chair: Monserrat Marimon |
14.50-16.30 |
Tobias Horsmann and Torsten Zesch |
DeepTC – An Extension of DKPro Text Classification for Fostering Reproducibility of Deep Learning Experiments
|
14.50-16.30 |
Steven Zimmerman, Udo Kruschwitz and Chris Fox |
Improving Hate Speech Detection with Deep Learning Ensembles
|
14.50-16.30 |
Amaru Cuba Gyllensten and Magnus Sahlgren |
Distributional Term Set Expansion
|
14.50-16.30 |
Núria Bel and Joel Pocostales |
Can Domain Adaptation be Handled as Analogies?
|
14.50-16.30 |
Fernando Hsieh, Rafael Dias and Ivandré Paraboni |
Author Profiling from Facebook Corpora
|
14.50-16.30 |
Liat Ein Dor, Alon Halfon, Yoav Kantor, Ran Levy, Yosi Mass, Ruty Rinott, Eyal Shnarch and Noam Slonim |
Semantic Relatedness of Wikipedia Concepts -- Benchmark Data and a Working Solution
|
14.50-16.30 |
Dainis Boumber, Yifan Zhang and Arjun Mukherjee |
Experiments with Convolutional Neural Networks for Multi-Label Authorship Attribution
|
|
Session P33 - Morphology (1) |
Chair: Piotr Banski |
14.50-16.30 |
Dat Quoc Nguyen, Dai Quoc Nguyen, Thanh Vu, Mark Dras and Mark Johnson |
A Fast and Accurate Vietnamese Word Segmenter
|
14.50-16.30 |
Francis Tyers, Sevilay Bayatli, Güllü Karanfil and Memduh Gokirmak |
Finite-state morphological analysis for Gagauz
|
14.50-16.30 |
Besim Kabashi and Thomas Proisl |
Albanian Part-of-Speech Tagging: Gold Standard and Evaluation
|
14.50-16.30 |
Sreelekha S and Pushpak Bhattacharyya |
Morphology Injection for English-Malayalam Statistical Machine Translation
|
14.50-16.30 |
Mohammed Attia, Vitaly Nikolaev and Ali Elkahky |
The Morpho-syntactic Annotation of Animacy for a Dependency Parser
|
14.50-16.30 |
Ossama Obeid, Salam Khalifa, Nizar Habash, Houda Bouamor, Wajdi Zaghouani and Kemal Oflazer |
MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction
|
14.50-16.30 |
Emily Chen and Lane Schwartz |
A Morphological Analyzer for St. Lawrence Island / Central Siberian Yupik
|
14.50-16.30 |
Christian Chiarcos, Kathrin Donandt, Maxim Ionov, Monika Rind-Pawlowski, Hasmik Sargsian, Jesse Wichers Schreur, Frank Abromeit and Christian Fäth |
Universal Morphologies for the Caucasus region
|
|
Session P34 - Opinion Mining / Sentiment Analysis (2) |
Chair: Patrick Paroubek |
14.50-16.30 |
Phan Duc-Anh and Yuji Matsumoto |
EMTC: Multilabel Corpus in Movie Domain for Emotion Analysis in Conversational Text
|
14.50-16.30 |
Stefania Pecore and Jeanne Villaneau |
Complex and Precise Movie and Book Annotations in French Language for Aspect Based Sentiment Analysis
|
14.50-16.30 |
Antonio Moreno-Ortiz and Chantal Pérez-Hernández |
Lingmotif-lex: a Wide-coverage, State-of-the-art Lexicon for Sentiment Analysis
|
14.50-16.30 |
Yiou Wang and Takuji Tahara |
A Japanese Corpus for Analyzing Customer Loyalty Information
|
14.50-16.30 |
Henny Sluyter-Gäthje, Pintu Lohar, Haithem Afli and Andy Way |
FooTweets: A Bilingual Parallel Corpus of World Cup Tweets
|
14.50-16.30 |
Thomas Gaillat, Manel Zarrouk, André Freitas and Brian Davis |
The SSIX Corpora: Three Gold Standard Corpora for Sentiment Analysis in English, Spanish and German Financial Microblogs
|
14.50-16.30 |
Aditya Joshi, Pranav Goel, Pushpak Bhattacharyya and Mark Carman |
Sarcasm Target Identification: Dataset and An Introductory Approach
|
14.50-16.30 |
Janaka Chathuranga, Shanika Ediriweera, Ravindu Hasantha, Pranidhith Munasinghe and Surangika Ranathunga |
Annotating Opinions and Opinion Targets in Student Course Feedback
|
14.50-16.30 |
Jacobo Rouces, Nina Tahmasebi, Lars Borin and Stian Rødven Eide |
Generating a Gold Standard for a Swedish Sentiment Lexicon
|
|
Session P36 - Question Answering and Machine Reading |
Chair: António Branco |
14.50-16.30 |
Aikaterini-Lida Kalouli, Katharina Kaiser, Annette Hautli-Janisz, Georg A. Kaiser and Miriam Butt |
A Multilingual Approach to Question Classification
|
14.50-16.30 |
Yiming Cui, Ting Liu, Zhipeng Chen, Wentao Ma, Shijin Wang and Guoping Hu |
Dataset for the First Evaluation on Chinese Machine Reading Comprehension
|
14.50-16.30 |
Amir Hazem, Basma El Amel Boussaha and Nicolas Hernandez |
A Multi-Domain Framework for Textual Similarity. A Case Study on Question-to-Question and Question-Answering Similarity Tasks
|
14.50-16.30 |
Peter Jansen, Elizabeth Wainwright, Steven Marmorstein and Clayton Morrison |
WorldTree: A Corpus of Explanation Graphs for Elementary Science Questions supporting Multi-hop Inference
|
14.50-16.30 |
Shun-ya Fukunaga, Hitoshi Nishikawa, Takenobu Tokunaga, Hikaru Yokono and Tetsuro Takahashi |
Analysis of Implicit Conditions in Database Search Dialogues
|
14.50-16.30 |
Jelte Van Waterschoot, Guillaume Dubuisson Duplessis, Lorenzo Gatti, Merijn Bruijnes and Dirk Heylen |
An Information-Providing Closed-Domain Human-Agent Interaction Corpus
|
14.50-16.30 |
Masashi Yokota and Hideki Nakayama |
Augmenting Image Question Answering Dataset by Exploiting Image Captions
|
14.50-16.30 |
Kyungjae Lee, Kyoungho Yoon, Sunghyun Park and Seung-won Hwang |
Semi-supervised Training Data Generation for Multilingual Question Answering
|
14.50-16.30 |
Andrei Dulceanu, Thang Le Dinh, Walter Chang, Trung Bui, Doo Soon Kim, Manh Chien Vu and Seokhwan Kim |
PhotoshopQuiA: A Corpus of Non-Factoid Questions and Answers for Why-Question Answering
|
14.50-16.30 |
Dimitris Pappas, Ion Androutsopoulos and Haris Papageorgiou |
BioRead: A New Dataset for Biomedical Reading Comprehension
|
14.50-16.30 |
Deepak Gupta, Surabhi Kumari, Asif Ekbal and Pushpak Bhattacharyya |
MMQA: A Multi-domain Multi-lingual Question-Answering Framework for English and Hindi
|
|
Session P37 - Social Media Processing (2) |
Chair: Tetsuro Takahashi |
14.50-16.30 |
Nathan Green and Septina Larasati |
The First 100 Days: A Corpus Of Political Agendas on Twitter
|
14.50-16.30 |
Shweta Yadav, Asif Ekbal, Sriparna Saha and Pushpak Bhattacharyya |
Medical Sentiment Analysis using Social Media: Towards building a Patient Assisted System
|
14.50-16.30 |
Manuela Sanguinetti, Fabio Poletto, Cristina Bosco, Viviana Patti and Marco Stranisci |
An Italian Twitter Corpus of Hate Speech against Immigrants
|
14.50-16.30 |
Giorgia Di Tommaso, Stefano Faralli and Paola Velardi |
A Large Multilingual and Multi-domain Dataset for Recommender Systems
|
14.50-16.30 |
Rob Voigt, David Jurgens, Vinodkumar Prabhakaran, Dan Jurafsky and Yulia Tsvetkov |
RtGender: A Corpus for Studying Differential Responses to Gender
|
14.50-16.30 |
Sara Meftah and Nasredine Semmar |
A Neural Network Model for Part-Of-Speech Tagging of Social Media Texts
|
14.50-16.30 |
Valerij Fredriksen, Brage Jahren and Björn Gambäck |
Utilizing Large Twitter Corpora to Create Sentiment Lexica
|
|
Session P38 - Speech Resource/Database (1) |
Chair: Gilles Adda |
14.50-16.30 |
Laura Fernández Gallardo and Benjamin Weiss |
The Nautilus Speaker Characterization Corpus: Speech Recordings and Labels of Speaker Characteristics and Voice Descriptions
|
14.50-16.30 |
Florian Schiel and Thomas Zitzelsberger |
Evaluation of Automatic Formant Trackers
|
14.50-16.30 |
Luboš Šmídl, Jan Švec, Daniel Tihelka, Jindrich Matousek, Jan Romportl and Pavel Ircing |
Design and Development of Speech Corpora for Air Traffic Control Training
|
14.50-16.30 |
Ewald Van der westhuizen and Thomas Niesler |
A First South African Corpus of Multilingual Code-switched Soap Opera Speech
|
14.50-16.30 |
Nina Poerner and Florian Schiel |
A Web Service for Pre-segmenting Very Long Transcribed Speech Recordings
|
14.50-16.30 |
Estelle Delpech, Marion Laignelet, Christophe Pimm, Céline Raynal, Michal Trzos, Alexandre Arnold and Dominique Pronto |
A Real-life, French-accented Corpus of Air Traffic Control Communications
|
14.50-16.30 |
Askars Salimbajevs
|
Creating Lithuanian and Latvian Speech Corpora from Inaccurately Annotated Web Data
|
14.50-16.30 |
Sunayana Sitaram, Varun Manjunath, Varun Bharadwaj, Monojit Choudhury, Kalika Bali and Michael Tjalve |
Discovering Canonical Indian English Accents: A Crowdsourcing-based Approach
|
14.50-16.30 |
Tomoko Ohsuga, Yuichi Ishimoto, Tomoko Kajiyama, Shunsuke Kozawa, Kiyotaka Uchimoto and Shuichi Itahashi |
Extending Search System based on Interactive Visualization for Speech Corpora
|
14.50-16.30 |
Katrin Schweitzer, Kerstin Eckart, Markus Gärtner, Agnieszka Falenska, Arndt Riester, Ina Roesiger, Antje Schweitzer, Sabrina Stehwien and Jonas Kuhn |
German Radio Interviews: The GRAIN Release of the SFB732 Silver Standard Collection
|
14.50-16.30 |
Margot Mieskes and Andreas Stiegelmayr |
Preparing Data from Psychotherapy for Natural Language Processing
|
14.50-16.30 |
Amir Vaheb, Ali Janalizadeh Choobbasti, Mahdi Mortazavi, Saeid Safavi and Behnam Sabeti |
MirasVoice: A bilingual (English-Persian) speech corpus
|
|
Session P39 - Conversational Systems/Dialogue/Chatbots/Human-Robot Interaction (2) |
Chair: Johannes Kraus |
16.50-18.30 |
Silvia Pareti and Tatiana Lando |
Dialog Intent Structure: A Hierarchical Schema of Linked Dialog Acts
|
16.50-18.30 |
Tetsuaki Nakamura and Daisuke Kawahara |
JDCFC: A Japanese Dialogue Corpus with Feature Changes
|
16.50-18.30 |
Koichiro Yoshino, Hiroki Tanaka, Kyoshiro Sugiyama, Makoto Kondo and Satoshi Nakamura |
Japanese Dialogue Corpus of Information Navigation and Attentive Listening Annotated with Extended ISO-24617-2 Dialogue Act Tags
|
16.50-18.30 |
Ron Artstein, Jill Boberg, Alesia Gainer, Jonathan Gratch, Emmanuel Johnson, Anton Leuski, Gale Lucas and David Traum |
The Niki and Julie Corpus: Collaborative Multimodal Dialogues between Humans, Robots, and Virtual Agents
|
16.50-18.30 |
Nan Wang, Yan Song and Fei Xia |
Constructing a Chinese Medical Conversation Corpus Annotated with Conversational Structures and Actions
|
16.50-18.30 |
Ryo Ishii, Ryuichiro Higashinaka and Junji Tomita |
Predicting Nods by using Dialogue Acts in Dialogue
|
16.50-18.30 |
Maria Koutsombogera and Carl Vogel |
Modeling Collaborative Multimodal Behavior in Group Dialogues: The MULTISIMO Corpus
|
16.50-18.30 |
Magalie Ochs, Philippe Blache, Grégoire De Montcheuil, Jean-Marie Pergandi, Jorane Saubesty, Daniel Francon and Daniel Mestre |
A Semi-autonomous System for Creating a Human-Machine Interaction Corpus in Virtual Reality: Application to the ACORFORMed System for Training Doctors to Break Bad News
|
16.50-18.30 |
Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Lestari and Satoshi Nakamura |
Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas
|
16.50-18.30 |
Vadim Sheinin, Elahe Khorasani, Hangu Yeo, Kun Xu, Ngoc Phuoc An Vo and Octavian Popescu |
QUEST: A Natural Language Interface to Relational Databases
|
|
Session P41 - Natural Language Generation |
Chair: Ineke Schuurman |
16.50-18.30 |
Danillo Rocha and Ivandré Paraboni |
Reference production in human-computer interaction: Issues for Corpus-based Referring Expression Generation
|
16.50-18.30 |
Alex Lan and Ivandré Paraboni |
Definite Description Lexical Choice: taking Speaker's Personality into account
|
16.50-18.30 |
André Mariotti and Ivandré Paraboni |
Referring Expression Generation in time-constrained communication
|
16.50-18.30 |
Natsuda Laokulrat, Naoaki Okazaki and Hideki Nakayama |
Incorporating Semantic Attention in Video Description Generation
|
16.50-18.30 |
François Lareau, Florie Lambrey, Ieva Dubinskaite, Daniel Galarreta-Piquette and Maryam Nejat |
GenDR: A Generic Deep Realizer with Complex Lexicalization
|
16.50-18.30 |
Sanja Štajner and Sergiu Nisioi |
A Detailed Evaluation of Neural Sequence-to-Sequence Models for In-domain and Cross-domain Text Simplification
|
16.50-18.30 |
Piek Vossen, Filip Ilievski, Marten Postma and Roxane Segers |
Don't Annotate, but Validate: a Data-to-Text Method for Capturing Event Data
|
16.50-18.30 |
Diego Moussallem, Thiago Ferreira, Marcos Zampieri, Maria Cláudia Cavalcanti, Geraldo Xexéo, Mariana Neves and Axel-Cyrille Ngonga Ngomo |
RDF2PT: Generating Brazilian Portuguese Texts from RDF Data
|
16.50-18.30 |
Michele Berlingerio and Francesca Bonin |
Towards a music-language mapping
|
16.50-18.30 |
Amy Isard, Jon Oberlander and Claire Grover |
Up-cycling Data for Natural Language Generation
|
|
Session P42 - Semantics (2) |
Chair: Cécile Fabre |
16.50-18.30 |
Minh Le and Antske Fokkens |
Neural Models of Selectional Preferences for Implicit Semantic Role Labeling
|
16.50-18.30 |
Adrien Barbaresi, Lothar Lemnitzer and Alexander Geyken |
A database of German definitory contexts from selected web sources
|
16.50-18.30 |
Noelia Migueles-Abraira, Rodrigo Agerri and Arantza Diaz de Ilarraza |
Annotating Abstract Meaning Representations for Spanish
|
16.50-18.30 |
Marie-Claude L' Homme, Benoît Robichaud and Nathalie Prévil |
Browsing the Terminological Structure of a Specialized Domain: A Method Based on Lexical Functions and their Classification
|
16.50-18.30 |
Asad Sayeed, Pavel Shkadzko and Vera Demberg |
Rollenwechsel-English: a large-scale semantic role corpus
|
16.50-18.30 |
Girishkumar Ponkiya, Kevin Patel, Pushpak Bhattacharyya and Girish K. Palshikar |
Towards a Standardized Dataset for Noun Compound Interpretation
|
16.50-18.30 |
Yuchen Zhang and Nianwen Xue |
Structured Interpretation of Temporal Relations
|
16.50-18.30 |
Xi Victoria Lin, Chenglong Wang, Luke Zettlemoyer and Michael D. Ernst |
NL2Bash: A Corpus and Semantic Parser for Natural Language Interface to the Linux Operating System
|
16.50-18.30 |
Charles Welch, Jonathan K. Kummerfeld, Song Feng and Rada Mihalcea |
World Knowledge for Abstract Meaning Representation Parsing
|
|
Session P43 - Speech Processing |
Chair: Sebastian Stüker |
16.50-18.30 |
Michael Gref, Joachim Köhler and Almut Leh |
Improved Transcription and Indexing of Oral History Interviews for Digital Humanities Research
|
16.50-18.30 |
Weicheng Ma, Kai Cao, Zhaoheng Ni, Peter Chin and Xiang Li |
Sound Signal Processing with Seq2Tree Network
|
16.50-18.30 |
Anna Björk Nikulásdóttir, Inga Rún Helgadóttir, Matthías Pétursson and Jón Guðnason |
Open ASR for Icelandic: Resources and a Baseline System
|
16.50-18.30 |
Zhao Meng, Lili Mou and Zhi Jin |
Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models
|
16.50-18.30 |
Junqing He, Xian Huang, Xuemin Zhao, Yan Zhang and Yonghong Yan |
Discriminating between Similar Languages on Imbalanced Conversational Texts
|
16.50-18.30 |
Michael Stadtschnitzer and Christoph Schmidt |
Data-Driven Pronunciation Modeling of Swiss German Dialectal Speech for Automatic Speech Recognition
|
16.50-18.30 |
Edwin Simonnet, Sahar Ghannay, Nathalie Camelin and Yannick Estève |
Simulating ASR errors for training SLU systems
|
16.50-18.30 |
Natalia Tomashenko and Yannick Estève |
Evaluation of Feature-Space Speaker Adaptation for End-to-End Acoustic Models
|
16.50-18.30 |
Ingmar Steiner and Sébastien Le Maguer |
Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform
|
16.50-18.30 |
Akira Hayakawa, Carl Vogel, Saturnino Luz and Nick Campbell |
Speech Rate Calculations with Short Utterances: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task
|
|
Session P44 - Summarisation |
Chair: Senja Pollak |
16.50-18.30 |
Christopher Tauchmann, Thomas Arnold, Andreas Hanselowski, Christian M. Meyer and Margot Mieskes |
Beyond Generic Summarization: A Multi-faceted Hierarchical Summarization Corpus of Large Heterogeneous Data
|
16.50-18.30 |
Elvys Linhares Pontes, Juan-Manuel Torres-Moreno, Stéphane Huet and Andréa carneiro Linhares |
A New Annotated Portuguese/Spanish Corpus for the Multi-Sentence Compression Task
|
16.50-18.30 |
Avinesh PVS, Maxime Peyrard and Christian M. Meyer |
Live Blog Corpus for Summarization
|
16.50-18.30 |
Minh-Tien Nguyen, Dac Viet Lai, Huy-Tien Nguyen and Minh-Le Nguyen |
TSix: A Human-involved-creation Dataset for Tweet Summarization
|
16.50-18.30 |
Nisarg Jhaveri, Manish Gupta and Vasudeva Varma |
A Workbench for Rapid Generation of Cross-Lingual Summaries
|
16.50-18.30 |
Takashi Yamamura and Kazutaka Shimada |
Annotation and Analysis of Extractive Summaries for the Kyutech Corpus
|
16.50-18.30 |
Franck Dernoncourt, Mohammad Ghassemi and Walter Chang |
A Repository of Corpora for Summarization
|
16.50-18.30 |
Markus Zopf
|
Auto-hMDS: Automatic Construction of a Large Heterogeneous Multilingual Multi-Document Summarization Corpus
|
16.50-18.30 |
Yanjun Gao, Andrew Warner and Rebecca Passonneau |
PyrEval: An Automated Method for Summary Content Analysis
|
DAY 3 Oral Sessions
|
Session O29 - Language Resource Infrastructures |
Chair: Takenobu Tokunaga |
9.45-10.05 |
Franciska De Jong, Bente Maegaard, Koenraad De Smedt, Darja Fišer and Dieter Van Uytvanck |
CLARIN: Towards FAIR and Responsible Data Science Using Language Resources
|
10.05-10.25 |
Christopher Cieri, Mark Liberman, Stephanie Strassel, Denise DiPersio, Jonathan Wright and Andrea Mazzucchi |
From ‘Solved Problems’ to New Challenges: A Report on LDC Activities
|
10.25-10.45 |
Valérie Mapelli, Victoria Arranz, Hélène Mazo, Pawel Kamocki and Vladimir Popescu |
New directions in ELRA activities
|
10.45-11.05 |
Donghui Lin, Yohei Murakami and Toru Ishida |
A Framework for Multi-Language Service Design with the Language Grid
|
11.05-11.25 |
Georg Rehm and Stefanie Hegele |
Language Technology for Multilingual Europe: An Analysis of a Large-Scale Survey regarding Challenges, Demands, Gaps and Needs
|
|
Session O30 - Digital Humanities & Text Analytics |
Chair: Thierry Declerck |
9.45-10.05 |
Boyang Li, Beth Cardier, Tong Wang and Florian Metze |
Annotating High-Level Structures of Short Stories and Personal Anecdotes
|
10.05-10.25 |
Els Lefever, Iris Hendrickx, Ilja Croijmans, Antal Van den Bosch and Asifa Majid |
Discovering the Language of Wine Reviews: A Text Mining Account
|
10.25-10.45 |
Francis Bond and Graham Matthews |
Toward An Epic Epigraph Graph
|
10.45-11.05 |
Thomas Proisl, Stefan Evert, Fotis Jannidis, Christof Schöch, Leonard Konle and Steffen Pielström |
Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods
|
11.05-11.25 |
Edward Newell, Drew Margolin and Derek Ruths |
An Attribution Relations Corpus for Political News
|
|
Session O31 - Crowdsourcing & Collaborative Resource Construction |
Chair: Steve Cassidy |
9.45-10.05 |
Albert Gatt, Marc Tanti, Adrian Muscat, Patrizia Paggio, Reuben A Farrugia, Claudia Borg, Kenneth Camilleri, Mike Rosner and Lonneke Van der Plas |
Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions
|
10.05-10.25 |
Ivan Habernal, Patrick Pauli and Iryna Gurevych |
Adapting Serious Game for Fallacious Argumentation to German: Pitfalls, Insights, and Best Practices
|
10.25-10.45 |
Jean-Philippe Goldman, Yves Scherrer, Julie Glikman, Mathieu Avanzi, Christophe Benzitoun and Philippe Boula de Mareüil |
Crowdsourcing Regional Variation Data and Automatic Geolocalisation of Speakers of European French
|
10.45-11.05 |
Maximiliana Behnke, Antonio Valerio Miceli Barone, Rico Sennrich, Vilelmini Sosoni, Thanasis Naskos, Eirini Takoulidou, Maria Stasimioti, Menno Van Zaanen, Sheila Castilho, Federico Gaspari, Panayota Georgakopoulou, Valia Kordoni, Markus Egg and Katia Lida Kermanidis |
Improving Machine Translation of Educational Content via Crowdsourcing
|
11.05-11.25 |
Rebecca Sharp, Mithun Paul, Ajay Nagesh, Dane Bell and Mihai Surdeanu |
Grounding Gradable Adjectives through Crowdsourcing
|
|
Session O32 - Less-Resourced Languages Speech & Multimodal Corpora |
Chair: Shyam Agrawal |
9.45-10.05 |
Oliver Adams, Trevor Cohn, Graham Neubig, Hilaria Cruz, Steven Bird and Alexis MICHAUD |
Evaluation Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation
|
10.05-10.25 |
Pierre Godard, Gilles Adda, Martine Adda-Decker, Juan Benjumea, Laurent Besacier, Jamison Cooper-Leavitt, Guy-Noel Kouarata, Lori Lamel, Hélène Maynard, Markus Mueller, Annie Rialland, Sebastian Stueker, François Yvon and Marcely Zanon Boito |
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments
|
10.25-10.45 |
Jacqueline Brixey, Eli Pincus and Ron Artstein |
Chahta Anumpa: A multimodal corpus of the Choctaw Language
|
10.45-11.05 |
Fatima Hamlaoui, Emmanuel-Moselly Makasso, Markus Müller, Jonas Engelmann, Gilles Adda, Alex Waibel and Sebastian Stüker |
BULBasaa: A Bilingual Basaa-French Speech Corpus for the Evaluation of Language Documentation Tools
|
11.05-11.25 |
Kristiina Jokinen
|
Researching Less-Resourced Languages – the DigiSami Corpus
|
|
Session O33 - Lexicon |
Chair: Simon Krek |
11.45-12.05 |
Houda Bouamor, Nizar Habash, Mohammad Salameh, Wajdi Zaghouani, Owen Rambow, Dana Abdulrahim, Ossama Obeid, Salam Khalifa, Fadhl Eryani, Alexander Erdmann and Kemal Oflazer |
The MADAR Arabic Dialect Corpus and Lexicon
|
12.05-12.25 |
Arbi Haza Nasution, Yohei Murakami and Toru Ishida |
Designing a Collaborative Process to Create Bilingual Dictionaries of Indonesian Ethnic Languages
|
12.25-12.45 |
Edward Newell and Jackie Chi Kit Cheung |
Constructing a Lexicon of Relational Nouns
|
12.45-13.05 |
Winston Wu and David Yarowsky |
Creating Large-Scale Multilingual Cognate Tables
|
13.05-13.25 |
Patrick Drouin, Marie-Claude L'Homme and Benoît Robichaud |
Lexical Profiling of Environmental Corpora
|
|
Session O34 - Knowledge Discovery |
Chair: German Rigau |
11.45-12.05 |
Marcus Klang and Pierre Nugues |
Linking, Searching, and Visualizing Entities in Wikipedia
|
12.05-12.25 |
Chin-Ho Lin, Hen-Hsen Huang and Hsin-Hsi Chen |
Learning to Map Natural Language Statements into Knowledge Base Representations for Knowledge Base Construction
|
12.25-12.45 |
Vivian Silva, André Freitas and Siegfried Handschuh |
Building a Knowledge Graph from Natural Language Definitions for Interpretable Text Entailment Recognition
|
12.45-13.05 |
Arnaud Ferré, Louise Deléger, Pierre Zweigenbaum and Claire Nédellec |
Combining rule-based and embedding-based approaches to normalize textual entities with an ontology
|
13.05-13.25 |
Hady Elsahar, Pavlos Vougiouklis, Arslen Remaci, Christophe Gravier, Jonathon Hare, Frederique Laforest and Elena Simperl |
T-REx: A Large Scale Alignment of Natural Language with Knowledge Base Triples
|
|
Session O36 - Corpus Creation, Use & Evaluation (2) |
Chair: Satoshi Nakamura |
11.45-12.05 |
Edouard Grave, Piotr Bojanowski, Prakhar Gupta, Armand Joulin and Tomas Mikolov |
Learning Word Vectors for 157 Languages
|
12.05-12.25 |
Milan Straka, Nikita Mediankin, Tom Kocmi, Zdeněk Žabokrtský, Vojtěch Hudeček and Jan Hajic |
SumeCzech: Large Czech News-Based Summarization Dataset
|
12.25-12.45 |
Carmen Klaussner and Carl Vogel |
A Diachronic Corpus for Literary Style Analysis
|
12.45-13.05 |
Carolina Scarton, Gustavo Paetzold and Lucia Specia |
Text Simplification from Professionally Produced Corpora
|
13.05-13.25 |
Jacky Visser, Rory Duthie, John Lawrence and Chris Reed |
Intertextual Correspondence for Integrating Corpora
|
|
Session O37 - Anaphora & Coreference |
Chair: Claire Bonial |
14.45-15.05 |
Olga Seminck and Pascal Amsili |
A Gold Anaphora Annotation Layer on an Eye Movement Corpus
|
15.05-15.25 |
Yoshihiko Asao, Ryu Iida and Kentaro Torisawa |
Annotating Zero Anaphora for Question Answering
|
15.25-15.45 |
Rodrigo Agerri, Yiling Chung, Itziar Aldabe, Nora Aranberri, Gorka Labaka and German Rigau |
Building Named Entity Recognition Taggers via Parallel Corpora
|
15.45-16.05 |
Zhiyi Song, Ann Bies, Justin Mott, Xuansong Li, Stephanie Strassel and Christopher Caruso |
Cross-Document, Cross-Language Event Coreference Annotation Using Event Hoppers
|
DAY 3 Poster Sessions
|
Session P46 - Dialects |
Chair: Claudia Soria |
9.45-11.25 |
Yo Sato and Kevin Heffernan |
Creating dialect sub-corpora by clustering: a case in Japanese for an adaptive method
|
9.45-11.25 |
Roeland Van Hout, Nicoline Van der Sijs, Erwin Komen and Henk Van den Heuvel |
A Fast and Flexible Webinterface for Dialect Research in the Low Countries
|
9.45-11.25 |
Mahmoud El-Haj, Paul Rayson and Mariam Aboelezz |
Arabic Dialect Identification in the Context of Bivalency and Code-Switching
|
9.45-11.25 |
Nizar Habash, Fadhl Eryani, Salam Khalifa, Owen Rambow, Dana Abdulrahim, Alexander Erdmann, Reem Faraj, Wajdi Zaghouani, Houda Bouamor, Nasser Zalmout, Sara Hassan, Faisal Al shargi, Sakhar Alkhereyf, Basma Abdulkareem, Ramy Eskander, Mohammad Salameh and Hind Saddiki |
Unified Guidelines and Resources for Arabic Dialect Orthography
|
9.45-11.25 |
Houda SAADANE, Hosni Seffih, Christian Fluhr, Khalid Choukri and Nasredine SEMMAR |
Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach
|
9.45-11.25 |
Chatrine Qwaider, Motaz Saad, Stergios Chatzikyriakidis and Simon Dobnik |
Shami: A Corpus of Levantine Arabic Dialects
|
9.45-11.25 |
Muhammad Abdul-Mageed, Hassan Alhuzali and Mohamed Elaraby |
You Tweet What You Speak: A City-Level Dataset of Arabic Dialects
|
9.45-11.25 |
Ada Wan
|
Visualizing the "Dictionary of Regionalisms of France" (DRF)
|
9.45-11.25 |
Israa Alsarsour, Esraa Mohamed, Reem Suwaileh and Tamer Elsayed |
DART: A Large Dataset of Dialectal Arabic Tweets
|
9.45-11.25 |
Samantha Wray
|
Classification of Closely Related Sub-dialects of Arabic Using Support-Vector Machines
|
|
Session P48 - Information Extraction, Information Retrieval, Text Analytics (3) |
Chair: Patrick Drouin |
9.45-11.25 |
Anne-Kathrin Schumann and Héctor Martínez Alonso |
Automatic Annotation of Semantic Term Types in the Complete ACL Anthology Reference Corpus
|
9.45-11.25 |
Piotr Andruszkiewicz and Rafal Hazan |
Annotated Corpus of Scientific Conference's Homepages for Information Extraction
|
9.45-11.25 |
Yang Yu and Vincent Ng |
Improving Unsupervised Keyphrase Extraction using Background Knowledge
|
9.45-11.25 |
Rüdiger Gleim, Alexander Mehler and Sung Y. Song |
WikiDragon: A Java Framework For Diachronic Content And Network Analysis Of MediaWikis
|
9.45-11.25 |
Antske Fokkens, Nel Ruigrok, Camiel Beukeboom, Gagestein Sarah and Wouter Van Attveldt |
Studying Muslim Stereotyping through Microportrait Extraction
|
9.45-11.25 |
Verónica Pérez-Rosas, Xuetong Sun, Christy Li, Yuchen Wang, Kenneth Resnicow and Rada Mihalcea |
Analyzing the Quality of Counseling Conversations: the Tell-Tale Signs of High-quality Counseling
|
9.45-11.25 |
Denys Katerenchuk, David Guy Brizan and Andrew Rosenberg |
Interpersonal Relationship Labels for the CALLHOME Corpus
|
9.45-11.25 |
Suemi Higuchi, Cláudia Freitas, Bruno Cuconato and Alexandre Rademaker |
Text Mining for History: first steps on building a large dataset
|
9.45-11.25 |
Lorraine Goeuriot, Josiane Mothe, Philippe Mulhem and Eric SanJuan |
Building Evaluation Datasets for Cultural Microblog Retrieval
|
|
Session P49 - Machine Translation, SpeechToSpeech Translation (2) |
Chair: Mona Diab |
9.45-11.25 |
Matīss Rikters, Mārcis Pinnis and Rihards Krišlauks |
Training and Adapting Multilingual NMT for Less-resourced and Morphologically Rich Languages
|
9.45-11.25 |
YU Yuan, Yuze Gao, Yue Zhang and Serge Sharoff |
Cross-lingual Terminology Extraction for Translation Quality Estimation
|
9.45-11.25 |
Pierre-Edouard Honnet, Andrei Popescu-Belis, Claudiu Musat and Michael Baeriswyl |
Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German
|
9.45-11.25 |
Fathima Farhath, Pranavan Theivendiram, Surangika Ranathunga, Sanath Jayasena and Gihan Dias |
Improving domain-specific SMT for low-resourced languages using data from different domains
|
9.45-11.25 |
Vassilis Papavassiliou, Prokopis Prokopidis and Stelios Piperidis |
Discovering Parallel Language Resources for Training MT Engines
|
9.45-11.25 |
Laura Van Brussel, Arda Tezcan and Lieve Macken |
A fine-grained error analysis of NMT, SMT and RBMT output for English-to-Dutch
|
9.45-11.25 |
Injy Hamed, Mohamed Elmahdy and Slim Abdennadher |
Collection and Analysis of Code-switch Egyptian Arabic-English Speech Corpus
|
9.45-11.25 |
Chiraag Lala and Lucia Specia |
Multimodal Lexical Translation
|
9.45-11.25 |
Isabel Lacruz, Michael Carl and Masaru Yamada |
Literality and cognitive effort: Japanese and Spanish
|
9.45-11.25 |
Marlies Van der Wees, Arianna Bisazza and Christof Monz |
Evaluation of Machine Translation Performance Across Multiple Genres and Languages
|
9.45-11.25 |
Pierre Zweigenbaum, Serge Sharoff and Reinhard Rapp |
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
|
9.45-11.25 |
Aibek Makazhanov, Bagdat Myrzakhmetov and Zhenisbek Assylbekov |
Manual vs Automatic Bitext Extraction
|
|
Session P50 - Morphology (2) |
Chair: Amália Mendes |
9.45-11.25 |
Salam Khalifa, Nizar Habash, Fadhl Eryani, Ossama Obeid, Dana Abdulrahim and Meera Al Kaabi |
A Morphologically Annotated Corpus of Emirati Arabic
|
9.45-11.25 |
Amir More, Özlem Çetinoğlu, Çağrı Çöltekin, Nizar Habash, Benoît Sagot, Djamé Seddah, Dima Taji and Reut Tsarfaty |
CoNLL-UL: Universal Morphological Lattices for Universal Dependency Parsing
|
9.45-11.25 |
Witold Kieraś and Marcin Woliński |
Manually Annotated Corpus of Polish Texts Published between 1830 and 1918
|
9.45-11.25 |
Claudia Marzi, Marcello Ferro, Ouafae Nahli, Patrizia Belik, Stavros Bompolas and Vito Pirrelli |
Evaluating Inflectional Complexity Crosslinguistically: a Processing Perspective
|
9.45-11.25 |
Patrick Littell, Tom McCoy, Na-Rae Han, Shruti Rijhwani, Zaid Sheikh, David R. Mortensen, Teruko Mitamura and Lori Levin |
Parser combinators for Tigrinya and Oromo morphology
|
9.45-11.25 |
Winston Wu and David Yarowsky |
Massively Translingual Compound Analysis and Translation Discovery
|
9.45-11.25 |
Petra Steiner and Josef Ruppenhofer |
Building a Morphological Treebank for German from a Linguistic Database
|
|
Session P51 - Multilinguality |
Chair: Pavel Straňák |
9.45-11.25 |
Željko Agić and Natalie Schluter |
Baselines and Test Data for Cross-Lingual Inference
|
9.45-11.25 |
Sanja Štajner, Marc Franco-Salvador, Paolo Rosso and Simone Paolo Ponzetto |
CATS: A Tool for Customized Alignment of Text Simplification Corpora
|
9.45-11.25 |
Thanh-Le Ha, Jan Niehues, Matthias Sperber, Ngoc Quan Pham and Alexander Waibel |
KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus
|
9.45-11.25 |
Alfred Sliwa, Yuan Man, Ruishen Liu, Niravkumar Borad, Seyedeh Ziyaei, Mina Ghobadi, Firas Sabbah and Ahmet Aker |
Multi-lingual Argumentative Corpora in English, Turkish, Greek, Albanian, Croatian, Serbian, Macedonian, Bulgarian, Romanian and Arabic
|
9.45-11.25 |
Siamak Barzegar, Brian Davis, Manel Zarrouk, Siegfried Handschuh and André Freitas |
SemR-11: A Multi-Lingual Gold-Standard for Semantic Similarity and Relatedness for Eleven Languages
|
|
Session P52 - Part-of-Speech Tagging |
Chair: Andreas Witt |
9.45-11.25 |
Delphine Bernhard, Anne-Laure Ligozat, Fanny Martin, Myriam Bras, Pierre Magistry, Marianne Vergez-Couret, Lucie Steiblé, Pascale Erhart, Nabil Hathout, Dominique Huck, Christophe Rey, Philippe Reynés, Sophie Rosset, Jean Sibille and Thomas Lavergne |
Corpora with Part-of-Speech Annotations for Three Regional Languages of France: Alsatian, Occitan and Picard
|
9.45-11.25 |
Randah Alharbi, Walid Magdy, Kareem Darwish, Ahmed Abdelali and Hamdy Mubarak |
Part-of-Speech Tagging for Arabic Gulf Dialect Using Bi-LSTM
|
9.45-11.25 |
Abdulrahman Alosaimy and Eric Atwell |
Web-based Annotation Tool for Inflectional Language Resources
|
9.45-11.25 |
Fabian Barteld, Sarah Ihden, Katharina Dreessen and Ingrid Schröder |
HiNTS: A Tagset for Middle Low German
|
9.45-11.25 |
Steven Neale, Kevin Donnelly, Gareth Watkins and Dawn Knight |
Leveraging Lexical Resources and Constraint Grammar for Rule-Based Part-of-Speech Tagging in Welsh
|
9.45-11.25 |
Mokanarangan Thayaparan, Surangika Ranathunga and Uthayasanker Thayasivam |
Graph Based Semi-Supervised Learning Approach for Tamil POS tagging
|
|
Session P53 - Conversational Systems/Dialogue/Chatbots/Human-Robot Interaction (3) |
Chair: Kalika Bali |
11.45-13.25 |
Juliana Miehle, Wolfgang Minker and Stefan Ultes |
What Causes the Differences in Communication Styles? A Multicultural Study on Directness and Elaborateness
|
11.45-13.25 |
Patrik Jonell, Mattias Bystedt, Per Fallgren, Dimosthenis Kontogiorgos, José Lopes, Zofia Malisz, Samuel Mascarenhas, Catharine Oertel, Eran Raveh and Todd Shore |
FARMI: A FrAmework for Recording Multi-Modal Interactions
|
11.45-13.25 |
Kazuki Sakai, Akari Inago, Ryuichiro Higashinaka, Yuichiro Yoshikawa, Hiroshi Ishiguro and Junji Tomita |
Creating Large-Scale Argumentation Structures for Dialogue Systems
|
11.45-13.25 |
Marilyn Walker, Albry Smither, Shereen Oraby, Vrindavan Harrison and Hadar Shemtov |
Exploring Conversational Language Generation for Rich Content about Hotels
|
11.45-13.25 |
Sarah Fillwock and David Traum |
Identification of Personal Information Shared in Chat-Oriented Dialogue
|
11.45-13.25 |
Thi Lan Ngo, Pham Khac Linh and Takeda Hideaki |
A Vietnamese Dialog Act Corpus Based on ISO 24617-2 standard
|
11.45-13.25 |
Nishitha Guntakandla and Nielsen Rodney |
Annotating Reflections for Health Behavior Change Therapy
|
|
Session P54 - Discourse Annotation, Representation and Processing (2) |
Chair: Bruno Cartoni |
11.45-13.25 |
Amal Alsaif, Tasniem Alyahya, Madawi Alotaibi, Huda Almuzaini and Abeer Algahtani |
Annotating Attribution Relations in Arabic
|
11.45-13.25 |
Emer Gilmartin, Christian Saam, Brendan Spillane, Maria O'Reilly, Ketong Su, Arturo Calvo, Loredana Cerrato, Killian Levacher, Nick Campbell and Vincent Wade |
The ADELE Corpus of Dyadic Social Text Conversations:Dialog Act Annotation with ISO 24617-2
|
11.45-13.25 |
Deniz Zeyrek and Murathan Kurfalı |
An Assessment of Explicit Inter- and Intra-sentential Discourse Connectives in Turkish Discourse Bank
|
11.45-13.25 |
Alicia Burga, Monica Dominguez, Mireia Farrús and Leo Wanner |
Compilation of Corpora for the Study of the Information Structure–Prosody Interface
|
11.45-13.25 |
Rui Sakaida, Ryosaku Makino and Mayumi Bono |
Preliminary Analysis of Embodied Interactions between Science Communicators and Visitors Based on a Multimodal Corpus of Japanese Conversations in a Science Museum
|
11.45-13.25 |
Yudai Kishimoto, Shinnosuke Sawada, Yugo Murawaki, Daisuke Kawahara and Sadao Kurohashi |
Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations
|
11.45-13.25 |
Azadeh Mirzaei and Pegah Safari |
Persian Discourse Treebank and coreference corpus
|
11.45-13.25 |
Yuanliang Meng, Anna Rumshisky and Florence Sullivan |
Automatic Labeling of Problem-Solving Dialogues for Computational Microgenetic Learning Analytics
|
11.45-13.25 |
Milagro Teruel, Cristian Cardellino, Fernando Cardellino, Laura Alonso Alemany and Serena Villata |
Increasing Argument Annotation Reproducibility by Using Inter-annotator Agreement to Improve Guidelines
|
|
Session P57 - Opinion Mining / Sentiment Analysis (3) |
Chair: Rodrigo Agerri |
11.45-13.25 |
Henrico Brum and Maria das Graças Volpe Nunes |
Building a Sentiment Corpus of Tweets in Brazilian Portuguese
|
11.45-13.25 |
Gavin Abercrombie and Riza Batista-Navarro |
'Aye' or 'No'? Speech-level Sentiment Analysis of Hansard UK Parliamentary Debate Transcripts
|
11.45-13.25 |
Jon Chamberlain, Udo Kruschwitz and Orland Hoeber |
Scalable Visualisation of Sentiment and Stance
|
11.45-13.25 |
Erik Velldal, Lilja Øvrelid, Eivind Alexander Bergem, Cathrine Stadsnes, Samia Touileb and Fredrik Jørgensen |
NoReC: The Norwegian Review Corpus
|
11.45-13.25 |
Jacobo Rouces, Nina Tahmasebi, Lars Borin and Stian Rødven Eide |
SenSALDO: Creating a Sentiment Lexicon for Swedish
|
11.45-13.25 |
Daniel Peñaloza, Juanjosé Tenorio, Rodrigo López, Héctor Gomez, Arturo Oncevay and Marco Antonio Sobrevilla Cabezudo |
Corpus Building and Evaluation of Aspect-based Opinion Summaries from Tweets in Spanish
|
11.45-13.25 |
Alessandra Teresa Cignarella, Cristina Bosco, Viviana Patti and Mirko Lai |
Application and Analysis of a Multi-layered Scheme for Irony on the Italian Twitter Corpus TWITTIRÒ
|
11.45-13.25 |
Jan Kocoń, Arkadiusz Janz and Maciej Piasecki |
Classifier-based Polarity Propagation in a WordNet
|
|
Session P58 - Sign Language |
Chair: Thomas Hanke |
11.45-13.25 |
Sarah Ebling, Necati Cihan Camgöz, Penny Boyes Braem, Katja Tissi, Sandra Sidler-Miserez, Stephanie Stoll, Simon Hadfield, Tobias Haug, Richard Bowden, Sandrine Tornay, Marzieh Razavi and Mathew Magimai-Doss |
SMILE Swiss German Sign Language Dataset
|
11.45-13.25 |
Vadim Kimmelman, Anna Klezovich and George Moroz |
IPSL: A Database of Iconicity Patterns in Sign Languages. Creation and Use
|
11.45-13.25 |
Shi Yu, Carlo Geraci and Natasha Abner |
Sign Languages and the Online World Online Dictionaries & Lexicostatistics
|
11.45-13.25 |
Michael Filhol and Mohamed Nassime Hadjadj |
Elicitation protocol and material for a corpus of long prepared monologues in Sign Language
|
11.45-13.25 |
Heike Brock and Kazuhiro Nakadai |
Deep JSLC: A Multimodal Corpus Collection for Data-driven Generation of Japanese Sign Language Expressions
|
11.45-13.25 |
Mohamed nassime Hadjadj, Michael Filhol and Annelies Braffort |
Modeling French Sign Language: a proposal for a semantically compositional system
|
|
Session P59 - Speech Resource/Database (2) |
Chair: Christoph Draxler |
11.45-13.25 |
Hanae Koiso, Yasuharu Den, Yuriko Iseki, Wakako Kashino, Yoshiko Kawabata, Ken'ya Nishikawa, Yayoi Tanaka and Yasuyuki Usuda |
Construction of the Corpus of Everyday Japanese Conversation: An Interim Report
|
11.45-13.25 |
Corine Astésano, Mathieu Balaguer, Jérôme Farinas, Corinne Fredouille, Pascal Gaillard, Alain Ghio, Imed Laaridh, Muriel Lalain, Benoît Lepage, Julie Mauclair, Olivier Nocaudie, Julien Pinquier, Oriol Pont, Gilles Pouchoulin, Michèle Puech, Danièle Robert, Etienne Sicard and Virginie Woisard |
Carcinologic Speech Severity Index Project: A Database of Speech Disorder Productions to Assess Quality of Life Related to Speech After Cancer
|
11.45-13.25 |
Annie Rialland, Martine Adda-Decker, Guy-Noël Kouarata, Gilles Adda, Laurent Besacier, Lori Lamel, Elodie Gauthier, Pierre Godard and Jamison Cooper-Leavitt |
Parallel Corpora in Mboshi (Bantu C25, Congo-Brazzaville)
|
11.45-13.25 |
Arif Khan, Ingmar Steiner, Yusuke Sugano, Andreas Bulling and Ross Macdonald |
A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks
|
11.45-13.25 |
Zhongxi Cai, Koichiro Ryu and Shigeki Matsubara |
Statistical Analysis of Missing Translation in Simultaneous Interpretation Using A Large-scale Bilingual Speech Corpus
|
11.45-13.25 |
Aghilas SINI, Damien Lolive, Gaëlle Vidal, Marie Tahon and Élisabeth Delais-Roussarie |
SynPaFlex-Corpus: An Expressive French Audiobooks Corpus dedicated to expressive speech synthesis.
|
11.45-13.25 |
Piotr Pęzik
|
Increasing the Accessibility of Time-Aligned Speech Corpora with Spokes Mix
|
11.45-13.25 |
Cécile Fougeron, Veronique Delvaux, Lucie Ménard and Marina Laganaro |
The MonPaGe_HA Database for the Documentation of Spoken French Throughout Adulthood
|
11.45-13.25 |
Per Fallgren, Zofia Malisz and Jens Edlund |
Bringing Order to Chaos: A Non-Sequential Approach for Browsing Large Sets of Found Audio Data
|
11.45-13.25 |
Robert Herms, Maria Wirzberger, Maximilian Eibl and Günter Daniel Rey |
CoLoSS: Cognitive Load Corpus with Speech and Performance Data from a Symbol-Digit Dual-Task
|
11.45-13.25 |
Jennifer Tracey and Stephanie Strassel |
VAST: A Corpus of Video Annotation for Speech Technologies
|
|
Session P60 - Corpus Creation, Annotation, Use (2) |
Chair: Beatrice Daille |
14.45-16.05 |
Ramesh Manuvinakurike, Jacqueline Brixey, Trung Bui, Walter Chang, Doo Soon Kim, Ron Artstein and Kallirroi Georgila |
Edit me: A Corpus and a Framework for Understanding Natural Language Image Editing
|
14.45-16.05 |
Anna Feltracco, Elisabetta Jezek and Bernardo Magnini |
Enriching a Lexicon of Discourse Connectives with Corpus-based Data
|
14.45-16.05 |
Carolina Scarton, Gustavo Paetzold and Lucia Specia |
SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain
|
14.45-16.05 |
Jorge Alberto Wagner Filho, Rodrigo Wilkens, Marco Idiart and Aline Villavicencio |
The brWaC Corpus: A New Open Resource for Brazilian Portuguese
|
14.45-16.05 |
Pavel Kral and Ladislav Lenc |
Czech Text Document Corpus v 2.0
|
14.45-16.05 |
Lydia Müller, Uwe Quasthoff and Maciej Sumalvico |
Corpora of Typical Sentences
|
14.45-16.05 |
Marc Kupietz, Harald Lüngen, Pawel Kamocki and Andreas Witt |
The German Reference Corpus DeReKo: New Developments – New Opportunities
|
14.45-16.05 |
Steinþór Steingrímsson, Sigrún Helgadóttir, Eiríkur Rögnvaldsson, Starkaður Barkarson and Jon Gudnason |
Risamálheild: A Very Large Icelandic Text Corpus
|
|
Session P61 - Lexicon (3) |
Chair: John McCrae |
14.45-16.05 |
Federica Vezzani, Giorgio Maria Di Nunzio and Geneviève Henrot |
TriMED: A Multilingual Terminological Database
|
14.45-16.05 |
Sonja Bosch, Thomas Eckart, Bettina Klimek, Dirk Goldhahn and Uwe Quasthoff |
Preparation and Usage of Xhosa Lexicographical Data for a Multilingual, Federated Environment
|
14.45-16.05 |
Amália Mendes, Iria Del Río Gayo, Manfred Stede and Felix Dombek |
A Lexicon of Discourse Markers for Portuguese – LDM-PT
|
14.45-16.05 |
Fahad Khan, Andrea Bellandi, Francesca Frontini and Monica Monachini |
One Language to rule them all: modelling Morphological Patterns in a Large Scale Italian Lexicon with SWRL
|
14.45-16.05 |
Gerard De Melo
|
Metaphor Suggestions based on a Semantic Metaphor Repository
|
14.45-16.05 |
Aleksander Wawer and Justyna Sarzyńska |
The Linguistic Category Model in Polish (LCM-PL)
|
14.45-16.05 |
Diego Maguiño Valencia, Arturo Oncevay and Marco Antonio Sobrevilla Cabezudo |
WordNet-Shp: Towards the Building of a Lexical Database for a Peruvian Minority Language
|
14.45-16.05 |
Alexsandro Fonseca, Fatiha Sadat and François Lareau |
Retrieving Information from the French Lexical Network in RDF/OWL Format
|
|
Session P62 - Named Entity Recognition |
Chair: Gilles Francopoulo |
14.45-16.05 |
Abbas Ghaddar and Phillippe Langlais |
Transforming Wikipedia into a Large-Scale Fine-Grained Entity Type Corpus
|
14.45-16.05 |
Halidanmu Abudukelimu, Adudoukelimu Abulizi, Boliang Zhang, Xiaoman Pan, Di Lu, Heng Ji and Yang Liu |
Error Analysis of Uyghur Name Tagging: Language-specific Techniques and Remaining Challenges
|
14.45-16.05 |
Hanieh Poostchi, Ehsan Zare Borzeshi and Massimo Piccardi |
BiLSTM-CRF for Persian Named-Entity Recognition ArmanPersoNERCorpus: the First Entity-Annotated Persian Dataset
|
14.45-16.05 |
Juyeon Kang and Jungyeul Park |
Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task
|
14.45-16.05 |
Martin Schiersch, Veselina Mironova, Maximilian Schmitt, Philippe Thomas, Aleksandra Gabryszak and Leonhard Hennig |
A German Corpus for Fine-Grained Named Entity Recognition and Relation Extraction of Traffic and Industry Events
|
14.45-16.05 |
Saskia Schön, Veselina Mironova, Aleksandra Gabryszak and Leonhard Hennig |
A Corpus Study and Annotation Schema for Named Entity Recognition and Relation Extraction of Business Products
|
14.45-16.05 |
Juliana Pirovani and Elias Oliveira |
Portuguese Named Entity Recognition using Conditional Random Fields and Local Grammars
|
14.45-16.05 |
Qi Lu, YaoSheng Yang, Zhenghua Li, Wenliang Chen and Min Zhang |
M-CNER: A Corpus for Chinese Named Entity Recognition in Multi-Domains
|
14.45-16.05 |
Kevin Bowden, Jiaqi Wu, Shereen Oraby, Amita Misra and Marilyn Walker |
SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems
|
14.45-16.05 |
Ji Young Lee, Franck Dernoncourt and Peter Szolovits |
Transfer Learning for Named-Entity Recognition with Neural Networks
|
|
Session P63 - Parsing, Syntax, Treebank (2) |
Chair: Jan Hajič |
14.45-16.05 |
Marie Mikulová and Eduard Bejček |
ForFun 1.0: Prague Database of Forms and Functions -- An Invaluable Resource for Linguistic Research
|
14.45-16.05 |
Lilja Øvrelid, Andre Kåsen, Kristin Hagen, Anders Nøklestad, Per Erik Solberg and Janne Bondi Johannessen |
The LIA Treebank of Spoken Norwegian Dialects
|
14.45-16.05 |
Guillaume Wisniewski
|
Errator: a Tool to Help Detect Annotation Errors in the Universal Dependencies Project
|
14.45-16.05 |
Shubham Bhardwaj, Neelamadhav Gantayat, Nikhil Chaturvedi, Rahul Garg and Sumeet Agarwal |
SandhiKosh: A Benchmark Corpus for Evaluating Sanskrit Sandhi Tools
|
14.45-16.05 |
Vincent Kríž and Barbora Hladka |
Czech Legal Text Treebank 2.0
|
14.45-16.05 |
Normunds Gruzitis, Lauma Pretkalnina, Baiba Saulite, Laura Rituma, Gunta Nespore-Berzkalne, Arturs Znotins and Peteris Paikens |
Creation of a Balanced State-of-the-Art Multilayer Corpus for NLU
|
14.45-16.05 |
Manjuan Duan and William Schuler |
Test Sets for Chinese Nonlocal Dependency Parsing
|
14.45-16.05 |
Sebastien Delecraz, Alexis Nasr, FREDERIC BECHET and Benoit Favre |
Adding Syntactic Annotations to Flickr30k Entities Corpus for Multimodal Ambiguous Prepositional-Phrase Attachment Resolution
|
14.45-16.05 |
Christian Chiarcos, Benjamin Kosmehl, Christian Fäth and Maria Sukhareva |
Analyzing Middle High German Syntax with RDF and SPARQL
|
14.45-16.05 |
Djamé Seddah, Eric De La Clergerie, Benoît Sagot, Héctor Martínez Alonso and Marie Candito |
Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer
|
14.45-16.05 |
Chiara Alzetta, Felice Dell'Orletta, Simonetta Montemagni and Giulia Venturi |
Universal Dependencies and Quantitative Typological Trends. A Case Study on Word Order
|
|
Session P64 - Wordnets and Ontologies |
Chair: Elena Montiel-Ponsoda |
14.45-16.05 |
Koki Washio and Tsuneaki Kato |
Undersampling Improves Hypernymy Prototypicality Learning
|
14.45-16.05 |
Vanya Dimitrova, Christian Fäth, Christian Chiarcos, Heike Renner-Westermann and Frank Abromeit |
Interoperability of Language-related Information: Mapping the BLL Thesaurus to Lexvo and Glottolog
|
14.45-16.05 |
António Branco, Ruben Branco, Chakaveh Saedi and João Silva |
Browsing and Supporting Pluricentric Global Wordnet, or just your Wordnet of Interest
|
14.45-16.05 |
Javier Alvez, Itziar Gonzalez-Dios and German Rigau |
Cross-checking WordNet and SUMO Using Meronymy
|
14.45-16.05 |
Wei-Yun Ma and Yueh-Yin Shih |
Extended HowNet 2.0 – An Entity-Relation Common-Sense Representation Model
|
14.45-16.05 |
Roxane Segers, Tommaso Caselli and Piek Vossen |
The Circumstantial Event Ontology (CEO) and ECB+/CEO: an Ontology and Corpus for Implicit Causal Relations between Events
|
14.45-16.05 |
Mahmoud El-Haj, Paul Rayson, Scott Piao and Jo Knight |
Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger
|
14.45-16.05 |
Felix Gervits and Matthias Scheutz |
Towards a Conversation-Analytic Taxonomy of Speech Overlap
|
14.45-16.05 |
Diptesh Kanojia, Kevin Patel and Pushpak Bhattacharyya |
Indian Language Wordnets and their Linkages with Princeton WordNet
|
|
|