LREC COLING 2024 Proceedings Home | Workshops | Tutorials | LREC COLING 2024 Website | ELRA Website | ICCL Website


The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

PROGRAMME

Full proceedings volume (PDF) | Author index | Bibliography (BibTeX) | Editors


For each paper, the video presentation plus the slides and the poster are available when provided by the author(s).

Wednesday, 22 May 2024

 Day 1
09:00 - 10:40Opening Session
[Video]
 Address by General Chairs
Nicoletta Calzolari & Min-Yen Kan
 Address by ICCL President
Junichi Tsuji
 Address by ELRA President
Simon Krek
 Address by ELRA Secretary General & ELDA CEO
Khalid Choukri
 Address by Program Chairs
Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
 Address by Local Chairs
Valerio Basile, Cristina Bosco, Viviana Patti
10:40 - 11:00Coffee break
D1-S2-R1 - Corpora and Annotation I (Chair: Siyao Peng)
11:00 - 11:20Geographically-Informed Language Identification
[Slides] [Video]
Jonathan Dunn and Lane Edwards-Brown
11:20 - 11:40Emotags: Computer-Assisted Verbal Labelling of Expressive Audiovisual Utterances for Expressive Multimodal TTS
[Slides] [Video]
Gérard Bailly, Romain Legrand, Martin Lenglet, Frédéric Elisei, Maëva Hueber and Olivier Perrotin
11:40 - 12:00Data Collection Pipeline for Low-Resource Languages: A Case Study on Constructing a Tetun Text Corpus
[Slides] [Video]
Gabriel de Jesus and Sérgio Sobral Nunes
12:00 - 12:20GlotScript: A Resource and Tool for Low Resource Writing System Identification
[Slides] [Video]
Amir Hossein Kargaran, François Yvon and Hinrich Schütze
12:20 - 12:40Collecting Linguistic Resources for Assessing Children’s Pronunciation of Nordic Languages
[Video]
Anne Marte Haug Olstad, Anna Smolander, Sofia Strömbergsson, Sari Ylinen, Minna Lehtonen, Mikko Kurimo, Yaroslav Getman, Tamás Grósz, Xinwei Cao, Torbjørn Svendsen and Giampiero Salvi
D1-S2-R2 - Applications Involving LRs and Evaluation I (Chair: David Adelani)
11:00 - 11:20Resolving Legalese: A Multilingual Exploration of Negation Scope Resolution in Legal Documents
[Slides] [Video]
Ramona Christen, Anastassia Shaitarova, Matthias Stürmer and Joel Niklaus
11:20 - 11:40Qsnail: A Questionnaire Dataset for Sequential Question Generation
[Slides] [Video]
Yan Lei, Liang Pang, Yuanzhuo Wang, Huawei Shen and Xueqi Cheng
11:40 - 12:00Self-reported Demographics and Discourse Dynamics in a Persuasive Online Forum
[Slides] [Video]
Agnieszka Falenska, Eva Maria Vecchi and Gabriella Lapesa
12:00 - 12:20Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean Translation
[Slides] [Video]
Sugyeong Eo, Jungwoo Lim, Chanjun Park, DaHyun Jung, Seonmin Koo, Hyeonseok Moon, Jaehyung Seo and Heuiseok Lim
12:20 - 12:40OpenMSD: Towards Multilingual Scientific Documents Similarity Measurement
[Video]
Yang Gao, Ji Ma, Ivan Korotkov, Keith Hall, Dana Alon and Donald Metzler
D1-S2-R3 - Natural Language Generation, Summarization and Simplification I (Chair: David Traum)
11:00 - 11:20SciNews: From Scholarly Complexities to Public Narratives – a Dataset for Scientific News Report Generation
[Slides] [Video]
Dongqi Pu, Yifan Wang, Jia E. Loy and Vera Demberg
11:20 - 11:40LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation
[Slides] [Video]
Jennifer A. Bishop, Sophia Ananiadou and Qianqian Xie
11:40 - 12:00Enhancing Court View Generation with Knowledge Injection and Guidance
[Slides] [Video]
Ang Li, Yiquan Wu, Yifei Liu, Kun Kuang, Fei Wu and Ming Cai
12:00 - 12:20Diversifying Question Generation over Knowledge Base via External Natural Questions
[Slides] [Video]
Shasha Guo, Jing Zhang, Xirui Ke, Cuiping Li and Hong Chen
12:20 - 12:40EROS:Entity-Driven Controlled Policy Document Summarization
[Slides] [Video]
Joykirat Singh, Sehban Fazili, Rohan Jain and Md. Shad Akhtar
D1-S2-R4 - Knowledge Discovery / Representation II (Chair: Ruochen Zhang)
11:00 - 11:20Few-shot Link Prediction on Hyper-relational Facts
[Video]
Jiyao Wei, Saiping Guan, Xiaolong Jin, Jiafeng Guo and Xueqi Cheng
11:20 - 11:40EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs
[Video]
Cheng Jiayang, Lin Qiu, Chunkit Chan, Xin Liu, Yangqiu Song and Zheng Zhang
11:40 - 12:00RENN: A Rule Embedding Enhanced Neural Network Framework for Temporal Knowledge Graph Completion
[Slides] [Video]
Linlin Zong, Zhenrong Xie, Chi Ma, Xinyue Liu, Xianchao Zhang and Bo Xu
12:00 - 12:20CMNEE:A Large-Scale Document-Level Event Extraction Dataset Based on Open-Source Chinese Military News
[Slides] [Video]
Mengna Zhu, Zijie Xu, Kaisheng Zeng, Kaiming Xiao, Mao Wang, Wenjun Ke and Hongbin Huang
12:20 - 12:40From Linguistic Linked Data to Big Data
[Slides] [Video]
Dimitar Trajanov, Elena Apostol, Radovan Garabik, Katerina Gkirtzou, Dagmar Gromann, Chaya Liebeskind, Cosimo Palma, Michael Rosner, Alexia Sampri, Gilles Sérasset, Blerina Spahiu, Ciprian-Octavian Truică and Giedre Valunaite Oleskeviciene
D1-S2-R5 - Multilinguality, Machine Translation, and Translation Aids I (Chair: Chenhui Chu)
11:00 - 11:20TAeKD: Teacher Assistant Enhanced Knowledge Distillation for Closed-Source Multilingual Neural Machine Translation
[Video]
Bo Lv, Xin Liu, Kaiwen Wei, Ping Luo and Yue Yu
11:20 - 11:40Teaching Large Language Models to Translate on Low-resource Languages with Textbook Prompting
[Video]
Ping Guo, Yubing Ren, Yue Hu, Yunpeng Li, Jiarui Zhang, Xingsheng Zhang and Heyan Huang
11:40 - 12:00CORI: CJKV Benchmark with Romanization Integration - a Step towards Cross-lingual Transfer beyond Textual Scripts
[Slides] [Video]
Hoang Nguyen, Chenwei Zhang, Ye Liu, Natalie Parde, Eugene Rohrbaugh and Philip S. Yu
12:00 - 12:20Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems
[Slides] [Video]
Bo-Han Lu, Yi-Hsuan Lin, Annie Lee and Richard Tzong-Han Tsai
12:20 - 12:40Humanistic Buddhism Corpus: A Challenging Domain-Specific Dataset of English Translations for Classical and Modern Chinese
[Video]
Youheng W. Wong, Natalie Parde and Erdem Koyuncu
D1-S2-R6 - Offensive and Harmful Language Detection and Analysis (Chair: Preslav Nakov)
11:00 - 11:20Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean
[Slides] [Video]
Seungyoon Lee, Chanjun Park, DaHyun Jung, Hyeonseok Moon, Jaehyung Seo, Sugyeong Eo and Heuiseok Lim
11:20 - 11:40Enhance Robustness of Language Models against Variation Attack through Graph Integration
Zi Xiong, Lizhi Qing, Yangyang Kang, Jiawei Liu, Hongsong Li, Changlong Sun, Xiaozhong Liu and Wei Lu
11:40 - 12:00Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes
[Slides] [Video]
Isar Nejadgholi, Kathleen C. Fraser, Anna Kerkhof and Svetlana Kiritchenko
12:00 - 12:20Humans Need Context, What about Machines? Investigating Conversational Context in Abusive Language Detection
[Slides] [Video]
Tom Bourgeade, Zongmin Li, Farah Benamara, Véronique Moriceau, Jian Su and Aixin Sun
12:20 - 12:40PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian Tweets
[Video]
Arianna Muti, Federico Ruggeri, Cagri Toraman, Alberto Barrón-Cedeño, Samuel Algherini, Lorenzo Musetti, Silvia Ronchi, Gianmarco Saretto and Caterina Zapparoli
11:00 - 12:40D1-S2-P1 - CL and Linguistic Theories, Cognitive Modeling and Psycholinguistics I (Chair: Sara Tonelli)
Evaluating Prompting Strategies for Grammatical Error Correction Based on Language Proficiency
[Poster] [Slides] [Video]
Min Zeng, Jiexin Kuang, Mengyang Qiu, Jayoung Song and Jungyeul Park
Principal Component Analysis as a Sanity Check for Bayesian Phylolinguistic Reconstruction
[Poster] [Video]
Yugo Murawaki
An Argument for Symmetric Coordination from Dependency Length Minimization: A Replication Study
[Poster] [Slides] [Video]
Adam Przepiórkowski, Magdalena Borysiak and Adam Głowacki
The Influence of Automatic Speech Recognition on Linguistic Features and Automatic Alzheimer’s Disease Detection from Spontaneous Speech
[Slides] [Video]
Jonathan Heitz, Gerold Schneider and Nicolas Langer
Multimodal Language Models Show Evidence of Embodied Simulation
[Poster] [Video]
Cameron R. Jones and Sean Trott
Task-Oriented Paraphrase Analytics
[Slides] [Video]
Marcel Gohsen, Matthias Hagen, Martin Potthast and Benno Stein
Do Neural Language Models Inferentially Compose Concepts the Way Humans Can?
[Poster] [Video]
Amilleah Rodriguez, Shaonan Wang and Liina Pylkkänen
The Contextual Variability of English Nouns: The Impact of Categorical Specificity beyond Conceptual Concreteness
[Slides] [Video]
Giulia Rambelli and Marianna Bolognesi
Towards Comprehensive Language Analysis for Clinically Enriched Spontaneous Dialogue
[Slides] [Video]
Baris Karacan, Ankit Aich, Avery Quynh, Amy Pinkham, Philip Harvey, Colin Depp and Natalie Parde
Context Shapes Emergent Communication about Concepts at Different Levels of Abstraction
[Slides] [Video]
Kristina Kobrock, Xenia Isabel Ohmer, Elia Bruni and Nicole Gotzner
A Study on How Attention Scores in the BERT Model Are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
[Poster] [Slides] [Video]
Dongjun Jang, Sungjoo Byun and Hyopil Shin
Fine-grained Classification of Circumstantial Meanings within the Prague Dependency Treebank Annotation Scheme
[Poster] [Video]
Marie Mikulova
C-Journal: A Journaling Application for Detecting and Classifying Cognitive Distortions Using Deep-Learning Based on a Crowd-sourced Dataset
[Slides] [Video]
Nada Elsharawi and Alia El Bolock
11:00 - 12:40D1-S2-P1 - Digital Humanities and Cultural Heritage I (Chair: Sara Tonelli)
Towards Building the LEMI Readability Platform for Children’s Literature in the Romanian Language
[Slides] [Video]
Madalina Chitez, Mihai Dascalu, Aura Cristina Udrea, Cosmin Strilețchi, Karla Csürös, Roxana Rogobete and Alexandru Oravițan
Re-evaluating the Tomes for the Times
[Slides] [Video]
Ryan Brate, Marieke van Erp and Antal van den Bosch
Restoring Ancient Ideograph: A Multimodal Multitask Neural Network Approach
[Poster] [Slides] [Video]
Siyu Duan, Jun Wang and Qi Su
The Swedish Parliament Corpus 1867 – 2022
[Slides] [Video]
Väinö Aleksi Yrjänäinen, Fredrik Mohammadi Norén, Robert Borges, Johan Jarlbrink, Lotta Åberg Brorsson, Anders P. Olsson, Pelle Snickars and Måns Magnusson
Introducing a Parsed Corpus of Historical High German
[Poster] [Slides] [Video]
Christopher D. Sapp, Elliott Evans, Rex Sprouse and Daniel Dakota
A Computational Analysis of the Dehumanisation of Migrants from Syria and Ukraine in Slovene News Media
[Poster] [Slides] [Video]
Jaya Caporusso, Damar Hoogland, Mojca Brglez, Boshko Koloski, Matthew Purver and Senja Pollak
Lemmatisation of Medieval Greek: Against the Limits of Transformer’s Capabilities?
[Video]
Colin Swaelens, Pranaydeep Singh, Ilse de Vos and Els Lefever
Revisiting the Classics: A Study on Identifying and Rectifying Gender Stereotypes in Rhymes and Poems
[Video]
Aditya Narayan Sankaran, Vigneshwaran Shankaran, Sampath Lonka and Rajesh Sharma
Mapping the Past: Geographically Linking an Early 20th Century Swedish Encyclopedia with Wikidata
[Video]
Axel Ahlin, Alfred Myrne Blåder and Pierre Nugues
A Dataset for Named Entity Recognition and Entity Linking in Chinese Historical Newspapers
[Video]
Baptiste Blouin, Cécile Armand and Christian Henriot
Using Bibliodata LODification to Create Metadata-Enriched Literary Corpora in Line with FAIR Principles
[Slides] [Video]
Agnieszka Karlinska, Cezary Rosiński, Marek Kubis, Patryk Hubar and Jan Wieczorek
A Matter of Perspective: Building a Multi-Perspective Annotated Dataset for the Study of Literary Quality
[Slides] [Video]
Yuri Bizzoni, Pascale Feldkamp Moreira, Ida Marie S. Lassen, Mads Rosendahl Thomsen and Kristoffer Nielbo
Deciphering Emotional Landscapes in the Iliad: A Novel French-Annotated Dataset for Emotion Recognition
[Video]
Davide Picca and John Pavlopoulos
Linguistic Survey of India and Polyglotta Africana: Two Retrostandardized Digital Editions of Large Historical Collections of Multilingual Wordlists
[Slides] [Video]
Robert Forkel, Johann-Mattis List, Christoph Rzymski and Guillaume Segerer
11:00 - 12:40D1-S2-P1 - Discourse and Pragmatics (Chair: Sara Tonelli)
How to Do Politics with Words: Investigating Speech Acts in Parliamentary Debates
[Slides] [Video]
Ines Reinig, Ines Rehbein and Simone Paolo Ponzetto
Annotating Customer-Oriented Behaviour in Call Centre Sales Dialogues
[Slides] [Video]
Jutta Stock, Volha Petukhova and Dietrich Klakow
Developing a Rhetorical Structure Theory Treebank for Czech
[Slides] [Video]
Lucie Polakova, Jiří Mírovský, Šárka Zikánová and Eva Hajicova
Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles
[Poster] [Slides] [Video]
Abhijnan Nath, Huma Jamil, Shafiuddin Rehan Ahmed, George Arthur Baker, Rahul Ghosh, James H. Martin, Nathaniel Blanchard and Nikhil Krishnaswamy
Announcing the Prague Discourse Treebank 3.0
[Poster] [Slides] [Video]
Pavlína Synková, Jiří Mírovský, Lucie Poláková and Magdaléna Rysová
An Empirical Study of Synthetic Data Generation for Implicit Discourse Relation Recognition
[Slides] [Video]
Kazumasa Omura, Fei Cheng and Sadao Kurohashi
Enhancing Unrestricted Cross-Document Event Coreference with Graph Reconstruction Networks
[Slides] [Video]
Loic de Langhe, Orphee de Clercq and Veronique Hoste
Intention and Face in Dialog
[Slides] [Video]
Adil Soubki and Owen Rambow
Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study
[Video]
Yaxin Fan, Feng Jiang, Peifeng Li and Haizhou Li
Cost-Effective Discourse Annotation in the Prague Czech–English Dependency Treebank
[Poster] [Slides] [Video]
Jiří Mírovský, Pavlína Synková, Lucie Polakova and Marie Paclíková
SIGA: A Naturalistic NLI Dataset of English Scalar Implicatures with Gradable Adjectives
[Slides] [Video]
Rashid Nizamani, Sebastian Schuster and Vera Demberg
How Diplomats Dispute: The UN Security Council Conflict Corpus
[Poster] [Slides] [Video]
Karolina Zaczynska, Peter Bourgonje and Manfred Stede
J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution
[Poster] [Slides] [Video]
Nobuhiro Ueda, Hideko Habe, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi and Koichiro Yoshino
Universal Anaphora: The First Three Years
[Slides] [Video]
Massimo Poesio, Maciej Ogrodniczuk, Vincent Ng, Sameer Pradhan, Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Amir Zeldes, Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský and Daniel Zeman
QCAW 1.0: Building a Qatari Corpus of Student Argumentative Writing
[Slides] [Video]
Wajdi Zaghouani, Abdelhamid Ahmed, Xiao Zhang and Lameya Rezk
DiscoGeM 2.0: A Parallel Corpus of English, German, French and Czech Implicit Discourse Relations
[Slides] [Video]
Frances Yung, Merel Scholman, Sarka Zikanova and Vera Demberg
Experimental versus In-Corpus Variation in Referring Expression Choice
[Slides] [Video]
T. Mark Ellison and Fahime Same
Multilingual Coreference Resolution in Low-resource South Asian Languages
[Slides] [Video]
Ritwik Mishra, Pooja Desur, Rajiv Ratn Shah and Ponnurangam Kumaraguru
Building a Database of Conversational Routines
[Video]
Polina Bychkova, Alyaxey Yaskevich, Serafima Gyulasaryan and Ekaterina Rakhilina
Polish Discourse Corpus (PDC): Corpus Design, ISO-Compliant Annotation, Data Highlights, and Parser Development
[Poster] [Slides] [Video]
Maciej Ogrodniczuk, Aleksandra Tomaszewska, Daniel Ziembicki, Sebastian Żurowski, Ryszard Tuora and Aleksandra Zwierzchowska
Conceptual Pacts for Reference Resolution Using Small, Dynamically Constructed Language Models: A Study in Puzzle Building Dialogues
[Slides] [Video]
Julian Hough, Sina Zarrieß, Casey Kennington, David Schlangen and Massimo Poesio
11:00 - 12:40D1-S2-P1 - Policy issues, Ethics, Legal Issues, Bias Analysis (Chair: Sara Tonelli)
To Share or Not to Share: What Risks Would Laypeople Accept to Give Sensitive Data to Differentially-Private NLP Systems?
[Poster] [Video]
Christopher Weiss, Frauke Kreuter and Ivan Habernal
Evidence-guided Inference for Neutralized Zero-shot Transfer
[Slides] [Video]
Xiaotong Feng, Meng-Fen Chiang, Wang-Chien Lee and Zixin Kuang
A Canonical Form for Flexible Multiword Expressions
[Poster] [Video]
Jan Odijk
Do Large Language Models Understand Mansplaining? Well, Actually...
[Slides] [Video]
Carla Perez Almendros and Jose Camacho-Collados
RuBia: A Russian Language Bias Detection Dataset
[Slides] [Video]
Veronika Grigoreva, Anastasiia Ivanova, Ilseyar Alimova and Ekaterina Artemova
European Language Grid: One Year after
[Slides] [Video]
Georg Rehm, Stelios Piperidis, Dimitris Galanis, Penny Labropoulou, Maria Giagkou, Miltos Deligiannis, Leon Voukoutis, Martin Courtois, Julian Moreno-Schneider and Katrin Marheinecke
Is Gender Reference Gender-specific? Studies in a Polar Domain
[Slides] [Video]
Manfred Klenner and Dylan Massey
Curation of Benchmark Templates for Measuring Gender Bias in Named Entity Recognition Models
[Poster] [Video]
Ana Cimitan, Ana Alves Pinto and Michaela Geierhos
LinguaMeta: Unified Metadata for Thousands of Languages
[Poster] [Slides] [Video]
Sandy Ritchie, Daan van Esch, Uche Okonkwo, Shikhar Vashishth and Emily Drummond
Quite Good, but Not Enough: Nationality Bias in Large Language Models - a Case Study of ChatGPT
[Video]
Shucheng Zhu, Weikang Wang and Ying Liu
Pseudonymization Categories across Domain Boundaries
[Poster] [Slides] [Video]
Maria Irena Szawerna, Simon Dobnik, Therese Lindström Tiedemann, Ricardo Muñoz Sánchez, Xuan-Son Vu and Elena Volodina
Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language We Prompt Them in
[Poster] [Slides] [Video]
Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal and Monojit Choudhury
ABLE: Agency-BeLiefs Embedding to Address Stereotypical Bias through Awareness Instead of Obliviousness
[Poster] [Slides] [Video]
Michelle YoungJin Kim, Junghwan Kim and Kristen Johnson
Language Technologies as If People Mattered: Centering Communities in Language Technology Development
[Slides] [Video]
Nina Markl, Lauren Hall-Lew and Catherine Lai
Your Stereotypical Mileage May Vary: Practical Challenges of Evaluating Biases in Multiple Languages and Cultural Contexts
[Slides] [Video]
Karen Fort, Laura Alonso Alemany, Luciana Benotti, Julien Bezançon, Claudia Borg, Marthese Borg, Yongjian Chen, Fanny Ducel, Yoann Dupont, Guido Ivetta, Zhijian Li, Margot Mieskes, Marco Naguib, Yuyan Qian, Matteo Radaelli, Wolfgang S. Schmeisser-Nieto, Emma Raimundo Schulz, Thiziri Saci, Sarah Saidi, Javier Torroba Marchante, Shilin Xie, Sergio E. Zanotto and Aurélie Névéol
Large Language Models Are Echo Chambers
[Poster] [Slides] [Video]
Jan Nehring, Aleksandra Gabryszak, Pascal Jürgens, Aljoscha Burchardt, Stefan Schaffer, Matthias Spielkamp and Birgit Stark
Common European Language Data Space
[Slides] [Video]
Georg Rehm, Stelios Piperidis, Khalid Choukri, Andrejs Vasiļjevs, Katrin Marheinecke, Victoria Arranz, Aivars Bērziņš, Miltos Deligiannis, Dimitris Galanis, Maria Giagkou, Katerina Gkirtzou, Dimitris Gkoumas, Annika Grützner-Zahn, Athanasia Kolovou, Penny Labropoulou, Andis Lagzdiņš, Elena Leitner, Valérie Mapelli, Hélène Mazo, Simon Ostermann, Stefania Racioppa, Mickaël Rigault and Leon Voukoutis
A Luxembourgish Corpus as a Gender Bias Evaluation Testset
[Video]
Dimitra Anastasiou, Carole Blond-Hanten and Marie Gallais
Impoverished Language Technology: The Lack of (Social) Class in NLP
[Slides] [Video]
Amanda Cercas Curry, Zeerak Talat and Dirk Hovy
Are Text Classifiers Xenophobic? A Country-Oriented Bias Detection Method with Least Confounding Variables
[Slides] [Video]
Valentin Barriere and Sebastian Cifuentes
11:00 - 12:40D1-S2-P1 - Speech Resources and Processing I (Chair: Sara Tonelli)
PWESuite: Phonetic Word Embeddings and Tasks They Facilitate
[Slides] [Video]
Vilém Zouhar, Kalvin Chang, Chenxuan Cui, Nate B. Carlson, Nathaniel Romney Robinson, Mrinmaya Sachan and David R. Mortensen
CoANZSE Audio: Creation of an Online Corpus for Linguistic and Phonetic Analysis of Australian and New Zealand Englishes
[Video]
Steven Coats
SamróMur MilljóN: An ASR Corpus of One Million Verified Read Prompts in Icelandic
[Poster] [Slides] [Video]
Carlos Daniel Hernandez Mena, Þorsteinn Daði Gunnarsson and Jon Gudnason
Becoming a High-Resource Language in Speech: The Catalan Case in the Common Voice Corpus
[Poster] [Slides] [Video]
Carme Armentano-Oller, Montserrat Marimon and Marta Villegas
TunArTTS: Tunisian Arabic Text-To-Speech Corpus
[Slides] [Video]
Imen Laouirine, Rami Kammoun and Fethi Bougares
Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
[Slides] [Video]
Heyang Liu, Yanfeng Wang and Yu Wang
Correcting Pronoun Homophones with Subtle Semantics in Chinese Speech Recognition
[Slides] [Video]
Zhaobo Zhang, Rui Gan, Pingpeng Yuan and Hai Jin
Evaluating Workflows for Creating Orthographic Transcripts for Oral Corpora by Transcribing from Scratch or Correcting ASR-Output
[Poster] [Slides] [Video]
Jan Gorisch and Thomas Schmidt
Exploring Pathological Speech Quality Assessment with ASR-Powered Wav2Vec2 in Data-Scarce Context
[Poster] [Slides] [Video]
Tuan Nguyen, Corinne Fredouille, Alain Ghio, Mathieu Balaguer and Virginie Woisard
Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants
[Slides] [Video]
Chloe Sekkat, Fanny Leroy, Salima Mdhaffar, Blake Perry Smith, Yannick Estève, Joseph Dureau and Alice Coucke
TARIC-SLU: A Tunisian Benchmark Dataset for Spoken Language Understanding
[Video]
Salima Mdhaffar, Fethi Bougares, Renato de Mori, Salah Zaiem, Mirco Ravanelli and Yannick Estève
ALLIES: A Speech Corpus for Segmentation, Speaker Diarization, Speech Recognition and Speaker Change Detection
[Video]
Marie Tahon, Anthony Larcher, Martin Lebourdais, Fethi Bougares, Anna Silnova and Pablo Gimeno
ART: The Alternating Reading Task Corpus for Speech Entrainment and Imitation
[Slides] [Video]
Zheng Byron Yuan, Dorina de Jong, Ruitao Feng, Štefan Beňuš, Noël Nguyen, Róbert Sabo, Luciano Fadiga and Alessandro D’Ausilio
Code-Mixed Text Augmentation for Latvian ASR
[Slides] [Video]
Martins Kronis, Askars Salimbajevs and Mārcis Pinnis
Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness
[Slides] [Video]
Xincan Feng and Akifumi Yoshimoto
Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
[Slides] [Video]
Yash Jain, David M. Chan, Pranav Dheram, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran and Shalini Ghosh
12:40 - 13:20D1-S1-RE1 - Applications Involving LRs and Evaluation I
IndicFinNLP: Financial Natural Language Processing for Indian Languages
[Slides] [Video]
Sohom Ghosh, Arnab Maji, Aswartha Narayana and Sudip Kumar Naskar
User Guide for KOTE: Korean Online That-gul Emotions Dataset
[Video]
Duyoung Jeon, Junho Lee and Cheongtag Kim
Positive and Risky Message Assessment for Music Products
[Slides] [Video]
Yigeng Zhang, Mahsa Shafaei, Fabio Gonzalez and Thamar Solorio
Interpreting Themes from Educational Stories
[Slides] [Video]
Yigeng Zhang, Fabio Gonzalez and Thamar Solorio
12:40 - 13:20D1-S1-RE1 - Applications Involving LRs and Evaluation II
Towards More Realistic Chinese Spell Checking with New Benchmark and Specialized Expert Model
[Slides] [Video]
Yue Wang, Zilong Zheng, Juntao Li, Zhihui Liu, Jinxiong Chang, Qishen Zhang, Zhongyi Liu, Guannan Zhang and Min Zhang
AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports
[Slides] [Video]
Lukas Lange, Marc Müller, Ghazaleh Haratinezhad Torbati, Dragan Milchevski, Patrick Grau, Subhash Chandra Pujari and Annemarie Friedrich
Prompt-based Generation of Natural Language Explanations of Synthetic Lethality for Cancer Drug Discovery
[Video]
Ke Zhang, Yimiao Feng and Jie Zheng
Reference-less Analysis of Context Specificity in Translation with Personalised Language Models
[Video]
Sebastian Vincent, Rowanne Sumner, Alice Dowek, Charlotte Prescott, Emily Preston, Chris Bayliss, Chris Oakley and Carolina Scarton
12:40 - 13:20D1-S1-RE1 - Applications Involving LRs and Evaluation III
WordNet under Scrutiny: Dictionary Examples in the Era of Large Language Models
[Video]
Fatemah Yousef Almeman, Steven Schockaert and Luis Espinosa Anke
Using Persuasive Writing Strategies to Explain and Detect Health Misinformation
[Slides] [Video]
Danial Kamali, Joseph D. Romain, Huiyi Liu, Wei Peng, Jingbo Meng and Parisa Kordjamshidi
Sarcasm Detection in a Disaster Context
[Slides] [Video]
Tiberiu Sosea, Junyi Jessy Li and Cornelia Caragea
MKeCL: Medical Knowledge-Enhanced Contrastive Learning for Few-shot Disease Diagnosis
[Slides] [Video]
Yutian Zhao, Huimin Wang, Xian Wu and Yefeng Zheng
12:40 - 13:20D1-S1-RE1 - Applications Involving LRs and Evaluation IV
Assessing Online Writing Feedback Resources: Generative AI vs. Good Samaritans
[Slides] [Video]
Shabnam Behzad, Omid Kashefi and Swapna Somasundaran
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search
Zehan Li, Jianfei Zhang, Chuantao Yin, Yuanxin Ouyang and Wenge Rong
Tree-Instruct: A Preliminary Study of the Intrinsic Relationship between Complexity and Alignment
[Slides] [Video]
Yingxiu Zhao, Bowen Yu, Binyuan Hui, Haiyang Yu, Minghao Li, Fei Huang, Nevin L. Zhang and Yongbin Li
Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs
[Video]
Fengbin Zhu, Chao Wang, Fuli Feng, Zifeng Ren, Moxin Li and Tat-Seng Chua
12:40 - 13:20D1-S1-RE1 - Applications Involving LRs and Evaluation V
Transformer-based Joint Modelling for Automatic Essay Scoring and Off-Topic Detection
[Slides] [Video]
Sourya Dipta Das, Yash A. Vadi and Kuldeep Yadav
Fast Adaptation via Prompted Data: An Efficient Cross-Domain Fine-tuning Method for Large Language Models
Yiming Zhang, Hantao Yang, Haobo Wang and Jake Zhao
Evaluating the Efficacy of Large Acoustic Model for Documenting Non-Orthographic Tribal Languages in India
[Slides] [Video]
Tonmoy Rajkhowa, Amartya Roy Chowdhury, Hrishikesh Ravindra Karande and S. R. Mahadeva Prasanna
Error-Robust Retrieval for Chinese Spelling Check
[Slides] [Video]
Xunjian Yin, Xinyu Hu, Jin Jiang and Xiaojun Wan
12:40 - 13:20D1-S1-RE2 - CL and Linguistic Theories, Cognitive Modeling and Psycholinguistics I
Which Sense Dominates Multisensory Semantic Understanding? A Brain Decoding Study
Dandan Huang, Lu Cao, Zhenting Li and Yue Zhang
Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases
[Slides] [Video]
Yuqi Liu, Guanyi Chen and Kees van Deemter
A Quantum-Inspired Matching Network with Linguistic Theories for Metaphor Detection
[Video]
Wenbo Qiao, Peng Zhang and ZengLai Ma
Phonotactic Complexity across Dialects
[Slides] [Video]
Ryan Soh-Eun Shim, Kalvin Chang and David R. Mortensen
12:40 - 13:20D1-S1-RE2 - CL and Linguistic Theories, Cognitive Modeling and Psycholinguistics II
Error Analysis of NLP Models and Non-Native Speakers of English Identifying Sarcasm in Reddit Comments
[Slides] [Video]
Oliver Cakebread-Andrews, Le An Ha, Ingo Frommholz and Burcu Can
Context Matters: Enhancing Metaphor Recognition in Proverbs
[Slides] [Video]
Gamze Goren and Carlo Strapparava
NutFrame: Frame-based Conceptual Structure Induction with LLMs
[Slides] [Video]
Shaoru Guo, Yubo Chen, Kang Liu, Ru Li and Jun Zhao
Decoding Probing: Revealing Internal Linguistic Structures in Neural Language Models Using Minimal Pairs
[Video]
Linyang He, Peili Chen, Ercong Nie, Yuanning Li and Jonathan R. Brennan
12:40 - 13:20D1-S1-RE3 - Corpora and Annotation I
Automatic Data Visualization Generation from Chinese Natural Language Questions
[Slides] [Video]
Yan Ge, Victor Junqiu Wei, Yuanfeng Song, Jason Chen Zhang and Raymond Chi-Wing Wong
Analyzing the Dynamics of Climate Change Discourse on Twitter: A New Annotated Corpus and Multi-Aspect Classification
[Slides] [Video]
Shuvam Shiwakoti, Surendrabikram Thapa, Kritesh Rauniyar, Akshyat Shah, Aashish Bhandari and Usman Naseem
A Corpus and Method for Chinese Named Entity Recognition in Manufacturing
[Slides] [Video]
Ruiting Li, Peiyan Wang, Libang Wang, Danqingxin Yang and Dongfeng Cai
Evaluation Dataset for Lexical Translation Consistency in Chinese-to-English Document-level Translation
[Video]
Xiangyu Lei, Junhui Li, Shimin Tao and Hao Yang
12:40 - 13:20D1-S1-RE3 - Corpora and Annotation II
Multi-Tiered Cantonese Word Segmentation
[Video]
Charles Lam, Chaak-ming Lau and Jackson L. Lee
MDS: A Fine-Grained Dataset for Multi-Modal Dialogue Summarization
[Slides] [Video]
Zhipeng Liu, Xiaoming Zhang, Litian Zhang and Zelong Yu
MedQA-SWE - a Clinical Question & Answer Dataset for Swedish
[Slides] [Video]
Niclas Hertzberg and Anna Lokrantz
A Persona-Based Corpus in the Diabetes Self-Care Domain - Applying a Human-Centered Approach to a Low-Resource Context
[Slides] [Video]
Rossana Cunha, Thiago Castro Ferreira, Adriana Pagano and Fabio Alves
12:40 - 13:20D1-S1-RE3 - Corpora and Annotation III
XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates
[Slides] [Video]
Haopeng Zhang, Hayate Iso, Sairam Gurajada and Nikita Bhutani
InferBR: A Natural Language Inference Dataset in Portuguese
[Slides] [Video]
Luciana Bencke, Francielle Vasconcellos Pereira, Moniele Kunrath Santos and Viviane Moreira
Limitations of Human Identification of Automatically Generated Text
[Video]
Nadège Alavoine, Maximin Coavoux, Emmanuelle Esperança-Rodier, Romane Gallienne, Carlos Gonzalez Gallardo, Jérôme Goulian, Jose G. Moreno, Aurélie Névéol, Didier Schwab, Vincent Segonne and Johanna Simoens
UQA: Corpus for Urdu Question Answering
[Slides] [Video]
Samee Arif, Sualeha Farid, Awais Athar and Agha Ali Raza
12:40 - 13:20D1-S1-RE3 - Corpora and Annotation IV
EPOQUE: An English-Persian Quality Estimation Dataset
[Slides] [Video]
Mohammed Hossein Jafari Harandi, Fatemeh Azadi, Mohammad Javad Dousti and Heshaam Faili
GOLEM: GOld Standard for Learning and Evaluation of Motifs
[Slides] [Video]
W. Victor Yarlott, Anurag Acharya, Diego Castro Estrada, Diana Gomez and Mark Finlayson
Khan Academy Corpus: A Multilingual Corpus of Khan Academy Lectures
[Slides] [Video]
Dominika Ďurišková, Daniela Jurášová, Matúš Žilinec, Eduard Šubert and Ondřej Bojar
PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents
[Video]
Nan Zhang, Connor Heaton, Sean Timothy Okonsky, Prasenjit Mitra and Hilal Ezgi Toraman
12:40 - 13:20D1-S1-RE3 - Corpora and Annotation V
DiaSet: An Annotated Dataset of Arabic Conversations
[Video]
Abraham Israeli, Aviv Naaman, Guy Maduel, Rawaa Makhoul, Dana Qaraeen, Amir Ejmail, Dina Lisnanskey, Julian Jubran, Shai Fine and Kfir Bar
Reflections & Resonance: Two-Agent Partnership for Advancing LLM-based Story Annotation
[Video]
Yuetian Chen and Mei Si
German Parliamentary Corpus (GerParCor) Reloaded
[Video]
Giuseppe Abrami, Mevlüt Bagci and Alexander Mehler
Murre24: Dialect Identification of Finnish Internet Forum Messages
[Slides] [Video]
Olli Kuparinen
12:40 - 13:20D1-S1-RE3 - Corpora and Annotation VI
Universal Dependencies for Learner Russian
[Slides] [Video]
Alla Rozovskaya
My Science Tutor (MyST)–a Large Corpus of Children’s Conversational Speech
Sameer Pradhan, Ronald A. Cole and Wayne H. Ward
12:40 - 13:20D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction I
CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering
[Slides] [Video]
Hongbin Na
Deriving Entity-Specific Embeddings from Multi-Entity Sequences
[Video]
Connor Heaton and Prasenjit Mitra
UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt
[Slides] [Video]
Yucheng Cai, Wentao Ma, Yuchuan Wu, Shuzheng Si, Yuan Shao, Zhijian Ou and Yongbin Li
Deep Reinforcement Learning with Hierarchical Action Exploration for Dialogue Generation
[Slides] [Video]
Itsugun Cho, Ryota Takahashi, Yusaku Yanase and Hiroaki Saito
12:40 - 13:20D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction II
Out-of-Domain Intent Detection Considering Multi-Turn Dialogue Contexts
[Slides] [Video]
Hao Lang, Yinhe Zheng, Binyuan Hui, Fei Huang and Yongbin Li
Detection, Diagnosis, and Explanation: A Benchmark for Chinese Medial Hallucination Evaluation
[Video]
Chengfeng Dou, Ying Zhang, Yanyuan Chen, Zhi Jin, Wenpin Jiao, Haiyan Zhao and Yu Huang
A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation
[Slides] [Video]
Xiangci Li, Linfeng Song, Lifeng Jin, Haitao Mi, Jessica Ouyang and Dong Yu
Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models
[Slides] [Video]
Haoyu Gao, Ting-En Lin, Hangyu Li, Min Yang, Yuchuan Wu, Wentao Ma, Fei Huang and Yongbin Li
12:40 - 13:20D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction III
CAMAL: A Novel Dataset for Multi-label Conversational Argument Move Analysis
[Slides] [Video]
Viet Dac Lai, Duy Ngoc Pham, Jonathan Steinberg, Jamie Mikeska and Thien Huu Nguyen
BERT-BC: A Unified Alignment and Interaction Model over Hierarchical BERT for Response Selection
[Slides] [Video]
Zhenfei Yang, Beiming Yu, Yuan Cui, Shi Feng, Daling Wang and Yifei Zhang
EmoTrans: Emotional Transition-based Model for Emotion Recognition in Conversation
[Slides] [Video]
Zhongquan Jian, Ante Wang, Jinsong Su, Junfeng Yao, Meihong Wang and Qingqiang Wu
DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues
[Slides] [Video]
Xiang Luo, Zhiwen Tang, Jin Wang and Xuejie Zhang
12:40 - 13:20D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction IV
EmpCRL: Controllable Empathetic Response Generation via In-Context Commonsense Reasoning and Reinforcement Learning
[Slides] [Video]
Mingxiu Cai, Daling Wang, Shi Feng and Yifei Zhang
Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
[Slides] [Video]
Hossam Zawbaa, Wael Rashwan, Sourav Dutta and Haytham Assem
CLHA: A Simple Yet Effective Contrastive Learning Framework for Human Alignment
[Slides] [Video]
Feiteng Fang, Liang Zhu, Xi Feng, Jinchang Hou, Qixuan Zhao, Chengming Li, Xiping Hu, Ruifeng Xu and Min Yang
Seeing Is Believing! towards Knowledge-Infused Multi-modal Medical Dialogue Generation
[Slides] [Video]
Abhisek Tiwari, Shreyangshu Bera, Preeti Verma, Jaithra Varma Manthena, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar and Sarbajeet Tiwari
12:40 - 13:20D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction V
How Susceptible Are LLMs to Logical Fallacies?
[Video]
Amirreza Payandeh, Dan Pluth, Jordan Hosier, Xuesu Xiao and Vijay K. Gurbani
Enhancing Text-to-SQL Capabilities of Large Language Models through Tailored Promptings
[Slides] [Video]
Zhao Tan, Xiping Liu, Qing Shu, Xi Li, Changxuan Wan, Dexi Liu, Qizhi Wan and Guoqiong Liao
New Intent Discovery with Attracting and Dispersing Prototype
[Video]
Shun Zhang, Jian Yang, Jiaqi Bai, Chaoran Yan, Tongliang Li, Zhao Yan and Zhoujun Li
Granular Change Accuracy: A More Accurate Performance Metric for Dialogue State Tracking
[Slides] [Video]
Taha Aksu and Nancy Chen
12:40 - 13:20D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction VI
Adding SPICE to Life: Speaker Profiling in Multiparty Conversations
Shivani Kumar, Rishabh Gupta, Md. Shad Akhtar and Tanmoy Chakraborty
Beyond the Known: Investigating LLMs Performance on Out-of-Domain Intent Detection
[Slides] [Video]
Pei Wang, Keqing He, Yejie Wang, Xiaoshuai Song, Yutao Mou, Jingang Wang, Yunsen Xian, Xunliang Cai and Weiran Xu
S3Prompt: Instructing the Model with Self-calibration, Self-recall and Self-aggregation to Improve In-context Learning
[Slides] [Video]
Junda Chen and Jianting Liu
ASEM: Enhancing Empathy in Chatbot through Attention-based Sentiment and Emotion Modeling
[Video]
Omama Hamad, Khaled Shaban and Ali Hamdi
12:40 - 13:20D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction VII
MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking
[Slides] [Video]
Tianwen Tang, Tong Zhu, Haodong Liu, Yin Bai, Jia Cheng and Wenliang Chen
What Are the Implications of Your Question? Non-Information Seeking Question-Type Identification in CNN Transcripts
[Slides] [Video]
Yao Sun, Anastasiia Tatlubaeva, Zhihan Li and Chester Palen-Michel
BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses
[Slides] [Video]
Weihao Zeng, Keqing He, Yejie Wang, Dayuan Fu and Weiran Xu
Exploring the Impact of Human Evaluator Group on Chat-Oriented Dialogue Evaluation
[Video]
Sarah E. Finch, James D. Finch and Jinho D. Choi
12:40 - 13:20D1-S1-RE4 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction VIII
LANID: LLM-assisted New Intent Discovery
[Video]
Lu Fan, Jiashu Pu, Rongsheng Zhang and Xiao-Ming Wu
Beyond Linguistic Cues: Fine-grained Conversational Emotion Recognition via Belief-Desire Modelling
[Slides] [Video]
Bo Xu, Longjiao Li, Wei Luo, Mehdi Naseriparsa, Zhehuan Zhao, Hongfei Lin and Feng Xia
Automatic Coding of Contingency in Child-Caregiver Conversations
[Slides] [Video]
Abhishek Agrawal, Mitja Nikolaus, Benoit Favre and Abdellah Fourtassi
Reranking Overgenerated Responses for End-to-End Task-Oriented Dialogue Systems
[Slides] [Video]
Songbo Hu, Ivan Vulić, Fangyu Liu and Anna Korhonen
12:40 - 13:20D1-S1-RE5 - Digital Humanities and Cultural Heritage
An Unsupervised Framework for Adaptive Context-aware Simplified-Traditional Chinese Character Conversion
[Slides] [Video]
Wei Li, Shutan Huang and Yanqiu Shao
Detecting Sexual Content at the Sentence Level in First Millennium Latin Texts
[Slides] [Video]
Thibault Clerice
Agenda-Driven Question Generation: A Case Study in the Courtroom Domain
[Video]
Yi Fung, Anoop Kumar, Aram Galstyan, Heng Ji and Prem Natarajan
Producing a Parallel Universal Dependencies Treebank of Ancient Hebrew and Ancient Greek via Cross-Lingual Projection
[Slides] [Video]
Daniel G. Swanson, Bryce D. Bussert and Francis Tyers
12:40 - 13:20D1-S1-RE6 - Discourse and Pragmatics
Action and Reaction Go Hand in Hand! a Multi-modal Dialogue Act Aided Sarcasm Identification
[Slides] [Video]
Mohit Singh Tomar, Tulika Saha, Abhisek Tiwari and Sriparna Saha
Global and Local Hierarchical Prompt Tuning Framework for Multi-level Implicit Discourse Relation Recognition
[Slides] [Video]
Lei Zeng, Ruifang He, Haowen Sun, Jing Xu, Chang Liu and Bo Wang
12:40 - 13:20D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval I
Enhanced Facet Generation with LLM Editing
[Slides] [Video]
Joosung Lee and Jinhong Kim
Logic Rules as Explanations for Legal Case Retrieval
[Slides] [Video]
ZhongXiang Sun, Kepu Zhang, Weijie Yu, Haoyu Wang and Jun Xu
NER-guided Comprehensive Hierarchy-aware Prompt Tuning for Hierarchical Text Classification
[Slides] [Video]
Fuhan Cai, Duo Liu, Zhongqiang Zhang, Ge Liu, Xiaozhe Yang and Xiangzhong Fang
Well Begun Is Half Done: An Implicitly Augmented Generative Framework with Distribution Modification for Hierarchical Text Classification
[Slides] [Video]
Huawen Feng, Jingsong Yan, Junlong Liu, Junhao Zheng and Qianli Ma
12:40 - 13:20D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval II
Coarse-Tuning for Ad-hoc Document Retrieval Using Pre-trained Language Models
[Video]
Atsushi Keyaki and Ribeka Keyaki
M3: A Multi-Task Mixed-Objective Learning Framework for Open-Domain Multi-Hop Dense Sentence Retrieval
[Slides] [Video]
Yang Bai, Anthony Colas, Christan Grant and Zhe Wang
KnowVrDU: A Unified Knowledge-aware Prompt-Tuning Framework for Visually-rich Document Understanding
[Slides] [Video]
Yunqi Zhang, Yubo Chen, Jingzhe Zhu, Jinyu Xu, Shuai Yang, Zhaoliang Wu, Liang Huang, Yongfeng Huang and Shuai Chen
Multimodal Cross-lingual Phrase Retrieval
[Video]
Chuanqi Dong, Wenjie Zhou, Xiangyu Duan, Yuqi Zhang and Min Zhang
12:40 - 13:20D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval III
IDC: Boost Text-to-image Retrieval via Indirect and Direct Connections
[Video]
Guowei Ge, Kuangrong Hao and Lingguang Hao
Pre-training Cross-Modal Retrieval by Expansive Lexicon-Patch Alignment
Yang Yiyuan, Guodong Long, Michael Blumenstein, Xiubo Geng, Chongyang Tao, Tao Shen and Daxin Jiang
Tackling Long Code Search with Splitting, Encoding, and Aggregating
[Slides] [Video]
Fan Hu, Yanlin Wang, Lun Du, Hongyu Zhang, Dongmei Zhang and Xirong Li
ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation
[Slides] [Video]
Sayar Ghosh Roy and Jiawei Han
12:40 - 13:20D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval IV
Recommending Missed Citations Identified by Reviewers: A New Task, Dataset and Baselines
[Video]
Kehan Long, Shasha Li, Pancheng Wang, Chenlong Bao, Jintao Tang and Ting Wang
Searching by Code: A New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets
[Slides] [Video]
Ivan Sedykh, Nikita Sorokin, Dmitry Abulkhanov, Sergey I. Nikolenko and Valentin Malykh
ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval
[Slides] [Video]
Yuanhang Zheng, Peng Li, Wei Liu, Yang Liu, Jian Luan and Bin Wang
Automatic Authorship Analysis in Human-AI Collaborative Writing
[Video]
Aquia Richburg, Calvin Bao and Marine Carpuat
12:40 - 13:20D1-S1-RE7 - Document Classification, Information Retrieval and Cross-lingual Retrieval V
Knowledge Enhanced Pre-training for Cross-lingual Dense Retrieval
[Video]
Hang Zhang, Yeyun Gong, Dayiheng Liu, Shunyu Zhang, Xingwei He, Jiancheng Lv and Jian Guo
PLAES: Prompt-generalized and Level-aware Learning Framework for Cross-prompt Automated Essay Scoring
[Video]
Yuan Chen and Xia Li
HYRR: Hybrid Infused Reranking for Passage Retrieval
[Slides] [Video]
Jing Lu, Keith Hall, Ji Ma and Jianmo Ni
Event-enhanced Retrieval in Real-time Search
[Slides] [Video]
Yanan Zhang, Xiaoling Bai and Tianhua Zhou
IR2: Information Regularization for Information Retrieval
[Slides] [Video]
Jianyou Wang, Kaicheng Wang, Xiaoyue Wang, Weili Cao, Ramamohan Paturi and Leon Bergen
12:40 - 13:20D1-S1-RE8 - Evaluation and Validation Methodologies I
ChatGPT Rates Natural Language Explanation Quality like Humans: But on Which Scales?
[Slides] [Video]
Fan Huang, Haewoon Kwak, Kunwoo Park and Jisun An
Evaluation of Really Good Grammatical Error Correction
[Slides] [Video]
Robert Östling, Katarina Gillholm, Murathan Kurfalı, Marie Mattson and Mats Wirén
SignBLEU: Automatic Evaluation of Multi-channel Sign Language Translation
[Slides] [Video]
Jung-Ho Kim, Mathew John Huerta-Enochian, Changyong Ko and Du Hui Lee
Keyphrase Generation: Lessons from a Reproducibility Study
[Slides] [Video]
Edwin Thomas and Sowmya Vajjala
12:40 - 13:20D1-S1-RE8 - Evaluation and Validation Methodologies II
Prompting Large Language Models for Counterfactual Generation: An Empirical Study
[Slides] [Video]
Yongqi Li, Mayi Xu, Xin Miao, Shen Zhou and Tieyun Qian
How Good Are LLMs at Out-of-Distribution Detection?
[Video]
Bo Liu, Li-Ming Zhan, Zexin Lu, Yujie Feng, Lei Xue and Xiao-Ming Wu
Is LLM a Reliable Reviewer? A Comprehensive Evaluation of LLM on Automatic Paper Reviewing Tasks
[Video]
Ruiyang Zhou, Lu Chen and Kai Yu
Can Multiple-choice Questions Really Be Useful in Detecting the Abilities of LLMs?
[Slides] [Video]
Wangyue Li, Liangzhi Li, Tong Xiang, Xiao Liu, Wei Deng and Noa Garcia
12:40 - 13:20D1-S1-RE8 - Evaluation and Validation Methodologies III
Benchmarking Large Language Models for Persian: A Preliminary Study Focusing on ChatGPT
[Slides] [Video]
Amirhossein Abaskohi, Sara Baruni, Mostafa Masoudi, Nesa Abbasi, Mohammad Hadi Babalou, Ali Edalat, Sepehr Kamahi, Samin Mahdizadeh Sani, nikoo naghavian, Danial Namazifard, Pouya Sadeghi and Yadollah Yaghoobzadeh
LFED: A Literary Fiction Evaluation Dataset for Large Language Models
[Slides] [Video]
Linhao Yu, Qun Liu and Deyi Xiong
REFeREE: A REference-FREE Model-Based Metric for Text Simplification
[Slides] [Video]
Yichen Huang and Ekaterina Kochmar
Who Said What: Formalization and Benchmarks for the Task of Quote Attribution
[Video]
Wenjie Zhong, Jason Naradowsky, Hiroya Takamura, Ichiro Kobayashi and Yusuke Miyao
12:40 - 13:20D1-S1-RE8 - Evaluation and Validation Methodologies IV
Measuring Cross-Text Cohesion for Segmentation Similarity Scoring
[Slides] [Video]
Gerardo Ocampo Diaz and Jessica Ouyang
Benchmarking Hallucination in Large Language Models Based on Unanswerable Math Word Problem
[Slides] [Video]
YuHong Sun, Zhangyue Yin, Qipeng Guo, Jiawen Wu, Xipeng Qiu and Hui Zhao
Assessing the Efficacy of Grammar Error Correction: A Human Evaluation Approach in the Japanese Context
[Video]
Qiao Wang and Zheng Yuan
Transfer Fine-tuning for Quality Estimation of Text Simplification
[Slides] [Video]
Yuki Hironaka, Tomoyuki Kajiwara and Takashi Ninomiya
12:40 - 13:20D1-S1-RE8 - Evaluation and Validation Methodologies V
Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models
[Video]
Zhuoqun Li, Hongyu Lin, Yaojie Lu, Hao Xiang, Xianpei Han and Le Sun
A Typology of Errors for User Utterances in Chatbots
[Slides] [Video]
Anu Singh and Esme Manandise
New Evaluation Methodology for Qualitatively Comparing Classification Models
[Slides] [Video]
Ahmad Aljanaideh
Towards Human-aligned Evaluation for Linear Programming Word Problems
[Slides] [Video]
Linzi Xing, Xinglu Wang, Yuxi Feng, Zhenan Fan, Jing Xiong, Zhijiang Guo, Xiaojin Fu, Rindra Ramamonjison, Mahdi Mostajabdaveh, Xiongwei Han, Zirui Zhou and Yong Zhang
12:40 - 13:20D1-S1-RE9 - Inference, Reasoning, Question Answering I
LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles
[Slides] [Video]
Shulin Huang, Shirong Ma, Yinghui Li, Mengzuo Huang, Wuhe Zou, Weidong Zhang and Haitao Zheng
No Need for Large-Scale Search: Exploring Large Language Models in Complex Knowledge Base Question Answering
[Slides] [Video]
Shouhui Wang and Biao Qin
PRIMO: Progressive Induction for Multi-hop Open Rule Generation
[Slides] [Video]
Jianyu Liu, Sheng Bi and Guilin Qi
Towards Graph-hop Retrieval and Reasoning in Complex Question Answering over Textual Database
[Slides] [Video]
Minjun Zhu, Yixuan Weng, Shizhu He, Kang Liu, Haifeng Liu, Yang jun Jun and Jun Zhao
12:40 - 13:20D1-S1-RE9 - Inference, Reasoning, Question Answering II
KET-QA: A Dataset for Knowledge Enhanced Table Question Answering
[Slides] [Video]
Mengkang Hu, Haoyu Dong, Ping Luo, Shi Han and Dongmei Zhang
APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning
[Video]
Jiashuo Sun, Hang Zhang, Chen Lin, Xiangdong Su, Yeyun Gong and Jian Guo
Can Language Models Learn Embeddings of Propositional Logic Assertions?
[Video]
Nurul Fajrin Ariyani, Zied Bouraoui, Richard Booth and Steven Schockaert
Prompting Explicit and Implicit Knowledge for Multi-hop Question Answering Based on Human Reading Process
[Slides] [Video]
Guangming Huang, Yunfei Long, Cunjin Luo, Jiaxing Shen and Xia Sun
12:40 - 13:20D1-S1-RE9 - Inference, Reasoning, Question Answering III
KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion
[Slides] [Video]
Yilin Wang, Minghao Hu, Zhen Huang, Dongsheng Li, Dong Yang and Xicheng Lu
Empowering Tree-structured Entailment Reasoning: Rhetorical Perception and LLM-driven Interpretability
[Slides] [Video]
Longyin Zhang, Bowei Zou and Ai Ti Aw
RankPrompt: Step-by-Step Comparisons Make Language Models Better Reasoners
[Slides] [Video]
Chi Hu, Yuan Ge, Xiangnan Ma, Hang Cao, Qiang Li, Yonghua Yang, Tong Xiao and Jingbo Zhu
Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering
[Slides] [Video]
Chenhao Wang, Pengfei Cao, Jiachun Li, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Li Qiuxia and Jun Zhao
12:40 - 13:20D1-S1-RE9 - Inference, Reasoning, Question Answering IV
Enhancing Large Language Models through Transforming Reasoning Problems into Classification Tasks
[Slides] [Video]
Tarun Raheja, Raunak Sinha, Advit Deepak, Will Healy, Jayanth Srinivasa, Myungjin Lee and Ramana Kompella
Robust and Scalable Model Editing for Large Language Models
[Video]
Yingfa Chen, Zhengyan Zhang, Xu Han, Chaojun Xiao, Zhiyuan Liu, Chen Chen, Kuai Li, Tao Yang and Maosong Sun
Step Feasibility-Aware and Error-Correctable Entailment Tree Generation
[Slides] [Video]
Junyue Song, Xin Wu and Yi Cai
QDMR-based Planning-and-Solving Prompting for Complex Reasoning Tasks
[Slides] [Video]
Jinfeng Huang, Qiaoqiao She, Wenbin Jiang, Hua Wu, Yang Hao, Tong Xu and Feng Wu
12:40 - 13:20D1-S1-RE9 - Inference, Reasoning, Question Answering V
What Factors Influence LLMs’ Judgments? A Case Study on Question Answering
[Slides] [Video]
Lei Chen, Bobo Li, Li Zheng, Haining Wang, Zixiang Meng, Runfeng Shi, Hao Fei, Jun Zhou, Fei Li, Chong Teng and Donghong Ji
An Event-based Abductive Learning for Hard Time-sensitive Question Answering
[Slides] [Video]
Shaojuan Wu, Jitong Li, Xiaowang Zhang and Zhiyong Feng
ControversialQA: Exploring Controversy in Question Answering
[Video]
Zhen Wang, Peide Zhu and Jie Yang
Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives
[Slides] [Video]
Qiushi Sun, Chengcheng Han, Nuo Chen, Renyu Zhu, Jingyang Gong, Xiang Li and Ming Gao
12:40 - 13:20D1-S1-RE9 - Inference, Reasoning, Question Answering VI
Abstract-level Deductive Reasoning for Pre-trained Language Models
[Video]
Xin Wu, Yi Cai and Ho-fung Leung
Biomedical Entity Linking as Multiple Choice Question Answering
[Slides] [Video]
Zhenxi Lin, Ziheng Zhang, Xian Wu and Yefeng Zheng
Probe Then Retrieve and Reason: Distilling Probing and Reasoning Capabilities into Smaller Language Models
[Slides] [Video]
Yichun Zhao, Shuheng Zhou and Huijia Zhu
Dealing with Data Scarcity in Spoken Question Answering
[Video]
Merve Ünlü Menevşe, Yusufcan Manav, Ebru Arisoy and Arzucan Özgür
12:40 - 13:20D1-S1-RE9 - Inference, Reasoning, Question Answering VII
MinT: Boosting Generalization in Mathematical Reasoning via Multi-view Fine-tuning
[Video]
Zhenwen Liang, Dian Yu, Xiaoman Pan, Wenlin Yao, Qingkai Zeng, Xiangliang Zhang and Dong Yu
Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought
[Slides] [Video]
Jooyoung Lee, Fan Yang, Thanh Tran, Qian Hu, Emre Barut and Kai-Wei Chang
Find-the-Common: A Benchmark for Explaining Visual Patterns from Images
[Slides] [Video]
Yuting Shi, Naoya Inoue, Houjing Wei, Yufeng Zhao and Tao Jin
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining I
CARE: Co-Attention Network for Joint Entity and Relation Extraction
[Slides] [Video]
Wenjun Kong and Yamei Xia
Know-Adapter: Towards Knowledge-Aware Parameter-Efficient Transfer Learning for Few-shot Named Entity Recognition
[Video]
Binling Nie, Yiming Shao and Yigang Wang
Event Representation Learning with Multi-Grained Contrastive Learning and Triple-Mixture of Experts
[Slides] [Video]
Tianqi Hu, Lishuang Li, Xueyang Qin and Yubo Feng
Federated Document-Level Biomedical Relation Extraction with Localized Context Contrast
[Slides] [Video]
Yan Xiao, Yaochu Jin and Kuangrong Hao
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining II
ChatEL: Entity Linking with Chatbots
[Slides] [Video]
Yifan Ding, Qingkai Zeng and Tim Weninger
Relation Classification via Bidirectional Prompt Learning with Data Augmentation by Large Language Model
[Slides] [Video]
Yizhi Jiang, Jinlong Li and Huanhuan Chen
MCIL: Multimodal Counterfactual Instance Learning for Low-resource Entity-based Multimodal Information Extraction
[Slides] [Video]
Baohang Zhou, Ying Zhang, Kehui Song, Hongru Wang, Yu Zhao, Xuhui Sui and Xiaojie Yuan
Extracting Financial Events from Raw Texts via Matrix Chunking
[Video]
Yusheng Huang, Ning Hu, Kunping Li, Nan Wang and Zhouhan Lin
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining III
Prompt Tuning for Few-shot Relation Extraction via Modeling Global and Local Graphs
[Slides] [Video]
Zirui Zhang, Yiyu Yang and Benhui Chen
A Regularization-based Transfer Learning Method for Information Extraction via Instructed Graph Decoder
[Slides] [Video]
Kedi Chen, Jie Zhou, Qin Chen, Shunyu Liu and Liang He
On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation
[Slides] [Video]
Di Wu, Wasi U. Ahmad and Kai-Wei Chang
A Streamlined Span-based Factorization Method for Few Shot Named Entity Recognition
[Slides] [Video]
Wenjie Xu, Yidan Chen and Jianquan Ouyang
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining IV
Making Pre-trained Language Models Better Continual Few-Shot Relation Extractors
[Slides] [Video]
Shengkun Ma, Jiale Han, Yi Liang and Bo Cheng
Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches
[Slides] [Video]
Deepak Gupta, Kush Attal and Dina Demner-Fushman
KCL: Few-shot Named Entity Recognition with Knowledge Graph and Contrastive Learning
[Slides] [Video]
Shan Zhang, Bin Cao and Jing Fan
TECA: A Two-stage Approach with Controllable Attention Soft Prompt for Few-shot Nested Named Entity Recognition
[Slides] [Video]
Yuanyuan Xu, Linhai Zhang and Deyu Zhou
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining V
MRC-based Nested Medical NER with Co-prediction and Adaptive Pre-training
[Slides] [Video]
Xiaojing Du, hanjie Zhao, Danyan Xing, Yuxiang Jia and Hongying Zan
Extracting Social Determinants of Health from Pediatric Patient Notes Using Large Language Models: Novel Corpus and Methods
[Slides] [Video]
Yujuan Fu, Giridhar Kaushik Ramachandran, Nicholas J. Dobbins, Namu Park, Michael Leu, Abby R. Rosenberg, Kevin Lybarger, Fei Xia, Özlem Uzuner and Meliha Yetisgen
Hierarchical Selection of Important Context for Generative Event Causality Identification with Optimal Transports
[Slides] [Video]
Hieu Man, Chien Van Nguyen, Nghia Trung Ngo, Linh Ngo, Franck Dernoncourt and Thien Huu Nguyen
Document-Level Event Extraction via Information Interaction Based on Event Relation and Argument Correlation
[Video]
Bangze Pan, Yang Li, Suge Wang, Xiaoli Li, Deyu Li, Jian Liao and Jianxing Zheng
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining VI
Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction
[Video]
Yuanzhen Luo, Qingyu Zhou and Feng Zhou
Few-Shot Relation Extraction with Hybrid Visual Evidence
[Slides] [Video]
Jiaying Gong and Hoda Eldardiry
ESCP: Enhancing Emotion Recognition in Conversation with Speech and Contextual Prefixes
[Slides] [Video]
Xiujuan Xu, Xiaoxiao Shi, Zhehuan Zhao and Yu Liu
HS-GC: Holistic Semantic Embedding and Global Contrast for Effective Text Clustering
[Slides] [Video]
Chen Yang, Bin Cao and Jing Fan
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining VII
DocScript: Document-level Script Event Prediction
[Slides] [Video]
Puneet Mathur, Vlad I. Morariu, Aparna Garimella, Franck Dernoncourt, Jiuxiang Gu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha and Rajiv Jain
Improving Multi-view Document Clustering: Leveraging Multi-structure Processor and Hybrid Ensemble Clustering Module
[Slides] [Video]
Ruina Bai and Qi Bai
Hierarchical Topic Modeling via Contrastive Learning and Hyperbolic Embedding
[Slides] [Video]
Zhicheng Lin, HeGang Chen, Yuyin Lu, Yanghui Rao, Hao Xu and Hanjiang Lai
On the Use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction
[Slides] [Video]
Jianwei Wang, Tianyin Wang and Ziqian Zeng
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining VIII
Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents
[Video]
Hao Wang, Tang Li, Chenhui Chu, Rui Wang and Pinpin Zhu
MixRED: A Mix-lingual Relation Extraction Dataset
[Slides] [Video]
Lingxing Kong, Yougang Chu, Zheng Ma, Jianbing Zhang, Liang He and Jiajun Chen
Distilling Causal Effect of Data in Continual Few-shot Relation Learning
[Slides] [Video]
Weihang Ye, Peng Zhang, Jing Zhang, Hui Gao and Moyao Wang
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining IX
Enhancing Knowledge Selection via Multi-level Document Semantic Graph
[Slides] [Video]
Haoran Zhang and Tan Yongmei
Efficient and Accurate Contextual Re-Ranking for Knowledge Graph Question Answering
[Slides] [Video]
Kexuan Sun, Nicolaas Paul Jedema, Karishma Sharma, Ruben Janssen, Jay Pujara, Pedro Szekely and Alessandro Moschitti
CWTM: Leveraging Contextualized Word Embeddings from BERT for Neural Topic Modeling
[Video]
Zheng Fang, Yulan He and Rob Procter
Class-Incremental Few-Shot Event Detection
[Video]
Kailin Zhao, Xiaolong Jin, Long Bai, Jiafeng Guo and Xueqi Cheng
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining X
Can We Learn Question, Answer, and Distractors All from an Image? A New Task for Multiple-choice Visual Question Answering
[Slides] [Video]
Wenjian Ding, Yao Zhang, Jun Wang, Adam Jatowt and Zhenglu Yang
Continual Few-shot Event Detection via Hierarchical Augmentation Networks
[Slides] [Video]
Chenlong Zhang, Pengfei Cao, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun and Jun Zhao
Prompt-based Zero-shot Relation Extraction with Semantic Knowledge Augmentation
[Slides] [Video]
Jiaying Gong and Hoda Eldardiry
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining XI
WkNER: Enhancing Named Entity Recognition with Word Segmentation Constraints and kNN Retrieval
[Video]
Yanchun Li, Senlin Deng, Dongsu Shen, Shujuan Tian and Saiqin Long
Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic Information
[Slides] [Video]
Qiang Gao, Bobo Li, Zixiang Meng, Yunlong Li, Jun Zhou, Fei Li, Chong Teng and Donghong Ji
Towards Few-shot Entity Recognition in Document Images: A Graph Neural Network Approach Robust to Image Manipulation
[Slides] [Video]
Prashant Krishnan, Zilong Wang, Yangkun Wang and Jingbo Shang
TacoERE: Cluster-aware Compression for Event Relation Extraction
[Slides] [Video]
Yong Guan, Xiaozhi Wang, Lei Hou, Juanzi Li, Jeff Z. Pan, Jiaoyan Chen and Freddy Lecue
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining XII
Emancipating Event Extraction from the Constraints of Long-Tailed Distribution Data Utilizing Large Language Models
[Slides] [Video]
Zhigang Kan, Liwen Peng, Linbo Qiao and Dongsheng Li
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction
[Slides] [Video]
Ziyang Xu, Keqin Peng, Liang Ding, Dacheng Tao and Xiliang Lu
LA-UCL: LLM-Augmented Unsupervised Contrastive Learning Framework for Few-Shot Text Classification
[Video]
Jing Zhang, Hui Gao, Peng Zhang, Boda Feng, Wenmin Deng and Yuexian Hou
Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic
[Slides] [Video]
Hao An, Zhihong Zhu, Xuxin Cheng, Zhiqi Huang and Yuexian Zou
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining XIII
Leveraging Linguistically Enhanced Embeddings for Open Information Extraction
[Slides] [Video]
Fauzan Nayeem Farooqui, Thanmay Jayakumar, Pulkit Mathur and Mansi A. Radke
ChatUIE: Exploring Chat-based Unified Information Extraction Using Large Language Models
[Slides] [Video]
Jun Xu, Mengshu Sun, Zhiqiang Zhang and Jun Zhou
Enhancing Distantly Supervised Named Entity Recognition with Strong Label Guided Lottery Training
[Video]
Zhiyuan Ma, Jintao Du, Changhua Meng and Weiqiang Wang
CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction
[Video]
Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu and Wenjuan Han
12:40 - 13:20D1-S1-RE10 - Information Extraction, Knowledge Extraction, and Text Mining XIV
BKEE: Pioneering Event Extraction in the Vietnamese Language
[Slides] [Video]
Thi-Nhung Nguyen, Bang Tien Tran, Trong-Nghia Luu, Thien Huu Nguyen and Kiem-Hieu Nguyen
Zero-shot Event Detection Using a Textual Entailment Model as an Enhanced Annotator
[Slides] [Video]
Ziqian Zeng, Runyu Wu, Yuxiang Xiao, Xiaoda Zhong, Hanlin Wang, Zhengdong Lu and Huiping Zhuang
Analyzing Large Language Models’ Capability in Location Prediction
[Slides] [Video]
Zhaomin Xiao, Eduardo Blanco and Yan Huang
Demonstration Retrieval-Augmented Generative Event Argument Extraction
[Video]
Shiming He, Yu Hong, Shuai Yang, Jianmin Yao and Guodong Zhou
12:40 - 13:20D1-S1-RE11 - Integrated Systems and Applications I
Knowledge-aware Attention Network for Medication Effectiveness Prediction
[Video]
Yingying Zhang, Xian Wu, Yu Zhang and Yefeng Zheng
Continuous Relational Diffusion Driven Topic Model with Multi-grained Text for Microblog
[Slides] [Video]
Chenhao Wu, Ruifang He, Chang Liu and Bo Wang
TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills
[Slides] [Video]
Qiushi Sun, Nuo Chen, Jianing Wang, Ming Gao and Xiang Li
CoBaLD Annotation: The Enrichment of the Enhanced Universal Dependencies with the Semantical Pattern
[Slides] [Video]
Maria Andreevna Petrova, Alexandra M. Ivoylova and Anastasia Tishchenkova
12:40 - 13:20D1-S1-RE11 - Integrated Systems and Applications II
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework
[Slides] [Video]
Xiang Li, Zhenyu Li, Chen Shi, Yong Xu, Qing Du, Mingkui Tan and Jun Huang
A Trusted Multi-View Evidential Fusion Framework for Commonsense Reasoning
[Video]
Shuo Yang
LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error Correction
[Slides] [Video]
Yixuan Wang, Baoxin Wang, Yijun Liu, Dayong Wu and Wanxiang Che
First Steps Towards the Integration of Resources on Historical Glossing Traditions in the History of Chinese: A Collection of Standardized Fǎnqiè Spellings from the Guǎngyùn
[Video]
Michele Pulini and Johann-Mattis List
AuRoRA: A One-for-all Platform for Augmented Reasoning and Refining with Task-Adaptive Chain-of-Thought Prompting
[Video]
Anni Zou, Zhuosheng Zhang and Hai Zhao
12:40 - 13:20D1-S1-RE12 - Knowledge Discovery / Representation I
Deep Reinforcement Learning-based Dialogue Policy with Graph Convolutional Q-network
[Slides] [Video]
Kai Xu, Zhengyu Wang, Yuxuan Long and Qiaona Zhao
A Decade of Scholarly Research on Open Knowledge Graphs
[Slides] [Video]
Houcemeddine Turki, Abraham Toluwase Owodunni, Mohamed Ali Hadj Taieb, René Fabrice Bile and Mohamed Ben Aouicha
Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion
[Slides] [Video]
Yichi Zhang, Zhuo Chen, Lei Liang, Huajun Chen and Wen Zhang
Bring Invariant to Variant: A Contrastive Prompt-based Framework for Temporal Knowledge Graph Forecasting
[Slides] [Video]
Ying Zhang, Xinying Qian, Yu Zhao, Baohang Zhou, Kehui Song and Xiaojie Yuan
12:40 - 13:20D1-S1-RE12 - Knowledge Discovery / Representation II
Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models
[Slides] [Video]
Derong Xu, Ziheng Zhang, Zhenxi Lin, Xian Wu, Zhihong Zhu, Tong Xu, Xiangyu Zhao, Yefeng Zheng and Enhong Chen
Construction of Paired Knowledge Graph - Text Datasets Informed by Cyclic Evaluation
[Video]
Ali Mousavi, Xin Zhan, He Bai, Peng Shi, Theodoros Rekatsinas, Benjamin Han, Yunyao Li, Jeffrey Pound, Joshua M. Susskind, Natalie Schluter, Ihab F. Ilyas and Navdeep Jaitly
DET: A Dual-Encoding Transformer for Relational Graph Embedding
[Slides] [Video]
Lingbing Guo, Zhuo Chen, Jiaoyan Chen, Qiang Zhang and Huajun Chen
Prior Relational Schema Assists Effective Contrastive Learning for Inductive Knowledge Graph Completion
[Slides] [Video]
Ruilin Luo, Jiayi Li, Jianghangfan Zhang, Jing Xiao and Yujiu Yang
12:40 - 13:20D1-S1-RE12 - Knowledge Discovery / Representation III
Self-Knowledge Distillation for Knowledge Graph Embedding
[Slides] [Video]
Haotian Xu, Yuhua Wang and Jiahui Fan
Hyperbolic Graph Neural Network for Temporal Knowledge Graph Completion
[Slides] [Video]
Yancong Li, Xiaoming Zhang, Ying Cui and Shuai Ma
Prompt-fused Framework for Inductive Logical Query Answering
[Slides] [Video]
Zezhong Xu, Wen Zhang, Peng Ye, Lei Liang and Huajun Chen
Hypergraph-Based Session Modeling: A Multi-Collaborative Self-Supervised Approach for Enhanced Recommender Systems
[Slides] [Video]
Xiangping Zheng, Bo Wu, Alex X. Zhang and Wei Li
Access Control Framework for Language Collections
[Slides] [Video]
Ben Foley, Peter Sefton, Simon Musgrave and Moises Sacal Bonequi
12:40 - 13:20D1-S1-RE13 - Language Modeling I
TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models
[Slides] [Video]
Junbing Yan, Chengyu Wang, Taolin Zhang, Xiaofeng He, Jun Huang, Wei Zhang, Longtao Huang and Hui Xue
Enhancing Parameter-efficient Fine-tuning with Simple Calibration Based on Stable Rank
[Slides] [Video]
Peiyu Liu, Ze-Feng Gao, Xiao Zhang, Wayne Xin Zhao and Ji-Rong Wen
Sinkhorn Distance Minimization for Knowledge Distillation
[Video]
Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou and Houqiang Li
EpiGEN: An Efficient Multi-Api Code GENeration Framework under Enterprise Scenario
[Video]
Sijie Li, Sha Li, Hao Zhang, Shuyang Li, Kai Chen, Jianyong Yuan, Yi Cao and Lvqing Yang
12:40 - 13:20D1-S1-RE13 - Language Modeling II
MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property
[Video]
Shiwen Ni, Minghuan Tan, Yuelin Bai, Fuqiang Niu, Min Yang, Bowen Zhang, Ruifeng Xu, Xiaojun Chen, Chengming Li and Xiping Hu
Look before You Leap: Dual Logical Verification for Knowledge-based Visual Question Generation
[Slides] [Video]
Xumeng Liu, Wenya Guo, Ying Zhang, Xubo Liu, Yu Zhao, Shenglong Yu and Xiaojie Yuan
Exploring and Mitigating Shortcut Learning for Generative Large Language Models
[Slides] [Video]
Zechen Sun, Yisheng Xiao, Juntao Li, Yixin Ji, Wenliang Chen and Min Zhang
Token-length Bias in Minimal-pair Paradigm Datasets
[Slides] [Video]
Naoya Ueda, Masato Mita, Teruaki Oka and Mamoru Komachi
12:40 - 13:20D1-S1-RE13 - Language Modeling III
Mixture-of-LoRAs: An Efficient Multitask Tuning Method for Large Language Models
[Slides] [Video]
Wenfeng Feng, Chuzhan Hao, Yuewei Zhang, Yu Han and Hao Wang
Structure-aware Fine-tuning for Code Pre-trained Models
[Video]
Jiayi Wu, Renyu Zhu, Nuo Chen, Qiushi Sun, Xiang Li and Ming Gao
Enhancing Hindi Feature Representation through Fusion of Dual-Script Word Embeddings
[Video]
Lianxi Wang, Yujia Tian and Zhuowei Chen
GPT-SW3: An Autoregressive Language Model for the Scandinavian Languages
[Slides] [Video]
Ariel Ekgren, Amaru Cuba Gyllensten, Felix Stollenwerk, Joey Öhman, Tim Isbister, Evangelia Gogoulou, Fredrik Carlsson, Judit Casademont and Magnus Sahlgren
12:40 - 13:20D1-S1-RE13 - Language Modeling IV
Analyzing Occupational Distribution Representation in Japanese Language Models
[Slides] [Video]
Katsumi Ibaraki, Winston Wu, Lu Wang and Rada Mihalcea
Improving Bengali and Hindi Large Language Models
[Video]
Arif Shahriar and Denilson Barbosa
Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study
[Slides] [Video]
Peiyu Liu, Zikang Liu, Ze-Feng Gao, Dawei Gao, Wayne Xin Zhao, Yaliang Li, Bolin Ding and Ji-Rong Wen
Representation Degeneration Problem in Prompt-based Models for Natural Language Understanding
[Video]
Qingyan Zhao, Ruifang He, Jinpeng Zhang, Chang Liu and Bo Wang
Sequence Reducible Holdout Loss for Language Model Pretraining
[Video]
Raghuveer Thirukovalluru, Nicholas Monath, Bhuwan Dhingra and Sam Wiseman
13:20 - 14:40Lunch
14:40 - 15:40Keynote Speaker 1: Roger Levy - Chair: Veronique Hoste
Large Language Models and Human Cognition
[Video]
D1-S3-R1 - Multimodal Applications, Grounded Language Acquisition, and HRI I (Chair: Nikhil Krishnaswamy)
15:50 - 16:10Seeing Eye-to-Eye: Cross-Modal Coherence Relations Inform Eye-gaze Patterns During Comprehension & Production
[Slides] [Video]
Mert Inan and Malihe Alikhani
16:10 - 16:30Select and Reorder: A Novel Approach for Neural Sign Language Production
[Slides] [Video]
Harry Walsh, Ben Saunders and Richard Bowden
16:30 - 16:50MM-IGLU: Multi-Modal Interactive Grounded Language Understanding
[Slides] [Video]
Claudiu Daniel Hromei, Daniele Margiotta, Danilo Croce and Roberto Basili
16:50 - 17:10A Tool for Determining Distances and Overlaps between Multimodal Annotations
[Slides] [Video]
Camila Antonio Barros, Jorge Francisco Ciprián-Sánchez and Saulo Mendes Santos
D1-S3-R2 - Applications Involving LRs and Evaluation II (Chair: Leonardo Ranaldi)
15:50 - 16:10Zero-shot Learning for Multilingual Discourse Relation Classification
[Slides] [Video]
Eleni Metheniti, Philippe Muller, Chloé Braud and Margarita Hernández Casas
16:10 - 16:30LoSST-AD: A Longitudinal Corpus for Tracking Alzheimer’s Disease Related Changes in Spontaneous Speech
[Slides] [Video]
Ulla Petti and Anna Korhonen
16:30 - 16:50Unsupervised Grouping of Public Procurement Similar Items: Which Text Representation Should I Use?
[Slides] [Video]
Pedro P. V. Brum, Mariana O. Silva, Gabriel P. Oliveira, Lucas G. L. Costa, Anisio Lacerda and Gisele Pappa
16:50 - 17:10Exploring the Generalization of Cancer Clinical Trial Eligibility Classifiers across Diseases
[Slides] [Video]
Yumeng Yang
D1-S3-R3 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction I (Chair: Larry Heck)
15:50 - 16:10COMICORDA: Dialogue Act Recognition in Comic Books
[Slides] [Video]
Jiri Martinek, Pavel Kral, Ladislav Lenc and Josef Baloun
16:10 - 16:30Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling
[Video]
Zhihong Zhu, Xuxin Cheng, Guimin Hu, Yaowei Li, Zhiqi Huang and Yuexian Zou
16:30 - 16:50Multilingual Turn-taking Prediction Using Voice Activity Projection
[Video]
Koji Inoue, Bing’er Jiang, Erik Ekstedt, Tatsuya Kawahara and Gabriel Skantze
16:50 - 17:10JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset
[Slides] [Video]
Atsumoto Ohashi, Ryu Hirai, Shinya Iizuka and Ryuichiro Higashinaka
D1-S3-R4 - Information Extraction, Knowledge Extraction, and Text Mining I (Chair: Ayla Rigouts Terryn)
15:50 - 16:10Building a Japanese Document-Level Relation Extraction Dataset Assisted by Cross-Lingual Transfer
[Slides] [Video]
Youmi Ma, An Wang and Naoaki Okazaki
16:10 - 16:30Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification
[Slides] [Video]
Xindi Wang, Robert E. Mercer and Frank Rudzicz
16:30 - 16:50MNER-MI: A Multi-image Dataset for Multimodal Named Entity Recognition in Social Media
[Slides] [Video]
Shizhou Huang, Bo Xu, Changqun Li, Jiabo Ye and xin Lin
16:50 - 17:10TED-EL: A Corpus for Speech Entity Linking
[Slides] [Video]
Silin Li, Ruoyu Song, Tianwei Lan, Zeming Liu and Yuhang Guo
D1-S3-R5 - Inference, Reasoning, Question Answering I (Chair: Bernardo Magnini)
15:50 - 16:10ChatGPT Is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models
[Slides] [Video]
Ning Bian, Xianpei Han, Le Sun, Hongyu Lin, Yaojie Lu, Ben He, Shanshan Jiang and Bin Dong
16:10 - 16:30Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering
[Video]
Yuan Gao, Yiheng Zhu, Yuanbin Cao, Yinzhi Zhou, Zhen Wu, Yujie Chen, Shenglan Wu, Haoyuan Hu and Xinyu Dai
16:30 - 16:50Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models
[Slides] [Video]
Yantao Liu, Zijun Yao, Xin Lv, Yuchen Fan, Shulin Cao, Jifan Yu, Lei Hou and Juanzi Li
16:50 - 17:10JEMHopQA: Dataset for Japanese Explainable Multi-Hop Question Answering
[Slides] [Video]
Ai Ishii, Naoya Inoue, Hisami Suzuki and Satoshi Sekine
D1-S3-R6 - Document Classification, Information Retrieval and Cross-lingual Retrieval II (Chair: Liana Ermakova)
15:50 - 16:10Scalable Patent Classification with Aggregated Multi-View Ranking
[Slides] [Video]
Dan Li, Vikrant Yadav, Zi Long Zhu, Maziar Moradi Fard, Zubair Afzal and George Tsatsaronis
16:10 - 16:30A Closer Look at Clustering Bilingual Comparable Corpora
[Slides] [Video]
Anna Laskina, Eric Gaussier and Gaelle Calvary
16:30 - 16:50PromptStream: Self-Supervised News Story Discovery Using Topic-Aware Article Representations
[Slides] [Video]
Arezoo Hatefi, Anton Eklund and Mona Forsman
16:50 - 17:10Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM
[Video]
Xuan Zhang and Wei Gao
15:50 - 17:10D1-S3-P2 - Digital Humanities and Cultural Heritage II (Chair: Eva Maria Vecchi)
BLN600: A Parallel Corpus of Machine/Human Transcribed Nineteenth Century Newspaper Texts
[Slides] [Video]
Callum William Booth, Alan Thomas and Robert Gaizauskas
Training BERT Models to Carry over a Coding System Developed on One Corpus to Another
[Poster] [Video]
Dalma Galambos and Pal Zsamboki
Linking Named Entities in Diderot’s Encyclopédie to Wikidata
[Slides] [Video]
Pierre Nugues
Development and Evaluation of Pre-trained Language Models for Historical Danish and Norwegian Literary Texts
[Poster] [Slides] [Video]
Ali Al-Laith, Alexander Conroy, Jens Bjerring-Hansen and Daniel Hershcovich
Converting Legacy Data to CLDF: A FAIR Exit Strategy for Linguistic Web Apps
[Slides] [Video]
Robert Forkel, Daniel G. Swanson and Steven Moran
HoLM: Analyzing the Linguistic Unexpectedness in Homeric Poetry
[Slides] [Video]
John Pavlopoulos, Ryan Sandell, Maria Konstantinidou and Chiara Bozzone
Exploring Neural Topic Modeling on a Classical Latin Corpus
[Slides] [Video]
Ginevra Martinelli, Paola Impicciché, Elisabetta Fersini, Francesco Mambrini and Marco Passarotti
Reflecting the Male Gaze: Quantifying Female Objectification in 19th and 20th Century Novels
[Slides] [Video]
Kexin Luo, Yue Mao, Bei Zhang and Sophie Hao
GENTRAC: A Tool for Tracing Trauma in Genocide and Mass Atrocity Court Transcripts
[Video]
Miriam Schirmer, Christian Brechenmacher and Juergen Pfeffer
The Onomastic Repertoire of the Roman d’Alexandre (ORNARE). Designing an Integrated Digital Onomastic Tool for Medieval French Romance
[Video]
Marta Milazzo and Giorgio Maria Di Nunzio
A Large Annotated Reference Corpus of New High German Poetry
[Slides] [Video]
Thomas Haider
15:50 - 17:10D1-S3-P2 - Evaluation and Validation Methodologies I (Chair: Eva Maria Vecchi)
From Technology to Market. Bilingual Corpus on the Evaluation of Technology Opportunity Discovery
[Slides] [Video]
Amir Hazem, Kazuyuki Motohashi and Chen Zhu
Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets
[Slides] [Video]
Yida Mu, Xingyi Song, Kalina Bontcheva and Nikolaos Aletras
HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models
[Slides] [Video]
Guijin Son, Hanwool Lee, suwan kim, Huiseo Kim, Jae cheol Lee, Je Won Yeom, Jihyu Jung, Jung woo Kim and Songseong Kim
SilverAlign: MT-Based Silver Data Algorithm for Evaluating Word Alignment
[Video]
Abdullatif Koksal, Silvia Severini and Hinrich Schütze
An Untold Story of Preprocessing Task Evaluation: An Alignment-based Joint Evaluation Approach
[Poster] [Slides] [Video]
Eunkyul Leah Jo, Angela Yoonseo Park, Grace Tianjiao Zhang, Izia Xiaoxiao Wang, Junrui Wang, MingJia Mao and Jungyeul Park
Tug-of-War between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models
[Slides] [Video]
Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin Xu, Li Qiuxia and Jun Zhao
Distribution Aware Metrics for Conditional Natural Language Generation
[Poster] [Slides] [Video]
David M. Chan, Yiming Ni, David Ross, Sudheendra Vijayanarasimhan, Austin Myers and John Canny
Automatic Speech Recognition System-Independent Word Error Rate Estimation
[Poster] [Slides] [Video]
Chanho Park, Mingjie Chen and Thomas Hain
Multilingual Generation in Abstractive Summarization: A Comparative Study
[Slides] [Video]
Jinpeng Li, Jiaze Chen, Huadong Chen, Dongyan Zhao and Rui Yan
When Cohesion Lies in the Embedding Space: Embedding-Based Reference-Free Metrics for Topic Segmentation
[Slides] [Video]
Iacopo Ghinassi, Lin Wang, Chris Newell and Matthew Purver
EsCoLA: Spanish Corpus of Linguistic Acceptability
[Poster] [Video]
Nuria Bel, Marta Punsola and Valle Ruíz-Fernández
BiVert: Bidirectional Vocabulary Evaluation Using Relations for Machine Translation
[Slides] [Video]
Carinne Cherf and Yuval Pinter
Meta-Evaluation of Sentence Simplification Metrics
[Slides] [Video]
Noof Abdullah Alfear, Dimitar Kazakov and Hend Al-Khalifa
SimLex-999 for Dutch
[Slides] [Video]
Lizzy Brans and Jelke Bloem
GPTEval: A Survey on Assessments of ChatGPT and GPT-4
[Poster] [Slides] [Video]
Rui Mao, Guanyi Chen, Xulang Zhang, Frank Guerin and Erik Cambria
15:50 - 17:10D1-S3-P2 - Integrated Systems and Applications (Chair: Eva Maria Vecchi)
Difficulty-Focused Contrastive Learning for Knowledge Tracing with a Large Language Model-Based Difficulty Prediction
[Poster] [Slides] [Video]
Unggi Lee, Sungjun Yoon, Joon Seo Yun, Kyoungsoo Park, YoungHoon Jung, Damji Stratton and Hyeoncheol Kim
Estimating Lexical Complexity from Document-Level Distributions
[Poster] [Slides] [Video]
Sondre Wold, Petter Mæhlum and Oddbjørn Hove
A Community-Driven Data-to-Text Platform for Football Match Summaries
[Poster] [Slides] [Video]
Pedro Fernandes, Sérgio Nunes and Luís Santos
Improved Neural Protoform Reconstruction via Reflex Prediction
[Slides] [Video]
Liang Lu, Jingzhi Wang and David R. Mortensen
Linking Judgement Text to Court Hearing Videos: UK Supreme Court as a Case Study
[Slides] [Video]
Hadeel Saadany, Constantin Orasan, Sophie Walker and Catherine Breslin
INMT-Lite: Accelerating Low-Resource Language Data Collection via Offline Interactive Neural Machine Translation
[Slides] [Video]
Harshita Diddee, Anurag Shukla, Tanuja Ganu, Vivek Seshadri, Sandipan Dandapat, Monojit Choudhury and Kalika Bali
Knowledge-augmented Graph Neural Networks with Concept-aware Attention for Adverse Drug Event Detection
[Slides] [Video]
Ya Gao, Shaoxiong Ji and Pekka Marttinen
MHGRL: An Effective Representation Learning Model for Electronic Health Records
[Slides] [Video]
Feiyan Liu, Liangzhi Li, Xiaoli Wang, Feng Luo, Chang Liu, Jinsong Su and Yiming Qian
text2story: A Python Toolkit to Extract and Visualize Story Components of Narrative Text
[Poster] [Slides] [Video]
Evelin Amorim, Ricardo Campos, Alipio Jorge, Pedro Mota and Rúben Almeida
LexAbSumm: Aspect-based Summarization of Legal Decisions
[Poster] [Slides] [Video]
Santosh T.Y.S.S., Mahmoud Aly and Matthias Grabmair
Extending the Discourse Analysis Tool Suite with Whiteboards for Visual Qualitative Analysis
[Poster] [Slides] [Video]
Tim Fischer, Florian Schneider, Fynn Petersen-Frey, Anja Silvia Mollah Haque, Isabel Eiser, Gertraud Koch and Chris Biemann
Bridging Textual and Tabular Worlds for Fact Verification: A Lightweight, Attention-Based Model
[Slides] [Video]
Shirin Dabbaghi Varnosfaderani, Canasai Kruengkrai, Ramin Yahyapour and Junichi Yamagishi
tasksource: A Large Collection of NLP tasks with a Structured Dataset Preprocessing Framework
[Slides] [Video]
Damien Sileo
15:50 - 17:10D1-S3-P2 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation I (Chair: Eva Maria Vecchi)
Hybrid of Spans and Table-Filling for Aspect-Level Sentiment Triplet Extraction
[Poster] [Slides] [Video]
Minghua Nuo and Chaofan Guo
Good or Bad News? Exploring GPT-4 for Sentiment Analysis for Faroese on a Public News Corpora
[Video]
Iben Nyholm Debess, Annika Simonsen and Hafsteinn Einarsson
Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training
[Slides] [Video]
Chaojun Xiao, Yutao Sun, Yuan Yao, Xu Han, Wenbin Zhang, Zhiyuan Liu and Maosong Sun
DMON: A Simple Yet Effective Approach for Argument Structure Learning
[Slides] [Video]
Sun Wei, Mingxiao Li, Jingyuan Sun, Jesse Davis and Marie-Francine Moens
EmoProgress: Cumulated Emotion Progression Analysis in Dreams and Customer Service Dialogues
[Poster] [Slides] [Video]
Eileen Wemmer, Sofie Labat and Roman Klinger
Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions
[Slides] [Video]
Flor Miriam Plaza-del-Arco, Alba A. Cercas Curry, Amanda Cercas Curry and Dirk Hovy
Czech Dataset for Complex Aspect-Based Sentiment Analysis Tasks
[Poster] [Slides] [Video]
Jakub Šmíd, Pavel Přibáň, Ondrej Prazak and Pavel Kral
A Two-Stage Framework with Self-Supervised Distillation for Cross-Domain Text Classification
[Poster] [Slides] [Video]
Yunlong Feng, Bohan Li, Libo Qin, Xiao Xu and Wanxiang Che
Source-free Domain Adaptation for Aspect-based Sentiment Analysis
[Poster] [Slides] [Video]
Zishuo Zhao, Ziyang Ma, Zhenzhou Lin, Jingyou Xie, Yinghui Li and Ying Shen
Autonomous Aspect-Image Instruction a2II: Q-Former Guided Multimodal Sentiment Classification
[Slides] [Video]
Junjia Feng, Mingqian Lin, Lin Shang and Xiaoying Gao
The ParlaSent Multilingual Training Dataset for Sentiment Identification in Parliamentary Proceedings
[Poster] [Video]
Michal Mochtak, Peter Rupnik and Nikola Ljubešić
STEntConv: Predicting Disagreement between Reddit Users with Stance Detection and a Signed Graph Convolutional Network
[Slides] [Video]
Isabelle Lorge, Li Zhang, Xiaowen Dong and Janet Pierrehumbert
Investigating the Robustness of Modelling Decisions for Few-Shot Cross-Topic Stance Detection: A Preregistered Study
[Poster] [Slides] [Video]
Myrthe Reuver, Suzan Verberne and Antske Fokkens
Stories and Personal Experiences in the COVID-19 Discourse
[Slides] [Video]
Neele Falk and Gabriella Lapesa
"Barking up the Right Tree", a GAN-Based Pun Generation Model through Semantic Pruning
[Video]
JingJie Zeng, Liang Yang, Jiahao Kang, Yufeng Diao, Zhihao Yang and Hongfei Lin
Human and System Perspectives on the Expression of Irony: An Analysis of Likelihood Labels and Rationales
[Poster] [Slides] [Video]
Aaron Maladry, Alessandra Teresa Cignarella, Els Lefever, Cynthia van Hee and Veronique Hoste
In-Context Example Retrieval from Multi-Perspectives for Few-Shot Aspect-Based Sentiment Analysis
[Slides] [Video]
Qianlong Wang, Hongling Xu, Keyang Ding, Bin Liang and Ruifeng Xu
15:50 - 17:10D1-S3-P2 - Speech Resources and Processing II (Chair: Eva Maria Vecchi)
BlendX: Complex Multi-Intent Detection with Blended Patterns
[Poster] [Slides] [Video]
Yejin Yoon, Jungyeon Lee, Kangsan Kim, Chanhee Park and Taeuk Kim
Leveraging the Interplay between Syntactic and Acoustic Cues for Optimizing Korean TTS Pause Formation
[Slides] [Video]
Yejin Jeon, Yunsu Kim and Gary Geunbae Lee
Annotation of Transition-Relevance Places and Interruptions for the Description of Turn-Taking in Conversations in French Media Content
[Video]
Rémi Uro, Marie Tahon, Jane Wottawa, David Doukhan, Albert Rilliard and Antoine Laurent
Audiocite.net : A Large Spoken Read Dataset in French
[Video]
Soline Felice, Solene Virginie Evain, Solange Rossato and François Portet
NB Uttale: A Norwegian Pronunciation Lexicon with Dialect Variation
[Slides] [Video]
Marie Iversdatter Røsok and Ingerid Løyning Dale
Gos 2: A New Reference Corpus of Spoken Slovenian
[Slides] [Video]
Darinka Verdonik, Kaja Dobrovoljc, Tomaž Erjavec and Nikola Ljubešić
Is Spoken Hungarian Low-resource?: A Quantitative Survey of Hungarian Speech Data Sets
[Slides] [Video]
Peter Mihajlik, Katalin Mády, Anna Kohári, Fruzsina Sára Fruzsina, Gábor Kiss, Tekla Etelka Gráczi and A. Seza Doğruöz
Ensembles of Hybrid and End-to-End Speech Recognition.
[Slides] [Video]
Aditya Kamlesh Parikh, Louis ten Bosch and Henk van den Heuvel
The Role of Creaky Voice in Turn Taking and the Perception of Speaker Stance: Experiments Using Controllable TTS
[Poster] [Slides] [Video]
Harm Lameris, Eva Szekely and Joakim Gustafson
Evaluating Self-Supervised Speech Representations for Indigenous American Languages
[Video]
Chih-Chen Chen, William Chen, Rodolfo Joel Zevallos and John E. Ortega
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition
[Slides] [Video]
Ruizhe Huang, Mahsa Yarmohammadi, Jan Trmal, Jing Liu, Desh Raj, Leibny Paola Garcia, Alexei V. Ivanov, Patrick Ehlen, Mingzhi Yu, Dan Povey and Sanjeev Khudanpur
nEMO: Dataset of Emotional Speech in Polish
[Slides] [Video]
Iwona Christop
ÌròyìnSpeech: A Multi-purpose Yorùbá Speech Corpus
Tolulope Ogunremi, Kola Tubosun, Anuoluwapo Aremu, Iroro Orife and David Ifeoluwa Adelani
PRODIS - a Speech Database and a Phoneme-based Language Model for the Study of Predictability Effects in Polish
[Video]
Zofia Malisz, Jan Foremski and Małgorzata Kul
17:10 - 17:30Coffee break
D1-S4-R1 - Corpora and Annotation II (Chair: Serge Sharoff)
17:30 - 17:50Why Voice Biomarkers of Psychiatric Disorders Are Not Used in Clinical Practice? Deconstructing the Myth of the Need for Objective Diagnosis
[Video]
Vincent P. Martin and Jean-Luc Rouas
17:50 - 18:10Annotate the Way You Think: An Incremental Note Generation Framework for the Summarization of Medical Conversations
[Slides] [Video]
Longxiang Zhang, Caleb D. Hart, Susanne Burger and Thomas Schaaf
18:10 - 18:30Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data
[Slides] [Video]
Siyao Peng, Zihang Sun, Huangyan Shan, Marie Kolm, Verena Blaschke, Ekaterina Artemova and Barbara Plank
18:30 - 18:50KGConv, a Conversational Corpus Grounded in Wikidata
[Video]
Quentin Brabant, Lina M. Rojas Barahona, Gwénolé Lecorvé and Claire Gardent
18:50 - 19:10EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection
[Slides] [Video]
Francesca Grasso, Stefano Locci, Giovanni Siragusa and Luigi Di Caro
D1-S4-R2 - Evaluation and Validation Methodologies I (Chair: Alessandra Zarcone)
17:30 - 17:50HuLU: Hungarian Language Understanding Benchmark Kit
[Slides] [Video]
Noémi Ligeti-Nagy, Gergő Ferenczi, Enikő Héja, László János Laki, Noémi Vadász, Zijian Győző Yang and Tamás Váradi
17:50 - 18:10KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark
[Slides] [Video]
Seongbo Jang, Seonghyeon Lee and Hwanjo Yu
18:10 - 18:30Does ChatGPT Know That It Does Not Know? Evaluating the Black-Box Calibration of ChatGPT
[Slides] [Video]
Youliang Yuan, Wenxuan Wang, Qingshuo Guo, Yiming Xiong, Chihao Shen and Pinjia He
18:30 - 18:50Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks
[Slides] [Video]
Xiao Pu, Mingqi Gao and Xiaojun Wan
18:50 - 19:10A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking the Privacy-Utility Trade-off
[Video]
Stephen Joseph Meisenbacher, Nihildev Nandakumar, Alexandra Klymenko and Florian Matthes
D1-S4-R3 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation I (Chair: Henning Wachsmuth)
17:30 - 17:50Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis
[Slides] [Video]
Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker and Sheak Rashed Haider Noori
17:50 - 18:10Learning Intrinsic Dimension via Information Bottleneck for Explainable Aspect-based Sentiment Analysis
[Video]
Zhenxiao Cheng, Jie Zhou, Wen Wu, Qin Chen and Liang He
18:10 - 18:30DEEM: Dynamic Experienced Expert Modeling for Stance Detection
[Slides] [Video]
Xiaolong Wang, Yile Wang, Sijie Cheng, Peng Li and Yang Liu
18:30 - 18:50Domain Generalization via Causal Adjustment for Cross-Domain Sentiment Analysis
[Video]
Siyin Wang, Jie Zhou, Qin Chen, Qi Zhang, Tao Gui and Xuanjing Huang
18:50 - 19:10EmoPrompt-ECPE: Emotion Knowledge-aware Prompt-tuning for Emotion-Cause Pair Extraction
[Slides] [Video]
Xue Gu, Zhihan Zhou, Ziyao Meng, Jian Li, Tiago Gomes, Adriano Tavares and Hao Xu
D1-S4-R4 - Speech Resources and Processing II (Chair: Jan Odijk)
17:30 - 17:50Corpus Creation and Automatic Alignment of Historical Dutch Dialect Speech
[Slides] [Video]
Martijn Bentum, Eric Sanders, Antal P.J. van den Bosch, Douwe Zeldenrust and Henk van den Heuvel
17:50 - 18:10Speech Analysis of Language Varieties in Italy
[Slides] [Video]
Moreno La Quatra, Alkis Koudounas, Elena Baralis and Sabato Marco Siniscalchi
18:10 - 18:30Phonetic Segmentation of the UCLA Phonetics Lab Archive
[Slides] [Video]
Eleanor Chodroff, Blaž Pažon, Annie Baker and Steven Moran
18:30 - 18:50myMediCon: End-to-End Burmese Automatic Speech Recognition for Medical Conversations
[Slides] [Video]
Hay Man Htun, Ye Kyaw Thu, Hutchatai Chanlekha, Kotaro Funakoshi and Thepchai Supnithi
18:50 - 19:10Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model
[Video]
Siyang Wang and Eva Szekely
D1-S4-R5 - Discourse and Pragmatics (Chair: Maciej Ogrodniczuk)
17:30 - 17:50DISRPT: A Multilingual, Multi-domain, Cross-framework Benchmark for Discourse Processing
[Slides] [Video]
Chloé Braud, Amir Zeldes, Laura Rivière, Yang Janet Liu, Philippe Muller, Damien Sileo and Tatsuya Aoyama
17:50 - 18:10Discourse Structure for the Minecraft Corpus
[Video]
Kate Thompson, Julie Hunter and Nicholas Asher
18:10 - 18:30Linear Cross-document Event Coreference Resolution with X-AMR
[Slides] [Video]
Shafiuddin Rehan Ahmed, George Arthur Baker, Evi Judge, Michael Reagan, Kristin Wright-Bettner, Martha Palmer and James H. Martin
18:30 - 18:50SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution
[Slides] [Video]
Yilun Zhu, Siyao Peng, Sameer Pradhan and Amir Zeldes
18:50 - 19:10To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese
[Slides] [Video]
Yukiko Ishizuki, Tatsuki Kuribayashi, Yuichiroh Matsubayashi, Ryohei Sasano and Kentaro Inui
D1-S4-R6 - Integrated Systems and Applications (Chair: Chris Biemann)
17:30 - 17:50To Err Is Human, How about Medical Large Language Models? Comparing Pre-trained Language Models for Medical Assessment Errors and Reliability
[Slides] [Video]
Wen-wai Yim, Yujuan Fu, Asma Ben Abacha and Meliha Yetisgen
17:50 - 18:10MedMT5: An Open-Source Multilingual Text-to-Text LLM for the Medical Domain
[Slides] [Video]
Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata and Andrea Zaninello
18:10 - 18:30Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents
[Slides] [Video]
Santosh T.Y.S.S., Hassan Sarwat, Ahmed Mohamed Abdelaal Abdou and Matthias Grabmair
18:30 - 18:50Distractor Generation Using Generative and Discriminative Capabilities of Transformer-based Models
[Slides] [Video]
Shiva Taslimipoor, Luca Benedetto, Mariano Felice and Paula Buttery
18:50 - 19:10Towards Autonomous Tool Utilization in Language Models: A Unified, Efficient and Scalable Framework
[Slides] [Video]
Zhi Li, Yicheng Li, Hequan Ye and Yin Zhang
17-30-19:10D1-S3-P3 - Document Classification, Information Retrieval and Cross-lingual Retrieval (Chair: François Yvon)
FaGANet: An Evidence-Based Fact-Checking Model with Integrated Encoder Leveraging Contextual Information
[Video]
Weiyao Luo, Junfeng Ran, Zailong Tian, Sujian Li and Zhifang Sui
PIRB: A Comprehensive Benchmark of Polish Dense and Hybrid Text Retrieval Methods
[Slides] [Video]
Slawomir Dadas, Michał Perełkiewicz and Rafał Poświata
Enhancing Few-Shot Topic Classification with Verbalizers. a Study on Automatic Verbalizer and Ensemble Methods
[Slides] [Video]
Quang Anh Nguyen, Nadi Tomeh, Mustapha Lebbah, Thierry Charnois, Hanene Azzag and Santiago Cordoba Muñoz
Incorporating Word-level Phonemic Decoding into Readability Assessment
[Poster] [Slides] [Video]
Christine Pinney, Casey Kennington, Maria Soledad Pera, Katherine Landau Wright and Jerry Alan Fails
Document Set Expansion with Positive-Unlabeled Learning Using Intractable Density Estimation
[Slides] [Video]
Haiyang Zhang, Qiuyi Chen, Yanjie Zou, Jia Wang, Yushan Pan and Mark Stevenson
UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
[Slides] [Video]
Hongru Wang, Boyang Xue, Baohang Zhou, Rui Wang, Fei Mi, Weichao Wang, Yasheng Wang and Kam-Fai Wong
Large Language Models for Generative Recommendation: A Survey and Visionary Discussions
[Poster] [Slides] [Video]
Lei Li, Yongfeng Zhang, Dugang Liu and Li Chen
From Graph to Word Bag: Introducing Domain Knowledge to Confusing Charge Prediction
[Poster] [Slides] [Video]
Ang Li, Qiangchao Chen, Yiquan Wu, Xiang Zhou, Kun Kuang, Fei Wu and Ming Cai
Fusion-in-T5: Unifying Variant Signals for Simple and Effective Document Ranking with Attention Fusion
[Video]
Shi Yu, Chenghao Fan, Chenyan Xiong, David Jin, Zhiyuan Liu and Zhenghao Liu
Enhancing Effectiveness and Robustness in a Low-Resource Regime via Decision-Boundary-aware Data Augmentation
[Poster] [Slides] [Video]
Kyohoon Jin, Junho Lee, Juhwan Choi, Sangmin Song and Youngbin Kim
JLBert: Japanese Light BERT for Cross-Domain Short Text Classification
[Poster] [Slides] [Video]
Chandrai Kayal, Sayantan Chattopadhyay, Aryan Gupta, Satyen Abrol and Archie Gugol
CLASSLA-web: Comparable Web Corpora of South Slavic Languages Enriched with Linguistic and Genre Annotation
[Poster] [Slides] [Video]
Nikola Ljubešić and Taja Kuzman
SPICED: News Similarity Detection Dataset with Multiple Topics and Complexity Levels
[Poster] [Slides] [Video]
Elena Shushkevich, Long Thanh Mai, Manuel V. Loureiro, Steven Derby and Tri Kurniawan Wijaya
An LCF-IDF Document Representation Model Applied to Long Document Classification
[Poster] [Slides] [Video]
Renzo Arturo Alva Principe, Nicola Chiarini and Marco Viviani
Lessons from Deploying the First Bilingual Peruvian Sign Language - Spanish Online Dictionary
[Video]
Joe Huamani-Malca, Miguel Rodriguez Mondoñedo, Francisco Cerna-Herrera, Gissella Bejarano, Carlos Vásquez Roque, Cesar Augusto Ramos Cantu and Sabina Oporto Pérez
Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data
[Slides] [Video]
Parth Patwa, Simone Filice, Zhiyu Chen, Giuseppe Castellucci, Oleg Rokhlenko and Shervin Malmasi
17-30-19:10D1-S3-P3 - Inference, Reasoning, Question Answering II (Chair: François Yvon)
Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering
[Poster] [Slides] [Video]
Yexin Wu, Zhuosheng Zhang and Hai Zhao
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
[Slides] [Video]
Xiaoxue Cheng, Junyi Li, Wayne Xin Zhao and Ji-Rong Wen
Visual-Textual Entailment with Quantities Using Model Checking and Knowledge Injection
[Slides] [Video]
Nobuyuki Iokawa and Hitomi Yanaka
MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts
[Slides] [Video]
Xiang Li, Shizhu He, Jiayu Wu, Zhao Yang, Yao Xu, Yang jun Jun, Haifeng Liu, Kang Liu and Jun Zhao
SGCM: Salience-Guided Context Modeling for Question Generation
[Video]
Chuyao Ding, Yu Hong and Jianmin Yao
TAPASGO: Transfer Learning towards a German-Language Tabular Question Answering Model
[Video]
Dominik Andreas Kowieski, Michael Hellwig and Thomas Feilhauer
Non-Essential Is NEcessary: Order-agnostic Multi-hop Question Generation
[Slides] [Video]
Kyungho Kim, Seongmin Park, Junseo Lee and Jihwa Lee
Does the Generator Mind Its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer
[Slides] [Video]
Xinshuo Hu, Dongfang Li, Xiaoguang Li, Yuxiang Wu, Lifeng Shang and Baotian Hu
Generating Multiple-choice Questions for Medical Question Answering with Distractors and Cue-masking
[Slides] [Video]
Damien Sileo, Kanimozhi Uma and Marie-Francine Moens
How Robust Are the QA Models for Hybrid Scientific Tabular Data? A Study Using Customized Dataset
[Slides] [Video]
Akash Ghosh, Venkata Sahith Bathini, Niloy Ganguly, Pawan Goyal and Mayank Singh
Choice-75: A Dataset on Decision Branching in Script Learning
[Slides] [Video]
Zhaoyi Hou, Li Zhang and Chris Callison-Burch
Denoising Table-Text Retrieval for Open-Domain Question Answering
[Slides] [Video]
Deokhyung Kang, Baikjin Jung, Yunsu Kim and Gary Geunbae Lee
EEE-QA: Exploring Effective and Efficient Question-Answer Representations
[Poster] [Slides] [Video]
Zhanghao Hu, Yijun Yang, Junjie Xu, Yifu Qiu and Pinzhen Chen
17-30-19:10D1-S3-P3 - Language Modeling (Chair: François Yvon)
Code Defect Detection Using Pre-trained Language Models with Encoder-Decoder via Line-Level Defect Localization
[Slides] [Video]
Jimin An, YunSeok Choi and Jee-Hyong Lee
JCoLA: Japanese Corpus of Linguistic Acceptability
[Video]
Taiga Someya, Yushi Sugimoto and Yohei Oseki
NGLUEni: Benchmarking and Adapting Pretrained Language Models for Nguni Languages
[Video]
Francois Meyer, Haiyue Song, Abhisek Chakrabarty, Jan Buys, Raj Dabre and Hideki Tanaka
How Important Is Tokenization in French Medical Masked Language Models?
[Poster] [Slides] [Video]
Yanis Labrak, Adrien Bazoge, Béatrice Daille, Mickael Rouvier and Richard Dufour
On the Relationship between Skill Neurons and Robustness in Prompt Tuning
[Poster] [Slides] [Video]
Leon Ackermann and Xenia Isabel Ohmer
Jargon: A Suite of Language Models and Evaluation Tasks for French Specialized Domains
[Video]
Vincent Segonne, Aidan Mannion, Laura Cristina Alonzo Canul, Alexandre Daniel Audibert, Xingyu Liu, Cécile Macaire, Adrien Pupier, Yongxin Zhou, Mathilde Aguiar, Felix E. Herron, Magali Norré, Massih R Amini, Pierrette Bouillon, Iris Eshkol-Taravella, Emmanuelle Esperança-Rodier, Thomas François, Lorraine Goeuriot, Jérôme Goulian, Mathieu Lafourcade, Benjamin Lecouteux, François Portet, Fabien Ringeval, Vincent Vandeghinste, Maximin Coavoux, Marco Dinarelli and Didier Schwab
CoCoMIC: Code Completion by Jointly Modeling In-file and Cross-file Context
[Slides] [Video]
Yangruibo Ding, Zijian Wang, Wasi U. Ahmad, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth and Bing Xiang
IAD: In-Context Learning Ability Decoupler of Large Language Models in Meta-Training
[Video]
Yuhan Liu, Xiuying Chen, Gao Xing, Ji Zhang and Rui Yan
Question Answering over Tabular Data with DataBench: A Large-Scale Empirical Evaluation of LLMs
[Slides] [Video]
Jorge Osés Grijalba, L. Alfonso Ureña-López, Eugenio Martínez Cámara and Jose Camacho-Collados
NLoPT: N-gram Enhanced Low-Rank Task Adaptive Pre-training for Efficient Language Model Adaption
[Video]
Hao Gu, Jiangyan Yi, Zheng Lian, Jianhua Tao and Xinrui Yan
Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It)
[Slides] [Video]
Alessio Miaschi, Felice Dell’Orletta and Giulia Venturi
Linguistic Rule Induction Improves Adversarial and OOD Robustness in Large Language Models
[Slides] [Video]
Shuoran Jiang, Qingcai Chen, Yang Xiang, Youcheng Pan and Yukang Lin
FLOR: On the Effectiveness of Language Adaptation
[Slides] [Video]
Severino Da Dalt, Joan Llop, Irene Baucells, Marc Pamies, Yishi Xu, Aitor Gonzalez-Agirre and Marta Villegas
Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding
[Video]
Ahmad Idrissi-Yaghir, Amin Dada, Henning Schäfer, Kamyar Arzideh, Giulia Baldini, Jan Trienes, Max Hasin, Jeanette Bewersdorff, Cynthia S. Schmidt, Marie Bauer, Kaleb E. Smith, Jiang Bian, Yonghui Wu, Jörg Schlötterer, Torsten Zesch, Peter A. Horn, Christin Seifert, Felix Nensa, Jens Kleesiek and Christoph M. Friedrich
Deconstructing In-Context Learning: Understanding Prompts via Corruption
[Slides] [Video]
Namrata Shivagunde, Vladislav Lialin, Sherin Muckatira and Anna Rumshisky
Improving the Robustness of Large Language Models via Consistency Alignment
[Poster] [Slides] [Video]
Yukun Zhao, Lingyong Yan, Weiwei Sun, Guoliang Xing, Shuaiqiang Wang, Chong Meng, Zhicong Cheng, Zhaochun Ren and Dawei Yin
LlamaCare: An Instruction Fine-Tuned Large Language Model for Clinical NLP
[Video]
Rumeng Li, Xun Wang and Hong Yu
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
[Slides] [Video]
Oana Ignat, Zhijing Jin, Artem Abzaliev, Laura Biester, Santiago Castro, Naihao Deng, Xinyi Gao, Aylin Ece Gunal, Jacky He, Ashkan Kazemi, Muhammad Khalifa, Namho Koh, Andrew Lee, Siyang Liu, Do June Min, Shinka Mori, Joan C. Nwatu, Veronica Perez-Rosas, Siqi Shen, Zekun Wang, Winston Wu and Rada Mihalcea
Disambiguating Homographs and Homophones Simultaneously: A Regrouping Method for Japanese
[Video]
Yo Sato
Release of Pre-Trained Models for the Japanese Language
[Slides] [Video]
Kei Sawada, Tianyu Zhao, Makoto Shing, Kentaro Mitsui, Akio Kaga, Yukiya Hono, Toshiaki Wakatsuki and Koh Mitsuda
Agent-based Modeling of Language Change in a Small-world Network
[Poster] [Slides] [Video]
Dalmo Buzato and Evandro Cunha
mALBERT: Is a Compact Multilingual BERT Model Still Worth It?
[Poster] [Video]
Christophe Servan, Sahar Ghannay and Sophie Rosset
Retentive or Forgetful? Diving into the Knowledge Memorizing Mechanism of Language Models
[Slides] [Video]
Boxi Cao, Qiaoyu Tang, Hongyu Lin, Shanshan Jiang, Bin Dong, Xianpei Han, Jiawei Chen, Tianshu Wang and Le Sun
17-30-19:10D1-S3-P3 - Less-Resourced/Endangered/Less-studied Languages I (Chair: François Yvon)
POS Tagging for the Endangered Dagur Language
[Poster] [Slides] [Video]
Joanna Dolińska and Delphine Bernhard
The ParCoLab Parallel Corpus and Its Extension to Four Regional Languages of France
[Poster] [Slides] [Video]
Dejan Stosic, Saša Marjanović, Delphine Bernhard, Myriam Bras, Laurent Kevers, Stella Retali-Medori, Marianne Vergez-Couret and Carole Werner
Towards Equitable Natural Language Understanding Systems for Dialectal Cohorts: Debiasing Training Data
[Poster] [Slides] [Video]
Khadige Abboud and Gokmen Oz
Mitigating Translationese in Low-resource Languages: The Storyboard Approach
[Slides] [Video]
Garry Kuwanto, Eno-Abasi E. Urua, Priscilla Amondi Amuok, Shamsuddeen Hassan Muhammad, Anuoluwapo Aremu, Verrah Otiende, Loice Emma Nanyanga, Teresiah W. Nyoike, Aniefon D. Akpan, Nsima Ab Udouboh, Idongesit Udeme Archibong, Idara Effiong Moses, Ifeoluwatayo A. Ige, Benjamin Ajibade, Olumide Benjamin Awokoya, Idris Abdulmumin, Saminu Mohammad Aliyu, Ruqayya Nasir Iro, Ibrahim Said Ahmad, Deontae Smith, Praise-EL Michaels, David Ifeoluwa Adelani, Derry Tanti Wijaya and Anietie Andy
Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants
[Poster] [Slides] [Video]
Miriam Winkler, Virginija Juozapaityte, Rob van der Goot and Barbara Plank
Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information
[Video]
Chihiro Taguchi, Jefferson Saransig, Dayana Velásquez and David Chiang
Speech Recognition Corpus of the Khinalug Language for Documenting Endangered Languages
[Slides] [Video]
Zhaolin Li, Monika Rind-Pawlowski and Jan Niehues
Extending AZee with Non-manual Gesture Rules for French Sign Language
[Poster] [Slides] [Video]
Camille Challant and Michael Filhol
Agettivu, Aggitivu o Aghjettivu? POS Tagging Corsican Dialects
[Slides] [Video]
Alice Millour, Lorenza Brasile, Alberto Ghia and Laurent Kevers
A Workflow for HTR-Postprocessing, Labeling and Classifying Diachronic and Regional Variation in Pre-Modern Slavic Texts
[Slides] [Video]
Piroska Lendvai, Maarten van Gompel, Anna Jouravel, Elena Renje, Uwe Reichel, Achim Rabus and Eckhart Arnold
Bootstrapping UMR Annotations for Arapaho from Language Documentation Resources
[Slides] [Video]
Matthew J. Buchholz, Julia Bonn, Claire Benet Post, Andrew Cowell and Alexis Palmer
Development of Community-Oriented Text-to-Speech Models for Māori ’Avaiki Nui (Cook Islands Māori)
[Slides] [Video]
Jesin James, Rolando Coto-Solano, Sally Akevai Nicholas, Joshua Zhu, Bovey Yu, Fuki Babasaki, Jenny Tyler Wang and Nicholas Derby
EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation
[Video]
Atnafu Lambebo Tonja, Israel Abebe Azime, Tadesse Destaw Belay, Mesay Gemeda Yigezu, Moges Ahmed Ah Mehamed, Abinew Ali Ayele, Ebrahim Chekol Jibril, Michael Melese Woldeyohannis, Olga Kolesnikova, Philipp Slusallek, Dietrich Klakow and Seid Muhie Yimam
Evaluating Performance of Pre-trained Word Embeddings on Assamese, a Low-resource Language
[Slides] [Video]
Dhrubajyoti Pathak, Sukumar Nandi and Priyankoo Sarmah
Malaysian English News Decoded: A Linguistic Resource for Named Entity and Relation Extraction
[Poster] [Video]
MohanRaj Chanthran, Lay-Ki Soon, Huey Fang Ong and Bhawani Selvaretnam
Learning from Wrong Predictions in Low-Resource Neural Machine Translation
[Slides] [Video]
Jia Cheng Hu, Roberto Cavicchioli, Giulia Berardinelli and Alessandro Capotondi
Transferring BERT Capabilities from High-Resource to Low-Resource Languages Using Vocabulary Matching
[Slides] [Video]
Piotr Rybak
NSina: A News Corpus for Sinhala
[Poster] [Video]
Hansi Hettiarachchi, Damith Premasiri, Lasitha Randunu Chandrakantha Uyangodage and Tharindu Ranasinghe
17-30-19:10D1-S3-P3 - Machine Learning Models and Techniques for CL/NLP I (Chair: François Yvon)
CrossTune: Black-Box Few-Shot Classification with Label Enhancement
[Video]
Danqing Luo, Chen Zhang, Yan Zhang and Haizhou Li
Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation
[Slides] [Video]
Heegon Jin, Seonil Son, Jemin Park, Youngseok Kim, Hyungjong Noh and Yeonsoo Lee
Multilingual Brain Surgeon: Large Language Models Can Be Compressed Leaving No Language behind
[Poster] [Slides] [Video]
Hongchuan Zeng, Hongshen Xu, Lu Chen and Kai Yu
Semantic Role Labeling Guided Out-of-distribution Detection
[Video]
Jinan Zou, Maihao Guo, Yu Tian, Yuhao Lin, Haiyao Cao, Lingqiao Liu, Ehsan Abbasnejad and Javen Qinfeng Shi
Evaluating Code-Switching Translation with Large Language Models
[Slides] [Video]
Muhammad Huzaifah, Weihua Zheng, Nattapol Chanpaisit and Kui Wu
Task-agnostic Distillation of Encoder-Decoder Language Models
[Slides] [Video]
Chen Zhang, Yang Yang, Qiuchi Li, Jingang Wang and Dawei Song
DDxGym: Online Transformer Policies in a Knowledge Graph Based Natural Language Environment
[Slides] [Video]
Benjamin Winter, Alexei Gustavo Figueroa Rosero, Alexander Loeser, Felix Alexander Gers, Nancy Katerina Figueroa Rosero and Ralf Krestel
TaiChi: Improving the Robustness of NLP Models by Seeking Common Ground While Reserving Differences
[Slides] [Video]
Huimin Chen, Chengyu Wang, Yanhao Wang, Cen Chen and Yinggui Wang
Rebalancing Label Distribution While Eliminating Inherent Waiting Time in Multi Label Active Learning Applied to Transformers
[Slides] [Video]
Maxime Arens, Lucile Callebert, Mohand Boughanem and Jose G. Moreno
FCDS: Fusing Constituency and Dependency Syntax into Document-Level Relation Extraction
[Video]
Xudong Zhu, Zhao Kang and Bei Hui
LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders
[Slides] [Video]
Xingwu Sun, Zhen Yang, Ruobing Xie, Fengzong Lian, Zhanhui Kang and Chengzhong Xu
Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural Networks
[Slides] [Video]
Ileana Rugina, Rumen Dangovski, Li Jing, Preslav Nakov and Marin Soljacic
Article Classification with Graph Neural Networks and Multigraphs
[Slides] [Video]
Khang Ly, Yury Kashnitsky, Savvas Chamezopoulos and Valeria Krzhizhanovskaya
Predictive and Distinctive Linguistic Features in Schizophrenia-Bipolar Spectrum Disorders
[Slides] [Video]
Martina Katalin Szabó, Veronika Vincze, Bernadett Dam, Csenge Guba, Anita Bagi and István Szendi
Towards Understanding the Relationship between In-context Learning and Compositional Generalization
[Slides] [Video]
Sungjun Han and Sebastian Padó
Multi-Channel Spatio-Temporal Transformer for Sign Language Production
[Slides] [Video]
Xiaohan Ma, Rize Jin and Tae-Sun Chung
Using Pre-Trained Language Models in an End-to-End Pipeline for Antithesis Detection
[Poster] [Video]
Ramona Kühn, Khouloud Saadi, Jelena Mitrović and Michael Granitzer
Evolving Knowledge Distillation with Large Language Models and Active Learning
[Slides] [Video]
Chengyuan Liu, Fubang Zhao, Kun Kuang, Yangyang Kang, Zhuoren Jiang, Changlong Sun and Fei Wu
How Speculative Can Speculative Decoding Be?
[Slides] [Video]
Zhuorui Liu, Chen Zhang and Dawei Song
Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization
[Video]
Shuo Yang and Gjergji Kasneci
19:10 - 19:50ELRA Members Meeting
[Video]
20:00 - 22:00Welcome Reception
 End of Day 1
  

Thursday, 23 May 2024

 Day 2
D2-S1-R1 - Corpora and Annotation III (Chair: Giulia Venturi)
09:00 - 09:20Motivational Interviewing Transcripts Annotated with Global Scores
[Slides] [Video]
Ben Cohen, Moreah Zisquit, Stav Yosef, Doron Friedman and Kfir Bar
09:20 - 09:40Project MOSLA: Recording Every Moment of Second Language Acquisition
[Slides] [Video]
Masato Hagiwara and Joshua B. Tanner
09:40 - 10:00SarcNet: A Multilingual Multimodal Sarcasm Detection Dataset
[Slides] [Video]
Tan Yue, Xuzhao Shi, Rui Mao, Zonghai Hu and Erik Cambria
10:00 - 10:20Controllable Paraphrase Generation for Semantic and Lexical Similarities
[Slides] [Video]
Yuya Ogasa, Tomoyuki Kajiwara and Yuki Arase
10:20 - 10:40ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus
[Slides] [Video]
Injy Hamed, Fadhl Eryani, David Palfreyman and Nizar Habash
D2-S1-R2 - Natural Language Generation, Summarization and Simplification II (Chair: Hiroya Takamura)
09:00 - 09:20Replace, Paraphrase or Fine-tune? Evaluating Automatic Simplification for Medical Texts in Spanish
[Slides] [Video]
Leonardo Campillos-Llanos, Ana Rosa Terroba, Rocío Bartolomé, Ana Valverde-Mateos, Cristina González, Adrián Capllonch-Carrión and Jonathan Heras
09:20 - 09:40Is It Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models
[Slides] [Video]
Asma Farajidizaji, Vatsal Raina and Mark Gales
09:40 - 10:00PSentScore: Evaluating Sentiment Polarity in Dialogue Summarization
[Slides] [Video]
Yongxin Zhou, Fabien Ringeval and François Portet
10:00 - 10:20Effective Distillation of Table-based Reasoning Ability from LLMs
[Slides] [Video]
Bohao Yang, Chen Tang, Kun Zhao, Chenghao Xiao and Chenghua Lin
10:20 - 10:40Improving Faithfulness of Large Language Models in Summarization via Sliding Generation and Self-Consistency
[Slides] [Video]
Taiji Li, Zhi Li and Yin Zhang
D2-S1-R3 -Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation II (Chair: Orphee De Clercq)
09:00 - 09:20Enhanced Coherence-Aware Network with Hierarchical Disentanglement for Aspect-Category Sentiment Analysis
[Slides] [Video]
Jin Cui, Fumiyo Fukumoto, Xinfeng Wang, Yoshimi Suzuki, Jiyi Li, Noriko Tomuro and Wanzeng Kong
09:20 - 09:40IDEM: The IDioms with EMotions Dataset for Emotion Recognition
[Slides] [Video]
Alexander Prochnow, Johannes E. Bendler, Caroline Lange, Foivos Ioannis Tzavellos, Bas Marco Göritzer, Marijn ten Thij and Riza Batista-Navarro
09:40 - 10:00Sequence-to-Sequence Language Models for Character and Emotion Detection in Dream Narratives
[Slides] [Video]
Gustave Cortal
10:00 - 10:20SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity
[Slides] [Video]
Jaemin Kim, Yohan Na, Kangmin Kim, Sang-Rak Lee and Dong-Kyu Chae
10:20 - 10:40Diffusion Based Counterfactual Augmentation for Dual Sentiment Classification
[Slides] [Video]
Dancheng Xin, Jiawei Yuan and Yang Li
D2-S1-R4 - Machine Learning Models and Techniques for CL/NLP I (Chair: Menno van Zaanen)
09:00 - 09:20Beyond Canonical Fine-tuning: Leveraging Hybrid Multi-Layer Pooled Representations of BERT for Automated Essay Scoring
[Slides] [Video]
Eujene Nikka V. Boquio and Prospero C. Naval, Jr.
09:20 - 09:40ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler
[Slides] [Video]
Paramita Mirza, Viju Sudhi, Soumya Ranjan Sahoo and Sinchana Ramakanth Bhat
09:40 - 10:00Did You Get It? A Zero-Shot Approach to Locate Information Transfers in Conversations
[Slides] [Video]
Eliot Maës, Hossam Boudraa, Philippe Blache and Leonor Becerra-Bonache
10:00 - 10:20Recognizing Value Resonance with Resonance-Tuned RoBERTa Task Definition, Experimental Validation, and Robust Modeling
[Video]
Noam K. Benkler, Scott Friedman, Sonja Schmer-Galunder, Drisana Marissa Mosaphir, Robert P. Goldman, Ruta Wheelock, Vasanth Sarathy, Pavan Kantharaju and Matthew D. McLure
10:20 - 10:40EFTNAS: Searching for Efficient Language Models in First-Order Weight-Reordered Super-Networks
[Video]
Juan Pablo Munoz, Yi Zheng and Nilesh Jain
D2-S1-R5 - Special Session Industrial Track I
[Video]
09:00 - 09:20SmartBic: Harvesting Smart Bilingual Corpora from the Internet.
09:20 - 09:40Customer at the Core: Transforming Gruner and Jahr with NLP in a Digital Age
09:40 - 10:00Better language resources for better machine learning models and language technology techniques
10:00 - 10:20How Many Annotators Does it Take to Have an LLM?
10:20 - 10:40Evaluating automated translation flows.
D2-S1-R6 - Language Modeling (Chair: Fabio Tamburini)
09:00 - 09:20Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean
[Slides] [Video]
ChangSu Choi, Yongbin Jeong, Seoyoon Park, Inho Won, HyeonSeok Lim, SangMin Kim, Yejee Kang, Chanhyuk Yoon, Jaewan Park, Yiseul Lee, HyeJin Lee, Younggyun Hahm, Hansaem Kim and KyungTae Lim
09:20 - 09:40Adaptive Reinforcement Tuning Language Models as Hard Data Generators for Sentence Representation
[Slides] [Video]
Bo Xu, Yifei Wu, Shouang Wei, Ming Du and Hongya Wang
09:40 - 10:00A Family of Pretrained Transformer Language Models for Russian
[Slides] [Video]
Dmitry Zmitrovich, Aleksandr Abramov, Andrey Kalmykov, Vitaly Kadulin, Maria Tikhonova, Ekaterina Taktasheva, Danil Astafurov, Mark Baushenko, Artem Snegirev, Tatiana Shavrina, Sergei S. Markov, Vladislav Mikhailov and Alena Fenogenova
10:00 - 10:20How Well Can BERT Learn the Grammar of an Agglutinative and Flexible-Order Language? The Case of Basque.
[Slides] [Video]
Gorka Urbizu, Muitze Zulaika, Xabier Saralegi and Ander Corral
10:20 - 10:40KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning
[Slides] [Video]
Dongyang Li, Taolin Zhang, Longtao Huang, Chengyu Wang, Xiaofeng He and Hui Xue
09:00 - 10:40D2-S1-P4 - Applications Involving LRs and Evaluation I (Chair: Atul Kr. Ojha)
Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition
[Poster] [Slides] [Video]
David Gimeno-Gómez and Carlos-D. Martínez-Hinarejos
Medical Vision-Language Pre-Training for Brain Abnormalities
[Video]
Masoud Monajatipoor, Zi-Yi Dou, Aichi Chien, Nanyun Peng and Kai-Wei Chang
CuSINeS: Curriculum-driven Structure Induced Negative Sampling for Statutory Article Retrieval
[Poster] [Slides] [Video]
Santosh T.Y.S.S., Kristina Kaiser and Matthias Grabmair
NumHG: A Dataset for Number-Focused Headline Generation
[Poster] [Slides] [Video]
Jian-Tao Huang, Chung-Chi Chen, Hen-Hsen Huang and Hsin-Hsi Chen
A Dual-View Approach to Classifying Radiology Reports by Co-Training
[Slides] [Video]
Yutong Han, Yan Yuan and Lili Mou
Rapidly Developing High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese
[Slides] [Video]
Yikun Sun, Zhen Wan, Nobuhiro Ueda, Sakiko Yahata, Fei Cheng, Chenhui Chu and Sadao Kurohashi
A Concept Based Approach for Translation of Medical Dialogues into Pictographs
[Slides] [Video]
Johanna Gerlach, Pierrette Bouillon, Jonathan Mutal and Hervé Spechbach
Grammatical Error Correction for Code-Switched Sentences by Learners of English
[Poster] [Slides] [Video]
Kelvin Wey Han Chan, Christopher Bryant, Li Nguyen, Andrew Caines and Zheng Yuan
A Benchmark Evaluation of Clinical Named Entity Recognition in French
[Poster] [Video]
Nesrine Bannour, Christophe Servan, Aurélie Névéol and Xavier Tannier
A Virtual Patient Dialogue System Based on Question-Answering on Clinical Records
[Video]
Janire Arana, Mikel Idoyaga, Maitane Urruela, Elisa Espina, Aitziber Atutxa Salazar and Koldo Gojenola
Using Speech Technology to Test Theories of Phonetic and Phonological Typology
[Slides] [Video]
Anisia Popescu, Lori Lamel and Ioana Vasilescu
When Argumentation Meets Cohesion: Enhancing Automatic Feedback in Student Writing
[Slides] [Video]
Yuning Ding, Omid Kashefi, Swapna Somasundaran and Andrea Horbach
SpreadNaLa: A Naturalistic Code Generation Evaluation Dataset of Spreadsheet Formulas
[Slides] [Video]
Sebastian Schuster, Ayesha Ansar, Om Agarwal and Vera Demberg
Exploring the Usability of Persuasion Techniques for Downstream Misinformation-related Classification Tasks
[Slides] [Video]
Nikolaos Nikolaidis, Jakub Piskorski and Nicolas Stefanovitch
Evaluating ChatGPT against Functionality Tests for Hate Speech Detection
[Poster] [Slides] [Video]
Mithun Das, Saurabh Kumar Pandey and Animesh Mukherjee
09:00 - 10:40D2-S1-P4 - Corpora and Annotation I (Chair: Atul Kr. Ojha)
Aligning the Norwegian UD Treebank with Entity and Coreference Information
[Poster] [Slides] [Video]
Tollef Emil Jørgensen and Andre Kåsen
Validating and Exploring Large Geographic Corpora
[Slides] [Video]
Jonathan Dunn
Data Drift in Clinical Outcome Prediction from Admission Notes
[Slides] [Video]
Paul Grundmann, Jens-Michalis Papaioannou, Tom Oberhauser, Thomas Steffek, Amy Siu, Wolfgang Nejdl and Alexander Loeser
JFLD: A Japanese Benchmark for Deductive Reasoning Based on Formal Logic
[Video]
Terufumi Morishita, Atsuki Yamaguchi, Gaku Morio, Hikaru Tomonari, Osamu Imaichi and Yasuhiro Sogawa
NAIST-SIC-Aligned: An Aligned English-Japanese Simultaneous Interpretation Corpus
[Slides] [Video]
Jinming Zhao, Katsuhito Sudoh, Satoshi Nakamura, Yuka Ko, Kosuke Doi and Ryo Fukuda
Specifying Genericity through Inclusiveness and Abstractness Continuous Scales
[Video]
Claudia Collacciani, Andrea Amelio Ravelli and Marianna Bolognesi
Logging Keystrokes in Writing by English Learners
[Slides] [Video]
Georgios Velentzas, Andrew Caines, Rita Borgo, Erin Pacquetet, Clive Hamilton, Taylor Arnold, Diane Nicholls, Paula Buttery, Thomas Gaillat, Nicolas Ballier and Helen Yannakoudakis
FReND: A French Resource of Negation Data
[Slides] [Video]
Hafida Le Cloirec - Ait Yahya, Olga Seminck and Pascal Amsili
FRASIMED: A Clinical French Annotated Resource Produced through Crosslingual BERT-Based Annotation Projection
[Poster] [Slides] [Video]
Jamil Zaghir, Mina Bjelogrlic, Jean-Philippe Goldman, Soukaïna Aananou, Christophe Gaudet-Blavignac and Christian Lovis
Automatically Estimating Textual and Phonemic Complexity for Cued Speech: How to See the Sounds from French Texts
Núria Gala, Brigitte Bigi and Marie Bauer
Leveraging Domain Corpora for Enhanced Terminology: The Case of Estonian-English Remote Sensing Termbase
[Poster] [Slides] [Video]
Liisi Jakobson, Jelena Kallas and Erko Jakobson
Dataset of Quotation Attribution in German News Articles
[Poster] [Video]
Fynn Petersen-Frey and Chris Biemann
New Datasets for Automatic Detection of Textual Entailment and of Contradictions between Sentences in French
[Video]
Maximos Skandalis, Richard Moot, Christian Retoré and Simon Robillard
MentalRiskES: A New Corpus for Early Detection of Mental Disorders in Spanish
[Poster] [Slides] [Video]
Alba M. Mármol Romero, Adrián Moreno Muñoz, Flor Miriam Plaza-del-Arco, M. Dolores Molina González, María Teresa Martín Valdivia, L. Alfonso Ureña-López and Arturo Montejo Ráez
Multimodal Behaviour in an Online Environment: The GEHM Zoom Corpus Collection
[Slides] [Video]
Patrizia Paggio, Manex Agirrezabal, Costanza Navarretta and Leo Vitasovic
Tell Me Again! a Large-Scale Dataset of Multiple Summaries for the Same Story
[Poster] [Slides] [Video]
Hans Ole Hatzel and Chris Biemann
Who Did You Blame When Your Project Failed? Designing a Corpus for Presupposition Generation in Cross-Examination Dialogues
[Poster] [Video]
Maria Francis, Julius Steuer, Dietrich Klakow and Volha Petukhova
Corpus Services: A Framework to Curate XML Corpus Data
[Video]
Aleksandr Riaposov and Elena Lazarenko
The Slovak Autistic and Non-Autistic Child Speech Corpus:Task-Oriented Child-Adult Interactions
Joanna Kruyt, Róbert Sabo, Katarína Polónyiová, Daniela Ostatníková and Štefan Beňuš
RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict
[Poster] [Slides] [Video]
Yirong Zeng, Xiao Ding, Yi Zhao, Xiangyu Li, Jie Zhang, Chao Yao, Ting Liu and Bing Qin
SOBR: A Corpus for Stylometry, Obfuscation, and Bias on Reddit
[Slides] [Video]
Chris Emmery, Marilù Miotto, Sergey Kramp and Bennett Kleinberg
09:00 - 10:40D2-S1-P4 - Lexicon and Semantics I (Chair: Atul Kr. Ojha)
ZeLa: Advancing Zero-Shot Multilingual Semantic Parsing with Large Language Models and Chain-of-Thought Strategies
[Poster] [Slides] [Video]
Truong Dinh Do, Phuong Minh Nguyen and Minh Nguyen
LexComSpaL2: A Lexical Complexity Corpus for Spanish as a Foreign Language
[Poster] [Slides] [Video]
Jasper Degraeuwe and Patrick Goethals
Sense of the Day: Short Timeframe Temporal-Aware Word Sense Disambiguation
[Video]
Yuchen Wei and Milton King
Reassessing Semantic Knowledge Encoded in Large Language Models through the Word-in-Context Task
[Poster] [Video]
Yoshihiko Hayashi
Multi-modal Semantic Understanding with Contrastive Cross-modal Feature Alignment
[Poster] [Slides] [Video]
Ming Zhang, Ke Chang and Yunfang Wu
ISO 24617-12: A New Standard for Semantic Annotation
[Poster] [Slides] [Video]
Harry Bunt
Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds
[Slides] [Video]
Annerose Eichel, Tana Deeg, Andre Blessing, Milena Belosevic, Sabine Arndt-Lappe and Sabine Schulte im Walde
Complex Word Identification: A Comparative Study between ChatGPT and a Dedicated Model for This Task
[Slides] [Video]
Abdelhak Kelious, Mathieu Constant and Christophe Coeur
Towards a Danish Semantic Reasoning Benchmark - Compiled from Lexical-Semantic Resources for Assessing Selected Language Understanding Capabilities of Large Language Models
[Video]
Bolette Pedersen, Nathalie Sørensen, Sussi Olsen, Sanni Nimb and Simon Gray
Multilingual Substitution-based Word Sense Induction
[Slides] [Video]
Denis Kokosinskii and Nikolay Arefyev
Morpheme Sense Disambiguation: A New Task Aiming for Understanding the Language at Character Level
[Slides] [Video]
Yue Wang, Hua Zheng, Yaqi Yin, Hansi Wang, Qiliang Liang and Yang Liu
Language Models and Semantic Relations: A Dual Relationship
[Poster] [Slides] [Video]
Olivier Ferret
Analyzing Homonymy Disambiguation Capabilities of Pretrained Language Models
[Poster] [Slides] [Video]
Lorenzo Proietti, Stefano Perrella, Simone Tedeschi, Giulia Vulpis, Leonardo Lavalle, Andrea Sanchietti, Andrea Ferrari and Roberto Navigli
To Learn or Not to Learn: Replaced Token Detection for Learning the Meaning of Negation
[Slides] [Video]
Gunjan Bhattarai and Katrin Erk
MultiLexBATS: Multilingual Dataset of Lexical Semantic Relations
[Slides] [Video]
Dagmar Gromann, Hugo Goncalo Oliveira, Lucia Pitarch, Elena-Simona Apostol, Jordi Bernad, Eliot Bytyçi, Chiara Cantone, Sara Carvalho, Francesca Frontini, Radovan Garabik, Jorge Gracia, Letizia Granata, Fahad Khan, Timotej Knez, Penny Labropoulou, Chaya Liebeskind, Maria Pia Di Buono, Ana Ostroški Anić, Sigita Rackevičienė, Ricardo Rodrigues, Gilles Sérasset, Linas Selmistraitis, Mahammadou Sidibé, Purificação Silvano, Blerina Spahiu, Enriketa Sogutlu, Ranka Stanković, Ciprian-Octavian Truică, Giedre Valunaite Oleskeviciene, Slavko Zitnik and Katerina Zdravkova
ChainNet: Structured Metaphor and Metonymy in WordNet
[Slides] [Video]
Rowan Hall Maudslay, Simone Teufel, Francis Bond and James Pustejovsky
Leveraging AMR Graph Structure for Better Sequence-to-Sequence AMR Parsing
[Slides] [Video]
Linyu Fan, Wu Wu Yiheng, Jun Xie, Junhui Li, Fang Kong and Guodong Zhou
Frame2: A FrameNet-based Multimodal Dataset for Tackling Text-image Interactions in Video
[Poster] [Slides] [Video]
Frederico Belcavello, Tiago Timponi Torrent, Ely E. Matos, Adriana S. Pagano, Maucha Gamonal, Natalia Sigiliano, Lívia Vicente Dutra, Helen de Andrade Abreu, Mairon Samagaio, Mariane Carvalho, Franciany Campos, Gabrielly Azalim, Bruna Mazzei, Mateus Fonseca de Oliveira, Ana Carolina Loçasso Luz, Lívia Pádua Ruiz, Júlia Bellei, Amanda Pestana, Josiane Costa, Iasmin Rabelo, Anna Beatriz Silva, Raquel Roza, Mariana Souza and Igor Oliveira
09:00 - 10:40D2-S1-P4 - Offensive and Harmful Language Detection and Analysis (Chair: Atul Kr. Ojha)
Human vs. Machine Perceptions on Immigration Stereotypes
[Slides] [Video]
Wolfgang S. Schmeisser-Nieto, Pol Pastells, Simona Frenda and Mariona Taule
Detecting Offensive Language in an Open Chatbot Platform
[Slides] [Video]
Hyeonho Song, Jisu Hong, Chani Jung, Hyojin Chin, Mingi Shin, Yubin Choi, Junghoi Choi and Meeyoung Cha
The Corpus AIKIA: Using Ranking Annotation for Offensive Language Detection in Modern Greek
[Video]
Stella Markantonatou, Vivian Stamou, Christina Christodoulou, Georgia Apostolopoulou, Antonis Balas and George Ioannakis
GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?
[Slides] [Video]
Yiping Jin, Leo Wanner and Alexander Shvets
CyberAgressionAdo-v2: Leveraging Pragmatic-Level Information to Decipher Online Hate in French Multiparty Chats
[Poster] [Video]
Anais Ollagnier
IDEATE: Detecting AI-Generated Text Using Internal and External Factual Structures
[Video]
Quan Wang, Licheng Zhang, Zikang Guo and Zhendong Mao
GERMS-AT: A Sexism/Misogyny Dataset of Forum Comments from an Austrian Online Newspaper
[Video]
Brigitte Krenn, Johann Petrak, Marina Kubina and Christian Burger
GerDISDETECT: A German Multilabel Dataset for Disinformation Detection
[Video]
Mina Schütz, Daniela Pisoiu, Daria Liakhovets, Alexander Schindler and Melanie Siegel
QUEEREOTYPES: A Multi-Source Italian Corpus of Stereotypes towards LGBTQIA+ Community Members
[Poster] [Video]
Alessandra Teresa Cignarella, Manuela Sanguinetti, Simona Frenda, Andrea Marra, Cristina Bosco and Valerio Basile
Detecting Cybercrimes in Accordance with Pakistani Law: Dataset and Evaluation Using PLMs
[Slides] [Video]
Faizad Ullah, Ali Faheem, Ubaid Azam, Muhammad Sohaib Ayub, Faisal Kamiran and Asim Karim
Multi-domain Hate Speech Detection Using Dual Contrastive Learning and Paralinguistic Features
[Slides] [Video]
Somaiyeh Dehghan and Berrin Yanıkoğlu
On Zero-Shot Counterspeech Generation by LLMs
[Slides] [Video]
Punyajoy Saha, Aalok Agrawal, Abhik Jana, Chris Biemann and Animesh Mukherjee
Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation
[Poster] [Video]
Jaione Bengoetxea, Yi-Ling Chung, Marco Guerini and Rodrigo Agerri
BAN-PL: A Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl Web Service
[Slides] [Video]
Anna Kolos, Inez Okulska, Kinga Głąbińska, Agnieszka Karlinska, Emilia Wisnios, Paweł Ellerik and Andrzej Prałat
Can GPT-4 Identify Propaganda? Annotation and Detection of Propaganda Spans in News Articles
[Slides] [Video]
Maram Hasanain, Fatema Ahmad and Firoj Alam
The Challenges of Creating a Parallel Multilingual Hate Speech Corpus: An Exploration
[Slides] [Video]
Katerina Korre, Arianna Muti and Alberto Barrón-Cedeño
JL-Hate: An Annotated Dataset for Joint Learning of Hate Speech and Target Detection
[Slides] [Video]
Kaan Büyükdemirci, Izzet Emre Kucukkaya, Eren Ölmez and Cagri Toraman
How Much Do Robots Understand Rudeness? Challenges in Human-Robot Interaction
[Slides] [Video]
Michael Andrew Orme, Yanchao Yu and Zhiyuan Tan
Beyond Binary: Towards Embracing Complexities in Cyberbullying Detection and Intervention - a Position Paper
Kanishk Verma, Kolawole John Adebayo, Joachim Wagner, Megan Reynolds, Rebecca Umbach, Tijana Milosevic and Brian Davis
Denoising Labeled Data for Comment Moderation Using Active Learning
[Video]
Andraž Pelicon, Mladen Karan, Ravi Shekhar, Matthew Purver and Senja Pollak
09:00 - 10:40D2-S1-P4 - Evaluation and Validation Methodologies II (Chair: Atul Kr. Ojha)
Small Language Models Are Good Too: An Empirical Study of Zero-Shot Classification
[Poster] [Slides] [Video]
Pierre Lepagnol, Thomas Gerald, Sahar Ghannay, Christophe Servan and Sophie Rosset
A Web Portal about the State of the Art of NLP Tasks in Spanish
[Video]
Enrique Amigó, Jorge Carrillo-de-Albornoz, Andrés Fernández, Julio Gonzalo, Guillermo Marco, Roser Morante, Laura Plaza and Jacobo Pedrosa
Latent vs Explicit Knowledge Representation: How ChatGPT Answers Questions about Low-Frequency Entities
[Slides] [Video]
Arianna Graciotti, Valentina Presutti and Rocco Tripodi
Beyond Static Evaluation: A Dynamic Approach to Assessing AI Assistants’ API Invocation Capabilities
[Poster] [Slides] [Video]
Honglin Mu, Yang Xu, Yunlong Feng, Xiaofeng Han, Yitong Li, Yutai Hou and Wanxiang Che
Text Style Transfer Evaluation Using Large Language Models
[Poster] [Slides] [Video]
Phil Sidney Ostheimer, Mayank Kumar Nagda, Marius Kloft and Sophie Fellenz
Pointing Out the Shortcomings of Relation Extraction Models with Semantically Motivated Adversarials
[Slides] [Video]
Gennaro Nolano, Moritz Blum, Basil Ell and Philipp Cimiano
Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation
[Slides] [Video]
Matthias Sperber, Ondřej Bojar, Barry Haddow, Dávid Javorský, Xutai Ma, Matteo Negri, Jan Niehues, Peter Polák, Elizabeth Salesky, Katsuhito Sudoh and Marco Turchi
Making Sentence Embeddings Robust to User-Generated Content
[Slides] [Video]
Lydia Nishimwe, Benoît Sagot and Rachel Bawden
Projective Methods for Mitigating Gender Bias in Pre-trained Language Models
[Slides] [Video]
Hillary Dawkins, Isar Nejadgholi, Daniel Gillis and Judi McCuaig
Two Counterexamples to Tokenization and the Noiseless Channel
[Slides] [Video]
Marco Cognetta, Vilém Zouhar, Sangwhan Moon and Naoaki Okazaki
Automatic Decomposition of Text Editing Examples into Primitive Edit Operations: Toward Analytic Evaluation of Editing Systems
[Slides] [Video]
Daichi Yamaguchi, Rei Miyata, Atsushi Fujita, Tomoyuki Kajiwara and Satoshi Sato
Analyzing Interpretability of Summarization Model with Eye-gaze Information
[Poster] [Slides] [Video]
Fariz Ikhwantri, Hiroaki Yamada and Takenobu Tokunaga
Anchor and Broadcast: An Efficient Concept Alignment Approach for Evaluation of Semantic Graphs
[Slides] [Video]
Haibo Sun and Nianwen Xue
COMET for Low-Resource Machine Translation Evaluation: A Case Study of English-Maltese and Spanish-Basque
[Slides] [Video]
Júlia Falcão, Claudia Borg, Nora Aranberri and Kurt Abela
Unraveling Spontaneous Speech Dimensions for Cross-Corpus ASR System Evaluation for French
[Video]
Solene Virginie Evain, Solange Rossato and François Portet
Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science
[Slides] [Video]
Yida Mu, Ben P. Wu, William Thorne, Ambrose Robinson, Nikolaos Aletras, Carolina Scarton, Kalina Bontcheva and Xingyi Song
DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain
[Poster] [Slides] [Video]
Yanis Labrak, Adrien Bazoge, Oumaima El Khettari, Mickael Rouvier, pacome constant dit beaufils, Natalia Grabar, Béatrice Daille, Solen Quiniou, Emmanuel Morin, Pierre-Antoine Gourraud and Richard Dufour
10:40 - 11:00Coffee break
D2-S2-R1 - Corpora and Annotation IV (Chair: Dominique Brunato)
11:00 - 11:20Language Pivoting from Parallel Corpora for Word Sense Disambiguation of Historical Languages: A Case Study on Latin
[Slides] [Video]
Iacopo Ghinassi, Simone Tedeschi, Paola Marongiu, Roberto Navigli and Barbara McGillivray
11:20 - 11:40NarrativeTime: Dense Temporal Annotation on a Timeline
[Slides] [Video]
Anna Rogers, Marzena Karpinska, Ankita Gupta, Vladislav Lialin, Gregory Smelkov and Anna Rumshisky
11:40 - 12:00Do Language Models Care about Text Quality? Evaluating Web-Crawled Corpora across 11 Languages
[Slides] [Video]
Rik van Noord, Taja Kuzman, Peter Rupnik, Nikola Ljubešić, Miquel Esplà-Gomis, Gema Ramírez-Sánchez and Antonio Toral
12:00 - 12:20New Methods for Exploring Intonosyntax: Introducing an Intonosyntactic Treebank for Nigerian Pidgin
[Video]
Emmett Strickland, Anne Lacheret-Dujour, Sylvain Kahane, Marc Evrard, Perrine Quennehen, Bernard Caron, Francis Egbokhare and Bruno Guillaume
12:20 - 12:40Towards a Unified Taxonomy of Deep Syntactic Relations
[Video]
Kira Droganova and Daniel Zeman
D2-S2-R2 - Multimodal Applications, Grounded Language Acquisition, and HRI II (Chair: Syrielle Montariol)
11:00 - 11:20Neural Multimodal Topic Modeling: A Comprehensive Evaluation
[Slides] [Video]
Felipe Gonzalez-Pizarro and Giuseppe Carenini
11:20 - 11:40Intrinsic Subgraph Generation for Interpretable Graph Based Visual Question Answering
[Slides] [Video]
Pascal Tilli and Ngoc Thang Vu
11:40 - 12:00Releasing the Capacity of GANs in Non-Autoregressive Image Captioning
[Slides] [Video]
Da Ren and Qing Li
12:00 - 12:20DGS-Fabeln-1: A Multi-Angle Parallel Corpus of Fairy Tales between German Sign Language and German Text
Fabrizio Nunnari, Eleftherios Avramidis, Cristina España-Bonet, Marco González, Anna Hennes and Patrick Gebhard
12:20 - 12:40Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies
[Slides] [Video]
Philipp Sadler, Sherzod Hakimov and David Schlangen
D2-S2-R3 - Less-Resourced/Endangered/Less-studied Languages I (Chair: Stelios Piperidis)
11:00 - 11:20Parsing for Mauritian Creole Using Universal Dependencies
[Slides] [Video]
Neha Ramsurrun, Rolando Coto-Solano and Michael Gonzalez
11:20 - 11:40Strengthening the WiC: New Polysemy Dataset in Hindi and Lack of Cross Lingual Transfer
[Video]
Haim Dubossarsky and Farheen Dairkee
11:40 - 12:00KazQAD: Kazakh Open-Domain Question Answering Dataset
[Slides] [Video]
Rustem Yeshpanov, Pavel Efimov, Leonid Boytsov, Ardak Shalkarbayuli and Pavel Braslavski
12:00 - 12:20The IgboAPI Dataset: Empowering Igbo Language Technologies through Multi-dialectal Enrichment
[Slides] [Video]
Chris Chinenye Emezue, Ifeoma Okoh, Chinedu Emmanuel Mbonu, Chiamaka Chukwuneke, Daisy Monika Lal, Ignatius Ezeani, Paul Rayson, Ijemma Onwuzulike, Chukwuma Onyebuchi Okeke, Gerald Okey Nweya, Bright Ikechukwu Ogbonna, Chukwuebuka Uchenna Oraegbunam, Esther Chidinma Awo-Ndubuisi and Akudo Amarachukwu Osuagwu
12:20 - 12:40AssameseBackTranslit: Back Transliteration of Romanized Assamese Social Media Text
[Slides] [Video]
Hemanta Baruah, Sanasam Ranbir Singh and Priyankoo Sarmah
D2-S2-R4 - Trustworthy, Interpretability, and Explainability of Neural Models I (Chair: Emmanuele Chersoni)
11:00 - 11:20Pre-Trained Language Models Represent Some Geographic Populations Better than Others
[Slides] [Video]
Jonathan Dunn, Benjamin Adams and Harish Tayyar Madabushi
11:20 - 11:40Causal Intersectionality and Dual Form of Gradient Descent for Multimodal Analysis: A Case Study on Hateful Memes
[Slides] [Video]
Yosuke Miyanishi and Minh Le Nguyen
11:40 - 12:00Jump to Conclusions: Short-Cutting Transformers with Linear Transformations
[Slides] [Video]
Alexander Yom Din, Taelin Karidi, Leshem Choshen and Mor Geva
12:00 - 12:20Distillation with Explanations from Large Language Models
[Video]
Hanyu Zhang, Xiting Wang, Xiang Ao and Qing He
12:20 - 12:40Evaluating Saliency Explanations in NLP by Crowdsourcing
[Slides] [Video]
Xiaotian Lu, Jiyi Li, Zhen Wan, Xiaofeng Lin, Koh Takeuchi and Hisashi Kashima
D2-S2-R5 - Special Session Industrial Track II
[Video]
11:20 - 11:40Research in NLP @Orange and first results on Efficient Domain Adaptation of LLMs for the Telco domain
11:40 - 12:00Creating more inclusive language technologies at Google and beyond
12:00 - 12:20Retrieval-Augmented Generation in Baidu Search
D2-S2-R6 - Lexicon and Semantics (Chair: Aina Garí Soler)
11:00 - 11:20Beyond Model Performance: Can Link Prediction Enrich French Lexical Graphs?
[Slides] [Video]
Hee-Soo Choi, Priyansh Trivedi, Mathieu Constant, Karen Fort and Bruno Guillaume
11:20 - 11:40Towards Semantic Tagging for Irish
[Slides] [Video]
Tim Czerniak and Elaine Uí Dhonnchadha
11:40 - 12:00When Your Cousin Has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages
[Slides] [Video]
Niyati Bafna, Cristina España-Bonet, Josef van Genabith, Benoît Sagot and Rachel Bawden
12:00 - 12:20On Modelling Corpus Citations in Computational Lexical Resources
[Video]
Fahad Khan, Maxim Ionov, Christian Chiarcos, Laurent Romary, Gilles Sérasset and Besim Kabashi
12:20 - 12:40Bridging Computational Lexicography and Corpus Linguistics: A Query Extension for OntoLex-FrAC
[Video]
Christian Chiarcos, Ranka Stanković, Maxim Ionov and Gilles Sérasset
11:00 - 12:40D2-S2-P5 - Corpora and Annotation II (Chair: Enrica Troiano)
A CURATEd CATalog: Rethinking the Extraction of Pretraining Corpora for Mid-Resourced Languages
[Poster] [Slides] [Video]
Jorge Palomar-Giner, Jose Javier Saiz, Ferran Espuña, Mario Mina, Severino Da Dalt, Joan Llop, Malte Ostendorff, Pedro Ortiz Suarez, Georg Rehm, Aitor Gonzalez-Agirre and Marta Villegas
The Syntactic Acceptability Dataset (Preview): A Resource for Machine Learning and Linguistic Analysis of English
[Video]
Tom S Juzek
Towards Explainability and Fairness in Swiss Judgement Prediction: Benchmarking on a Multilingual Dataset
[Poster] [Slides] [Video]
Santosh T.Y.S.S., Nina Baumgartner, Matthias Stürmer, Matthias Grabmair and Joel Niklaus
Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition
[Poster] [Slides] [Video]
Sungjoo Byun, Jiseung Hong, Sumin Park, Dongjun Jang, Jean Seo, Minseok Kim, Chaeyoung OH and Hyopil Shin
A Corpus of Spontaneous L2 English Speech for Real-situation Speaking Assessment
[Poster] [Slides] [Video]
Sylvain Coulange, Marie-Hélène Fries, Monica Masperi and Solange Rossato
CONAN-MT-SP: A Spanish Corpus for Counternarrative Using GPT Models
[Poster] [Slides] [Video]
María Estrella Vallecillo Rodríguez, Maria Victoria Cantero Romero, Isabel Cabrera De Castro, Arturo Montejo Ráez and María Teresa Martín Valdivia
Puntuguese: A Corpus of Puns in Portuguese with Micro-edits
[Slides] [Video]
Marcio Lima Inacio, Gabriela Wick-Pedro, Renata Ramisch, Luís Espírito Santo, Xiomara S. Q. Chacon, Roney Santos, Rogério Sousa, Rafael Anchiêta and Hugo Goncalo Oliveira
DARIUS: A Comprehensive Learner Corpus for Argument Mining in German-Language Essays
[Video]
Nils-Jonathan Schaller, Andrea Horbach, Lars Ingver Höft, Yuning Ding, Jan Luca Bahr, Jennifer Meyer and Thorben Jansen
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries
[Video]
Jing Han Sun and Ali Emami
PyRater: A Python Toolkit for Annotation Analysis
[Video]
Angelo Basile, Marc Franco-Salvador and Paolo Rosso
A Tulu Resource for Machine Translation
[Poster] [Slides] [Video]
Manu Narayanan and Noëmi Aepli
DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark
[Slides] [Video]
Enes Yavuz Ugan, Ngoc-Quan Pham and Alexander Waibel
PolyNERE: A Novel Ontology and Corpus for Named Entity Recognition and Relation Extraction in Polymer Science Domain
[Video]
Van-Thuy Phi, Hiroki Teranishi, Yuji Matsumoto, Hiroyuki Oka and Masashi Ishii
GIL-GALaD: Gender Inclusive Language - German Auto-Assembled Large Database
[Slides] [Video]
Anna-Katharina Dick, Matthias Drews, Valentin Pickard and Victoria Pierz
Mathematical Entities: Corpora and Benchmarks
[Slides] [Video]
Jacob Collard, Valeria de Paiva and Eswaran Subrahmanian
Language Variety Identification with True Labels
[Slides] [Video]
Marcos Zampieri, Kai North, Tommi Jauhiainen, Mariano Felice, Neha Kumari, Nishant Nair and Yash Mahesh Bangera
FoRC4CL: A Fine-grained Field of Research Classification and Annotated Dataset of NLP Articles
[Poster] [Slides] [Video]
Raia Abu Ahmad, Ekaterina Borisova and Georg Rehm
FRACAS: a FRench Annotated Corpus of Attribution relations in newS
[Slides] [Video]
Ange Richard, Laura Cristina Alonzo Canul and François Portet
Text Filtering Classifiers for Medium-Resource Languages
[Poster] [Slides] [Video]
Jón Daðason and Hrafn Loftsson
I Remember You!: SUI Corpus for Remembering and Utilizing Users’ Information in Chat-oriented Dialogue Systems
[Slides] [Video]
Yuiko Tsunomori and Ryuichiro Higashinaka
11:00 - 12:40D2-S2-P5 - Information Extraction, Knowledge Extraction, and Text Mining I (Chair: Enrica Troiano)
Large Language Models as Financial Data Annotators: A Study on Effectiveness and Efficiency
[Poster] [Slides] [Video]
Toyin D. Aguda, Suchetha Siddagangappa, Elena Kochkina, Simerjot Kaur, Dongsheng Wang and Charese Smiley
Schema-based Data Augmentation for Event Extraction
[Video]
Xiaomeng Jin and Heng Ji
DEIE: Benchmarking Document-level Event Information Extraction with a Large-scale Chinese News Dataset
[Video]
Yubing Ren, Yanan Cao, Hao Li, yingjie li, Zixuan ZM Ma, Fang Fang, Ping Guo and Wei Ma
Unlocking Instructive In-Context Learning with Tabular Prompting for Relational Triple Extraction
[Slides] [Video]
Guozheng Li, Wenjun Ke, Peng Wang, Zijie Xu, Ke Ji, Jiajun Liu, Ziyu Shang and Qiqing Luo
Distill, Fuse, Pre-train: Towards Effective Event Causality Identification with Commonsense-Aware Pre-trained Model
[Poster] [Slides] [Video]
Peixin Huang, Xiang Zhao, Minghao Hu, Zhen Tan and Weidong Xiao
Few-Shot Multimodal Named Entity Recognition Based on Mutlimodal Causal Intervention Graph
[Slides] [Video]
Feihong Lu, Xiaocui Yang, Qian Li, Qingyun Sun, Ke Jiang, Cheng Ji and Jianxin Li
Domain-aware and Co-adaptive Feature Transformation for Domain Adaption Few-shot Relation Extraction
[Slides] [Video]
Yijun Liu, Feifei Dai, Xiaoyan Gu, Minghui Zhai, Bo Li and Meiou Zhang
JRC-Names-Retrieval: A Standardized Benchmark for Name Search
[Slides] [Video]
Philip Blair and Kfir Bar
Knowledge Triplets Derivation from Scientific Publications via Dual-Graph Resonance
[Poster] [Slides] [Video]
Kai Zhang, Pengcheng Li, Kaisong Song, Xurui Li, Yangyang Kang, Xuhong Zhang and Xiaozhong Liu
Prototype-based Prompt-Instance Interaction with Causal Intervention for Few-shot Event Detection
[Slides] [Video]
Jingyao Tang, Lishuang Li, Hongbin Lu, Xueyang Qin, Beibei Zhang and Haiming Wu
SciDMT: A Large-Scale Corpus for Detecting Scientific Mentions
[Poster] [Slides] [Video]
Huitong Pan, Qi Zhang, Cornelia Caragea, Eduard Dragut and Longin Jan Latecki
Character-level Language Models for Abbreviation and Long-form Detection
[Poster] [Slides] [Video]
Leonardo Zilio, Shenbin Qian, Diptesh Kanojia and Constantin Orasan
CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions
[Slides] [Video]
Donghee Choi, Mogan Gim, Donghyeon Park, Mujeen Sung, Hyunjae Kim, Jaewoo Kang and Jihun Choi
Labeling Results of Topic Models: Word Sense Disambiguation as Key Method for Automatic Topic Labeling with GermaNet
[Poster] [Slides] [Video]
Jennifer Ecker
CAGK: Collaborative Aspect Graph Enhanced Knowledge-based Recommendation
[Slides] [Video]
Xiaotong Song, Huiping Lin, Jiatao Zhu and Xinyi Gong
What Is Needed for Intra-document Disambiguation of Math Identifiers?
[Slides] [Video]
Takuto Asakura and Yusuke Miyao
11:00 - 12:40D2-S2-P5 - Multilinguality, Machine Translation, and Translation Aids I (Chair: Enrica Troiano)
Pluggable Neural Machine Translation Models via Memory-augmented Adapters
[Poster] [Video]
Yuzhuang Xu, Shuo Wang, Peng Li, Xuebo Liu, Xiaolong Wang, Weidong Liu and Yang Liu
Improving Cross-lingual Transfer with Contrastive Negative Learning and Self-training
[Slides] [Video]
Guanlin Li, Xuechen Zhao, Amir Jafari, Wenhao Shao, Reza Farahbakhsh and Noel Crespi
Applying Transfer Learning to German Metaphor Prediction
[Poster] [Slides] [Video]
Maria Berger, Sebastian Michael Reimann and Nieke Marie Kiwitt
MoNMT: Modularly Leveraging Monolingual and Bilingual Knowledge for Neural Machine Translation
[Slides] [Video]
Jianhui Pang, Baosong Yang, Derek F. Wong, Dayiheng Liu, Xiangpeng Wei, Jun Xie and Lidia S. Chao
Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean
[Slides] [Video]
Dojun Park and Sebastian Padó
ParaNames 1.0: Creating an Entity Name Corpus for 400+ Languages Using Wikidata
[Video]
Jonne Sälevä and Constantine Lignos
Unmasking Biases: Exploring Gender Bias in English-Catalan Machine Translation through Tokenization Analysis and Novel Dataset
[Slides] [Video]
Audrey Mash, Carlos Escolano, Aleix Sant, Maite Melero and Francesca de Luca Fornaciari
Gradient Consistency-based Parameter Allocation for Multilingual Neural Machine Translation
[Slides] [Video]
Wenshuai Huo, Xiaocheng Feng, Yichong Huang, Chengpeng Fu, Hui Wang and Bing Qin
The Effects of Pretraining in Video-Guided Machine Translation
[Slides] [Video]
Ammon Shurtz, Lawry Sorenson and Stephen D. Richardson
UniPSDA: Unsupervised Pseudo Semantic Data Augmentation for Zero-Shot Cross-Lingual Natural Language Understanding
[Poster] [Slides] [Video]
Dongyang Li, Taolin Zhang, Jiali Deng, Longtao Huang, Chengyu Wang, Xiaofeng He and Hui Xue
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
[Poster] [Slides] [Video]
Jian Yang, Hongcheng Guo, Yuwei Yin, Jiaqi Bai, Bing Wang, Jiaheng Liu, Xinnian Liang, LinZheng Chai, Liqun Yang and Zhoujun Li
A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models
[Video]
Chenyang Lyu, Zefeng Du, Jitao Xu, Yitao Duan, Minghao Wu, Teresa Lynn, Alham Fikri Aji, Derek F. Wong and Longyue Wang
Can Large Language Models Learn Translation Robustness from Noisy-Source In-context Demonstrations?
[Slides] [Video]
Leiyu Pan, Yongqi Leng and Deyi Xiong
Improving Unsupervised Neural Machine Translation via Training Data Self-Correction
[Poster] [Slides] [Video]
Jinliang Lu and Jiajun Zhang
An Empirical Study on the Robustness of Massively Multilingual Neural Machine Translation
[Video]
Supryadi Supryadi, Leiyu Pan and Deyi Xiong
11:00 - 12:40D2-S2-P5 - Parsing, Tagging, Chunking, Grammar, Syntax, Morphosyntax, Morphology (Chair: Enrica Troiano)
New Proposal of Greenberg’s Universal 14 from Typometrics
[Video]
Antoni Brosa-Rodríguez and Sylvain Kahane
NLPre: A Revised Approach towards Language-centric Benchmarking of Natural Language Preprocessing Systems
[Slides] [Video]
Martyna Wiącek, Piotr Rybak, Łukasz Pszenny and Alina Wróblewska
Integrating Headedness Information into an Auto-generated Multilingual CCGbank for Improved Semantic Interpretation
[Slides] [Video]
Tu-Anh Tran and Yusuke Miyao
Analyzing the Understanding of Morphologically Complex Words in Large Language Models
[Poster] [Slides] [Video]
Marion Weller-Di Marco and Alexander Fraser
Chinese Morpheme-informed Evaluation of Large Language Models
[Video]
Yaqi Yin, Yue Wang and Yang Liu
OOVs in the Spotlight: How to Inflect Them?
[Slides] [Video]
Tomáš Sourada, Jana Straková and Rudolf Rosa
Parsing Headed Constituencies
[Video]
Katarzyna Krasnowska-Kieraś and Marcin Woliński
Evaluating Shortest Edit Script Methods for Contextual Lemmatization
[Poster] [Slides] [Video]
Olia Toporkov and Rodrigo Agerri
A Computational Model of Latvian Morphology
[Slides] [Video]
Peteris Paikens, Lauma Pretkalniņa and Laura Rituma
Gramble: A Tabular Programming Language for Collaborative Linguistic Modeling
[Slides] [Video]
Patrick Littell, Darlene Stewart, Fineen Davis, Aidan Pine and Roland Kuhn
Arabic Diacritization Using Morphologically Informed Character-Level Model
[Slides] [Video]
Muhammad Morsy Elmallah, Mahmoud Reda, Kareem Darwish, Abdelrahman El-Sheikh, Ashraf Hatim Elneima, Murtadha Aljubran, Nouf Alsaeed, Reem Mohammed and Mohamed Al-Badrashiny
Palmyra 3.0: A User-Friendly Cloud-Based Platform for Morphology and Dependency Syntax Annotation
[Poster] [Video]
Muhammed AbuOdeh, Long Phan, Ahmed Farouk Zakaria Elshabrawy and Nizar Habash
End-to-end Parsing of Procedural Text into Flow Graphs
[Video]
Dhaivat J. Bhatt, Seyed Ahmad Abdollahpouri Hosseini, Federico Fancellu and Afsaneh Fazly
11:00 - 12:40D2-S2-P5 - Social Media Processing (Chair: Enrica Troiano)
ESDM: Early Sensing Depression Model in Social Media Streams
[Poster] [Slides] [Video]
Bichen Wang, Yuzhe Zi, Yanyan Zhao, Pengfei Deng and Bing Qin
Exploring the Emotional Dimension of French Online Toxic Content
[Poster] [Slides] [Video]
Valentina Dragos, Delphine Battistelli, Fatou Sow and Aline Etienne
Who Is Bragging More Online? A Large Scale Analysis of Bragging in Social Media
[Poster] [Slides] [Video]
Mali Jin, Daniel Preotiuc-Pietro, A. Seza Doğruöz and Nikolaos Aletras
Identifying Fine-grained Depression Signs in Social Media Posts
[Slides] [Video]
Augusto R. Mendes and Helena Caseli
Metaphors in Online Religious Communication: A Detailed Dataset and Cross-Genre Metaphor Detection
[Slides] [Video]
Sebastian Reimann and Tatjana Scheffler
AcnEmpathize: A Dataset for Understanding Empathy in Dermatology Conversations
[Video]
Gyeongeun Lee and Natalie Parde
Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social Media
[Video]
Gregorios Katsios, Ning Sa, Ankita Bhaumik and Tomek Strzalkowski
Social Convos: Capturing Agendas and Emotions on Social Media
[Slides] [Video]
Ankita Bhaumik, Ning Sa, Gregorios Katsios and Tomek Strzalkowski
Knowledge Graphs for Real-World Rumour Verification
[Slides] [Video]
John Dougrez-Lewis, Elena Kochkina, Maria Liakata and Yulan He
Classifying Social Media Users before and after Depression Diagnosis via Their Language Usage: A Dataset and Study
[Poster] [Slides] [Video]
Falwah Alhamed, Julia Ive and Lucia Specia
BigNLI: Native Language Identification with Big Bird Embeddings
[Slides] [Video]
Sergey Kramp, Giovanni Cassani and Chris Emmery
DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods
[Slides] [Video]
Lorenzo Lupo, Paul Bose, Mahyar Habibi, Dirk Hovy and Carlo Schwarz
Can We Identify Stance without Target Arguments? A Study for Rumour Stance Classification
[Video]
Yue Li and Carolina Scarton
MentalHelp: A Multi-Task Dataset for Mental Health in Social Media
[Slides] [Video]
Nishat Raihan, Sadiya Sayara Chowdhury Puspo, Shafkat Farabi, Ana-Maria Bucur, Tharindu Ranasinghe and Marcos Zampieri
Annotations for Exploring Food Tweets from Multiple Aspects
[Poster] [Slides] [Video]
Matiss Rikters, Rinalds Vīksna and Edison Marrese-Taylor
12:40 - 13:20Antonio Zampolli Price Talk - Chair: Nicoletta Calzolari
[Video]
13:20 - 14:40Lunch
14:40 - 15:20Invited Local Talk: Michele Loporcaro - Chair: Alessandro Lenci
The Language Landscape of Italy as a Linguistic Data Mine
[Video]
D2-S3-R1 - Applications Involving LRs and Evaluation IV (Chair: Natalie Parde)
15:30 - 15:50Identifying and Aligning Medical Claims Made on Social Media with Medical Evidence
[Slides] [Video]
Anthony James Hughes and Xingyi Song
15:50 - 16:10Generating Clarification Questions for Disambiguating Contracts
[Slides] [Video]
Anmol Singhal, Chirag Jain, Preethu Rose Anish, Arkajyoti Chakraborty and Smita Ghaisas
16:10 - 16:30Hierarchical Graph Convolutional Network Approach for Detecting Low-Quality Documents
[Slides] [Video]
Jaeyoung Lee, Joonwon Jang and Misuk Kim
16:30 - 16:50HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking
[Slides] [Video]
Juraj Vladika, Phillip Schneider and Florian Matthes
16:50 - 17:10Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling
[Slides] [Video]
Yida Mu, Chun Dong, Kalina Bontcheva and Xingyi Song
D2-S3-R2 -Less-Resourced/Endangered/Less-studied Languages II (Chair: Lauriane Aufrant)
15:30 - 15:50Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets
[Slides] [Video]
Shadi Manafi and Nikhil Krishnaswamy
15:50 - 16:10Pruning before Fine-tuning: A Retraining-free Compression Framework for Pre-trained Language Models
[Slides] [Video]
Pingjie Wang, Hongcheng Liu, Yanfeng Wang and Yu Wang
16:10 - 16:30Empowering Oneida Language Revitalization: Development of an Oneida Verb Conjugator
[Slides] [Video]
Yanfei Lu, Patrick Littell and Keren Rice
16:30 - 16:50Connecting Language Technologies with Rich, Diverse Data Sources Covering Thousands of Languages
[Slides] [Video]
Daan van Esch, Sandy Ritchie, Sebastian Ruder, Julia Kreutzer, Clara Rivera, Ishank Saxena and Isaac Caswell
16:50 - 17:10Cross-Lingual Learning vs. Low-Resource Fine-Tuning: A Case Study with Fact-Checking in Turkish
[Slides] [Video]
Recep Firat Cekinel, Çağrı Çöltekin and Pinar Karagoz
D2-S3-R3 - Evaluation and Validation Methodologies II (Chair: John Ortega)
15:30 - 15:50DanteLLM: Let’s Push Italian LLM Research Forward!
[Video]
Andrea Bacciu, Cesare Campagnano, Giovanni Trappolini and Fabrizio Silvestri
15:50 - 16:10Halwasa: Quantify and Analyze Hallucinations in Large Language Models: Arabic as a Case Study
[Slides] [Video]
Hamdy Mubarak, Hend Al-Khalifa and Khaloud Suliman Alkhalefah
16:10 - 16:30SaGE: Evaluating Moral Consistency in Large Language Models
[Slides] [Video]
Vamshi Krishna Bonagiri, Sreeram Vennam, Priyanshul Govil, Ponnurangam Kumaraguru and Manas Gaur
16:30 - 16:50PromISe: Releasing the Capabilities of LLMs with Prompt Introspective Search
[Slides] [Video]
Minzheng Wang, Nan Xu, Jiahao Zhao, Yin Luo and Wenji Mao
16:50 - 17:10LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
[Video]
Chuang Liu, Renren Jin, Yuqi Ren and Deyi Xiong
D2-S3-R4 - Document Classification, Information Retrieval and Cross-lingual Retrieval I (Chair: Giorgio Maria Di Nunzio)
15:30 - 15:50Text-to-Multimodal Retrieval with Bimodal Input Fusion in Shared Cross-Modal Transformer
[Slides] [Video]
Pranav Arora, Selen Pehlivan and Jorma Laaksonen
15:50 - 16:10Utilizing Local Hierarchy with Adversarial Training for Hierarchical Text Classification
[Slides] [Video]
Zihan Wang, Peiyi Wang and Houfeng Wang
16:10 - 16:30MAGIC: Multi-Argument Generation with Self-Refinement for Domain Generalization in Automatic Fact-Checking
[Slides] [Video]
Wei-Yu Kao and An-Zi Yen
16:30 - 16:50Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification
[Slides] [Video]
Zhipeng Xie and Yahe Li
16:50 - 17:10ECtHR-PCR: A Dataset for Precedent Understanding and Prior Case Retrieval in the European Court of Human Rights
[Slides] [Video]
Santosh T.Y.S.S., Rashid Haddad and Matthias Grabmair
D2-S3-R5 - Parsing, Tagging, Chunking, Grammar, Syntax, Morphosyntax, Morphology I (Chair: Kaja Dobrovoljc)
15:30 - 15:50Null Subjects in Spanish as a Machine Translation Problem
[Slides] [Video]
Jose Diego Suarez and Luis Chiruzzo
15:50 - 16:10Bits and Pieces: Investigating the Effects of Subwords in Multi-task Parsing across Languages and Domains
[Slides] [Video]
Daniel Dakota and Sandra Kübler
16:10 - 16:30Model-Agnostic Cross-Lingual Training for Discourse Representation Structure Parsing
[Slides] [Video]
Jiangming Liu
16:30 - 16:50Improving Implicit Discourse Relation Recognition with Semantics Confrontation
[Slides] [Video]
Mingyang Cai, Zhen Yang and Ping Jian
16:50 - 17:10UDMorph: Morphosyntactically Tagged UD Corpora
[Slides] [Video]
Maarten Janssen
D2-S3-R6 - Digital Humanities and Cultural Heritage (Chair: Jaap Kamps)
15:30 - 15:50CHisIEC: An Information Extraction Corpus for Ancient Chinese History
[Slides] [Video]
Xuemei Tang, Qi Su, Jun Wang and Zekun Deng
15:50 - 16:10A Lightweight Approach to a Giga-Corpus of Historical Periodicals: The Story of a Slovenian Historical Newspaper Collection
[Slides] [Video]
Filip Dobranić, Bojan Evkoski and Nikola Ljubešić
16:10 - 16:30From Text to Historical Ecological Knowledge: The Construction and Application of the Shan Jing Knowledge Base
[Slides] [Video]
Ke Liang, Chu-Ren Huang and Xin-Lan Jiang
16:30 - 16:50Reconstruction of Cuneiform Literary Texts as Text Matching
[Slides] [Video]
Fabian Simonjetz, Jussi Laasonen, Yunus Cobanoglu, Alexander Fraser and Enrique Jiménez
16:50 - 17:10At the Crossroad of Cuneiform and NLP: Challenges for Fine-grained Part-of-speech Tagging
[Slides] [Video]
Gustav Ryberg Smidt, Els Lefever and Katrien de Graef
D2-S3-R7 - Special Session Industrial Track III
15:30 - 15:50L2 for Language Nerds: Tapping into Natural Intelligence in the Age of AI
15:50 - 16:10It's all about transparency
16:10 - 16:30Challenges, needs, approaches and points of view for industry grade AI
16:30 - 16:50TALIA presents three research and innovative running projects
16:50 - 17:10NLP initiatives in Intesa Sanpaolo
15:30 - 17:10D2-S3-P6 - Corpora and Annotation III (Chair: Maja Buljan)
QueryNER: Segmentation of E-commerce Queries
[Slides] [Video]
Chester Palen-Michel, Lizzie Liang, Zhe Wu and Constantine Lignos
KazParC: Kazakh Parallel Corpus for Machine Translation
[Poster] [Slides] [Video]
Rustem Yeshpanov, Alina Polonskaya and Huseyin Atakan Varol
DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures
[Slides] [Video]
Agrima Seth, Sanchit Ahuja, Kalika Bali and Sunayana Sitaram
FastSpell: The LangId Magic Spell
[Poster] [Slides] [Video]
Marta Bañón, Gema Ramírez-Sánchez, Jaume Zaragoza-Bernabeu and Sergio Ortiz Rojas
Automatic Extraction of Nominal Phrases from German Learner Texts of Different Proficiency Levels
[Slides] [Video]
Ronja Laarmann-Quante, Marco Müller and Eva Belke
Spanish Resource Grammar Version 2023
[Video]
Olga Zamaraeva, Lorena S. Allegue and Carlos Gómez-Rodríguez
Building Question-Answer Data Using Web Register Identification
[Slides] [Video]
Anni Eskelinen, Amanda Myntti, Erik Henriksson, Sampo Pyysalo and Veronika Laippala
RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts
[Slides] [Video]
Hongzheng Li, Ruojin Wang, Ge Shi, Xing Lv, Lei Lei, Chong Feng, Fang Liu, Jinkun Lin, Yangguang Mei and Linnan Xu
Can Humans Identify Domains?
[Video]
Maria Barrett, Max Müller-Eberstein, Elisa Bassignana, Amalie Brogaard Pauli, Mike Zhang and Rob van der Goot
A Dataset for Pharmacovigilance in German, French, and Japanese: Annotating Adverse Drug Reactions across Languages
[Poster] [Video]
Lisa Raithel, Hui-Syuan Yeh, Shuntaro Yada, Cyril Grouin, Thomas Lavergne, Aurélie Névéol, Patrick Paroubek, Philippe Thomas, Tomohiro Nishiyama, Sebastian Möller, Eiji Aramaki, Yuji Matsumoto, Roland Roller and Pierre Zweigenbaum
Text2Story Lusa: A Dataset for Narrative Analysis in European Portuguese News Articles
[Poster] [Slides] [Video]
Sérgio Nunes, Alípio Mario Jorge, Evelin Amorim, Hugo Sousa, António Leal, Purificação Moura Silvano, Inês Cantante and Ricardo Campos
PSE v1.0: The First Open Access Corpus of Public Service Encounters
[Video]
Ingrid Espinoza, Steffen Frenzel, Laurin Friedrich, Wassiliki Siskou, Steffen Eckhard and Annette Hautli-Janisz
A Benchmark for Recipe Understanding in Artificial Agents
[Video]
Jens Nevens, Robin de Haes, Rachel Ringe, Mihai Pomarlan, Robert Porzel, Katrien Beuls and Paul van Eecke
JaParaPat: A Large-Scale Japanese-English Parallel Patent Application Corpus
[Slides] [Video]
Masaaki Nagata, Makoto Morishita, Katsuki Chousa and Norihito Yasuda
FinCorpus-DE10k: A Corpus for the German Financial Domain
[Poster] [Video]
Serhii Hamotskyi, Nata Kozaeva and Christian Hänig
M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets
[Slides] [Video]
Gaurish Thakkar, Sherzod Hakimov and Marko Tadić
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
[Poster] [Slides] [Video]
Xinyu Ma, Xuebo Liu, Derek F. Wong, Jun Rao, Bei Li, Liang Ding, Lidia S. Chao, Dacheng Tao and Min Zhang
FAIRification of LeiLanD
[Video]
Eric Sanders, Sara Petrollino, Gilles R. Scheifer, Henk van den Heuvel and Christopher Handy
A Linguistically-Informed Annotation Strategy for Korean Semantic Role Labeling
[Poster] [Slides] [Video]
Yige Chen, KyungTae Lim and Jungyeul Park
An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset Generation
[Poster] [Slides] [Video]
Ahmet Gunduz, Kamer Ali Yuksel, Kareem Darwish, Golara Javadi, Fabio Minazzi, Nicola Sobieski and Sébastien Bratières
CSSWiki: A Chinese Sentence Simplification Dataset with Linguistic and Content Operations
[Video]
Fengkai Liu and John S. Y. Lee
15:30 - 17:10D2-S3-P6 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction I (Chair: Maja Buljan)
Combining Discourse Coherence with Large Language Models for More Inclusive, Equitable, and Robust Task-Oriented Dialogue
[Slides] [Video]
Katherine Atwell, Mert Inan, Anthony B. Sicilia and Malihe Alikhani
Emstremo: Adapting Emotional Support Response with Enhanced Emotion-Strategy Integrated Selection
[Video]
Junlin Li, Bo Peng and Yu-Yin Hsu
Theoretical and Empirical Advantages of Dense-Vector to One-Hot Encoding of Intent Classes in Open-World Scenarios
[Slides] [Video]
Paulo Cavalin and Claudio Santos Pinhanez
BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation
[Slides] [Video]
Yuhong He, Yongqi Zhang, Shizhu He and Jun Wan
CTSM: Combining Trait and State Emotions for Empathetic Response Model
[Poster] [Slides] [Video]
Yufeng Wang, Chao Chen, Zhou Yang, Shuhui Wang and Xiangwen Liao
CO3: Low-resource Contrastive Co-training for Generative Conversational Query Rewrite
[Video]
Yifei Yuan, Chen Shi, Wang Runze, Liyi Chen, Renjun Hu, Zengming Zhang, Feijun Jiang and Wai Lam
Counterfactual Dialog Mixing as Data Augmentation for Task-Oriented Dialog Systems
[Slides] [Video]
Sebastian Steindl, Ulrich Schäfer and Bernd Ludwig
SPOTTER: A Framework for Investigating Convention Formation in a Visually Grounded Human-Robot Reference Task
[Slides] [Video]
Jaap Kruijt, Peggy van Minkelen, Lucia Donatelli, Piek T.J.M. Vossen, Elly Konijn and Thomas Baier
Persona-aware Multi-party Conversation Response Generation
[Slides] [Video]
Khyati Mahajan and Samira Shaikh
Chitchat as Interference: Adding User Backstories to Task-Oriented Dialogues
[Video]
Armand Stricker and Patrick Paroubek
Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units
[Slides] [Video]
Biswesh Mohapatra, Seemab Hassan, Laurent Romary and Justine Cassell
Modeling the Quality of Dialogical Explanations
[Poster] [Slides] [Video]
Milad Alshomary, Felix Lange, Meisam Booshehri, Meghdut Sengupta, Philipp Cimiano and Henning Wachsmuth
ADEA: An Argumentative Dialogue Dataset on Ethical Issues Concerning Future A.I. Applications
[Poster] [Video]
Christian Hauptmann, Adrian Krenzer, Antonia Krause and Frank Puppe
Characteristic AI Agents via Large Language Models
[Slides] [Video]
Xi Wang, Hongliang Dai, Shen Gao and Piji Li
Analysis of Sensation-transfer Dialogues in Motorsports
[Video]
Takeru Isaka, Atsushi Otsuka and Iwaki Toshima
Multi-Grained Conversational Graph Network for Retrieval-based Dialogue Systems
[Video]
Quan Tu, Chongyang Tao and Rui Yan
DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space
[Slides] [Video]
Jianxiang Xiang, Zhenhua Liu, Haodong Liu, Yin Bai, Jia Cheng and Wenliang Chen
OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog
[Poster] [Slides] [Video]
Adnen Abdessaied, Manuel Hochmeister and Andreas Bulling
Reimagining Intent Prediction: Insights from Graph-Based Dialogue Modeling and Sentence Encoders
[Slides] [Video]
Daria Romanovna Ledneva and Denis Pavlovich Kuznetsov
15:30 - 17:10D2-S3-P6 - Information Extraction, Knowledge Extraction, and Text Mining II (Chair: Maja Buljan)
MaintIE: A Fine-Grained Annotation Schema and Benchmark for Information Extraction from Maintenance Short Texts
[Video]
Tyler K. Bikaun, Tim French, Michael Stewart, Wei Liu and Melinda Hodkiewicz
Few-shot Named Entity Recognition via Superposition Concept Discrimination
[Slides] [Video]
Jiawei Chen, Hongyu Lin, Xianpei Han, Yaojie Lu, Shanshan Jiang, Bin Dong and Le Sun
Asking and Answering Questions to Extract Event-Argument Structures
[Slides] [Video]
Md Nayem Uddin, Enfa Rose George, Eduardo Blanco and Steven R. Corman
ToNER: Type-oriented Named Entity Recognition with Generative Language Model
[Video]
Guochao Jiang, Ziqin Luo, Yuchen Shi, Dixuan Wang, Jiaqing Liang and Deqing Yang
Temporal Knowledge Graph Reasoning with Dynamic Hypergraph Embedding
[Slides] [Video]
Xinyue Liu, Jianan Zhang, Chi Ma, Wenxin Liang, Bo Xu and Linlin Zong
CoNLL#: Fine-grained Error Analysis and a Corrected Test Set for CoNLL-03 English
[Slides] [Video]
Andrew Rueda, Elena Alvarez-Mellado and Constantine Lignos
EDEN: A Dataset for Event Detection in Norwegian News
[Slides] [Video]
Samia Touileb, Jeanett Murstad, Petter Mæhlum, Lubos Steskal, Lilja Charlotte Storset, Huiling You and Lilja Øvrelid
Improving Low-Resource Keyphrase Generation through Unsupervised Title Phrase Generation
[Video]
Byungha Kang and Youhyun Shin
Enriching a Time-Domain Astrophysics Corpus with Named Entity, Coreference and Astrophysical Relationship Annotations
[Video]
Atilla Kaan Alkan, Felix Grezes, Cyril Grouin, Fabian Schussler and Pierre Zweigenbaum
Active Learning Design Choices for NER with Transformers
[Poster] [Slides] [Video]
Robert Vacareanu, Enrique Noriega-Atala, Gus Hahn-Powell, Marco A. Valenzuela-Escarcega and Mihai Surdeanu
Retrieval-based Question Answering with Passage Expansion Using a Knowledge Graph
[Poster] [Slides] [Video]
Benno Kruit, Yiming Xu and Jan-Christoph Kalo
Deep Learning Based Named Entity Recognition Models for Recipes
[Slides] [Video]
Ayush Agarwal, Janak Kapuriya, Shubham Agrawal, Akhil Vamshi Konam, Mansi Goel, Rishabh Gupta, Shrey Rastogi, Niharika Niharika and Ganesh Bagler
A Novel Corpus of Annotated Medical Imaging Reports and Information Extraction Results Using BERT-based Language Models
[Poster] [Slides] [Video]
Namu Park, Kevin Lybarger, Giridhar Kaushik Ramachandran, Spencer Lewis, Aashka Damani, Özlem Uzuner, Martin Gunn and Meliha Yetisgen
Topic Detection and Tracking with Time-Aware Document Embeddings
[Video]
Hang Jiang, Doug Beeferman, Weiquan Mao and Deb Roy
Information Extraction with Differentiable Beam Search on Graph RNNs
[Slides] [Video]
Niama El Khbir, Nadi Tomeh and Thierry Charnois
Guided Distant Supervision for Multilingual Relation Extraction Data: Adapting to a New Language
[Video]
Alistair Plum, Tharindu Ranasinghe and Christoph Purschke
15:30 - 17:10D2-S3-P6 - Lexicon and Semantics II (Chair: Maja Buljan)
J-SNACS: Adposition and Case Supersenses for Japanese Joshi
[Slides] [Video]
Tatsuya Aoyama, Chihiro Taguchi and Nathan Schneider
Framed Multi30K: A Frame-Based Multimodal-Multilingual Dataset
[Poster] [Slides] [Video]
Marcelo Viridiano, Arthur Lorenzi, Tiago Timponi Torrent, Ely E. Matos, Adriana S. Pagano, Natália Sathler Sigiliano, Maucha Gamonal, Helen de Andrade Abreu, Lívia Vicente Dutra, Mairon Samagaio, Mariane Carvalho, Franciany Campos, Gabrielly Azalim, Bruna Mazzei, Mateus Fonseca de Oliveira, Ana Carolina Luz, Livia Padua Ruiz, Júlia Bellei, Amanda Pestana, Josiane Costa, Iasmin Rabelo, Anna Beatriz Silva, Raquel Roza, Mariana Souza Mota, Igor Oliveira and Márcio Henrique Pelegrino de Freitas
AMenDeD: Modelling Concepts by Aligning Mentions, Definitions and Decontextualised Embeddings
[Slides] [Video]
Amit Gajbhiye, Zied Bouraoui, Luis Espinosa Anke and Steven Schockaert
The ELCo Dataset: Bridging Emoji and Lexical Composition
[Slides] [Video]
Zi Yun Yang, Ziqing Zhang and Yisong Miao
Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding
[Slides] [Video]
Zichen Wu, Hsiu-Yuan Huang, Fanyi Qu and Yunfang Wu
Are Large Language Models Good at Lexical Semantics? A Case of Taxonomy Learning
[Slides] [Video]
Viktor Moskvoretskii, Alexander Panchenko and Irina Nikishina
Comparing Static and Contextual Distributional Semantic Models on Intrinsic Tasks: An Evaluation on Mandarin Chinese Datasets
[Slides] [Video]
A Pranav, Yan Cong, Emmanuele Chersoni, Yu-Yin Hsu and Alessandro Lenci
Towards Standardized Annotation and Parsing for Korean FrameNet
[Poster] [Slides] [Video]
Yige Chen, Jae Ihn, KyungTae Lim and Jungyeul Park
What Can Diachronic Contexts and Topics Tell Us about the Present-Day Compositionality of English Noun Compounds?
[Poster] [Slides] [Video]
Samin Mahdizadeh Sani, Malak Rassem, Chris W. Jenkins, Filip Miletić and Sabine Schulte im Walde
Croatian Idioms Integration: Enhancing the LIdioms Multilingual Linked Idioms Dataset
[Slides] [Video]
Ivana Filipović Petrović, Miguel López Otal and Slobodan Beliga
Loflòc: A Morphological Lexicon for Occitan using Universal Dependencies
[Poster] [Video]
Marianne Vergez-Couret, Myriam Bras, Aleksandra Miletić and Clamença Poujade
Negation Scope Conversion: Towards a Unified Negation-Annotated Dataset
[Poster] [Slides] [Video]
Asahi Yoshida, Yoshihide Kato and Shigeki Matsubara
Qabas: An Open-Source Arabic Lexicographic Database
[Slides] [Video]
Mustafa Jarrar and Tymaa Hasanain Hammouda
Transformer-based Swedish Semantic Role Labeling through Transfer Learning
[Poster] [Video]
Dana Dannélls, Richard Johansson and Lucy Yang Buhr
Automatic Animacy Classification for Romanian Nouns
[Slides] [Video]
Maria Tepei and Jelke Bloem
Textual Coverage of Eventive Entries in Lexical Semantic Resources
[Poster] [Slides] [Video]
Eva Fučíková, Cristina Fernández Alcaina, Jan Hajič and Zdeňka Urešová
Exploring Interpretability of Independent Components of Word Embeddings with Automated Word Intruder Test
[Video]
Tomáš Musil and David Mareček
15:30 - 17:10D2-S3-P6 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation II (Chair: Maja Buljan)
CEPT: A Contrast-Enhanced Prompt-Tuning Framework for Emotion Recognition in Conversation
[Poster] [Slides] [Video]
Qingqing Gao, Jiuxin Cao, Biwei Cao, Xin Guan and Bo Liu
Opinion Mining Using Pre-Trained Large Language Models: Identifying the Type, Polarity, Intensity, Expression, and Source of Private States
[Poster] [Slides] [Video]
Saeed Ahmadnia, Arash Yousefi Jordehi, Mahsa Hosseini Khasheh Heyran, SeyedAbolghasem Mirroshandel and Owen Rambow
A Hybrid Approach to Aspect Based Sentiment Analysis Using Transfer Learning
[Slides] [Video]
Gaurav Negi, Rajdeep Sarkar, Omnia Zayed and Paul Buitelaar
Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit Reasoning
[Slides] [Video]
Maksym Taranukhin, Vered Shwartz and Evangelos Milios
FaiMA: Feature-aware In-context Learning for Multi-domain Aspect-based Sentiment Analysis
[Video]
Songhua Yang, Xinke Jiang, Hanjie Zhao, Wenxuan Zeng, Hongde Liu and Yuxiang Jia
STAGE: Simple Text Data Augmentation by Graph Exploration
[Slides] [Video]
Ho-Seung Kim, YongHoon Kang and Jee-Hyong Lee
CAM 2.0: End-to-End Open Domain Comparative Question Answering System
[Poster] [Slides] [Video]
ahmad shallouf, Hanna Herasimchyk, Mikhail Salnikov, Rudy Alexandro Garrido Veliz, Natia Mestvirishvili, Alexander Panchenko, Chris Biemann and Irina Nikishina
Modelling Argumentation for an User Opinion Aggregation Tool
[Video]
Pablo Weingart, Thiemo Wambsganss and Matthias Soellner
KPatch: Knowledge Patch to Pre-trained Language Model for Zero-Shot Stance Detection on Social Media
[Poster] [Slides] [Video]
Shuohao Lin, Wei Chen, Yunpeng Gao, Zhishu Jiang, Mengqi Liao, Zhiyu Zhang, Shuyuan Zhao and Huaiyu Wan
Argument Quality Assessment in the Age of Instruction-Following Large Language Models
[Slides] [Video]
Henning Wachsmuth, Gabriella Lapesa, Elena Cabrio, Anne Lauscher, Joonsuk Park, Eva Maria Vecchi, Serena Villata and Timon Ziegenbein
TACO – Twitter Arguments from COnversations
[Poster] [Video]
Marc Feger and Stefan Dietze
Polish-ASTE: Aspect-Sentiment Triplet Extraction Datasets for Polish
[Slides] [Video]
Marta Lango, Borys Naglik, Mateusz Lango and Iwo Naglik
A Corpus for Sentence-Level Subjectivity Detection on English News Articles
[Poster] [Slides] [Video]
Francesco Antici, Federico Ruggeri, Andrea Galassi, Katerina Korre, Arianna Muti, Alessandra Bardi, Alice Fedotova and Alberto Barrón-Cedeño
Learning Strategies for Robust Argument Mining: An Analysis of Variations in Language and Domain
[Slides] [Video]
Ramon Ruiz-Dolz, Chr-Jr Chiu, Chung-Chi Chen, Noriko Kando and Hsin-Hsi Chen
Segmentation of Complex Question Turns for Argument Mining: A Corpus-based Study in the Financial Domain
[Video]
Giulia D’Agostino, Chris A. Reed and Daniele Puccinelli
EMOLIS App and Dataset to Find Emotionally Close Cartoons
[Slides] [Video]
Soëlie Lerch, Patrice Bellot, Elisabeth Murisasco and Emmanuel Bruno
AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments
[Slides] [Video]
Huadai Liu, Xu Wenqiang, Xuan Lin, Jingjing Huo, Hong Chen and Zhou Zhao
17:10 - 17:30Coffee break
D2-S4-R1 - Corpora and Annotation V (Chair: Miquel Esplà)
17:30 - 17:50Constructing Korean Learners’ L2 Speech Corpus of Seven Languages for Automatic Pronunciation Assessment
[Slides] [Video]
Seunghee Han, Sunhee Kim and Minhwa Chung
17:50 - 18:10CHICA: A Developmental Corpus of Child-Caregiver’s Face-to-face vs. Video Call Conversations in Middle Childhood
[Video]
Dhia Elhak Goumri, Abhishek Agrawal, Mitja Nikolaus, Hong Duc Thang Vu, Kübra Bodur, Elias Emmar, Cassandre Armand, Chiara Mazzocconi, Shreejata Gupta, Laurent Prévot, Benoit Favre, Leonor Becerra-Bonache and Abdellah Fourtassi
18:10 - 18:30Schema Learning Corpus: Data and Annotation Focused on Complex Events
[Slides] [Video]
Song Chen, Jennifer Tracey, Ann Bies and Stephanie Strassel
18:30 - 18:50Language and Speech Technology for Central Kurdish Varieties
[Video]
Sina Ahmadi, Daban Jaff, Md Mahfuz Ibn Alam and Antonios Anastasopoulos
18:50 - 19:10FUSE - FrUstration and Surprise Expressions: A Subtle Emotional Multimodal Language Corpus
[Video]
Rajesh Titung and Cecilia Ovesdotter Alm
D2-S4-R2 - Information Extraction, Knowledge Extraction, and Text Mining II (Chair: Elena Cabrio)
17:30 - 17:50Topics as Entity Clusters: Entity-based Topics from Large Language Models and Graph Neural Networks
[Video]
Manuel V. Loureiro, Steven Derby and Tri Kurniawan Wijaya
17:50 - 18:10Synergetic Interaction Network with Cross-task Attention for Joint Relational Triple Extraction
[Slides] [Video]
Da Luo, Run Lin, Qiao Liu, Yuxiang Cai, Xueyi Liu, Yanglei Gan and Rui Hou
18:10 - 18:30Leveraging Information Redundancy of Real-World Data through Distant Supervision
[Slides] [Video]
Ariel Cohen, Alexandrine Lanson, Emmanuelle Kempf and Xavier Tannier
18:30 - 18:50Event Extraction in Basque: Typologically Motivated Cross-Lingual Transfer-Learning Analysis
[Slides] [Video]
Mikel Zubillaga, Oscar Sainz, Ainara Estarrona, Oier Lopez de Lacalle and Eneko Agirre
18:50 - 19:10Selective Temporal Knowledge Graph Reasoning
[Video]
Zhongni Hou, Xiaolong Jin, Zixuan Li, Long Bai, Jiafeng Guo and Xueqi Cheng
D2-S4-R3 - Machine Learning Models and Techniques for CL/NLP III (Chair: Sabine Schulte im Walde)
17:30 - 17:50LoNAS: Elastic Low-Rank Adapters for Efficient Large Language Models
[Video]
Juan Pablo Munoz, Jinjie Yuan, Yi Zheng and Nilesh Jain
17:50 - 18:10Contextual Modeling for Document-level ASR Error Correction
[Slides] [Video]
Jin Jiang, Xunjian Yin, Xiaojun Wan, Wei Peng, Rongjun Li, Jingyuan Yang and Yanquan Zhou
18:10 - 18:30SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language Models
[Slides] [Video]
Zekun Wang, Jingchang Chen, Wangchunshu Zhou, Haichao Zhu, Jiafeng Liang, Liping Shan, Ming Liu, Dongliang Xu, Qing Yang and Bing Qin
18:30 - 18:50Automatic Identification of COVID-19-Related Conspiracy Narratives in German Telegram Channels and Chats
[Slides] [Video]
Philipp Heinrich, Andreas Blombach, Bao Minh Doan Dang, Leonardo Zilio, Linda Havenstein, Nathan Dykes, Stephanie Evert and Fabian Schäfer
18:50 - 19:10Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification
[Video]
Muhammad ElNokrashy, Badr AlKhamissi and mona Diab
D2-S4-R4 - Inference, Reasoning, Question Answering II (Chair: Lucia Passaro)
17:30 - 17:50Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies
[Slides] [Video]
Flavio Petruzzellis, Alberto Testolin and Alessandro Sperduti
17:50 - 18:10Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences
[Slides] [Video]
Sai Koneru, Jian Wu and Sarah Rajtmajer
18:10 - 18:30A Logical Pattern Memory Pre-trained Model for Entailment Tree Generation
[Slides] [Video]
Li Yuan, Yi Cai, Haopeng Ren and Jiexin Wang
18:30 - 18:50HyperMR: Hyperbolic Hypergraph Multi-hop Reasoning for Knowledge-based Visual Question Answering
[Slides] [Video]
Bin Wang, Fuyong Xu, Peiyu Liu and Zhenfang Zhu
18:50 - 19:10Sequential and Repetitive Pattern Learning for Temporal Knowledge Graph Reasoning
[Slides] [Video]
Xuefei Li, Huiwei Zhou, Weihong Yao, Wenchu Li, Yingyu Lin and Lei Du
D2-S4-R5 - CL and Linguistic Theories, Cognitive Modeling and Psycholinguistics I (Chair: Vito Pirrelli)
17:30 - 17:50A Computational Approach to Quantifying Grammaticization of English Deverbal Prepositions
[Slides] [Video]
Ryo Nagata, Yoshifumi Kawasaki, Naoki Otani and Hiroya Takamura
17:50 - 18:10ContrastWSD: Enhancing Metaphor Detection with Word Sense Disambiguation Following the Metaphor Identification Procedure
[Video]
Mohamad MZ Elzohbi and Richard Zhao
18:10 - 18:30Automatic Annotation of Grammaticality in Child-Caregiver Conversations
[Slides] [Video]
Mitja Nikolaus, Abhishek Agrawal, Petros Kaklamanis, Alex Warstadt and Abdellah Fourtassi
18:30 - 18:50Every Verb in Its Right Place? A Roadmap for Operationalizing Developmental Stages in the Acquisition of L2 German
[Slides] [Video]
Josef Ruppenhofer, Matthias Schwendemann, Annette Portmann, Katrin Wisniewski and Torsten Zesch
18:50 - 19:10Endowing Neural Language Learners with Human-like Biases: A Case Study on Dependency Length Minimization
[Slides] [Video]
Yuqing Zhang, Tessa Verhoef, Gertjan van Noord and Arianna Bisazza
D2-S4-R6 - Policy issues, Ethics, Legal Issues, Bias Analysis (Chair: Penny Labropoulou)
17:30 - 17:50Unpacking Bias: An Empirical Study of Bias Measurement Metrics, Mitigation Algorithms, and Their Interactions
[Slides] [Video]
Felipe Bravo-Marquez and Maria Jose Zambrano
17:50 - 18:10Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels
[Slides] [Video]
Panatchakorn Anantaprayoon, Masahiro Kaneko and Naoaki Okazaki
18:10 - 18:30The Ethical Question – Use of Indigenous Corpora for Large Language Models
[Slides] [Video]
Linda Wiechetek, Flammie A. Pirinen, Børre Gaup, Trond Trosterud, Maja Lisa Kappfjell and Sjur Moshagen
18:30 - 18:50Gendered Grammar or Ingrained Bias? Exploring Gender Bias in Icelandic Language Models
[Slides] [Video]
Steinunn Rut Friðriksdóttir and Hafsteinn Einarsson
18:50 - 19:10Analyzing Effects of Learning Downstream Tasks on Moral Bias in Large Language Models
[Slides] [Video]
Niklas Kiehne, Alexander Ljapunov, Marc Bätje and Wolf-Tilo Balke
17:30 - 17:50D2-S4-R7 - Special Session Industrial Track IV
17:30 - 19:10D2-S4-P7 - Document Classification, Information Retrieval and Cross-lingual Retrieval (Chair: Frederic Bechet)
Typos Correction Training against Misspellings from Text-to-Text Transformers
[Slides] [Video]
Guicai Xie, Ke Zhang, Lei Duan, Wei Zhang and Zeqian Huang
Language Models for Text Classification: Is In-Context Learning Enough?
[Poster] [Slides] [Video]
Aleksandra Edwards and Jose Camacho-Collados
Evaluating Topic Model on Asymmetric and Multi-Domain Financial Corpus
Corentin Masson and Patrick Paroubek
Domain Adaptation for Dense Retrieval and Conversational Dense Retrieval through Self-Supervision by Meticulous Pseudo-Relevance Labeling
[Slides] [Video]
Minghan Li and Eric Gaussier
BEIR-PL: Zero Shot Information Retrieval Benchmark for the Polish Language
[Video]
Konrad Wojtasik, Kacper Wołowiec, Vadim Shishkin, Arkadiusz Janz and Maciej Piasecki
Mapping Work Task Descriptions from German Job Ads on the O*NET Work Activities Ontology
[Slides] [Video]
Ann-Sophie Gnehm and Simon Clematide
Enhancing Writing Proficiency Classification in Developmental Education: The Quest for Accuracy
[Video]
Miguel Da Corte and Jorge Baptista
Silver Retriever: Advancing Neural Passage Retrieval for Polish Question Answering
[Slides] [Video]
Piotr Rybak and Maciej Ogrodniczuk
SuperST: Superficial Self-Training for Few-Shot Text Classification
[Slides] [Video]
Ju-Hyoung Lee, Joonghyuk Hahn, Hyeon-Tae Seo, Jiho Park and Yo-Sub Han
LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding
[Slides] [Video]
Masato Fujitake
Detecting Impact Relevant Sections in Scientific Research
[Poster] [Slides] [Video]
Maria Becker, Kanyao Han, Antonina Werthmann, Rezvaneh Rezapour, Haejin Lee and Jana Diesner
Few-Shot Learning for Cold-Start Recommendation
[Video]
Mingming Li, Songlin Hu, Fuqing Zhu and Qiannan Zhu
Bridging the Code Gap: A Joint Learning Framework across Medical Coding Systems
[Video]
Geunyeong Jeong, Seokwon Jeong, Juoh Sun and Harksoo Kim
On an Intermediate Task for Classifying URL Citations on Scholarly Papers
[Slides] [Video]
Kazuhiro Wada, Masaya Tsunokake and Shigeki Matsubara
Contribution of Move Structure to Automatic Genre Identification: An Annotated Corpus of French Tourism Websites
[Slides] [Video]
Rémi Cardon, Trang Tran Hanh Pham, Julien Zakhia Doueihi and Thomas François
RADCoT: Retrieval-Augmented Distillation to Specialization Models for Generating Chain-of-Thoughts in Query Expansion
[Slides] [Video]
Sung-Min Lee, Eunhwan Park, DongHyeon Jeon, Inho Kang and Seung-Hoon Na
17:30 - 19:10D2-S4-P7 - Trustworthy, Interpretability, and Explainability of Neural Models (Chair: Frederic Bechet)
Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models
[Slides] [Video]
Julia Rozanova, Marco Valentino and André Freitas
The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement
[Slides] [Video]
Jonathan Kamp, Lisa Beinborn and Antske Fokkens
Harnessing the Power of Large Language Model for Uncertainty Aware Graph Processing
[Video]
Zhenyu Qian, Yiming Qian, Yuting Song, Fei Gao, Hai Jin, Chen Yu and Xia Xie
PAD: A Robustness Enhancement Ensemble Method via Promoting Attention Diversity
[Video]
Yuting Yang, Pei Huang, Feifei Ma, Juan Cao and Jintao Li
What Do Transformers Know about Government?
[Slides] [Video]
Jue Hou, Anisia Katinskaia, Lari Kotilainen, Sathianpong Trangcasanchai, Anh-Duc Vu and Roman Yangarber
Code-Mixed Probes Show How Pre-Trained Models Generalise on Code-Switched Text
[Slides] [Video]
Frances Adriana Laureano De Leon, Harish Tayyar Madabushi and Mark Lee
MVP: Minimal Viable Phrase for Long Text Understanding
[Video]
Louis Clouatre, Amal Zouaq and Sarath Chandar
Backdoor NLP Models via AI-Generated Text
[Poster] [Slides] [Video]
Wei Du, Tianjie Ju, Ge Ren, GaoLei Li and Gongshen Liu
Towards a Framework for Evaluating Explanations in Automated Fact Verification
[Slides] [Video]
Neema Kotonya and Francesca Toni
When Do "More Contexts" Help with Sarcasm Recognition?
[Poster] [Slides] [Video]
Ojas Nimase and Sanghyun Hong
What Happens to a Dataset Transformed by a Projection-based Concept Removal Method?
[Poster] [Slides] [Video]
Richard Johansson
Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction
[Poster] [Video]
Masahiro Kaneko and Naoaki Okazaki
On the Scaling Laws of Geographical Representation in Language Models
[Slides] [Video]
Nathan Godey, Éric de la Clergerie and Benoît Sagot
Detecting Conceptual Abstraction in LLMs
[Poster] [Slides] [Video]
Michaela Regneri, Alhassan Abdelhalim and Soeren Laue
From Text to Source: Results in Detecting Large Language Model-Generated Content
[Slides] [Video]
Wissam Antoun, Benoît Sagot and Djamé Seddah
17:30 - 19:10D2-S4-P7 - Less-Resourced/Endangered/Less-studied Languages II (Chair: Frederic Bechet)
Evaluating the Potential of Language-family-specific Generative Models for Low-resource Data Augmentation: A Faroese Case Study
[Video]
Barbara Scalvini and Iben Nyholm Debess
RoBERTa Low Resource Fine Tuning for Sentiment Analysis in Albanian
[Poster] [Slides] [Video]
Krenare Pireva Nuci, Paul Landes and Barbara Di Eugenio
Constructing Indonesian-English Travelogue Dataset
[Slides] [Video]
Eunike Andriani Kardinata, Hiroki Ouchi and Taro Watanabe
Disentangling Pretrained Representation to Leverage Low-Resource Languages in Multilingual Machine Translation
[Poster] [Slides] [Video]
Frederikus Hudi, Zhi Qu, Hidetaka Kamigaito and Taro Watanabe
Towards Universal Dependencies for Ancash Quechua
[Video]
Johanna Cordova
Experiments on Speech Synthesis for Teochew, Can Taiwanese Help ?
[Video]
Pierre Magistry, Ilaine Wang and Ty Eng Lim
An Evaluation of Croatian ASR Models for Čakavian Transcription
[Poster] [Video]
Shulin Zhang, John Hale, Margaret Renwick, Zvjezdana Vrzić and Keith Langston
Monolingual Paraphrase Detection Corpus for Low Resource Pashto Language at Sentence Level
[Poster] [Slides] [Video]
Iqra Ali, Hidetaka Kamigaito and Taro Watanabe
Fine-Tuning a Pre-Trained Wav2Vec2 Model for Automatic Speech Recognition- Experiments with De Zahrar Sproche
[Slides] [Video]
Andrea Gulli, Francesco Costantini, Diego Sidraschi and Emanuela Li Destri
A Treebank of Asia Minor Greek
[Slides] [Video]
Eleni Vligouridou, Inessa Iliadou and Çağrı Çöltekin
Creating Terminological Resources in the Digital Age for Less-resourced Languages
[Poster] [Video]
Mercè Vàzquez
BalsuTalka.lv - Boosting the Common Voice Corpus for Low-Resource Languages
[Video]
Roberts Dargis, Arturs Znotins, Ilze Auzina, Baiba Saulite, Sanita Reinsone, Raivis Dejus, Antra Klavinska and Normunds Gruzitis
DORE: A Dataset for Portuguese Definition Generation
[Poster] [Slides] [Video]
Anna Beatriz Dimas Furtado, Tharindu Ranasinghe, Frederic Blain and Ruslan Mitkov
The Low Saxon LSDC Dataset at Universal Dependencies
[Slides] [Video]
Janine Siewert and Jack Rueter
Topic Classification and Headline Generation for Maltese Using a Public News Corpus
[Slides] [Video]
Amit Kumar Chaudhary, Kurt Micallef and Claudia Borg
Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin
[Slides] [Video]
Pin-Jie Lin, Merel Scholman, Muhammed Saeed and Vera Demberg
UzbekVerbDetection: Rule-based Detection of Verbs in Uzbek Texts
[Slides] [Video]
Maksud Sharipov, Elmurod Kuriyozov, Ollabergan Yuldashev and Ogabek Sobirov
17:30 - 19:10D2-S4-P7 - Less-Resourced/Endangered/Less-studied Languages II (Chair: Frederic Bechet)
Flexible Lexicalization in Rule-based Text Realization
[Poster] [Video]
Avril Gazeau and Francois Lareau
PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization
[Video]
Xinbei Ma, Yeyun Gong, Pengcheng He, Hai Zhao and Nan Duan
XVD: Cross-Vocabulary Differentiable Training for Generative Adversarial Attacks
[Slides] [Video]
Tom Roth, Inigo Jauregi Unanue, Alsharif Abuadbba and Massimo Piccardi
Improving Factual Consistency in Abstractive Summarization with Sentence Structure Pruning
[Slides] [Video]
Dingxin Hu, Xuanyu Zhang, Xingyue Zhang, Yiyang Li, Dongsheng Chen, Marina Litvak, Natalia Vanetik, Qing Yang, Dongliang Xu, Yanquan Zhou, Lei Li, Yuze Li and Yingqi Zhu
Distantly Supervised Contrastive Learning for Low-Resource Scripting Language Summarization
[Video]
Junzhe Liang, Haifeng Sun, Zirui Zhuang, Qi Qi, Jingyu Wang and Jianxin Liao
Controllable Sentence Simplification in Swedish Using Control Prefixes and Mined Paraphrases
[Video]
Julius Monsen and Arne Jonsson
EpLSA: Synergy of Expert-prefix Mixtures and Task-Oriented Latent Space Adaptation for Diverse Generative Reasoning
[Poster] [Slides] [Video]
Fujun Zhang, Xiangdong Su, Jiang Li, Rong Yan and Guanglai Gao
A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation
[Slides] [Video]
Yunxin Li, Baotian Hu, Wenhan Luo, Lin Ma, Yuxin Ding and Min Zhang
Speech Corpus for Korean Children with Autism Spectrum Disorder: Towards Automatic Assessment Systems
[Video]
Seonwoo Lee, Jihyun Mun, Sunhee Kim and Minhwa Chung
A Preliminary Study of ChatGPT for Spanish E2R Text Adaptation
[Slides]
Margot Madina, Itziar Gonzalez-Dios and Melanie Siegel
WikiSplit++: Easy Data Refinement for Split and Rephrase
[Slides] [Video]
Hayato Tsukagoshi, Tsutomu Hirao, Makoto Morishita, Katsuki Chousa, Ryohei Sasano and Koichi Takeda
Topic-Controllable Summarization: Topic-Aware Evaluation and Transformer Methods
[Poster] [Video]
Tatiana Passali and Grigorios Tsoumakas
Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
[Video]
Feifan Song, Bowen Yu, Hao Lang, Haiyang Yu, Fei Huang, Houfeng Wang and Yongbin Li
Rationale-based Learning Using Self-Supervised Narrative Events for Text Summarisation of Interactive Digital Narratives
[Slides] [Video]
Ashwathy T Revi, Stuart E. Middleton and David E. Millard
17:30 - 19:10D2-S4-P7 - Parsing, Tagging, Chunking, Grammar, Syntax, Morphosyntax, Morphology (Chair: Frederic Bechet)
High-order Joint Constituency and Dependency Parsing
[Slides] [Video]
Yanggan Gu, Yang Hou, Zhefeng Wang, Xinyu Duan and Zhenghua Li
MWE-Finder: A Demonstration
[Poster] [Video]
Jan Odijk, Martin Kroon, Tijmen Baarda, Ben Bonfil and Sheean Spoel
Enough Is Enough! a Case Study on the Effect of Data Size for Evaluation Using Universal Dependencies
[Slides] [Video]
Rob van der Goot, Zoey Liu and Max Müller-Eberstein
What Has LeBenchmark Learnt about French Syntax?
[Slides] [Video]
Zdravko Dugonjić, Adrien Pupier, Benjamin Lecouteux and Maximin Coavoux
EMAD: A Bridge Tagset for Unifying Arabic POS Annotations
[Poster] [Slides] [Video]
Omar Kallas, Go Inoue and Nizar Habash
Leveraging Syntactic Dependencies in Disambiguation: The Case of African American English
[Slides] [Video]
Wilermine Previlon, Alice Rozet, Jotsna Gowda, Bill Dyer, Kevin Tang and Sarah Moeller
Camel Morph MSA: A Large-Scale Open-Source Morphological Analyzer for Modern Standard Arabic
[Poster] [Slides] [Video]
Christian Khairallah, Salam Khalifa, Reham Marzouk, Mayar Nassar and Nizar Habash
Efficient AMR Parsing with CLAP: Compact Linearization with an Adaptable Parser
[Poster] [Video]
Abelardo Carlos Martinez Lorenzo and Roberto Navigli
Categorial Grammar Induction with Stochastic Category Selection
[Video]
Christian Clark and William Schuler
Soft Well-Formed Semantic Parsing with Score-Based Selection
[Slides] [Video]
Jiangming Liu
17:30 - 19:10D2-S4-P7 - Multimodal Applications, Grounded Language Acquisition, and HRI I (Chair: Frederic Bechet)
Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation
[Poster] [Slides] [Video]
Zhigang Chen, Benjia Zhou, Jun Li, Jun Wan, Zhen Lei, Ning Jiang, Quan Lu and Guoqing Zhao
A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions
[Slides] [Video]
Shun Inadumi, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi and Koichiro Yoshino
Motion Capture Analysis of Verb and Adjective Types in Austrian Sign Language (ÖGS)
[Poster] [Video]
Julia Krebs, Evguenia A. Malaia, Isabella Fessl, Hans-Peter Wiesinger, Dietmar Roehm, Ronnie Wilbur and Hermann Schwameder
Word-Aware Modality Stimulation for Multimodal Fusion
[Poster] [Slides] [Video]
Shuhei Tateishi, Makoto Nakatsuji and Yasuhito Osugi
TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling
[Poster] [Slides] [Video]
Weiran Chen, Xin Li, Jiaqi Su, Guiqian Zhu, Ying Li, Yi Ji and Chunping Liu
A Multimodal French Corpus of Aligned Speech, Text, and Pictogram Sequences for Speech-to-Pictogram Machine Translation
[Slides] [Video]
Cécile Macaire, Chloé Dion, Jordan Arrigo, Claire Lemaire, Emmanuelle Esperança-Rodier, Benjamin Lecouteux and Didier Schwab
High-Order Semantic Alignment for Unsupervised Fine-Grained Image-Text Retrieval
[Video]
Rui Gao, Miaomiao Cheng, Xu Han and Wei Song
SwissSLi: The Multi-parallel Sign Language Corpus for Switzerland
[Video]
Zifan Jiang, Anne Göhring, Amit Moryossef, Rico Sennrich and Sarah Ebling
Automated Extraction of Prosodic Structure from Unannotated Sign Language Video
[Video]
Antonio F. G. Sevilla, José María Lahoz-Bengoechea and Alberto Diaz
Encoding Gesture in Multimodal Dialogue: Creating a Corpus of Multimodal AMR
[Slides] [Video]
Kenneth Lai, Richard Brutti, Lucia Donatelli and James Pustejovsky
The Key Points: Using Feature Importance to Identify Shortcomings in Sign Language Recognition Models
[Slides] [Video]
Ruth M. Holmes, Ellen Rushe and Anthony Ventresque
Semantic Map-based Generation of Navigation Instructions
[Poster] [Slides] [Video]
Chengzu Li, Chao Zhang, Simone Teufel, Rama Sanand Doddipatla and Svetlana Stoyanchev
Saliency-Aware Interpolative Augmentation for Multimodal Financial Prediction
[Slides] [Video]
Samyak Jain, Parth Chhabra, Atula Tejaswi Neerkaje, Puneet Mathur, Ramit Sawhney, Shivam Agarwal, Preslav Nakov, Sudheer Chava and Dinesh Manocha
 End of Day 2
  

Friday, 24 May 2024

 Day 3
D3-S1-R1 - Corpora and Annotation VI (Chair: Simonetta Montemagni)
09:00 - 09:20Appraisal Framework for Clinical Empathy: A Novel Application to Breaking Bad News Conversations
[Slides] [Video]
Allison Claire Lahnala, Béla Neuendorf, Alexander Thomin, Charles Welch, Tina Stibane and Lucie Flek
09:20 - 09:40Out of the Mouths of MPs: Speaker Attribution in Parliamentary Debates
[Slides] [Video]
Ines Rehbein, Josef Ruppenhofer, Annelen Brunner and Simone Paolo Ponzetto
09:40 - 10:00LCGbank: A Corpus of Syntactic Analyses Based on Proof Nets
[Slides] [Video]
Aditya Bhargava, Timothy A. D. Fowler and Gerald Penn
10:00 - 10:20Russian Learner Corpus: Towards Error-Cause Annotation for L2 Russian
[Slides] [Video]
Daniil Kosakin, Sergei Obiedkov, Ivan Smirnov, Ekaterina Rakhilina, Anastasia Vyrenkova and Ekaterina Zalivina
10:20 - 10:40TeClass: A Human-Annotated Relevance-based Headline Classification and Generation Dataset for Telugu
[Slides] [Video]
Gopichand Kanumolu, Lokesh Madasu, Nirmal Surange and Manish Shrivastava
D3-S1-R2 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction II (Chair: Simon Dobnik)
09:00 - 09:20Collecting Human-Agent Dialogue Dataset with Frontal Brain Signal toward Capturing Unexpressed Sentiment
[Slides] [Video]
Shun Katada, Ryu Takeda and Kazunori Komatani
09:20 - 09:40RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education
[Slides] [Video]
Jieun Han, Haneul Yoo, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn and Alice Oh
09:40 - 10:00Generating Hard-Negative Out-of-Scope Data with ChatGPT for Intent Classification
[Video]
Zhijian Li, Stefan Larson and Kevin Leach
10:00 - 10:20Towards a Zero-Data, Controllable, Adaptive Dialog System
[Slides] [Video]
Dirk Väth, Lindsey Vanderlyn and Ngoc Thang Vu
10:20 - 10:40A Collection of Pragmatic-Similarity Judgments over Spoken Dialog Utterances
[Slides] [Video]
Nigel Ward and Divette Marco
D3-S1-R3 - Multilinguality, Machine Translation, and Translation Aids II (Chair: Jan Niehues)
09:00 - 09:20CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization
[Video]
Ruochen Zhang and Carsten Eickhoff
09:20 - 09:40A New Massive Multilingual Dataset for High-Performance Language Technologies
[Slides] [Video]
Ona de Gibert, Graeme Nail, Nikolay Arefyev, Marta Bañón, Jelmer van der Linde, Shaoxiong Ji, Jaume Zaragoza-Bernabeu, Mikko Aulamo, Gema Ramírez-Sánchez, Andrey Kutuzov, Sampo Pyysalo, Stephan Oepen and Jörg Tiedemann
09:40 - 10:00Exploring Geometric Representational Disparities between Multilingual and Bilingual Translation Models
[Video]
Neha Verma, Kenton Murray and Kevin Duh
10:00 - 10:20Identifying Source Language Expressions for Pre-editing in Machine Translation
[Slides] [Video]
Norizo Sakaguchi, Yugo Murawaki, Chenhui Chu and Sadao Kurohashi
10:20 - 10:40Neural Machine Translation between Low-Resource Languages with Synthetic Pivoting
[Slides] [Video]
Khalid Ahmed and Jan Buys
D3-S1-R4 - Less-Resourced/Endangered/Less-studied Languages III (Chair: Constantine Lignos)
09:00 - 09:20MaCmS: Magahi Code-mixed Dataset for Sentiment Analysis
[Slides] [Video]
Priya Rani, Theodorus Fransen, John P. McCrae and Gaurav Negi
09:20 - 09:40ManNER & ManPOS: Pioneering NLP for Endangered Manchu Language
[Slides] [Video]
Sangah Lee, Sungjoo Byun, Jean Seo and Minha Kang
09:40 - 10:00Detecting Loanwords in Emakhuwa: An Extremely Low-Resource Bantu Language Exhibiting Significant Borrowing from Portuguese
[Slides] [Video]
Felermino Dario Mario Ali, Henrique Lopes Cardoso and Rui Sousa-Silva
10:00 - 10:20Empowering Low-Resource Regional Languages with Lexicons : A Comparative Study of NLP Tools for Morphosyntactic Analysis
[Video]
Cristina Garcia Holgado and Marianne Vergez-Couret
10:20 - 10:40Automatic Speech Recognition for Gascon and Languedocian Variants of Occitan
[Slides] [Video]
Iñigo Morcillo, Igor Leturia, Ander Corral, Xabier Sarasola, Michaël Barret, Aure Séguier and Benaset Dazéas
D3-S1-R5 - Evaluation and Validation Methodologies III (Chair: Deyi Xiong)
09:00 - 09:20Benchmarking the Performance of Machine Translation Evaluation Metrics with Chinese Multiword Expressions
[Video]
Huacheng Song and Hongzhi Xu
09:20 - 09:40CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
[Slides] [Video]
Eunsu Kim, Juyoung Suk, Philhoon Oh, Haneul Yoo, James Thorne and Alice Oh
09:40 - 10:00ShadowSense: A Multi-annotated Dataset for Evaluating Word Sense Induction
[Slides] [Video]
Ondřej Herman and Miloš Jakubíček
10:00 - 10:20Sequence-to-Sequence Spanish Pre-trained Language Models
[Slides] [Video]
Vladimir Araujo, Maria Mihaela Trusca, Rodrigo Tufiño and Marie-Francine Moens
10:20 - 10:40Vygotsky Distance: Measure for Benchmark Task Similarity
Maxim K. Surkov and Ivan P. Yamshchikov
D3-S1-R6 - CL and Linguistic Theories, Cognitive Modeling and Psycholinguistics II (Chair: Alessandro Lenci)
09:00 - 09:20A Construction Grammar Corpus of Varying Schematicity: A Dataset for the Evaluation of Abstractions in Language Models
[Video]
Claire Bonial and Harish Tayyar Madabushi
09:20 - 09:40Approaches and Challenges for Resolving Different Representations of Fictional Characters for Chinese Novels
[Slides] [Video]
Li Song and Ying Liu
09:40 - 10:00Targeted Syntactic Evaluation on the Chomsky Hierarchy
[Video]
Taiga Someya, Ryo Yoshida and Yohei Oseki
10:00 - 10:20Learning Bidirectional Morphological Inflection like Humans
[Slides] [Video]
Akiyo Fukatsu, Yuto Harada and Yohei Oseki
10:20 - 10:40Efficiency and Effectiveness in Task-Oriented Dialogue: On Construction Repetition, Information Rate, and Task Success
[Slides] [Video]
Jun Sen Yee, Mario Giulianelli and Arabella J. Sinclair
09:00 - 10:40D3-S1-P8 - Corpora and Annotation VI (Chair: Andreas Witt)
SkOTaPA: A Dataset for Skepticism Detection in Online Text after Persuasion Attempt
[Slides] [Video]
Smitha Muthya Sudheendra, Maral Abdollahi, Dongyeop Kang, Jisu Huh and Jaideep Srivastava
CuRIAM: Corpus Re Interpretation and Metalanguage in U.S. Supreme Court Opinions
[Video]
Michael Kranzlein, Nathan Schneider and Kevin Tobia
UkraiNER: A New Corpus and Annotation Scheme towards Comprehensive Entity Recognition
[Slides] [Video]
Lauriane Aufrant and Lucie Chasseur
SM-FEEL-BG - the First Bulgarian Datasets and Classifiers for Detecting Feelings, Emotions, and Sentiments of Bulgarian Social Media Text
[Slides] [Video]
Irina Temnikova, Iva Marinova, Silvia Gargova, Ruslana Margova, Alexander Komarov, Tsvetelina Stefanova, Veneta Kireva, Dimana Vyatrova, Nevena Grigorova, Yordan Mandevski and Stefan Minkov
Humanitarian Corpora for English, French and Spanish
[Slides] [Video]
Loryn Isaacs, Santiago Chambó and Pilar León-Araúz
Domain Transferable Semantic Frames for Expert Interview Dialogues
[Video]
Taishi Chika, Taro Okahisa, Takashi Kodama, Yin Jou Huang, Yugo Murawaki and Sadao Kurohashi
The Touché23-ValueEval Dataset for Identifying Human Values behind Arguments
[Slides] [Video]
Nailia Mirzakhmedova, Johannes Kiesel, Milad Alshomary, Maximilian Heinrich, Nicolas Handke, Xiaoni Cai, Valentin Barriere, Doratossadat Dastgheib, Omid Ghahroodi, MohammadAli SadraeiJavaheri, Ehsaneddin Asgari, Lea Kawaletz, Henning Wachsmuth and Benno Stein
InaGVAD : A Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation
[Video]
David Doukhan, Christine Maertens, William Le Personnic, Ludovic Speroni and Reda Dehak
KoCoSa: Korean Context-aware Sarcasm Detection Dataset
[Poster] [Slides] [Video]
Yumin Kim, Heejae Suh, Mingi Kim, Dongyeon Won and Hwanhee Lee
Annotation of Japanese Discourse Relations Focusing on Concessive Inferences
[Video]
Ai Kubota, Takuma Sato, Takayuki Amamoto, Ryota Akiyoshi and Koji Mineshima
CAMERA³: An Evaluation Dataset for Controllable Ad Text Generation in Japanese
[Slides] [Video]
Go Inoue, Akihiko Kato, Masato Mita, Ukyo Honda and Peinan Zhang
Cross-lingual Named Entity Corpus for Slavic Languages
Jakub Piskorski, Michał Marcińczuk and Roman Yangarber
From News to Summaries: Building a Hungarian Corpus for Extractive and Abstractive Summarization
[Poster] [Slides] [Video]
Botond Barta, Dorina Lakatos, Attila Nagy, Milán Konor Nyist and Judit Ács
PopAut: An Annotated Corpus for Populism Detection in Austrian News Comments
[Poster] [Slides] [Video]
Ahmadou Wagne, Julia Neidhardt and Thomas Elmar Kolb
IsraParlTweet: The Israeli Parliamentary and Twitter Resource
[Slides] [Video]
Guy Mor-Lan, Effi Levi, Tamir Sheafer and Shaul R. Shenhav
Towards a Corpus of Spoken Maltese: Korpus tal-Malti Mitkellem, KMM
[Slides] [Video]
Alexandra (Sandra) Vella, Sarah Agius, Aiden Williams and Claudia Borg
Annotate Chinese Aspect with UMR——a Case Study on the Liitle Prince
[Slides] [Video]
Sijia Ge, Zilong Li, Alvin Po-Chun Chen and Guanchao Wang
Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles
[Slides] [Video]
Andrea Zugarini, Kamyar Zeinalipour, Surya Sai Kadali, Marco Maggini, Marco Gori and Leonardo Rigutini
Towards Cost-effective Multi-style Conversations: A Pilot Study in Task-oriented Dialogue Generation
[Poster] [Slides] [Video]
Tiziano Labruna and Bernardo Magnini
CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation
[Slides] [Video]
Yujie Shao, Xinrong Yao, Xingwei Qu, Chenghua Lin, Shi Wang, Wenhao Huang, Ge Zhang and Jie Fu
MUCH: A Multimodal Corpus Construction for Conversational Humor Recognition Based on Chinese Sitcom
[Poster] [Slides] [Video]
Hongyu Guo, Wenbo Shang, Xueyao Zhang and Binyang Li
SUK 1.0: A New Training Corpus for Linguistic Annotation of Modern Standard Slovene
[Slides] [Video]
Špela Arhar Holdt, Jaka Čibej, Kaja Dobrovoljc, Tomaž Erjavec, Polona Gantar, Simon Krek, Tina Munda, Nejc Robida, Luka Terčon and Slavko Zitnik
ReflectSumm: A Benchmark for Course Reflection Summarization
[Slides] [Video]
Mohamed Elaraby, Yang Zhong, Diane Litman, Ahmed Ashraf Butt and Muhsin Menekse
SciMRC: Multi-perspective Scientific Machine Reading Comprehension
[Slides] [Video]
Xiao Zhang, Heqi Zheng, Yuxiang Nie, Heyan Huang and Xian-Ling Mao
Beyond Words: Decoding Facial Expression Dynamics in Motivational Interviewing
[Slides] [Video]
Nezih Younsi, Catherine Pelachaud and Laurence Chaby
Creation and Analysis of an International Corpus of Privacy Laws
[Video]
Sonu Gupta, Geetika Gopi, Harish Balaji, Ellen Poplavska, Nora O’Toole, Siddhant Arora, Thomas Norton, Norman Sadeh and Shomir Wilson
AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies
[Poster] [Slides] [Video]
José-M. Acosta-Triana, David Gimeno-Gómez and Carlos-D. Martínez-Hinarejos
09:00 - 10:40D3-S1-P8 - Inference, Reasoning, Question Answering I (Chair: Andreas Witt)
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
[Poster] [Slides] [Video]
Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Xiaonan Li, Tianxiang Sun, Cheng Chang, Qinyuan Cheng, Ding Wang, Xiaofeng Mou, Xipeng Qiu and Xuanjing Huang
TIGQA: An Expert-Annotated Question-Answering Dataset in Tigrinya
[Slides] [Video]
Hailay Kidu Teklehaymanot, Dren Fazlija, Niloy Ganguly, Gourab Kumar Patro and Wolfgang Nejdl
CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments
[Video]
Savitha Sam Abraham, Marjan Alirezaie and Luc de Raedt
PolQA: Polish Question Answering Dataset
[Slides] [Video]
Piotr Rybak, Piotr Przybyła and Maciej Ogrodniczuk
Select High-quality Synthetic QA Pairs to Augment Training Data in MRC under the Reward Guidance of Generative Language Models
[Video]
Jing Jin and Houfeng Wang
Improving Language Model Reasoning with Self-motivated Learning
[Poster] [Slides] [Video]
Yunlong Feng, Yang Xu, Libo Qin, Yasheng Wang and Wanxiang Che
Towards Generalizable and Faithful Logic Reasoning over Natural Language via Resolution Refutation
[Poster] [Slides] [Video]
Zhouhao Sun, Xiao Ding, Li Du, Bibo Cai, Jinglong Gao, Ting Liu and Bing Qin
A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference
[Video]
Mokanarangan Thayaparan, Marco Valentino and André Freitas
Explainable Multi-hop Question Generation: An End-to-End Approach without Intermediate Question Labeling
[Poster] [Slides] [Video]
Seonjeong Hwang, Yunsu Kim and Gary Geunbae Lee
DRAMA: Dynamic Multi-Granularity Graph Estimate Retrieval over Tabular and Textual Question Answering
[Video]
Ruize Yuan, Xiang Ao, Li Zeng and Qing He
Eye-Tracking Features Masking Transformer Attention in Question-Answering Tasks
[Video]
Leran Zhang and Nora Hollenstein
SI-NLI: A Slovene Natural Language Inference Dataset and Its Evaluation
[Slides] [Video]
Matej Klemen, Aleš Žagar, Jaka Čibej and Marko Robnik-Šikonja
PECC: Problem Extraction and Coding Challenges
[Poster] [Slides] [Video]
Patrick Haller, Jonas Golde and Alan Akbik
Self-Improvement Programming for Temporal Knowledge Graph Question Answering
[Slides] [Video]
Zhuo Chen, Zhao Zhang, Zixuan Li, Fei Wang, Yutao Zeng, Xiaolong Jin and Yongjun Xu
Prompting-based Synthetic Data Generation for Few-Shot Question Answering
[Poster] [Slides] [Video]
Maximilian Schmidt, Andrea Bartezzaghi and Ngoc Thang Vu
09:00 - 10:40D3-S1-P8 - Knowledge Discovery / Representation (Chair: Andreas Witt)
L^2GC:Lorentzian Linear Graph Convolutional Networks for Node Classification
[Poster] [Slides] [Video]
Qiuyu Liang, Weihua Wang, Feilong Bao and Guanglai Gao
Ideological Knowledge Representation: Framing Climate Change in EcoLexicon
[Video]
Arianne Reimerink, Melania Cabezas-García, Pilar León-Araúz and Pamela Faber
Knowledge GeoGebra: Leveraging Geometry of Relation Embeddings in Knowledge Graph Completion
[Poster] [Video]
Kossi Amouzouvi, Bowen Song, Sahar Vahdati and Jens Lehmann
Dual Complex Number Knowledge Graph Embeddings
[Poster] [Slides] [Video]
Yao Dong, Qingchao Kong, Lei Wang and Yin Luo
BanglaAutoKG: Automatic Bangla Knowledge Graph Construction with Semantic Neural Graph Filtering
[Slides] [Video]
Azmine Toushik Wasi, Taki Hasan Rafi, Raima Islam and Dong-Kyu Chae
Related Work Is All You Need
[Slides] [Video]
Rodolfo Joel Zevallos, John E. Ortega and Benjamin Irving
Time-aware COMET: A Commonsense Knowledge Model with Temporal Knowledge
[Poster] [Video]
Eiki Murata and Daisuke Kawahara
09:00 - 10:40D3-S1-P8 - Machine Learning Models and Techniques for CL/NLP II (Chair: Andreas Witt)
Automatic Speech Interruption Detection: Analysis, Corpus, and System
[Video]
Martin Lebourdais, Marie Tahon, Antoine Laurent and Sylvain Meignier
Domain-Agnostic Adapter Architecture for Deception Detection: Extensive Evaluations with the DIFrauD Benchmark
[Video]
Dainis A. Boumber, Fatima Zahra Qachfar and Rakesh Verma
Exploring the Synergy of Dual-path Encoder and Alignment Module for Better Graph-to-Text Generation
[Poster] [Slides] [Video]
Tianxin Zhao, Yingxin Liu, Xiangdong Su, Jiang Li and Guanglai Gao
Unveiling Vulnerability of Self-Attention
[Slides] [Video]
Khai Jiet Liong, Hongqiu Wu and Hai Zhao
RISE: Robust Early-exiting Internal Classifiers for Suicide Risk Evaluation
[Slides] [Video]
Ritesh Singh Soun, Atula Tejaswi Neerkaje, Ramit Sawhney, Nikolaos Aletras and Preslav Nakov
Sparse Logistic Regression with High-order Features for Automatic Grammar Rule Extraction from Treebanks
[Slides] [Video]
Santiago Herrera, Caio Corro and Sylvain Kahane
Action-Concentrated Embedding Framework: This Is Your Captain Sign-tokening
[Slides] [Video]
Hyunwook Yu, Suhyeon Shin, Junku Heo, Hyuntaek Shin, Hyosu Kim and Mucheol Kim
Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models
[Video]
Sixing Yu, Juan Pablo Munoz and Ali Jannesari
Sub-Table Rescorer for Table Question Answering
[Slides] [Video]
Atsushi Kojima
Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations
[Video]
Gregor Donabauer and Udo Kruschwitz
Cross-type French Multiword Expression Identification with Pre-trained Masked Language Models
[Video]
Van-Tuan Bui and Agata Savary
Evaluating Unsupervised Dimensionality Reduction Methods for Pretrained Sentence Embeddings
[Poster] [Slides] [Video]
Gaifan Zhang, Yi Zhou and Danushka Bollegala
A Single Linear Layer Yields Task-Adapted Low-Rank Matrices
[Poster] [Video]
Hwichan Kim, Shota Sasaki, Sho Hoshino and Ukyo Honda
IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
[Slides] [Video]
Gabriele Sarti and Malvina Nissim
AdaKron: An Adapter-based Parameter Efficient Model Tuning with Kronecker Product
[Poster] [Slides] [Video]
Marco Braga, Alessandro Raganato and Gabriella Pasi
Fisher Mask Nodes for Language Model Merging
[Poster] [Slides] [Video]
Thennal D K, Ganesh Nathan and Suchithra M S
A Multi-Label Dataset of French Fake News: Human and Machine Insights
[Slides] [Video]
Benjamin Icard, François Maine, Morgane Casanova, Géraud Faye, Julien Chanson, Guillaume Gadek, Ghislain Atemezing, François Bancilhon and Paul Égré
Automatic Punctuation Model for Spanish Live Transcriptions
[Slides] [Video]
Mario Perez-Enriquez, Jose Manuel Masiello-Ruiz, Jose Luis Lopez-Cuadrado, Israel Gonzalez-Carrasco, Paloma Martinez-Fernandez and Belen Ruiz-Mezcua
On the Way to Lossless Compression of Language Transformers: Exploring Cross-Domain Properties of Quantization
[Slides] [Video]
Nikita Martynov, Aleksei Goncharov, Gleb Kumichev, Evgeniy Egorov, Stanislav Vladimirovich Pavlov, Mikhail Sergeevich Durinov, Aleksandr Sergeevich Zuev and Egor Anatolievich Filimonov
Hyperbolic Representations for Prompt Learning
[Poster] [Slides] [Video]
Nan Chen, Xiangdong Su and Feilong Bao
Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation
[Slides] [Video]
Mateusz Klimaszewski, Piotr Andruszkiewicz and Alexandra Birch
Cross-Lingual NLU: Mitigating Language-Specific Impact in Embeddings Leveraging Adversarial Learning
[Poster] [Slides] [Video]
Saedeh Tahery, Sahar Kianian and Saeed Farzi
09:00 - 10:40D3-S1-P8 - Natural Language Generation, Summarization and Simplification III (Chair: Andreas Witt)
Prompting for Numerical Sequences: A Case Study on Market Comment Generation
[Video]
Masayuki Kawarada, Tatsuya Ishigaki and Hiroya Takamura
An LLM-Enhanced Adversarial Editing System for Lexical Simplification
[Slides] [Video]
Keren Tan, Kangyang Luo, Yunshi Lan, Zheng Yuan and Jinlong Shu
A Generative Model for Lambek Categorial Sequents
[Slides] [Video]
Jinman Zhao and Gerald Penn
UrduMASD: A Multimodal Abstractive Summarization Dataset for Urdu
[Slides] [Video]
Ali Faheem, Faizad Ullah, Muhammad Sohaib Ayub and Asim Karim
Effective Integration of Text Diffusion and Pre-Trained Language Models with Linguistic Easy-First Schedule
[Poster] [Slides] [Video]
Yimin Ou and Ping Jian
Longform Multimodal Lay Summarization of Scientific Papers: Towards Automatically Generating Science Blogs from Research Articles
[Slides] [Video]
Sandeep Kumar, Guneet Singh Kohli, Tirthankar Ghosal and Asif Ekbal
Continual Reinforcement Learning for Controlled Text Generation
[Slides] [Video]
Velizar Shulev and Khalil Sima’an
German Also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset
[Poster] [Slides] [Video]
Laura Mascarell, Ribin Chalumattu and Annette Rios
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
[Poster] [Slides] [Video]
Dongheng Li, Yongchang Hao and Lili Mou
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Dragomir Radev, Yejin Choi and Noah A. Smith
A Japanese News Simplification Corpus with Faithfulness
[Poster] [Video]
Toru Urakawa, Yuya Taguchi, Takuro Niitsuma and Hideaki Tamori
Contextualizing Generated Citation Texts
[Poster] [Slides] [Video]
Biswadip Mandal, Xiangci Li and Jessica Ouyang
Retrieval-Augmented Modular Prompt Tuning for Low-Resource Data-to-Text Generation
[Video]
Ruitao Feng, Xudong Hong, Mayank Jobanputra, Mattes Warning and Vera Demberg
GreekBART: The First Pretrained Greek Sequence-to-Sequence Model
[Slides] [Video]
Iakovos Evdaimon, Hadi Abdine, Christos Xypolopoulos, Stamatis Outsios, Michalis Vazirgiannis and Giorgos Stamou
SENTA: Sentence Simplification System for Slovene
[Poster] [Video]
Aleš Žagar, Matej Klemen, Marko Robnik-Šikonja and Iztok Kosem
SlovakSum: A Large Scale Slovak Summarization Dataset
[Slides] [Video]
Viktoria Ondrejova and Marek Suppa
DACL: Disfluency Augmented Curriculum Learning for Fluent Text Generation
[Poster] [Slides] [Video]
Rohan Chaudhury, Maria Teleki, Xiangjue Dong and James Caverlee
10:40 - 11:00Coffee break
D3-S2-R1 - Corpora and Annotation VII (Chair: Rémi Cardon)
11:00 - 11:20Constructing a Dependency Treebank for Second Language Learners of Korean
[Slides] [Video]
Hakyung Sung and Gyu-Ho Shin
11:20 - 11:40Building a Data Infrastructure for a Mid-Resource Language: The Case of Catalan
[Slides] [Video]
Aitor Gonzalez-Agirre, Montserrat Marimon, Carlos Rodriguez-Penagos, Javier Aula-Blasco, Irene Baucells, Carme Armentano-Oller, Jorge Palomar-Giner, Baybars Kulebi and Marta Villegas
11:40 - 12:00The SAMER Arabic Text Simplification Corpus
[Slides] [Video]
Bashar Alhafni, Reem Hazim, Juan David Pineros Liberato, Muhamed Al Khalil and Nizar Habash
12:00 - 12:20A Multi-layered Approach to Physical Commonsense Understanding: Creation and Evaluation of an Italian Dataset
[Slides] [Video]
Giulia Pensa, Begoña Altuna and Itziar Gonzalez-Dios
12:20 - 12:40QA-based Event Start-Points Ordering for Clinical Temporal Relation Annotation
[Slides] [Video]
Seiji Shimizu, Lis Pereira, Shuntaro Yada and Eiji Aramaki
D3-S2-R2 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation II (Chair: Elisabetta Fersini)
11:00 - 11:20A Challenge Dataset and Effective Models for Conversational Stance Detection
[Slides] [Video]
Fuqiang Niu, Min Yang, Ang Li, Baoquan Zhang, Xiaojiang Peng and Bowen Zhang
11:20 - 11:40A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction
[Slides] [Video]
Jian Zhang, Changlin Yang, Haiping Zhu, Qika Lin, Fangzhi Xu and Jun Liu
11:40 - 12:00DeFaktS: A German Dataset for Fine-Grained Disinformation Detection through Social Media Framing
[Slides] [Video]
Shaina Ashraf, Isabel Bezzaoui, Ionut Andone, Alexander Markowetz, Jonas Fegert and Lucie Flek
12:00 - 12:20OATS: A Challenge Dataset for Opinion Aspect Target Sentiment Joint Detection for Aspect-Based Sentiment Analysis
[Slides] [Video]
Siva Uday Sampreeth Chebolu, Franck Dernoncourt, Nedim Lipka and Thamar Solorio
12:20 - 12:40KazSAnDRA: Kazakh Sentiment Analysis Dataset of Reviews and Attitudes
[Slides] [Video]
Rustem Yeshpanov and Huseyin Atakan Varol
D3-S2-R3 - Multimodal Applications, Grounded Language Acquisition, and HRI III (Chair: Alessandro Bondielli)
11:00 - 11:20Labeling Comic Mischief Content in Online Videos with a Multimodal Hierarchical-Cross-Attention Model
[Slides] [Video]
Elaheh Baharlouei, Mahsa Shafaei, Yigeng Zhang, Hugo Jair Escalante and Thamar Solorio
11:20 - 11:40A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV News
[Video]
Zhe Niu, Ronglai Zuo, Brian Mak and Fangyun Wei
11:40 - 12:00Born a BabyNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation
[Video]
Cong Ma, Yaping Zhang, Zhiyang Zhang, Yupu Liang, Yang Zhao, Yu Zhou and Chengqing Zong
12:00 - 12:20MULTICOLLAB: A Multimodal Corpus of Dialogues for Analyzing Collaboration and Frustration in Language
[Slides] [Video]
Michael Peechatt, Cecilia Ovesdotter Alm and Reynold Bailey
12:20 - 12:40Abstractive Multi-Video Captioning: Benchmark Dataset Construction and Extensive Evaluation
[Slides] [Video]
Rikito Takahashi, Hirokazu Kiyomaru, Chenhui Chu and Sadao Kurohashi
D3-S2-R4 - Natural Language Generation, Summarization and Simplification III (Chair: Marco Passarotti)
11:00 - 11:20Triples-to-isiXhosa (T2X): Addressing the Challenges of Low-Resource Agglutinative Data-to-Text Generation
[Video]
Francois Meyer and Jan Buys
11:20 - 11:40Scale-VAE: Preventing Posterior Collapse in Variational Autoencoder
[Slides] [Video]
Tianbao Song, Jingbo Sun, Xin Liu and Weiming Peng
11:40 - 12:00Alleviating Exposure Bias in Abstractive Summarization via Sequentially Generating and Revising
[Slides] [Video]
Jiaxin Duan, Fengyu Lu and Junfei Liu
12:00 - 12:20PACAR: Automated Fact-Checking with Planning and Customized Action Reasoning Using Large Language Models
[Slides] [Video]
Xiaoyan Zhao, Lingzhi Wang, Zhanghao Wang, Hong Cheng, Rui Zhang and Kam-Fai Wong
12:20 - 12:40Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
[Slides] [Video]
Tyler A. Chang, Katrin Tomanek, Jessica Hoffmann, Nithum Thain, Erin MacMurray van Liemt, Kathleen Meier-Hellstern and Lucas Dixon
D3-S2-R5 - Trustworthy, Interpretability, and Explainability of Neural Models II (Chair: Yuki Arase)
11:00 - 11:20Interpretable Assessment of Speech Intelligibility Using Deep Learning: A Case Study on Speech Disorders Due to Head and Neck Cancers
[Slides] [Video]
Sondes Abderrazek, Corinne Fredouille, Alain Ghio, Muriel Lalain, Christine Meunier, Mathieu Balaguer and Virginie Woisard
11:20 - 11:40Analyzing Symptom-based Depression Level Estimation through the Prism of Psychiatric Expertise
[Slides] [Video]
Navneet Agarwal, Kirill Milintsevich, Lucie Metivier, Maud Rotharmel, Gaël Dias and Sonia Dollfus
11:40 - 12:00Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic
[Slides] [Video]
Xufeng Zhao, Mengdi Li, Wenhao Lu, Cornelius Weber, Jae Hee Lee, Kun Chu and Stefan Wermter
12:00 - 12:20Attack Named Entity Recognition by Entity Boundary Interference
[Video]
Yifei Yang, Hongqiu Wu and Hai Zhao
12:20 - 12:40How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study
[Slides] [Video]
Tianjie Ju, Weiwei Sun, Wei Du, Xinwei Yuan, Zhaochun Ren and Gongshen Liu
D3-S2-R6 - Parsing, Tagging, Chunking, Grammar, Syntax, Morphosyntax, Morphology II (Chair: Daniel Zeman)
11:00 - 11:20PaReNT (Parent Retrieval Neural Tool): A Deep Dive into Word Formation across Languages
[Slides] [Video]
Emil Svoboda and Magda Sevcikova
11:20 - 11:40Eesthetic: A Paralex Lexicon of Estonian Paradigms
[Slides] [Video]
Sacha Beniamine, Mari Aigro, Matthew Baerman, Jules Bouton and Maria Copot
11:40 - 12:00Joint Annotation of Morphology and Syntax in Dependency Treebanks
[Slides] [Video]
Bruno Guillaume, Kim Gerdes, Kirian Guiller, Sylvain Kahane and Yixuan Li
12:00 - 12:20UCxn: Typologically-Informed Annotation of Constructions Atop Universal Dependencies
[Slides] [Video]
Leonie Weissweiler, Nina Böbel, Kirian Guiller, Santiago Herrera, Wesley Samuel Scivetti, Arthur Lorenzi, Nurit Melnik, Archna Bhatia, Hinrich Schütze, Lori Levin, Amir Zeldes, Joakim Nivre, William Croft and Nathan Schneider
12:20 - 12:40LLMSegm: Surface-level Morphological Segmentation Using Large Language Model
[Slides] [Video]
Marko Pranjić, Marko Robnik-Šikonja and Senja Pollak
11:00 - 12:40D3-S2-P9 - Applications Involving LRs and Evaluation III (Chair: Samia Touileb)
CamemBERT-bio: Leveraging Continual Pre-training for Cost-Effective Models on French Biomedical Data
[Poster] [Slides] [Video]
Rian Touchent and Éric de la Clergerie
Enriching Word Usage Graphs with Cluster Definitions
[Slides] [Video]
Andrey Kutuzov, Mariia Fedorova, Dominik Schlechtweg and Nikolay Arefyev
BenLLM-Eval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP
[Poster] [Slides] [Video]
Mohsinul Kabir, Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Mir Tafseer Nayeem, M Saiful Bari and Enamul Hoque
Refining rtMRI Landmark-Based Vocal Tract Contour Labels with FCN-Based Smoothing and Point-to-Curve Projection
[Slides] [Video]
Mushaffa Rasyid Ridha and Sakriani Sakti
Assessing the Capabilities of Large Language Models in Coreference: An Evaluation
[Slides] [Video]
Yujian Gan, Massimo Poesio and Juntao Yu
Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text Generation
[Video]
Yue Zhou, Barbara Di Eugenio, Brian Ziebart, Lisa Sharp, Bing Liu and Nikolaos Agadakos
GPT-3.5 for Grammatical Error Correction
[Video]
Anisia Katinskaia and Roman Yangarber
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models
[Video]
Zican Dong, Tianyi Tang, Junyi Li, Wayne Xin Zhao and Ji-Rong Wen
CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models
[Poster] [Video]
Yufei Huang and Deyi Xiong
Term-Driven Forward-Looking Claim Synthesis in Earnings Calls
[Poster] [Slides] [Video]
Chung-Chi Chen and Hiroya Takamura
A Self-verified Method for Exploring Simile Knowledge from Pre-trained Language Models
[Poster] [Slides] [Video]
Longxuan Ma, Changxin Ke, Shuhan Zhou, Churui Sun, Wei-Nan Zhang and Ting Liu
KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
[Poster] [Slides] [Video]
Dongjun Jang, Sungjoo Byun, Hyemi Jo and Hyopil Shin
Annotating Chinese Word Senses with English WordNet: A Practice on OntoNotes Chinese Sense Inventories
[Slides] [Video]
Hongzhi Xu, Jingxia Lin, Sameer Pradhan, Mitchell Marcus and Ming Liu
Locally Differentially Private In-Context Learning
[Slides] [Video]
Chunyan Zheng, Keke Sun, Wenhao Zhao, Haibo Zhou, Lixing Jiang, Shaoyang Song and Chunlai Zhou
Konidioms Corpus: A Dataset of Idioms in Konkani Language
[Poster] [Slides] [Video]
Naziya Mahamdul Shaikh, Jyoti D. Pawar and Mubarak Banu Sayed
11:00 - 12:40D3-S2-P9 - Corpora and Annotation IV (Chair: Samia Touileb)
HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
[Slides] [Video]
Qiwei Peng, Yekun Chai and Xuhong Li
Automatic Partitioning of a Code-Switched Speech Corpus Using Mixed-Integer Programming
[Video]
Joshua Miles Jansen van Vüren, Febe de Wet and Thomas Niesler
Eliciting Motivational Interviewing Skill Codes in Psychotherapy with LLMs: A Bilingual Dataset and Analytical Study
[Slides] [Video]
Xin Sun, Jiahuan Pei, Jan de Wit, Mohammad Aliannejadi, Emiel Krahmer, Jos T.P. Dobber and Jos A. Bosch
From Laughter to Inequality: Annotated Dataset for Misogyny Detection in Tamil and Malayalam Memes
[Video]
Rahul Ponnusamy, Kathiravan Pannerselvam, Saranya R, Prasanna Kumar Kumaresan, Sajeetha Thavareesan, Bhuvaneswari S, Anshid K.A, Susminu S Kumar, Paul Buitelaar and Bharathi Raja Chakravarthi
Dataset for Identification of Homophobia and Transphobia for Telugu, Kannada, and Gujarati
[Poster] [Slides] [Video]
Prasanna Kumar Kumaresan, Rahul Ponnusamy, Dhruv Sharma, Paul Buitelaar and Bharathi Raja Chakravarthi
RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian
[Slides] [Video]
Adrian Cosma, Ioan-Bogdan Iordache and Paolo Rosso
CASIMIR: A Corpus of Scientific Articles Enhanced with Multiple Author-Integrated Revisions
[Slides] [Video]
Léane Isabelle Jourdan, Florian Boudin, Nicolas Hernandez and Richard Dufour
Specifying Genericity through Inclusiveness and Abstractness Continuous Scales
[Video]
Claudia Collacciani, Andrea Amelio Ravelli and Marianna Bolognesi
PPORTAL_ner: An Annotated Corpus of Portuguese Literary Entities
[Poster] [Video]
Mariana O. Silva and Mirella M. Moro
Towards the WhAP Corpus: A Resource for the Study of Italian on WhatsApp
[Video]
Ilaria Fiorentini, Marco Forlano and Nicholas Nese
Can Factual Statements Be Deceptive? The DeFaBel Corpus of Belief-based Deception
[Poster] [Video]
Aswathy Velutharambath, Roman Klinger and Amelie Wührl
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
[Slides] [Video]
Verena Blaschke, Barbara Kovačić, Siyao Peng, Hinrich Schütze and Barbara Plank
WikiFactDiff: A Large, Realistic, and Temporally Adaptable Dataset for Atomic Factual Knowledge Update in Causal Language Models
[Poster] [Slides] [Video]
Hichem Ammar Khodja, Frederic Bechet, Quentin Brabant, Alexis Nasr and Gwénolé Lecorvé
Biomedical Concept Normalization over Nested Entities with Partial UMLS Terminology in Russian
[Slides] [Video]
Natalia Loukachevitch, Andrey Sakhovskiy and Elena Tutubalina
Advancing Semi-Supervised Learning for Automatic Post-Editing: Data-Synthesis by Mask-Infilling with Erroneous Terms
[Poster] [Slides] [Video]
Wonkee Lee, Seong-Hwan Heo and Jong-Hyeok Lee
Annotation and Classification of Relevant Clauses in Terms-and-Conditions Contracts
[Slides] [Video]
Pietro Giovanni Bizzaro, Elena Della Valentina, Maurizio Napolitano, Nadia Mana and Massimo Zancanaro
Charting the Linguistic Landscape of Developing Writers: An Annotation Scheme for Enhancing Native Language Proficiency
[Video]
Miguel Da Corte and Jorge Baptista
ARBRES Kenstur: A Breton-French Parallel Corpus Rooted in Field Linguistics
[Slides] [Video]
Loïc Grobol and Mélanie Jouitteau
GMEG-EXP: A Dataset of Human- and LLM-Generated Explanations of Grammatical and Fluency Edits
[Poster] [Video]
S. Magalí López Cortez, Mark Josef Norris and Steve Duman
Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark
[Poster] [Slides] [Video]
Feng Jiang, Weihao Liu, Xiaomin Chu, Peifeng Li, Qiaoming Zhu and Haizhou Li
11:00 - 12:40D3-S2-P9 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction II (Chair: Samia Touileb)
A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU
[Video]
Guanhua Chen, Yutong Yao, Derek F. Wong and Lidia S. Chao
Educational Dialogue Systems for Visually Impaired Students: Introducing a Task-Oriented User-Agent Corpus
[Poster] [Video]
Elisa Di Nuovo, Manuela Sanguinetti, Pier Felice Balestrucci, Luca Anselma, Cristian Bernareggi and Alessandro Mazzei
The Distracted Ear: How Listeners Shape Conversational Dynamics
[Slides] [Video]
Auriane Boudin, Stéphane Rauzy, Roxane Bertrand, Magalie Ochs and Philippe Blache
Linguistic Nudges and Verbal Interaction with Robots, Smart-Speakers, and Humans
[Video]
Natalia Kalashnikova, Ioana Vasilescu and Laurence Devillers
ChatGPT Role-play Dataset: Analysis of User Motives and Model Naturalness
[Poster] [Slides] [Video]
Yufei Tao, Ameeta Agrawal, Judit Dombi, Tetyana Sydorenko and Jung In Lee
IndirectQA: Understanding Indirect Answers to Implicit Polar Questions in French and Spanish
[Slides] [Video]
Christin Müller and Barbara Plank
Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks
[Poster] [Slides] [Video]
Abhinav Sukumar Rao, Atharva Roshan Naik, Sachin Vashistha, Somak Aditya and Monojit Choudhury
A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation
[Slides] [Video]
Jifan Yu, Xiaohan Zhang, Yifan Xu, Xuanyu Lei, Zijun Yao, Jing Zhang, Lei Hou and Juanzi Li
JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialogue Policy Learning
[Poster] [Slides] [Video]
Wai-Chung Kwan, Huimin Wang, Hongru Wang, Zezhong Wang, Bin Liang, Xian Wu, Yefeng Zheng and Kam-Fai Wong
Collecting and Analyzing Dialogues in a Tagline Co-Writing Task
[Video]
Xulin Zhou, Takuma Ichikawa and Ryuichiro Higashinaka
Would You Like to Make a Donation? A Dialogue System to Persuade You to Donate
[Slides] [Video]
Yuhan Song and Houfeng Wang
It’s Not under the Lamppost: Expanding the Reach of Conversational AI
[Slides] [Video]
Christy Doran and Deborah A. Dahl
M3TCM: Multi-modal Multi-task Context Model for Utterance Classification in Motivational Interviews
[Video]
Sayed Muddashir Hossain, Jan Alexandersson and Philipp Müller
Dialogue Systems Can Generate Appropriate Responses without the Use of Question Marks?– a Study of the Effects of “?” for Spoken Dialogue Systems –
[Poster] [Slides] [Video]
Tomoya Mizumoto, Takato Yamazaki, Katsumasa Yoshikawa, Masaya Ohagi, Toshiki Kawamoto and Toshinori Sato
TIGER: A Unified Generative Model Framework for Multimodal Dialogue Response Generation
[Slides] [Video]
Fanheng Kong, Peidong Wang, Shi Feng, Daling Wang and Yifei Zhang
SynTOD: Augmented Response Synthesis for Robust End-to-End Task-Oriented Dialogue System
[Slides] [Video]
Nguyen Quang Chieu, Quang-Minh Tran and Khac-Hoai Nam Bui
mForms : Multimodal Form Filling with Question Answering
[Slides] [Video]
Larry Heck, Simon Heck and Anirudh S. Sundar
Clarifying Completions: Evaluating How LLMs Respond to Incomplete Questions
[Video]
Angus Addlesee, Oliver Lemon and Arash Eshghi
Enhancing Knowledge Retrieval with Topic Modeling for Knowledge-Grounded Dialogue
[Slides] [Video]
Nhat Tran and Diane Litman
Comparison of the Intimacy Process between Real and Acting-based Long-term Text Chats
[Video]
Tsunehiro Arimoto, Hiroaki Sugiyama, Hiromi Narimatsu and Masahiro Mizukami
SCOUT: A Situated and Multi-Modal Human-Robot Dialogue Corpus
[Poster] [Slides] [Video]
Stephanie M. Lukin, Claire Bonial, Matthew Marge, Taylor A. Hudson, Cory J. Hayes, Kimberly Pollard, Anthony Baker, Ashley N. Foots, Ron Artstein, Felix Gervits, Mitchell Abrams, Cassidy Henry, Lucia Donatelli, Anton Leuski, Susan G. Hill, David Traum and Clare Voss
11:00 - 12:40D3-S2-P9 - Information Extraction, Knowledge Extraction, and Text Mining III (Chair: Samia Touileb)
Improving Continual Few-shot Relation Extraction through Relational Knowledge Distillation and Prototype Augmentation
[Slides] [Video]
Zhiheng Zhang, Daojian Zeng and Xue Bai
Improving Chinese Named Entity Recognition with Multi-grained Words and Part-of-Speech Tags via Joint Modeling
[Poster] [Slides] [Video]
Chenhui Dou, Chen Gong, Zhenghua Li, Zhefeng Wang, baoxing Huai and Min Zhang
Generative Multimodal Entity Linking
[Poster] [Video]
senbao shi, Zhenran Xu, Baotian Hu and Min Zhang
Towards Realistic Few-Shot Relation Extraction: A New Meta Dataset and Evaluation
[Poster] [Slides] [Video]
Fahmida Alam, Md Asiful Islam, Robert Vacareanu and Mihai Surdeanu
ELLEN: Extremely Lightly Supervised Learning for Efficient Named Entity Recognition
[Slides] [Video]
Haris Riaz, Razvan Gabriel Dumitru and Mihai Surdeanu
Automatic Extraction of Language-Specific Biomarkers of Healthy Aging in Icelandic
[Slides] [Video]
Elena Callegari, Iris Edda Nowenstein, Ingunn Jóhanna Kristjánsdóttir and Anton Karl Ingason
Nested Event Extraction upon Pivot Element Recognition
[Video]
Weicheng Ren, Zixuan Li, Xiaolong Jin, Long Bai, Miao Su, Yantao Liu, Saiping Guan, Jiafeng Guo and Xueqi Cheng
Human in the Loop: How to Effectively Create Coherent Topics by Manually Labeling Only a Few Documents per Class
[Slides] [Video]
Anton F. Thielmann, Christoph Weisser and Benjamin Säfken
Automatic Construction of a Large-Scale Corpus for Geoparsing Using Wikipedia Hyperlinks
[Slides] [Video]
Keyaki Ohno, Hirotaka Kameko, Keisuke Shirai, Taichi Nishimura and Shinsuke Mori
Word-level Commonsense Knowledge Selection for Event Detection
[Video]
Shuai Yang, Yu Hong, Shiming He, Qingting Xu and Jianmin Yao
Improving Text Readability through Segmentation into Rheses
[Slides] [Video]
Antoine Jamelot, Solen Quiniou and Sophie Hamon
How to Encode Domain Information in Relation Classification
[Video]
Elisa Bassignana, Viggo Unmack Gascou, Frida Nøhr Laustsen, Gustav Kristensen, Marie Haahr Petersen, Rob van der Goot and Barbara Plank
SPACE-IDEAS: A Dataset for Salient Information Detection in Space Innovation
[Video]
Andres Garcia-Silva, Cristian Berrio and Jose Manuel Gomez-Perez
GRIT: A Dataset of Group Reference Recognition in Italian
[Slides] [Video]
Sergio E. Zanotto, Qi Yu, Miriam Butt and Diego Frassinelli
Semantic Frame Extraction in Multilingual Olfactory Events
[Slides] [Video]
Stefano Menini
StructAM: Enhancing Address Matching through Semantic Understanding of Structure-aware Information
[Poster] [Slides] [Video]
Zhaoqi Zhang, Pasquale Balsebre, Siqiang Luo, Zhen Hai and Jiangping Huang
Grounded Multimodal Procedural Entity Recognition for Procedural Documents: A New Dataset and Baseline
[Poster] [Slides] [Video]
Haopeng Ren, Yushi Zeng, Yi Cai, Zhenqi Ye, Li Yuan and Pinli Zhu
Query-driven Relevant Paragraph Extraction from Legal Judgments
[Poster] [Slides] [Video]
Santosh T.Y.S.S., Elvin A. Quero Hernandez and Matthias Grabmair
11:00 - 12:40D3-S2-P9 - Multilinguality, Machine Translation, and Translation Aids I (Chair: Samia Touileb)
How Do Hyenas Deal with Human Speech? Speech Recognition and Translation with ConfHyena
[Poster] [Slides] [Video]
Marco Gaido, Sara Papi, Matteo Negri and Luisa Bentivogli
Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity
[Video]
Sho Hoshino, Akihiko Kato, Soichiro Murakami and Peinan Zhang
Multilingual Sentence-T5: Scalable Sentence Encoders for Multilingual Applications
[Slides] [Video]
Chihiro Yano, Akihiko Fukuchi, Shoko Fukasawa, Hideyuki Tachibana and Yotaro Watanabe
Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?
[Poster] [Slides] [Video]
Shaoxiong Ji, Timothee Mickus, Vincent Segonne and Jörg Tiedemann
Charles Translator: A Machine Translation System between Ukrainian and Czech
[Slides] [Video]
Martin Popel, Lucie Polakova, Michal Novák, Jindřich Helcl, Jindřich Libovický, Pavel Straňák, Tomas Krabac, Jaroslava Hlavacova, Mariia Anisimova and Tereza Chlanova
Continued Pre-training on Sentence Analogies for Translation with Small Data
[Video]
Liyan Wang, Haotong Wang and Yves Lepage
Evaluating Automatic Subtitling: Correlating Post-editing Effort and Automatic Metrics
[Slides] [Video]
Alina Karakanta, Mauro Cettolo, Matteo Negri and Luisa Bentivogli
Multilinguality or Back-translation? A Case Study with Estonian
[Slides] [Video]
Elizaveta Korotkova, Taido Purason, Agnes Luhtaru and Mark Fishel
Correlations between Multilingual Language Model Geometry and Crosslingual Transfer Performance
[Slides] [Video]
Cheril Shah, Yashashree Chandak, Atharv Mahesh Mane, Benjamin Bergen and Tyler A. Chang
Utilizing Longer Context than Speech Bubbles in Automated Manga Translation
[Video]
Hiroto Kaino, Soichiro Sugihara, Tomoyuki Kajiwara, Takashi Ninomiya, Joshua B. Tanner and Shonosuke Ishiwatari
Understanding How Positional Encodings Work in Transformer Model
[Video]
Taro Miyazaki, Hideya Mino and Hiroyuki Kaneko
Analysis on Unsupervised Acquisition Process of Bilingual Vocabulary through Iterative Back-Translation
[Slides] [Video]
Takuma Tanigawa, Tomoyosi Akiba and Hajime Tsukada
Massively Multilingual Token-Based Typology Using the Parallel Bible Corpus
[Slides] [Video]
Amanda Kann
Evaluating Word Expansion for Multilingual Sentiment Analysis of Parliamentary Speech
[Video]
Yana Nikolova and Costanza Navarretta
Revisiting Context Choices for Context-aware Machine Translation
[Poster] [Slides] [Video]
Matiss Rikters and Toshiaki Nakazawa
Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation
[Poster] [Slides] [Video]
Kartik Kartik, Sanjana Soni, Anoop Kunchukuttan, Tanmoy Chakraborty and Md. Shad Akhtar
12:40 - 13:20D3-S2-RE14 - Less-Resourced/Endangered/Less-studied Languages I
PrOnto: Language Model Evaluations for 859 Languages
[Slides] [Video]
Luke Gessler
Zero-shot Cross-lingual Automated Essay Scoring
[Slides] [Video]
Junyi He and Xia Li
Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning
[Slides] [Video]
Zhitao He, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Zhiqiang Zhang, Mengshu Sun and Jun Zhao
WaCadie: Towards an Acadian French Corpus
[Video]
Jeremy Robichaud and Paul Cook
12:40 - 13:20D3-S2-RE14 - Less-Resourced/Endangered/Less-studied Languages II
Conjoin after Decompose: Improving Few-Shot Performance of Named Entity Recognition
[Slides] [Video]
Chengcheng Han, Renyu Zhu, Jun Kuang, Fengjiao Chen, Xiang Li, Ming Gao, Xuezhi Cao and Yunsen Xian
Still All Greeklish to Me: Greeklish to Greek Transliteration
[Slides] [Video]
Anastasios Toumazatos, John Pavlopoulos, Ion Androutsopoulos and Stavros Vassos
Exploring the Potential of Large Language Models (LLMs) for Low-resource Languages: A Study on Named-Entity Recognition (NER) and Part-Of-Speech (POS) Tagging for Nepali Language
[Slides] [Video]
Bipesh Subedi, Sunil Regmi, Bal Krishna Bal and Praveen Acharya
KoFREN: Comprehensive Korean Word Frequency Norms Derived from Large Scale Free Speech Corpora
[Slides] [Video]
Jin-seo Kim, Anna Seo Gyeong Choi and Sunghye Cho
12:40 - 13:20D3-S2-RE14 - Less-Resourced/Endangered/Less-studied Languages III
Transformers for Bridging Persian Dialects: Transliteration Model for Tajiki and Iranian Scripts
[Slides] [Video]
MohammadAli SadraeiJavaheri, Ehsaneddin Asgari and Hamid Reza Rabiee
Samayik: A Benchmark and Dataset for English-Sanskrit Translation
[Slides] [Video]
Ayush Maheshwari, Ashim Gupta, Amrith Krishna, Atul Kumar Singh, Ganesh Ramakrishnan, Anil Kumar Gourishetty and Jitin Singla
A Multilingual Parallel Corpus for Aromanian
[Slides] [Video]
Iulia Petrariu and Sergiu Nisioi
PDAMeta: Meta-Learning Framework with Progressive Data Augmentation for Few-Shot Text Classification
[Slides] [Video]
Xurui Li, Kaisong Song, Tianqianjing Lin, Yangyang Kang, Fubang Zhao, Changlong Sun and Xiaozhong Liu
12:40 - 13:20D1-S2-RE15 - Lexicon and Semantics I
ReCAP: Semantic Role Enhanced Caption Generation
[Slides] [Video]
Abhidip Bhattacharyya, Martha Palmer and Christoffer Heckman
Medical Entity Disambiguation with Medical Mention Relation and Fine-grained Entity Knowledge
[Slides] [Video]
Wenpeng Lu, Guobiao Zhang, Xueping Peng, Hongjiao Guan and Shoujin Wang
Multi-Granularity Fusion Text Semantic Matching Based on WoBERT
[Slides] [Video]
Hongchun Yu, Wei Pan, Xing Fan and Hanqi Li
Alignment before Awareness: Towards Visual Question Localized-Answering in Robotic Surgery via Optimal Transport and Answer Semantics
[Video]
Zhihong Zhu, Yunyan Zhang, Xuxin Cheng, Zhiqi Huang, Derong Xu, Xian Wu and Yefeng Zheng
12:40 - 13:20D3-S2-RE15 - Lexicon and Semantics II
Modelling and Linking an Old Latin-Portuguese Dictionary to the LiLa Knowledge Base
[Slides] [Video]
Lucas Consolin Dezotti, Marco Passarotti and Francesco Mambrini
Few-Shot Semantic Dependency Parsing via Graph Contrastive Learning
[Slides] [Video]
Bin Li, Yunlong Fan, Yikemaiti Sataer, Chuanqi Shi, Miao Gao and Zhiqiang Gao
German SRL: Corpus Construction and Model Training
[Slides] [Video]
Maxim Konca, Andy Luecking and Alexander Mehler
SDA: Simple Discrete Augmentation for Contrastive Sentence Representation Learning
[Slides] [Video]
Dongsheng Zhu, Zhenyu Mao, Jinghui Lu, Rui Zhao and Fei Tan
12:40 - 13:20D3-S2-RE15 - Lexicon and Semantics III
GLAMR: Augmenting AMR with GL-VerbNet Event Structure
[Video]
Jingxuan Tu, Timothy Obiso, Bingyang Ye, Kyeongmin Rim, Keer Xu, Liulu Yue, Susan Windisch Brown, Martha Palmer and James Pustejovsky
Finding Educationally Supportive Contexts for Vocabulary Learning with Attention-Based Models
[Slides] [Video]
Sungjin Nam, Kevyn Collins-Thompson, David Jurgens and Xin Tong
12:40 - 13:20D3-S2-RE16 - Machine Learning Models and Techniques for CL/NLP I
Calibrating LLM-Based Evaluator
[Slides] [Video]
Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun and Qi Zhang
VI-OOD: A Unified Framework of Representation Learning for Textual Out-of-distribution Detection
[Slides] [Video]
Li-Ming Zhan, Bo Liu and Xiao-Ming Wu
Depth Aware Hierarchical Replay Continual Learning for Knowledge Based Question Answering
[Video]
Zhixiong Cao, Hai-Tao Zheng, Yangning Li, Jin Xu, Rongsheng Li and Hong-Gee Kim
Towards Robust Temporal Activity Localization Learning with Noisy Labels
[Slides] [Video]
Daizong Liu, Xiaoye Qu, Xiang Fang, Jianfeng Dong, Pan Zhou, Guoshun Nan, Keke Tang, Wanlong Fang and Yu Cheng
12:40 - 13:20D3-S2-RE16 - Machine Learning Models and Techniques for CL/NLP II
CoRelation: Boosting Automatic ICD Coding through Contextualized Code Relation Learning
[Slides] [Video]
Junyu Luo, Xiaochen Wang, Jiaqi Wang, Aofei Chang, Yaqing Wang and Fenglong Ma
Refining Idioms Semantics Comprehension via Contrastive Learning and Cross-Attention
[Slides] [Video]
Mingmin Wu, Guixin Su, Yongcheng Zhang, Zhongqiang Huang and Ying Sha
Low-Rank Prune-And-Factorize for Language Model Compression
[Slides] [Video]
Siyu Ren and Kenny Q. Zhu
FlattenQuant: Breaking through the Inference Compute-bound for Large Language Models with Per-tensor Quantization
[Slides] [Video]
Yi Zhang, Fei Yang, Shuang Peng, Fangyu Wang and Aimin Pan
12:40 - 13:20D3-S2-RE16 - Machine Learning Models and Techniques for CL/NLP III
MemoryPrompt: A Light Wrapper to Improve Context Tracking in Pre-trained Language Models
[Slides] [Video]
Nathanael Carraz Rakotonirina and Marco Baroni
Layer-wise Regularized Dropout for Neural Language Models
[Video]
Shiwen Ni, Min Yang, Ruifeng Xu, Chengming Li and Xiping Xiping Hu
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation
[Slides] [Video]
Do June Min, Veronica Perez-Rosas, Ken Resnicow and Rada Mihalcea
TransERR: Translation-based Knowledge Graph Embedding via Efficient Relation Rotation
[Slides] [Video]
Jiang Li, Xiangdong Su, Fujun Zhang and Guanglai Gao
12:40 - 13:20D3-S2-RE16 - Machine Learning Models and Techniques for CL/NLP IV
DimA: A Parameter-efficient Fine-tuning Method with Knowledge Transfer Based on Transformer
[Slides] [Video]
Wenxuan Zhang, Min Huang, Zhuoyang Song and Qinghai Miao
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces
[Slides] [Video]
Tianyu Zheng, Ge Zhang, Xingwei Qu, Ming Kuang, Wenhao Huang and Zhaofeng He
Beyond Full Fine-tuning: Harnessing the Power of LoRA for Multi-Task Instruction Tuning
[Slides] [Video]
Chunlei Xin, Yaojie Lu, Hongyu Lin, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han and Le Sun
How to Understand "Support"? An Implicit-enhanced Causal Inference Approach for Weakly-supervised Phrase Grounding
[Slides] [Video]
Jiamin Luo, Jianing Zhao, Jingjing Wang and Guodong Zhou
12:40 - 13:20D3-S2-RE16 - Machine Learning Models and Techniques for CL/NLP V
FoTo: Targeted Visual Topic Modeling for Focused Analysis of Short Texts
[Slides] [Video]
Sanuj Kumar and Tuan Le
Improving Robustness of GNN-based Anomaly Detection by Graph Adversarial Training
[Slides] [Video]
Xiangping Zheng, Bo Wu, Alex X. Zhang and Wei Li
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
[Slides] [Video]
Mengfei Du, Binhao Wu, Jiwen Zhang, Zhihao Fan, Zejun Li, Ruipu Luo, Xuanjing Huang and Zhongyu Wei
Zero-Shot Spoken Language Understanding via Large Language Models: A Preliminary Study
[Video]
Zhihong Zhu, Xuxin Cheng, Hao An, Zhichang Wang, Dongsheng Chen and Zhiqi Huang
12:40 - 13:20D3-S2-RE16 - Machine Learning Models and Techniques for CL/NLP VI
Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition
[Slides] [Video]
Cam-Van Thi Nguyen, Cao-Bach Nguyen, Duc-Trong Le and Quang-Thuy Ha
On the Adaptation of Unlimiformer for Decoder-Only Transformers
[Slides] [Video]
Kian Ahrabian, Alon Benhaim, Barun Patra, Jay Pujara, Saksham Singhal and Xia Song
Probing Multimodal Large Language Models for Global and Local Semantic Representations
[Slides] [Video]
Mingxu Tao, Quzhe Huang, Kun Xu, Liwei Chen, Yansong Feng and Dongyan Zhao
12:40 - 13:20D3-S2-RE17 - Multilinguality, Machine Translation, and Translation Aids I
Rewiring the Transformer with Depth-Wise LSTMs
[Slides] [Video]
Hongfei Xu, Yang Song, Qiuhui Liu, Josef van Genabith and Deyi Xiong
K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling
[Slides] [Video]
Haven Kim, Jongmin Jung, Dasaem Jeong and Juhan Nam
Submodular-based In-context Example Selection for LLMs-based Machine Translation
[Video]
Baijun Ji, Xiangyu Duan, Zhenyu Qiu, Tong Zhang, Junhui Li, Hao Yang and Min Zhang
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
[Slides] [Video]
Jianhao Yan, Jin Xu, Fandong Meng, Jie Zhou and Yue Zhang
12:40 - 13:20D3-S2-RE17 - Multilinguality, Machine Translation, and Translation Aids II
The Emergence of Semantic Units in Massively Multilingual Models
[Slides] [Video]
Andrea Gregor de Varda and Marco Marelli
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
[Slides] [Video]
Thuat Nguyen, Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi and Thien Huu Nguyen
Esposito: An English-Persian Scientific Parallel Corpus for Machine Translation
[Slides] [Video]
Mersad Esalati, Mohammad Javad Dousti and Heshaam Faili
Incorporating Lexical and Syntactic Knowledge for Unsupervised Cross-Lingual Transfer
[Slides] [Video]
Jianyu Zheng, Fengfei Fan and Jianquan Li
12:40 - 13:20D3-S2-RE17 - Multilinguality, Machine Translation, and Translation Aids III
Towards Robust In-Context Learning for Machine Translation with Large Language Models
[Slides] [Video]
Shaolin Zhu, Menglong Cui and Deyi Xiong
A Lifelong Multilingual Multi-granularity Semantic Alignment Approach via Maximum Co-occurrence Probability
[Slides] [Video]
Xin Liu, Hongwei Sun, Shaojie Dai, Bo Lv, Youcheng Pan, Hui Wang and Yue Yu
A Reinforcement Learning Approach to Improve Low-Resource Machine Translation Leveraging Domain Monolingual Data
[Video]
Hongxiao Zhang, Mingtong Liu, Chunyou Li, Yufeng Chen, Jinan Xu and Ming Zhou
Enhancing Translation Ability of Large Language Models by Leveraging Task-Related Layers
[Slides] [Video]
Pei Cheng, Xiayang Shi and Yinlin Li
12:40 - 13:20D3-S2-RE17 - Multilinguality, Machine Translation, and Translation Aids IV
Context-Aware Non-Autoregressive Document-Level Translation with Sentence-Aligned Connectionist Temporal Classification
[Slides] [Video]
Hao Yu, Kaiyu Huang, Anqi Zhao, Junpeng Liu and Degen Huang
Rapidly Piloting Real-time Linguistic Assistance for Simultaneous Interpreters with Untrained Bilingual Surrogates
[Slides] [Video]
Alvin C. Grissom II, Jo Shoemaker, Benjamin Goldman, Ruikang Shi, Craig Stewart, C. Anton Rytting, Leah Findlater and Jordan Boyd-Graber
Improving Vietnamese-English Medical Machine Translation
[Video]
Nhu Vo, Dat Quoc Nguyen, Dung D. Le, Massimo Piccardi and Wray Buntine
12:40 - 13:20D3-S2-RE18 - Multimodal Applications, Grounded Language Acquisition, and HRI I
Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification
[Slides] [Video]
Artem Abzaliev, Humberto Perez-Espinosa and Rada Mihalcea
Image Matters: A New Dataset and Empirical Study for Multimodal Hyperbole Detection
[Video]
Huixuan Zhang and Xiaojun Wan
Visual-Linguistic Dependency Encoding for Image-Text Retrieval
[Slides] [Video]
Wenxin Guo, Lei Zhang, Kun Zhang, Yi Liu and Zhendong Mao
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling
[Video]
Guangmin Zheng, Jin Wang, Xiaobing Zhou and Xuejie Zhang
12:40 - 13:20D3-S2-RE18 - Multimodal Applications, Grounded Language Acquisition, and HRI II
Dynamic Spatial-Temporal Aggregation for Skeleton-Aware Sign Language Recognition
[Slides] [Video]
Lianyu Hu, Liqing Gao, Zekang Liu and Wei Feng
Modalities Should Be Appropriately Leveraged: Uncertainty Guidance for Multimodal Chinese Spelling Correction
[Slides] [Video]
Yongliang Lin, Zhen Zhang, Mengting Hu, Yufei Sun and Yuzhi Zhang
Uncertainty-Aware Cross-Modal Alignment for Hate Speech Detection
[Video]
Chuanpeng Yang, Fuqing Zhu, Yaxin Liu, Jizhong Han and Songlin Hu
MEVTR: A Multilingual Model Enhanced with Visual Text Representations
[Slides] [Video]
Xiaohua Wang, Wenlong Fei, Min Hu, Qingyu Zhang and Aoqiang Zhu
12:40 - 13:20D3-S2-RE18 - Multimodal Applications, Grounded Language Acquisition, and HRI III
Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning
[Slides] [Video]
Guisheng Liu, Yi Li, Zhengcong Fei, Haiyan Fu, Xiangyang Luo and Yanqing Guo
UMTIT: Unifying Recognition, Translation, and Generation for Multimodal Text Image Translation
[Slides] [Video]
Liqiang Niu, Fandong Meng and Jie Zhou
MccSTN: Multi-Scale Contrast and Fine-Grained Feature Fusion Networks for Subject-driven Style Transfer
[Slides] [Video]
Honggang Zhao, Chunling Xiao, Jiayi Yang, Guozhu Jin and Mingyong Li
TMFN: A Target-oriented Multi-grained Fusion Network for End-to-end Aspect-based Multimodal Sentiment Analysis
[Slides] [Video]
Di Wang, Yuzheng He, Xiao Liang, Yumin Tian, Shaofeng Li and Lin Zhao
12:40 - 13:20D3-S2-RE18 - Multimodal Applications, Grounded Language Acquisition, and HRI IV
CM-Off-Meme: Code-Mixed Hindi-English Offensive Meme Detection with Multi-Task Learning by Leveraging Contextual Knowledge
[Slides] [Video]
Gitanjali Kumari, Dibyanayan Bandyopadhyay, Asif Ekbal and Vinutha B. NarayanaMurthy
WW-CSL: A New Dataset for Word-Based Wearable Chinese Sign Language Detection
[Slides] [Video]
Fan Xu, Kai Liu, Yifeng Yang and Keyu Yan
Multimodal and Multilingual Laughter Detection in Stand-Up Comedy Videos
[Slides] [Video]
Anna Kuznetsova and Carlo Strapparava
Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning
[Slides] [Video]
Jun Cheng Yang, Zuchao Li, Shuai Xie, Wei Yu, Shijun Li and Bo Du
12:40 - 13:20D3-S2-RE18 - Multimodal Applications, Grounded Language Acquisition, and HRI V
JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
[Video]
Eri Onami, Shuhei Kurita, Taiki Miyanishi and Taro Watanabe
Text360Nav: 360-Degree Image Captioning Dataset for
Urban Pedestrians Navigation

[Video]
Chieko Nishimura, Shuhei Kurita and Yohei Seki
MMAD:Multi-modal Movie Audio Description
[Video]
Xiaojun Ye, Junhao Chen, Xiang Li, Haidong Xin, Chao Li, Sheng Zhou and Jiajun Bu
12:40 - 13:20D3-S2-RE19 - Natural Language Generation, Summarization and Simplification I
A Document-Level Text Simplification Dataset for Japanese
[Slides] [Video]
Yoshinari Nagai, Teruaki Oka and Mamoru Komachi
Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs
[Video]
Zhihong Sun, Chen Lyu, Bolun Li, Yao Wan, Hongyu Zhang, Ge Li and Zhi Jin
Dynamic Knowledge Prompt for Chest X-ray Report Generation
[Slides] [Video]
Shenshen Bu, Yujie Song, Taiji Li and Zhiming Dai
StyleFlow: Disentangle Latent Representations via Normalizing Flow for Unsupervised Text Style Transfer
[Slides] [Video]
Kangchen Zhu, Zhiliang Tian, Jingyu Wei, Ruifeng Luo, Yiping Song and Xiaoguang Mao
12:40 - 13:20D3-S2-RE19 - Natural Language Generation, Summarization and Simplification II
ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization
[Slides] [Video]
Mengsha Liu, Daoyuan Chen, Yaliang Li, Guian Fang and Ying Shen
Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs
[Video]
Chenxi Sun, Hongzhi Zhang, Zijia Lin, Jingyuan Zhang, Fuzheng Zhang, Zhongyuan Wang, Bin Chen, Chengru Song, Di Zhang, Kun Gai and Deyi Xiong
Opinions Are Not Always Positive: Debiasing Opinion Summarization with Model-Specific and Model-Agnostic Methods
[Slides] [Video]
Yanyue Zhang, Yilong Lai, Zhenglin Wang, Pengfei Li, Deyu Zhou and Yulan He
Enhancing Image-to-Text Generation in Radiology Reports through Cross-modal Multi-Task Learning
[Slides] [Video]
Nurbanu Aksoy, Nishant Ravikumar and Serge Sharoff
12:40 - 13:20D3-S2-RE19 - Natural Language Generation, Summarization and Simplification III
Analyzing the Performance of Large Language Models on Code Summarization
[Slides] [Video]
Rajarshi Haldar and Julia Hockenmaier
BengaliLCP: A Dataset for Lexical Complexity Prediction in the Bengali Texts
[Video]
Nabila Ayman, Md. Akram Hossain, Abdul Aziz, Rokan Uddin Faruqui and Abu Nowshed Chy
GECSum: Generative Evaluation-Driven Sequence Level Contrastive Learning for Abstractive Summarization
[Slides] [Video]
Jiawen Xie, Shaoting Zhang and Xiaofan Zhang
Improving Role-Oriented Dialogue Summarization with Interaction-Aware Contrastive Learning
[Slides] [Video]
Weihong Guan, Shi Feng, Daling Wang, Faliang Huang, Yifei Zhang and Yuan Cui
12:40 - 13:20D3-S2-RE19 - Natural Language Generation, Summarization and Simplification IV
Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack
[Slides] [Video]
Ying Zhou, Ben He and Le Sun
Few-shot Temporal Pruning Accelerates Diffusion Models for Text Generation
[Slides] [Video]
Bocheng Li, Zhujin Gao, Yongxin Zhu, Kun Yin, Haoyu Cao, Deqiang Jiang and Linli Xu
Multi-Objective Forward Reasoning and Multi-Reward Backward Refinement for Product Review Summarization
[Video]
Libo Sun, Siyuan Wang, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang and Zhongyu Wei
Can Large Language Models Automatically Score Proficiency of Written Essays?
[Video]
Watheq Ahmad Mansour, Salam Albatarni, Sohaila Eltanbouly and Tamer Elsayed
12:40 - 13:20D3-S2-RE19 - Natural Language Generation, Summarization and Simplification V
Beyond Code: Evaluate Thought Steps for Complex Code Generation
[Video]
Liuwen Cao, Yi Cai, Jiexin Wang, Hongkui He and Hailin Huang
Reduce Redundancy Then Rerank: Enhancing Code Summarization with a Novel Pipeline Framework
[Slides] [Video]
Xiaoyu Hu, Xu Zhang, Zexu Lin and Deyu Zhou
Explicit over Implict: Explicit Diversity Conditions for Effective Question Answer Generation
[Slides] [Video]
Vikas Yadav, Hyuk joon Kwon, Vijay Srinivasan and Hongxia Jin
Improving Copy-oriented Text Generation via EDU Copy Mechanism
[Video]
Tianxiang Wu, Han Chen, Luozheng Qin, Ziqiang Cao and Chunhui Ai
A Frustratingly Simple Decoding Method for Neural Text Generation
[Slides] [Video]
Haoran Yang, Deng Cai, Huayang Li, Wei Bi, Wai Lam and Shuming Shi
12:40 - 13:20D3-S2-RE20 - Offensive and Harmful Language Detection and Analysis I
How to Solve Few-Shot Abusive Content Detection Using the Data We Actually Have
[Slides] [Video]
Viktor Hangya and Alexander Fraser
STAF: Pushing the Boundaries of Test-Time Adaptation towards Practical Noise Scenarios
[Video]
Haoyu Xiong, Xinchun Zhang, Leixin Yang, Yu Xiang and Gang Fang
Intent-Aware and Hate-Mitigating Counterspeech Generation via Dual-Discriminator Guided LLMs
[Slides] [Video]
Haiyang Wang, Zhiliang Tian, Xin Song, Yue Zhang, Yuchen Pan, Hongkui Tu, Minlie Huang and Bin Zhou
Take Its Essence, Discard Its Dross! Debiasing for Toxic Language Detection via Counterfactual Causal Effect
[Video]
Junyu Lu, Bo Xu, Xiaokun Zhang, Kaiyuan Liu, Dongyu Zhang, Liang Yang and Hongfei Lin
12:40 - 13:20D3-S2-RE20 - Offensive and Harmful Language Detection and Analysis II
MAGPIE: Multi-Task Analysis of Media-Bias Generalization with Pre-Trained Identification of Expressions
[Slides] [Video]
Tomáš Horych, Martin Paul Wessel, Jan Philip Wahle, Terry Ruas, Jerome Waßmuth, André Greiner-Petter, Akiko Aizawa, Bela Gipp and Timo Spinde
LI4: Label-Infused Iterative Information Interacting Based Fact Verification in Question-answering Dialogue
[Slides] [Video]
Xiaocheng Zhang, Chang Wang, Guoping Zhao and Xiaohong Su
HarmPot: An Annotation Framework for Evaluating Offline Harm Potential of Social Media Text
[Slides] [Video]
Ritesh Kumar, Ojaswee Bhalla, Madhu Vanthi, Shehlat Maknoon Wani and Siddharth Singh
InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks
[Video]
Somnath Banerjee, Maulindu Sarkar, Punyajoy Saha, Binny Mathew and Animesh Mukherjee
12:40 - 13:20D3-S2-RE21 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation I
Let’s Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models
[Slides] [Video]
Shunyu Liu, Jie Zhou, Qunxi Zhu, Qin Chen, Qingchun Bai, Jun Xiao and Liang He
Structure-aware Generation Model for Cross-Domain Aspect-based Sentiment Classification
[Slides] [Video]
Shichen Li, Zhongqing Wang, Yanzhi Xu and Guodong Zhou
GCNet: Global-and-Context Collaborative Learning for Aspect-Based Sentiment Analysis
[Video]
Ting Zhou, Ying Shen and Yinghui Li
Enhancing Emotion Prediction in News Headlines: Insights from ChatGPT and Seq2Seq Models for Free-Text Generation
[Video]
Ge Gao, Jongin Kim, Sejin Paik, Ekaterina Novozhilova, Yi Liu, Sarah T. Bonna, Margrit Betke and Derry Tanti Wijaya
12:40 - 13:20D3-S2-RE21 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation II
A Hierarchical Sequence-to-Set Model with Coverage Mechanism for Aspect Category Sentiment Analysis
[Slides] [Video]
Siyu Wang, Jianhui Jiang, Shengran Dai and Jiangtao Qiu
Target-Adaptive Consistency Enhanced Prompt-Tuning for Multi-Domain Stance Detection
[Slides] [Video]
Shangkang Wang and Li Pan
SynPrompt: Syntax-aware Enhanced Prompt Engineering for Aspect-based Sentiment Analysis
[Slides] [Video]
Wen Yin, Cencen Liu, Yi Xu, Ahmad Raza Wahla, Huang Yiting and Dezhang Zheng
MLDSP-MA: Multidimensional Attention for Multi-Round Long Dialogue Sentiment Prediction
[Video]
Yunfei Yin, Congrui Zou, Zheng Yuan and Xianjian Bao
12:40 - 13:20D3-S2-RE21 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation III
Dual Encoder: Exploiting the Potential of Syntactic and Semantic for Aspect Sentiment Triplet Extraction
[Slides] [Video]
Xiaowei Zhao, Yong Zhou and Xiujuan Xu
ChatASU: Evoking LLM’s Reflexion to Truly Understand Aspect Sentiment in Dialogues
[Slides] [Video]
Yiding Liu, Jingjing Wang, Jiamin Luo, Tao Zeng and Guodong Zhou
Automatic Construction of a Chinese Review Dataset for Aspect Sentiment Triplet Extraction via Iterative Weak Supervision
[Slides] [Video]
Chia-Wen Lu, Ching-Wen Yang and Wei-Yun Ma
InfoEnh: Towards Multimodal Sentiment Analysis via Information Bottleneck Filter and Optimal Transport Alignment
[Video]
Yifeng Xie, Zhihong Zhu, Xuan Lu, Zhiqi Huang and Haoran Xiong
12:40 - 13:20D3-S2-RE21 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation IV
TopicDiff: A Topic-enriched Diffusion Approach for Multimodal Conversational Emotion Detection
[Slides] [Video]
Jiamin Luo, Jingjing Wang and Guodong Zhou
Integrating Representation Subspace Mapping with Unimodal Auxiliary Loss for Attention-based Multimodal Emotion Recognition
[Slides] [Video]
Xulong Du, Xingnan Zhang, Dandan Wang, Yingying Xu, Zhiyuan Wu, Shiqing Zhang, Xiaoming Zhao, Jun Yu and Liangliang Lou
Semantics-Aware Dual Graph Convolutional Networks for Argument Pair Extraction
[Slides] [Video]
Minzhao Guan, Zhixun Qiu, Fenghuan Li and Yun Xue
Impact of Task Adapting on Transformer Models for Targeted Sentiment Analysis in Croatian Headlines
[Video]
Sofia Lee and Jelke Bloem
12:40 - 13:20D3-S2-RE21 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation V
Multi-stream Information Fusion Framework for Emotional Support Conversation
[Slides] [Video]
Yinan Bao, Dou Hu, Lingwei Wei, Shuchong Wei, Wei Zhou and Songlin Hu
Feature Structure Matching for Multi-source Sentiment Analysis with Efficient Adaptive Tuning
[Slides] [Video]
Rui Li, Cheng Liu, Yu Tong and Jiang Dazhi
Emotion Recognition in Conversation via Dynamic Personality
[Slides] [Video]
Yan Wang, Bo Wang, Yachao Zhao, Dongming Zhao, Xiaojia Jin, Jijun Zhang, Ruifang He and Yuexian Hou
EDDA: An Encoder-Decoder Data Augmentation Framework for Zero-Shot Stance Detection
[Video]
Daijun Ding, Li Dong, Zhichao Huang, Guangning Xu, Xu Huang, Bo Liu, Liwen Jing and Bowen Zhang
12:40 - 13:20D3-S2-RE21 - Opinion & Argument Mining, Sentiment Analysis, Emotion Recognition/Generation VI
Linking Adaptive Structure Induction and Neuron Filtering: A Spectral Perspective for Aspect-based Sentiment Analysis
[Slides] [Video]
Hao Niu, Maoyi Wang, Yun Xiong, Biao Yang, Xing Jia and Zhonglei Guo
The Impact of Stance Object Type on the Quality of Stance Detection
[Slides] [Video]
Maxwell A. Weinzierl and Sanda M. Harabagiu
Mitigating Linguistic Artifacts in Emotion Recognition for Conversations from TV Scripts to Daily Conversations
[Slides] [Video]
Donovan Ong, Shuo Sun, Jian Su and Bin Chen
Majority Rules Guided Aspect-Category Based Sentiment Analysis via Label Prior Knowledge
[Slides] [Video]
Lin Li, Shaopeng Tang and Renwei Wu
Debiasing Multi-Entity Aspect-Based Sentiment Analysis with Norm-Based Data Augmentation
[Slides] [Video]
Scott Friedman, Joan Zheng and Hillel Steinmetz
12:40 - 13:20D3-S2-RE22 - Parsing, Tagging, Chunking, Grammar, Syntax, Morphosyntax, Morphology I
Dependencies over Times and Tools (DoTT)
[Slides] [Video]
Andy Luecking, Giuseppe Abrami, Leon Hammerla, Marc Rahn, Daniel Baumartz, Steffen Eger and Alexander Mehler
Improving Grammatical Error Correction by Correction Acceptability Discrimination
[Video]
Bin Cao, Kai Jiang, Fayu Pan, Chenlei Bao and Jing Fan
TP-Link: Fine-grained Pre-Training for Text-to-SQL Parsing with Linking Information
[Slides] [Video]
Ziqiang Liu, Shujie Li, Zefeng Cai, Xiangyu Li, Yunshui Li, Chengming Li, Xiping Hu, Ruifeng Xu and Min Yang
Unicode Normalization and Grapheme Parsing of Indic Languages
[Slides] [Video]
Nazmuddoha Ansary, Quazi Adibur Rahman Adib, Tahsin Reasat, Asif Shahriyar Sushmit, Ahmed Imtiaz Humayun, Sazia Mehnaz, Kanij Fatema, Mohammad Mamun Or Rashid and Farig Sadeque
12:40 - 13:20D3-S2-RE22 - Parsing, Tagging, Chunking, Grammar, Syntax, Morphosyntax, Morphology II
Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training
[Video]
Longhui Zhang, Dingkun Long, Meishan Zhang, Yanzhao Zhang, Pengjun Xie and Min Zhang
Plots Made Quickly: An Efficient Approach for Generating Visualizations from Natural Language Queries
[Video]
Henrik Voigt, Kai Lawonn and Sina Zarrieß
Nested Noun Phrase Identification Using BERT
[Slides] [Video]
Shweta Misra and Johan Boye
Relation between Cross-Genre and Cross-Topic Transfer in Dependency Parsing
[Slides] [Video]
Vera Danilova and Sara Stymne
12:40 - 13:20D3-S2-RE23 - Policy issues, Ethics, Legal Issues, Bias Analysis I
Correcting Language Model Bias for Text Classification in True Zero-Shot Learning
[Video]
Feng Zhao, Wan Xianlin, Cheng Yan and Chu Kiong Loo
PolitiCause: An Annotation Scheme and Corpus for Causality in Political Texts
[Slides] [Video]
Paulina Garcia Corral, Hanna Bechara, Ran Zhang and Slava Jankin
A Comparative Study of Explicit and Implicit Gender Biases in Large Language Models via Self-evaluation
[Slides] [Video]
Yachao Zhao, Bo Wang, Yan Wang, Dongming Zhao, Xiaojia Jin, Jijun Zhang, Ruifang He and Yuexian Hou
WorldValuesBench: A Large-Scale Benchmark Dataset for Multi-Cultural Value Awareness of Language Models
[Slides] [Video]
Wenlong Zhao, Debanjan Mondal, Niket Tandon, Danica Dillion, Kurt Gray and Yuling Gu
12:40 - 13:20D3-S2-RE24 - Social Media Processing I
Interpretable Short Video Rumor Detection Based on Modality Tampering
[Slides] [Video]
Kaixuan Wu, Yanghao Lin, Donglin Cao and Dazhen Lin
Towards Robust Evidence-Aware Fake News Detection via Improving Semantic Perception
[Slides] [Video]
Yike Wu, Yang Xiao, Mengting Hu, Mengying Liu, Pengcheng Wang and Mingming Liu
Triple-R: Automatic Reasoning for Fact Verification Using Language Models
[Slides] [Video]
Mohammadamin Kanaani
Breakthrough from Nuance and Inconsistency: Enhancing Multimodal Sarcasm Detection with Context-Aware Self-Attention Fusion and Word Weight Calculation.
[Video]
Hongfei Xue, Linyan Xu, Yu Tong, Rui Li, Jiali Lin and Dazhi Jiang
12:40 - 13:20D3-S2-RE24 - Social Media Processing II
CLFFRD: Curriculum Learning and Fine-grained Fusion for Multimodal Rumor Detection
[Video]
Fan Xu, Lei Zeng, Bowei Zou, Ai Ti Aw and Huan Rong
MRT: Multi-modal Short- and Long-range Temporal Convolutional Network for Time-sync Comment Video Behavior Prediction
[Video]
Weihao Zhao, Weidong He, Hao Wang, Haoyang Bi, Han Wu, Chen Zhu, Tong Xu and Enhong Chen
Exploring BERT-Based Classification Models for Detecting Phobia Subtypes: A Novel Tweet Dataset and Comparative Analysis
[Slides] [Video]
Anik Das, Milton King and James Alexander Hughes
LocalTweets to LocalHealth: A Mental Health Surveillance Framework Based on Twitter Data
[Slides] [Video]
Vijeta Deshpande, Minhwa Lee, Zonghai Yao, Zihao Zhang, Jason Brian Gibbons and Hong Yu
12:40 - 13:20D3-S2-RE24 - Social Media Processing III
SoftMCL: Soft Momentum Contrastive Learning for Fine-grained Sentiment-aware Pre-training
[Slides] [Video]
Jin Wang, Liang-Chih Yu and Xuejie Zhang
Recognizing Social Cues in Crisis Situations
[Slides] [Video]
Di Wang, Yuan Zhuang, Ellen Riloff and Marina Kogan
PASUM: A Pre-training Architecture for Social Media User Modeling Based on Text Graph
[Slides] [Video]
Kun Wu, Xinyi Mou, Lanqing Xue, Zhenzhe Ying, Weiqiang Wang, Qi Zhang, Xuanjing Huang and Zhongyu Wei
TweetTER: A Benchmark for Target Entity Retrieval on Twitter without Knowledge Bases
[Video]
Kiamehr Rezaee, Jose Camacho-Collados and Mohammad Taher Pilehvar
12:40 - 13:20D3-S2-RE24 - Social Media Processing IV
Claim-Centric and Sentiment Guided Graph Attention Network for Rumour Detection
[Slides] [Video]
Sajad Ramezani, Mauzama Firdaus and Lili Mou
MiDe22: An Annotated Multi-Event Tweet Dataset for Misinformation Detection
[Slides] [Video]
Cagri Toraman, Oguzhan Ozcelik, Furkan Sahinuc and Fazli Can
Improving Personalized Sentiment Representation with Knowledge-enhanced and Parameter-efficient Layer Normalization
[Slides] [Video]
You Zhang, Jin Wang, Liang-Chih Yu, Dan Xu and Xuejie Zhang
12:40 - 13:20D3-S2-RE25 - Speech Resources and Processing I
Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
[Video]
Linzhi Wu, Xingyu Zhang, Yakun Zhang, Changyan Zheng, Tiejun Liu, Liang Xie, Ye Yan and Erwei Yin
HYPERTTS: Parameter Efficient Adaptation in Text to Speech Using Hypernetworks
[Slides] [Video]
Yingting Li, Rishabh Bhardwaj, Ambuj Mehrish, Bo Cheng and Soujanya Poria
A Fast and High-quality Text-to-Speech Method with Compressed Auxiliary Corpus and Limited Target Speaker Corpus
[Video]
Ye Tao, Chaofeng Lu, Meng Liu, Kai Xu, Tianyu Liu, Yunlong Tian and Yongjie Du
New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark
[Video]
Nadège Alavoine, Gaëlle Laperrière, Christophe Servan, Sahar Ghannay and Sophie Rosset
12:40 - 13:20D3-S2-RE25 - Speech Resources and Processing II
Extracting Biomedical Entities from Noisy Audio Transcripts
[Video]
Nima Ebadi, Kellen Morgan, Adrian Tan, Billy Linares, Sheri Osborn, Emma Majors, Jeremy Davis and Anthony Rios
CB-Whisper: Contextual Biasing Whisper Using Open-Vocabulary Keyword-Spotting
[Video]
Yuang Li, Yinglu Li, Min Zhang, Chang Su, Jiawei Yu, Mengyao Piao, Xiaosong Qiao, Miaomiao Ma, Yanqing Zhao and Hao Yang
VietMed: A Dataset and Benchmark for Automatic Speech Recognition of Vietnamese in the Medical Domain
[Slides] [Video]
Khai Le-Duc
Meta-Adapter for Self-Supervised Speech Models: A Solution to Low-Resource Speech Recognition Challenges
[Video]
Yaqi Chen, Hao Zhang, Xukui Yang, Wenlin Zhang and Dan Qu
12:40 - 13:20D3-S2-RE25 - Speech Resources and Processing III
RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot
[Slides] [Video]
Mohammad Mohammadamini, Driss Matrouf, Michael Rouvier, Jean-Francois Bonastre, Romain Serizel and Theophile Gonos
FFSTC: Fongbe to French Speech Translation Corpus
[Video]
D. Fortuné Kponou, Fréjus A. A. Laleye and Eugène Cokou Ezin
Parameter-Efficient Transfer Learning for End-to-end Speech Translation
[Slides] [Video]
Yunlong Zhao, Kexin Wang, Qianqian Dong and Tom Ko
DOC-RAG: ASR Language Model Personalization with Domain-Distributed Co-occurrence Retrieval Augmentation
[Slides] [Video]
Puneet Mathur, Zhe Liu, Ke Li, Yingyi Ma, Gil Karen, Zeeshan Ahmed, Dinesh Manocha and Xuedong Zhang
Indic-TEDST: Datasets and Baselines for Low-Resource Speech to Text Translation
[Video]
Nivedita Sethiya, Saanvi Nair and Chandresh Maurya
12:40 - 13:20D3-S2-RE26 - Trustworthy, Interpretability, and Explainability of Neural Models I
XAI-Attack: Utilizing Explainable AI to Find Incorrectly Learned Patterns for Black-Box Adversarial Example Creation
[Slides] [Video]
Markus Bayer, Markus Neiczer, Maximilian Samsinger, Björn Buchhold and Christian Reuter
Analyzing Chain-of-thought Prompting in Black-Box Large Language Models via Estimated V-information
[Slides] [Video]
Zecheng Wang, Chunshan Li, Zhao Yang, Qingbin Liu, Yanchao Hao, Xi Chen, Dianhui Chu and Dianbo Sui
Unveiling Project-Specific Bias in Neural Code Models
[Slides] [Video]
Zhiming Li, Yanzhou Li, Tianlin Li, Mengnan Du, Bozhi Wu, Yushi Cao, Junzhe Jiang and Yang Liu
Trustworthiness and Self-awareness in Large Language Models: An Exploration through the Think-Solve-Verify Framework
[Video]
Zhendong Liu, Changhong Xia, Wei He and Chongjun Wang
12:40 - 13:20D3-S2-RE26 - Trustworthy, Interpretability, and Explainability of Neural Models II
Executing Natural Language-Described Algorithms with Large Language Models: An Investigation
[Slides] [Video]
Xin Zheng, Qiming Zhu, Hongyu Lin, Yaojie Lu, Xianpei Han and Le Sun
Rethinking Word-level Adversarial Attack: The Trade-off between Efficiency, Effectiveness, and Imperceptibility
[Video]
Pengwei Zhan, Jing Yang, He Wang, Chao Zheng and Liming Wang
Evaluating Webcam-based Gaze Data as an Alternative for Human Rationale Annotations
[Slides] [Video]
Stephanie Brandl, Oliver Eberle, Tiago Ribeiro, Anders Søgaard and Nora Hollenstein
The Open-World Lottery Ticket Hypothesis for OOD Intent Classification
[Video]
Yunhua Zhou, Pengyu Wang, Peiju Liu, Yuxin Wang and Xipeng Qiu
12:40 - 13:20D3-S2-RE26 - Trustworthy, Interpretability, and Explainability of Neural Models III
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
[Slides] [Video]
Yuansen Zhang, Xiao Wang, Zhiheng Xi, Han Xia, Tao Gui, Qi Zhang and Xuanjing Huang
Towards Algorithmic Fidelity: Mental Health Representation across Demographics in Synthetic vs. Human-generated Data
[Slides] [Video]
Shinka Mori, Oana Ignat, Andrew Lee and Rada Mihalcea
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals
Rui Zheng, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang and Xuanjing Huang
ORTicket: Let One Robust BERT Ticket Transfer across Different Tasks
[Video]
Yuhao Zhou, Wenxiang Chen, Rui Zheng, Zhiheng Xi, Tao Gui, Qi Zhang and Xuanjing Huang
12:40 - 13:20D3-S2-RE26 - Trustworthy, Interpretability, and Explainability of Neural Models IV
Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings
[Slides] [Video]
Wei Zhou, Heike Adel, Hendrik Schuff and Ngoc Thang Vu
Mitigating Shortcuts in Language Models with Soft Label Encoding
[Video]
Zirui He, Huiqi Deng, Haiyan Zhao, Ninghao Liu and Mengnan Du
Revisiting the Self-Consistency Challenges in Multi-Choice Question Formats for Large Language Model Evaluation
[Video]
Wenjie Zhou, Qiang Wang, Mingzhou Xu, Ming Chen and Xiangyu Duan
13:20 - 14:40Lunch
14:40 - 15:40Keynote Speaker 2: Li Juanzi - Chair: Nianwen Xue
Knowledge in LLM Era: Actuality, Challenge, and Potentiality
[Video]
D3-S3-R1 - Dialogue, Conversational Systems, Chatbots, Human-Robot Interaction III (Chair: Gabriella Lapesa)
15:50 - 16:10Social Orientation: A New Feature for Dialogue Analysis
[Slides] [Video]
Todd Morrill, Zhaoyuan Deng, Yanda Chen, Amith Ananthram, Colin Wayne Leach and Kathleen McKeown
16:10 - 16:30Common Ground Tracking in Multimodal Dialogue
[Slides] [Video]
Ibrahim Khalil Khebour, Kenneth Lai, Mariah Bradford, Yifan Zhu, Richard A. Brutti, Christopher Tam, Jingxuan Tu, Benjamin A. Ibarra, Nathaniel Blanchard, Nikhil Krishnaswamy and James Pustejovsky
16:30 - 16:50PSYDIAL: Personality-based Synthetic Dialogue Generation Using Large Language Models
[Slides] [Video]
Ji-Eun Han, Jun-Seok Koh, Hyeon-Tae Seo, Du-Seong Chang and Kyung-Ah Sohn
16:50 - 17:10Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations
[Slides] [Video]
Yi-Pei Chen, Noriki Nishida, Hideki Nakayama and Yuji Matsumoto
D3-S3-R2 - Applications Involving LRs and Evaluation III (Chair: Sebastian Padó)
15:50 - 16:10A Multi-Task Transformer Model for Fine-grained Labelling of Chest X-Ray Reports
[Slides] [Video]
Yuanyi Zhu, Maria Liakata and Giovanni Montana
16:10 - 16:30A Survey on Natural Language Processing for Programming
[Slides] [Video]
Qingfu Zhu, Xianzhen Luo, Fang Liu, Cuiyun Gao and Wanxiang Che
16:30 - 16:50SLaCAD: A Spoken Language Corpus for Early Alzheimer’s Disease Detection
[Slides] [Video]
Shahla Farzana, Edoardo Stoppa, Alex Leow, Tamar Gollan, Raeanne Moore, David Salmon, Douglas Galasko, Erin Sundermann and Natalie Parde
16:50 - 17:10LexDrafter: Terminology Drafting for Legislative Documents Using Retrieval Augmented Generation
[Slides] [Video]
Ashish Chouhan and Michael Gertz
D3-S3-R3 - Information Extraction, Knowledge Extraction, and Text Mining III (Chair: Els Lefever)
15:50 - 16:10Negation Triplet Extraction with Syntactic Dependency and Semantic Consistency
[Video]
Yuchen Shi, Deqing Yang, Jingping Liu, Yanghua Xiao, Zongyu Wang and Huimin Xu
16:10 - 16:30A Novel Three-stage Framework for Few-shot Named Entity Recognition
[Slides] [Video]
Shengjie Ji and Fang Kong
16:30 - 16:50Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction
[Slides] [Video]
Zepeng Ding, Wenhao Huang, Jiaqing Liang, Yanghua Xiao and Deqing Yang
16:50 - 17:10DP-CRE: Continual Relation Extraction via Decoupled Contrastive Learning and Memory Structure Preservation
[Slides] [Video]
Mengyi Huang, Meng Xiao, Ludi Wang and Yi Du
D3-S3-R4 - Speech Resources and Processing I (Chair: Sakriani Sakti)
15:50 - 16:10DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
[Slides] [Video]
Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan, Chi-Han Lin and Berlin Chen
16:10 - 16:30FalAI: A Dataset for End-to-end Spoken Language Understanding in a Low-Resource Scenario
[Slides] [Video]
Andres Pineiro-Martin, Carmen Garcia-Mateo, Laura Docio-Fernandez, Maria del Carmen Lopez-Perez and Jose Gandarela-Rodriguez
16:30 - 16:50KazEmoTTS: A Dataset for Kazakh Emotional Text-to-Speech Synthesis
[Slides] [Video]
Adal Abilbekov, Saida Mussakhojayeva, Rustem Yeshpanov and Huseyin Atakan Varol
16:50 - 17:10SpeechAlign: A Framework for Speech Translation Alignment Evaluation
[Slides] [Video]
Belen Alastruey, Aleix Sant, Gerard I. Gállego, David Dale and Marta R. Costa-jussà
D3-S3-R5 - Knowledge Discovery / Representation I (Chair: Sunipa Dev)
15:50 - 16:10Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding
[Slides] [Video]
Zhiyu Fang, Jingyan Qin, Xiaobin Zhu, Chun Yang and Xu-Cheng Yin
16:10 - 16:30Empowering Small-Scale Knowledge Graphs: A Strategy of Leveraging General-Purpose Knowledge Graphs for Enriched Embeddings
[Slides] [Video]
Albert Sawczyn, Jakub Binkowski, Piotr Bielak and Tomasz Kajdanowicz
16:30 - 16:50Improving Content Recommendation: Knowledge Graph-Based Semantic Contrastive Learning for Diversity and Cold-Start Users
[Slides] [Video]
Yejin Kim, Scott Rome, Kevin Foley, Mayur Nankani, Rimon Melamed, Javier Morales, Abhay K. Yadav, Maria Peifer, Sardar Hamidian and H. Howie Huang
16:50 - 17:10Inductive Knowledge Graph Completion with GNNs and Rules: An Analysis
[Slides] [Video]
Akash Anil, Victor Gutierrez-Basulto, Yazmin Ibanez-Garcia and Steven Schockaert
D3-S3-R6 - Social Media Processing (Chair: Veronique Hoste)
15:50 - 16:10PopALM: Popularity-Aligned Language Models for Social Media Trendy Response Prediction
[Video]
Erxin Yu, Jing Li and Chunpu Xu
16:10 - 16:30ZenPropaganda: A Comprehensive Study on Identifying Propaganda Techniques in Russian Coronavirus-Related Media
[Slides] [Video]
Anton Chernyavskiy, Svetlana Shomova, Irina Dushakova, Ilya Kiriya and Dmitry Ilvovsky
16:30 - 16:50Leveraging Social Context for Humor Recognition and Sense of Humor Evaluation in Social Media with a New Chinese Humor Corpus - HumorWB
[Video]
Zeyuan Zeng, Zefeng Li, Liang Yang and Hongfei Lin
15:50 - 17:10D3-S3-P10 - CL and Linguistic Theories, Cognitive Modeling and Psycholinguistics II (Chair: Valentin Barriere)
So Hateful! Building a Multi-Label Hate Speech Annotated Arabic Dataset
[Poster] [Slides] [Video]
Wajdi Zaghouani, Hamdy Mubarak and Md. Rafiul Biswas
InteRead: An Eye Tracking Dataset of Interrupted Reading
[Slides] [Video]
Francesca Zermiani, Prajit Dhar, Ekta Sood, Fabian Kögel, Andreas Bulling and Maria Wirzberger
PILA: A Historical-Linguistic Dataset of Proto-Italic and Latin
[Slides] [Video]
Stephen Bothwell, Brian DuSell, David Chiang and Brian Krostenko
Pater Incertus? There Is a Solution: Automatic Discrimination between Cognates and Borrowings for Romance Languages
[Poster] [Slides] [Video]
Liviu P. Dinu, Ana Sabina Uban, Ioan-Bogdan Iordache, Alina Maria Cristea, Simona Georgescu and Laurentiu Zoicas
Probing Large Language Models for Scalar Adjective Lexical Semantics and Scalar Diversity Pragmatics
[Slides] [Video]
Fangru Lin, Daniel Altshuler and Janet B. Pierrehumbert
Cognitive Information Bottleneck: Extracting Minimal Sufficient Cognitive Language Processing Signals
[Video]
Yuto Harada and Yohei Oseki
Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs
[Slides] [Video]
David R. Mortensen, Valentina Izrailevitch, Yunze Xiao, Hinrich Schütze and Leonie Weissweiler
Reading Does Not Equal Reading: Comparing, Simulating and Exploiting Reading Behavior across Populations
[Video]
David R. Reich, Shuwen Deng, Marina Björnsdóttir, Lena Jäger and Nora Hollenstein
IT2ACL Learning Easy-to-Hard Instructions via 2-Phase Automated Curriculum Learning for Large Language Models
[Poster] [Video]
Yufei Huang and Deyi Xiong
Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons
[Video]
Shijia Zhou, Leonie Weissweiler, Taiqi He, Hinrich Schütze, David R. Mortensen and Lori Levin
LexiVault: A Repository for Psycholinguistic Lexicons of Lesser-studied Languages
[Video]
Hind Saddiki, Samantha Wray and Daisy Li
Representing Compounding with OntoLex. An Evaluation of Vocabularies for Word Formation Resources
[Video]
Elena Benzoni, Matteo Pellegrini, Francesco Dedè and Marco Passarotti
15:50 - 17:10D3-S3-P10 - Corpora and Annotation V (Chair: Valentin Barriere)
Building MUSCLE, a Dataset for MUltilingual Semantic Classification of Links between Entities
[Video]
Lucia Pitarch, Carlos Bobed Lisbona, David Abián, Jorge Gracia and Jordi Bernad
MARASTA: A Multi-dialectal Arabic Cross-domain Stance Corpus
[Slides] [Video]
Anis Charfi, Mabrouka Ben-Sghaier, Andria Samy Raouf Atalla, Raghda Akasheh, Sara Al-Emadi and Wajdi Zaghouani
Towards an Ideal Tool for Learner Error Annotation
[Slides] [Video]
Špela Arhar Holdt, Tomaž Erjavec, Iztok Kosem and Elena Volodina
LeadEmpathy: An Expert Annotated German Dataset of Empathy in Written Leadership Communication
[Slides] [Video]
Didem Sedefoglu, Allison Claire Lahnala, Jasmin Wagner, Lucie Flek and Sandra Ohly
There’s Something New about the Italian Parliament: The IPSA Corpus
[Slides] [Video]
Valentino Frasnelli and Alessio Palmero Aprosio
CLAUSE-ATLAS: A Corpus of Narrative Information to Scale up Computational Literary Analysis
[Poster] [Video]
Enrica Troiano and Piek T.J.M. Vossen
Spanless Event Annotation for Corpus-Wide Complex Event Understanding
[Poster] [Slides] [Video]
Ann Bies, Jennifer Tracey, Ann O’Brien, Song Chen and Stephanie Strassel
MultiLeg: Dataset for Text Sanitisation in Less-resourced Languages
[Slides] [Video]
Rinalds Vīksna and Inguna Skadiņa
FORECAST2023: A Forecast and Reasoning Corpus of Argumentation Structures
[Poster] [Slides] [Video]
Kamila Górska, John Lawrence and Chris Reed
Italian Word Embeddings for the Medical Domain
[Video]
Franco Alberto Cardillo and Franca Debole
GAATME: A Genetic Algorithm for Adversarial Translation Metrics Evaluation
[Slides] [Video]
Josef Jon and Ondřej Bojar
A Corpus of German Abstract Meaning Representation (DeAMR)
[Video]
Christoph Otto, Jonas Groschwitz, Alexander Koller, Xiulin Yang and Lucia Donatelli
Universal Dependencies: Extensions for Modern and Historical German
[Poster] [Video]
Stefanie Dipper, Cora Haiber, Anna Maria Schröter, Alexandra Wiemann and Maike Brinkschulte
CareCorpus: A Corpus of Real-World Solution-Focused Caregiver Strategies for Personalized Pediatric Rehabilitation Service Design
[Slides] [Video]
Mina Valizadeh, Vera C. Kaelin, Mary A. Khetani and Natalie Parde
Introducing CQuAE : A New French Contextualised Question-Answering Corpus for the Education Domain
[Slides] [Video]
Thomas Gerald, Anne Vilnat, Sofiane Ettayeb, Louis Tamames and Patrick Paroubek
The RIP Corpus of Collaborative Hypothesis-Making
[Slides] [Video]
Ella Schad, Jacky Visser and Chris Reed
Developing a Benchmark for Pronunciation Feedback: Creation of a Phonemically Annotated Speech Corpus of isiZulu Language Learner Speech
[Slides] [Video]
Alexandra O’Neil, Nils Hjortnaes, Francis Tyers, Zinhle Nkosi, Thulile Ndlovu, Zanele Mlondo and Ngami Phumzile Pewa
Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline
[Poster] [Video]
Maliha Jahan, Helin Wang, Thomas Thebaud, Yinglun Sun, Giang Ha Le, Zsuzsanna Fagyal, Odette Scharenborg, Mark Hasegawa-Johnson, Laureano Moro Velazquez and Najim Dehak
Building a Broad Infrastructure for Uniform Meaning Representations
[Video]
Julia Bonn, Matthew J. Buchholz, Jayeol Chun, Andrew Cowell, William Croft, Lukas Denk, Sijia Ge, Jan Hajič, Kenneth Lai, James H. Martin, Skatje Myers, Alexis Palmer, Martha Palmer, Claire Benet Post, James Pustejovsky, Kristine Stenzel, Haibo Sun, Zdeňka Urešová, Rosa Vallejos, Jens E. L. Van Gysel, Meagan Vigus, Nianwen Xue and Jin Zhao
Revisiting Data Reconstruction Attacks on Real-world Dataset for Federated Natural Language Understanding
[Video]
Zhuo Zhang, Jintao Huang, Xiangjing Hu, Jingyuan Zhang, Yating Zhang, Hui Wang, Yue Yu, Qifan Wang, Lizhen Qu and Zenglin Xu
15:50 - 17:10D3-S3-P10 - Evaluation and Validation Methodologies III (Chair: Valentin Barriere)
Evaluating the Quality of a Corpus Annotation Scheme Using Pretrained Language Models
[Slides] [Video]
Furkan Akkurt, Onur Gungor, Büşra Marşan, Tunga Gungor, Balkiz Ozturk Basaran, Arzucan Özgür and Susan Uskudarli
Revisiting Three Text-to-Speech Synthesis Experiments with a Web-Based Audience Response System
[Video]
Christina Tånnander, Jens Edlund and Joakim Gustafson
Examining Temporalities on Stance Detection towards COVID-19 Vaccination
[Slides] [Video]
Yida Mu, Mali Jin, Kalina Bontcheva and Xingyi Song
How Far Is Too Far? Studying the Effects of Domain Discrepancy on Masked Language Models
[Slides] [Video]
Subhradeep Kayal, Alexander Rakhlin, Ali Dashti and Serguei Stepaniants
A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks
[Poster] [Slides] [Video]
Yanis Labrak, Mickael Rouvier and Richard Dufour
Unveiling Strengths and Weaknesses of NLP Systems Based on a Rich Evaluation Corpus: The Case of NER in French
[Slides] [Video]
Alice Millour, Yoann Dupont, Karen Fort and Liam Duignan
How Gender Interacts with Political Values: A Case Study on Czech BERT Models
[Video]
Adnan Al Ali and Jindřich Libovický
Comparative Analysis of Sign Language Interpreting Agents Perception: A Study of the Deaf
[Slides] [Video]
Alfarabi Imashev, Nurziya Oralbayeva, Gulmira Baizhanova and Anara Sandygulova
Kosmic: Korean Text Similarity Metric Reflecting Honorific Distinctions
[Slides] [Video]
Yerin Hwang, Yongil Kim, Hyunkyung Bae, Jeesoo Bang, Hwanhee Lee and Kyomin Jung
Evaluating Generative Language Models in Information Extraction as Subjective Question Correction
[Video]
Yuchen Fan, Yantao Liu, Zijun Yao, Jifan Yu, Lei Hou and Juanzi Li
Does the Language Matter? Curriculum Learning over Neo-Latin Languages
[Video]
Giulia Pucci and Leonardo Ranaldi
Automating Dataset Production Using Generative Text and Image Models
[Slides] [Video]
Christopher Thierauf, Mitchell Abrams and Matthias Scheutz
A Controlled Reevaluation of Coreference Resolution Models
[Slides] [Video]
Ian Porada, Xiyuan Zou and Jackie Chi Kit Cheung
Schroedinger’s Threshold: When the AUC Doesn’t Predict Accuracy
[Poster] [Video]
Juri Opitz
15:50 - 17:10D3-S3-P10 - Multimodal Applications, Grounded Language Acquisition, and HRI I (Chair: Valentin Barriere)
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
[Video]
Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li and Weiming Hu
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval
[Video]
Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li and Weiming Hu
CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation
[Video]
Hongcheng Liu, Pingjie Wang, Zhiyuan Zhu, Yanfeng Wang and Yu Wang
An Effective Span-based Multimodal Named Entity Recognition with Consistent Cross-Modal Alignment
[Slides] [Video]
Yongxiu Xu, Hao Xu, Heyan Huang, Shiyao Cui, Minghao Tang, Longzheng Wang and Hongbo Xu
Korean Disaster Safety Information Sign Language Translation Benchmark Dataset
[Slides] [Video]
Wooyoung Kim, TaeYong Kim, Byeongjin KIM, Myeong Jin MJ Lee, Gitaek Lee, kirok kim, Jisoo Cha and Wooju Kim
RT-VQ2A2: Real Time Vector Quantized Question Answering with ASR
[Slides] [Video]
Kyungho Kim, Seongmin Park and Jihwa Lee
EVil-Probe - a Composite Benchmark for Extensive Visio-Linguistic Probing
[Poster] [Slides] [Video]
Marie Bexte, Andrea Horbach and Torsten Zesch
ReadLet: A Dataset for Oral, Visual and Tactile Text Reading Data of Early and Mature Readers
[Video]
Marcello Ferro, Claudia Marzi, Andrea Nadalini, Loukia Taxitari, Alessandro Lento and Vito Pirrelli
Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost
[Slides] [Video]
Oana Ignat, Longju Bai, Joan C. Nwatu and Rada Mihalcea
Generating Contextual Images for Long-Form Text
[Slides] [Video]
Avijit Mitra, Nalin Gupta, Chetan Naik, Abhinav Sethy, Kinsey Bice and Zeynab Raeesy
Decompose, Prioritize, and Eliminate: Dynamically Integrating Diverse Representations for Multimodal Named Entity Recognition
[Video]
Zihao Zheng, Zihan Zhang, Zexin Wang, Ruiji Fu, Ming Liu, Zhongyuan Wang and Bing Qin
Motion Generation from Fine-grained Textual Descriptions
[Slides] [Video]
Kunhang Li and Yansong Feng
Adaptive Simultaneous Sign Language Translation with Confident Translation Length Estimation
[Slides] [Video]
Tong Sun, Biao Fu, Cong Hu, Liang Zhang, Ruiquan Zhang, xiaodong shi, Jinsong Su and Yidong Chen
15:50 - 17:10D3-S3-P10 - Multimodal Applications, Grounded Language Acquisition, and HRI I (Chair: Valentin Barriere)
Prophecy Distillation for Boosting Abstractive Summarization
[Slides] [Video]
Jiaxin Duan, Fengyu Lu and Junfei Liu
Step-by-Step: Controlling Arbitrary Style in Text with Large Language Models
[Poster] [Slides] [Video]
Pusheng Liu, Lianwei Wu, Linyong Wang, Sensen Guo and Yang Liu
Enhancing Scientific Document Summarization with Research Community Perspective and Background Knowledge
[Poster] [Slides] [Video]
Sudipta Singha Roy and Robert E. Mercer
A Natural Approach for Synthetic Short-Form Text Analysis
[Slides] [Video]
Ruiting Shao, Ryan Schwarz, Christopher Clifton and Edward Delp
Little Red Riding Hood Goes around the Globe: Crosslingual Story Planning and Generation with Large Language Models
[Slides] [Video]
Evgeniia Razumovskaia, Joshua Maynez, Annie Louis, Mirella Lapata and Shashi Narayan
Benchmarking the Simplification of Dutch Municipal Text
[Slides] [Video]
Daniel Vlantis, Iva Gornishka and Shuai Wang
Scansion-based Lyrics Generation
[Slides] [Video]
Yiwen Chen and Simone Teufel
Quantifying the Impact of Disfluency on Spoken Content Summarization
[Poster] [Slides] [Video]
Maria Teleki, Xiangjue Dong and James Caverlee
Knowledge-Guided Cross-Topic Visual Question Generation
[Slides] [Video]
Hongfei Liu, Guohua Wang, Jiayuan Xie, Jiali Chen, Wenhao Fang and Yi Cai
MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization
[Video]
Tao Chen, Ze Lin, Hui Li, Jiayi Ji, Yiyi Zhou, Guanbin Li and Rongrong Ji
MCTS: A Multi-Reference Chinese Text Simplification Dataset
[Poster] [Slides] [Video]
Ruining Chong, Luming Lu, Liner Yang, Jinran Nie, Zhenghao Liu, Shuo Wang, Shuhan Zhou, Yaoxin Li and Erhong Yang
DGoT: Dynamic Graph of Thoughts for Scientific Abstract Generation
[Poster] [Slides] [Video]
Xinyu Ning, Yutong Zhao, Yitong Liu and Hongwen Yang
CALAMR: Component ALignment for Abstract Meaning Representation
[Slides] [Video]
Paul Landes and Barbara Di Eugenio
Reference-guided Style-Consistent Content Transfer
[Video]
Wei-Fan Chen, Milad Alshomary, Maja Stahl, Khalid Al Khatib, Benno Stein and Henning Wachsmuth
Title-based Extractive Summarization via MRC Framework
[Video]
Hongjin Kim, Jai-Eun Kim and Harksoo Kim
17:10 - 17:30Coffee break
17:30 - 18:50Closing Session
[Video]
20:00 - 23:00Gala Dinner
 End of Day 3