LREC COLING 2024 Proceedings Home | Workshops | Tutorials | LREC Proceedings | ELRA Website | ICCL Website


The Third Ukrainian Natural Language Processing Workshop


Full proceedings volume (PDF) | Workshop Site | Home | Programme | Author index | Bibliography (BibTeX) | Editors

PROGRAM

Saturday, May 25, 2024

 09:00–10:30 Morning session 1: New Datasets # %chair1 Mariana Romanyshyn
09:10–09:25A Contemporary News Corpus of Ukrainian (CNC-UA): Compilation, Annotation, Publication
Stefan Fischer, Kateryna Haidarzhyi, Jörg Knappen, Olha Polishchuk, Yuliya Stodolinska and Elke Teich
09:25–09:40Introducing the Djinni Recruitment Dataset: A Corpus of Anonymized CVs and Job Postings
Nazarii Drushchak and Mariana Romanyshyn
09:40–09:55Creating Parallel Corpora for Ukrainian: A German-Ukrainian Parallel Corpus (ParaRook||DE-UK)
Maria Shvedova and Arsenii Lukashevskyi
09:55–10:10Introducing NER-UK 2.0: A Rich Corpus of Named Entities for Ukrainian
Dmytro Chaplynskyi and Mariana Romanyshyn
 10:30–11:00 Coffee break
 11:00–13:00 Morning session 2: New Directions # %chair1 Oleksii Ignatenko
11:00–11:20Instant Messaging Platforms News Multi-Task Classification for Stance, Sentiment, and Discrimination Detection
Taras Ustyianovych and Denilson Barbosa
11:20–11:35Setting up the Data Printer with Improved English to Ukrainian Machine Translation
Yurii Paniv, Dmytro Chaplynskyi, Nikita Trynus and Volodymyr Kyrylov
11:35–11:55Automated Extraction of Hypo-Hypernym Relations for the Ukrainian WordNet
Nataliia Romanyshyn, Dmytro Chaplynskyi and Mariana Romanyshyn
11:55–12:10Ukrainian Visual Word Sense Disambiguation Benchmark
Yurii Laba, Yaryna Mohytych, Ivanna Rohulia, Halyna Kyryleyza, Hanna Dydyk-Meush, Oles Dobosevych and Rostyslav Hryniv
 13:00–14:00 Lunch
 14:00–16:00 Afternoon session: LLMs for Ukrainian # %chair1 Mariana Romanyshyn
14:00–14:15The UNLP 2024 Shared Task on Fine-Tuning Large Language Models for Ukrainian
Mariana Romanyshyn, Oleksiy Syvokon and Roman Kyslyi
14:15–14:35Fine-Tuning and Retrieval Augmented Generation for Question Answering Using Affordable Large Language Models
Tiberiu Boros, Radu Chivereanu, Stefan Dumitrescu and Octavian Purcaru
14:35–14:55From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian Language Representation
Artur Kiulian, Anton Polishko, Mykola Khandoga, Oryna Chubych, Jack Connor, Raghav Ravishankar and Adarsh Shirawalmath
14:55–15:15Spivavtor: An Instruction Tuned Ukrainian Text Editing Model
Aman Saini, Artem Chernodub, Vipul Raheja and Vivek Kulkarni
15:15–15:35Eval-UA-tion 1.0: Benchmark for Evaluating Ukrainian (Large) Language Models
Serhii Hamotskyi, Anna-Izabella Levbarg and Christian Hänig
15:35–15:55LiBERTa: Advancing Ukrainian Language Modeling through Pre-training from Scratch
Mykola Haltiuk and Aleksander Smywiński-Pohl
 16:00–16:30 Coffee break
 16:30–18:00 Afternoon session: LLMs for Ukrainian # %chair1 Oleksii Ignatenko
16:30–16:45Entity Embellishment Mitigation in LLMs Output with Noisy Synthetic Dataset for Alignment
Svitlana GALESHCHUK
16:45–17:00Language-Specific Pruning for Efficient Reduction of Large Language Models
Maksym Shamrai