LREC COLING 2024 Proceedings Home | Workshops | Tutorials | LREC Proceedings | ELRA Website | ICCL Website


The 6th Workshop on Open-Source Arabic Corpora and Processing Tools (OSACT) with Shared Tasks on Arabic LLMs Hallucination and Dialect to MSA Machine Translation


Full proceedings volume (PDF) | Workshop Site | Home | Programme | Author index | Bibliography (BibTeX) | Editors

PROGRAM

Saturday 25 May 2024

 Session 1: Main Workshop
9:00–9:10Workshop Opening
9:10–9:50Keynote Talk: Towards Arab-Centric Large Language Models
Muhammad Abdul-Mageed
9:50–10:10AraTar: A Corpus to Support the Fine-grained Detection of Hate Speech Targets in the Arabic Language
Seham Alghamdi, Youcef Benkhedda, Basma Alharbi and Riza Batista-Navarro
10:10–10:30CLEANANERCorp: Identifying and Correcting Incorrect Labels in the ANERcorp Dataset
Mashael AlDuwais, Hend Al-Khalifa and Abdulmalik AlSalman
 Session 2: Main Workshop (Cont.)
11:00–11:20Munazarat 1.0: A Corpus of Arabic Competitive Debates
Mohammad M. Khader, AbdulGabbar Al-Sharafi, Mohamad Hamza Al-Sioufy, Wajdi Zaghouani and Ali Al-Zawqari
11:20–11:40Leveraging Corpus Metadata to Detect Template-based Translation: An Exploratory Case Study of the Egyptian Arabic Wikipedia Edition
Saied Alshahrani, Hesham Haroon Mohammed, Ali Elfilali, Mariama Njie and Jeanna Matthews
11:40–12:00A Novel Approach for Root Selection in the Dependency Parsing
Sharefah Ahmed Al-Ghamdi, Hend Al-Khalifa and Abdulmalik AlSalman
12:00–12:20AraMed: Arabic Medical Question Answering using Pretrained Transformer Language Models
Ashwag Alasmari, sarah alhumoud and Waad Alshammari
12:20–12:40The Multilingual Corpus of World’s Constitutions (MCWC)
Mo El-Haj and Saad Ezzini
12:40–13:00TafsirExtractor: Text Preprocessing Pipeline preparing Classical Arabic Literature for Machine Learning Applications
Carl Kruse and Sajawel Ahmed
 Session 3: Main Workshop (Cont.)
14:00–14:20Advancing the Arabic WordNet: Elevating Content Quality
Abed Alhakim Freihat, Hadi Mahmoud Khalilia, Gábor Bella and Fausto Giunchiglia
14:20–14:40Arabic Speech Recognition of zero-resourced Languages: A case of Shehri (Jibbali) Language
Norah A. Alrashoudi, Omar Said Alshahri and Hend Al-Khalifa
 Session 4: Shared Tasks
14:40–14:55OSACT6 Dialect to MSA Translation Shared Task Overview
Ashraf Hatim Elneima, AhmedElmogtaba Abdelmoniem Ali Abdelaziz and Kareem Darwish
14:55–15:10OSACT 2024 Task 2: Arabic Dialect to MSA Translation
hanin atwany, Nour Rabih, Ibrahim Mohammed, Abdul Waheed and Bhiksha Raj
15:10–15:25ASOS at OSACT6 Shared Task: Investigation of Data Augmentation in Arabic Dialect-MSA Translation
Omer Nacar, Abdullah Alharbi, Serry Sibaee, Samar Ahmed, Lahouari Ghouti and Anis Koubaa
15:25–15:50LLM-based MT Data Creation: Dialectal to MSA Translation Shared Task
AhmedElmogtaba Abdelmoniem Ali Abdelaziz, Ashraf Hatim Elneima and Kareem Darwish
15:50–16:00Sirius_Translators at OSACT6 2024 Shared Task: Fin-tuning Ara-T5 Models for Translating Arabic Dialectal Text to Modern Standard Arabic
Salwa Saad Alahmari
 Session 5: Shared Tasks (Cont.)
16:30–16:45AraT5-MSAizer: Translating Dialectal Arabic to MSA
Murhaf Fares
16:45–17:00ASOS at Arabic LLMs Hallucinations 2024: Can LLMs detect their Hallucinations :)
Serry Taiseer Sibaee, Abdullah I. Alharbi, Samar Ahmed, Omar Nacar, Lahouri Ghouti and Anis Koubaa
17:00–17:05Workshop Closing