LREC COLING 2024 Proceedings Home | Workshops | Tutorials | LREC Proceedings | ELRA Website | ICCL Website


The Fifth Workshop on Resources for African Indigenous Languages


Full proceedings volume (PDF) | Workshop Site | Home | Programme | Author index | Bibliography (BibTeX) | Editors

PROGRAM

Saturday, May 25, 2024

09:00–09:05Opening
09:05–09:30Doing Phonetics in the Rift Valley: Sound Systems of Maasai, Iraqw and Hadza
Alain Ghio, Didier Demolin, Michael Karani and Yohann Meynadier
09:30–09:55Kallaama: A Transcribed Speech Dataset about Agriculture in the Three Most Widely Spoken Languages in Senegal
Elodie Gauthier, Aminata Ndiaye and Abdoulaye Guissé
09:55–10:20Long-Form Recordings to Study Children’s Language Input and Output in Under-Resourced Contexts
Joseph R. Coffey and Alejandrina Cristia
10:20–10:30Developing Bilingual English-Setswana Datasets for Space Domain
Tebatso G. Moape, Sunday Olusegun Ojo and Oludayo O. Olugbara
10:30–11:00Coffee break
11:00–11:25Compiling a List of Frequently Used Setswana Words for Developing Readability Measures
Johannes Sibeko
11:25–11:50A Qualitative Inquiry into the South African Language Identifier’s Performance on YouTube Comments.
Nkazimlo N. Ngcungca, Johannes Sibeko and Sharon Rudman
11:50–12:15The First Universal Dependency Treebank for Tswana: Tswana-Popapolelo
Tanja Gaustad, Ansu Berg, Rigardt Pretorius and Roald Eiselen
12:15–12:40Adapting Nine Traditional Text Readability Measures into Sesotho
Johannes Sibeko and Menno van Zaanen
12:40–13:05Bootstrapping Syntactic Resources from isiZulu to Siswati
Laurette Marais, Laurette Pretorius and Lionel Clive Posthumus
13:05–14:20Lunch break
14:20–14:45Early Child Language Resources and Corpora Developed in Nine African Languages by the SADiLaR Child Language Development Node
Michelle J. White, Frenette Southwood and Sefela Londiwe Yalala
14:45–15:10Morphological Synthesizer for Ge’ez Language: Addressing Morphological Complexity and Resource Limitations
Gebrearegawi Gebremariam Gidey, Hailay Kidu Teklehaymanot and Gebregewergs Mezgebe Atsbha
15:10–15:35EthioMT: Parallel Corpus for Low-resource Ethiopian Languages
Atnafu Lambebo Tonja, Olga Kolesnikova, Alexander Gelbukh and Jugal Kalita
15:35–16:00Resources for Annotating Hate Speech in Social Media Platforms Used in Ethiopia: A Novel Lexicon and Labelling Scheme
Nuhu Ibrahim, Felicity Mulford, Matt Lawrence and Riza Batista-Navarro
16:00–16:30Coffee break
16:30–16:55Low Resource Question Answering: An Amharic Benchmarking Dataset
Tilahun Abedissa Taffa, Ricardo Usbeck and Yaregal Assabie
16:55–17:05The Annotators Agree to Not Agree on the Fine-grained Annotation of Hate-speech against Women in Algerian Dialect Comments
Imane Guellil, Yousra Houichi, Sara Chennoufi, Mohamed Boubred, Anfal Yousra Boucetta and Faical Azouaou
17:05–17:30Advancing Language Diversity and Inclusion: Towards a Neural Network-based Spell Checker and Correction for Wolof
Thierno Ibrahima Cissé and Fatiha Sadat
17:30–17:55Lateral Inversions, Word Form/Order, Unnamed Grammatical Entities and Ambiguities in the Constituency Parsing and Annotation of the Igala Syntax through the English Language
Mahmud Mohammed Momoh
17:55–18:00Closing