LREC 2022 Proceedings Home | Workshops | LREC 2022 WEBSITE | ELRA WEBSITE

LREC 2022

Proceedings of the the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages

ISBN: 979-10-95546-91-7
EAN: 9791095546917

List of Papers


Full proceedings volume (PDF) | Workshop Site | Home | Programme | Author index | Bibliography (BibTeX) | Editors



pdf bib Papers pages
pdf bib Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings
Marcely Zanon Boito, Bolaji Yusuf, Lucas Ondel, Aline Villavicencio and Laurent Besacier
pp. 1‑9
pdf bib An Open Source Web Reader for Under-Resourced Languages
Judy Fong, Þorsteinn Daði Gunnarsson, Sunneva Þorsteinsdóttir, Gunnar Thor Örnólfsson and Jon Gudnason
pp. 10‑15
pdf bib Text-to-Speech for Under-Resourced Languages: Phoneme Mapping and Source Language Selection in Transfer Learning
Phat Do, Matt Coler, Jelske Dijkstra and Esther Klabbers
pp. 16‑22
pdf bib ReadAlong Studio: Practical Zero-Shot Text-Speech Alignment for Indigenous Language Audiobooks
Patrick Littell, Eric Joanis, Aidan Pine, Marc Tessier, David Huggins Daines and Delasie Torkornoo
pp. 23‑32
pdf bib Corpus Creation for Sentiment Analysis in Code-Mixed Tulu Text
Asha Hegde, Mudoor Devadas Anusha, Sharal Coelho, Hosahalli Lakshmaiah Shashirekha and Bharathi Raja Chakravarthi
pp. 33‑40
pdf bib Crowd-sourcing for Less-resourced Languages: Lingua Libre for Polish
Mathilde Hutin and Marc Allassonnière-Tang
pp. 41‑47
pdf bib Tupían Language Ressources: Data, Tools, Analyses
Lorena Martín Rodríguez, Tatiana Merzhevich, Wellington Silva, Tiago Tresoldi, Carolina Aragon and Fabrício F. Gerardi
pp. 48‑58
pdf bib Quality versus Quantity: Building Catalan-English MT Resources
Ona de Gibert Bonet, Ksenia Kharitonova, Blanca Calvo Figueras, Jordi Armengol-Estapé and Maite Melero
pp. 59‑69
pdf bib A Sentiment Corpus for South African Under-Resourced Languages in a Multilingual Context
Ronny Mabokela and Tim Schlippe
pp. 70‑77
pdf bib CUNI Submission to MT4All Shared Task
Ivana Kvapilíková and Ondrej Bojar
pp. 78‑82
pdf bib Resource: Indicators on the Presence of Languages in Internet
Daniel Pimienta
pp. 83‑91
pdf bib Language Technologies for Low Resource Languages: Sociolinguistic and Multilingual Insights
A. Seza Doğruöz and Sunayana Sitaram
pp. 92‑97
pdf bib Sentiment Analysis for Hausa: Classifying Students’ Comments
Ochilbek Rakhmanov and Tim Schlippe
pp. 98‑105
pdf bib Nepali Encoder Transformers: An Analysis of Auto Encoding Transformer Language Models for Nepali Text Classification
Utsav Maskey, Manish Bhatta, Shiva Bhatt, Sanket Dhungel and Bal Krishna Bal
pp. 106‑111
pdf bib CoSwID, a Code Switching Identification Method Suitable for Under-Resourced Languages
Laurent Kevers
pp. 112‑121
pdf bib A Neural Network Approach to Create Minangkabau-Indonesia Bilingual Dictionary
Kartika Resiandi, Yohei Murakami and Arbi Haza Nasution
pp. 122‑128
pdf bib Machine Translation from Standard German to Alemannic Dialects
Louisa Lambrecht, Felix Schneider and Alexander Waibel
pp. 129‑136
pdf bib Question Answering Classification for Amharic Social Media Community Based Questions
Tadesse Destaw, Seid Muhie Yimam, Abinew Ayele and Chris Biemann
pp. 137‑145
pdf bib Automatic Detection of Morphological Processes in the Yorùbá Language
Tunde Adegbola
pp. 146‑154
pdf bib Evaluating Unsupervised Approaches to Morphological Segmentation for Wolastoqey
Diego Bear and Paul Cook
pp. 155‑160
pdf bib Baseline English and Maltese-English Classification Models for Subjectivity Detection, Sentiment Analysis, Emotion Analysis, Sarcasm Detection, and Irony Detection
Keith Cortis and Brian Davis
pp. 161‑168
pdf bib Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments
Katri Hiovain-Asikainen and Sjur Moshagen
pp. 169‑175
pdf bib Investigating the Quality of Static Anchor Embeddings from Transformers for Under-Resourced Languages
Pranaydeep Singh, Orphee De Clercq and Els Lefever
pp. 176‑184
pdf bib Introducing YakuToolkit. Yakut Treebank and Morphological Analyzer.
Tatiana Merzhevich and Fabrício Ferraz Gerardi
pp. 185‑188
pdf bib A Language Model for Spell Checking of Educational Texts in Kurdish (Sorani)
Roshna Abdulrahman and Hossein Hassani
pp. 189‑198
pdf bib SimRelUz: Similarity and Relatedness Scores as a Semantic Evaluation Dataset for Uzbek Language
Ulugbek Salaev, Elmurod Kuriyozov and Carlos Gómez-Rodríguez
pp. 199‑206
pdf bib ENRICH4ALL: A First Luxembourgish BERT Model for a Multilingual Chatbot
Dimitra Anastasiou
pp. 207‑212

Powered by ELDA © 2022 ELDA/ELRA