Title |
BUCEADOR, a multi-language search engine for digital libraries |
Authors |
Jordi Adell, Antonio Bonafonte, Antonio Cardenal, Marta R. Costa-Jussà, José A. R. Fonollosa, Asunción Moreno, Eva Navas and Eduardo R. Banga |
Abstract |
This paper presents a web-based multimedia search engine built within the Buceador (www.buceador.org) research project. A proof-of-concept tool has been implemented which is able to retrieve information from a digital library made of multimedia documents in the 4 official languages in Spain (Spanish, Basque, Catalan and Galician). The retrieved documents are presented in the user language after translation and dubbing (the four previous languages + English). The paper presents the tool functionality, the architecture, the digital library and provide some information about the technology involved in the fields of automatic speech recognition, statistical machine translation, text-to-speech synthesis and information retrieval. Each technology has been adapted to the purposes of the presented tool as well as to interact with the rest of the technologies involved. |
Topics |
Tools, systems, applications, Multimedia Document Processing, Multilinguality |
Full paper |
BUCEADOR, a multi-language search engine for digital libraries |
Bibtex |
@InProceedings{ADELL12.828,
author = {Jordi Adell and Antonio Bonafonte and Antonio Cardenal and Marta R. Costa-Jussà and José A. R. Fonollosa and Asunción Moreno and Eva Navas and Eduardo R. Banga}, title = {BUCEADOR, a multi-language search engine for digital libraries}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |