Title |
A Multilingual Database of Idioms |
Author(s) |
Aline Villavicencio (1), Timothy Baldwin (2), Benjamin Waldron (1) (1) University of Cambridge Computer Laboratory, (Villavicencio and Waldron); (2) CSLI, Stanford University, (Baldwin) |
Session |
P10-W |
Abstract |
This paper presents a possible architecture for a multilingual database of idioms. We discuss the challenges that idioms present to the creation of such a database and propose a possible encoding that maximises the amount of information that can be stored for different languages. Such a resource provides important information for linguistic, computational linguistic and psycholinguistic use, and allows for the comparison of different phenomena in different languages. This can provide the basis for a better understanding of regularities in idioms across languages. |
Keyword(s) |
Idiom, lexical database, multiword expression |
Language(s) | English, Portuguese |
Full Paper |