Title |
Creative Language Explorations through a high-Expressivity N-grams Query Language |
Authors |
Carlo Strapparava, Lorenzo Gatti, Marco Guerini and Oliviero Stock |
Abstract |
In computation linguistics a combination of syntagmatic and paradigmatic features is often exploited. While the first aspects are typically managed by information present in large n-gram databases, domain and ontological aspects are more properly modeled by lexical ontologies such as WordNet and semantic similarity spaces. This interconnection is even stricter when we are dealing with creative language phenomena, such as metaphors, prototypical properties, puns generation, hyperbolae and other rhetorical phenomena. This paper describes a way to focus on and accomplish some of these tasks by exploiting NgramQuery, a generalized query language on Google N-gram database. The expressiveness of this query language is boosted by plugging semantic similarity acquired both from corpora (e.g. LSA) and from WordNet, also integrating operators for phonetics and sentiment analysis. The paper reports a number of examples of usage in some creative language tasks. |
Topics |
Lexicon, Lexical Database, Language Modelling |
Full paper |
Creative Language Explorations through a high-Expressivity N-grams Query Language |
Bibtex |
@InProceedings{STRAPPARAVA14.486,
author = {Carlo Strapparava and Lorenzo Gatti and Marco Guerini and Oliviero Stock}, title = {Creative Language Explorations through a high-Expressivity N-grams Query Language}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |