Summary of the paper

Title Arabic WordNet: Semi-automatic Extensions using Bayesian Inference
Authors Horacio Rodríquez, David Farwell, Javi Ferreres, Manuel Bertran, Musa Alkhalifa and M. Antonia Martí
Abstract This presentation focuses on the semi-automatic extension of Arabic WordNet (AWN) using lexical and morphological rules and applying Bayesian inference. We briefly report on the current status of AWN and propose a way of extending its coverage by taking advantage of a limited set of highly productive Arabic morphological rules for deriving a range of semantically related word forms from verb entries. The application of this set of rules, combined with the use of bilingual Arabic-English resources and Princeton’s WordNet, allows the generation of a graph representing the semantic neighbourhood of the original word. In previous work, a set of associations between the hypothesized Arabic words and English synsets was proposed on the basis of this graph. Here, a novel approach to extending AWN is presented whereby a Bayesian Network is automatically built from the graph and then the net is used as an inferencing mechanism for scoring the set of candidate associations. Both on its own and in combination with the previous technique, this new approach has led to improved results.
Language Single language
Topics Lexicon, lexical database, Acquisition, Machine Learning, Other
Full paper Arabic WordNet: Semi-automatic Extensions using Bayesian Inference
Slides Arabic WordNet: Semi-automatic Extensions using Bayesian Inference
Bibtex @InProceedings{RODRQUEZ08.434,
  author = {Horacio Rodríquez, David Farwell, Javi Ferreres, Manuel Bertran, Musa Alkhalifa and M. Antonia Martí},
  title = {Arabic WordNet: Semi-automatic Extensions using Bayesian Inference},
  booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
  year = {2008},
  month = {may},
  date = {28-30},
  address = {Marrakech, Morocco},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-4-0},
  note = {http://www.lrec-conf.org/proceedings/lrec2008/},
  language = {english}
  }

Powered by ELDA © 2008 ELDA/ELRA