Summary of the paper

Title Ping-pong Document Clustering using NMF and Linkage-Based Refinement
Authors Hiroyuki Shinnou and Minoru Sasaki
Abstract This paper proposes a ping-pong document clustering method using NMF and the linkage based refinement alternately, in order to improve the clustering result of NMF. The use of NMF in the ping-pong strategy can be expected effective for document clustering. However, NMF in the ping-pong strategy often worsens performance because NMF often fails to improve the clustering result given as the initial values. Our method handles this problem with the stop condition of the ping-pong process. In the experiment, we compared our method with the k-means and NMF by using 16 document data sets. Our method improved the clustering result of NMF significantly.
Language
Topics Document Classification, Text categorisation, Text mining, Information Extraction, Information Retrieval
Full paper Ping-pong Document Clustering using NMF and Linkage-Based Refinement
Slides -
Bibtex @InProceedings{SHINNOU08.38,
  author = {Hiroyuki Shinnou and Minoru Sasaki},
  title = {Ping-pong Document Clustering using NMF and Linkage-Based Refinement},
  booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
  year = {2008},
  month = {may},
  date = {28-30},
  address = {Marrakech, Morocco},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-4-0},
  note = {http://www.lrec-conf.org/proceedings/lrec2008/},
  language = {english}
  }

Powered by ELDA © 2008 ELDA/ELRA