Title |
Ping-pong Document Clustering using NMF and Linkage-Based Refinement |
Authors |
Hiroyuki Shinnou and Minoru Sasaki |
Abstract |
This paper proposes a ping-pong document clustering method using NMF and the linkage based refinement alternately, in order to improve the clustering result of NMF. The use of NMF in the ping-pong strategy can be expected effective for document clustering. However, NMF in the ping-pong strategy often worsens performance because NMF often fails to improve the clustering result given as the initial values. Our method handles this problem with the stop condition of the ping-pong process. In the experiment, we compared our method with the k-means and NMF by using 16 document data sets. Our method improved the clustering result of NMF significantly. |
Language |
|
Topics |
Document Classification, Text categorisation, Text mining, Information Extraction, Information Retrieval |
Full paper |
Ping-pong Document Clustering using NMF and Linkage-Based Refinement |
Slides |
- |
Bibtex |
@InProceedings{SHINNOU08.38,
author = {Hiroyuki Shinnou and Minoru Sasaki},
title = {Ping-pong Document Clustering using NMF and Linkage-Based Refinement},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |