A Novel Center Point Initialization Technique for K-means Clustering Algorithm

Dauda Usman, Ismail Bin Mohamad

Abstract


Clustering is a major data analysis tool utilized in numerous domains. The basic K-means method has been widely discussed and applied in many applications. But unfortunately failed to offer good clustering result due to the initial center points are chosen randomly. In this article, we present a new method of centre points initialization and we prove that the distance of the new method follows a Chi-square distribution. The new method overcomes the drawbacks of the basic K-means. Experimental analysis shows that the new method performs well on infectious diseases dataset when compare with the basic K-means clustering method and a histogram measures the quality of the new method.


Full Text: PDF DOI: 10.5539/mas.v7n9p10

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.

Modern Applied Science   ISSN 1913-1844 (Print)   ISSN 1913-1852 (Online)

Copyright © Canadian Center of Science and Education

To make sure that you can receive messages from us, please add the 'ccsenet.org' domain to your e-mail 'safe list'. If you do not receive e-mail in your 'inbox', check your 'bulk mail' or 'junk mail' folders.