Parallel Framework for Efficient k-means Clustering

dc.contributor.authorGoyal, Navneet
dc.contributor.authorGoyal, Poonam
dc.date.accessioned2022-12-27T07:08:38Z
dc.date.available2022-12-27T07:08:38Z
dc.date.issued2015-10
dc.description.abstractHandling and processing of larger volume of data requires efficient data mining algorithms. k-means is a very popular clustering algorithm for data mining, but its performance suffers because of initial seeding problem. The computation time of k-means algorithm is directly proportional to the number of data-points, number of dimensions, and number of iterations, therefore, it is very expensive to process large data-points sequentially. We proposed an efficient parallel framework which includes dimensionality-reduction as well as data-size reduction techniques to improve k-means processing time and initial seeding problem. Our proposed parallel framework leverages the multi-node and multi-core architectures of a typical commodity cluster. We have validated our proposed approaches with real and synthetic datasets in parallel environment setup. The experimental results clearly show the significant improvements in k-means performance.en_US
dc.identifier.urihttps://dl.acm.org/doi/10.1145/2835043.2835060
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/8154
dc.language.isoenen_US
dc.publisherACM Digital Libraryen_US
dc.subjectComputer Scienceen_US
dc.subjectClusteringen_US
dc.subjectClustering frameworken_US
dc.subjectParallel frameworken_US
dc.titleParallel Framework for Efficient k-means Clusteringen_US
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: