AnyStreamKM: Anytime k-medoids Clustering for Streaming Data

dc.contributor.authorChalla, Jagat Sesh
dc.contributor.authorGoyal, Navneet
dc.contributor.authorGoyal, Poonam
dc.date.accessioned2024-10-24T04:33:55Z
dc.date.available2024-10-24T04:33:55Z
dc.date.issued2022
dc.description.abstractStream Clustering algorithms have gained a lot of importance in the recent past due to rapid rising utilities of IoT systems and applications. Anytime algorithms and frameworks play a key role in handling streams that have data arriving/generating at variable rates. They are capable of handling both slow and fast stream speeds, at the same time generate the result with highest possible accuracy. In this paper, we present AnyStreamKM, which is a framework for anytime k-medoids clustering of data streams. It uses a proposed hierarchical data indexing structure known as AnyKMTree that stores the incoming data from the stream in the form of hierarchy of micro-clusters. AnyKMTree is an adaptation of R-tree with its splitting strategy inspired from the design principles of k-medoids clustering. AnyKMTree not only supports anytime features but is also capable of filtering out noise and outliers. Our experimental analysis establishes that AnyKMTree produces micro-clusters that are more compact and purer than the state-of-the-art methods. Also, when offline k-medoids clustering such as PAM (Partitioning Around Medoids) is applied on the micro-clusters produced by AnyKMTree, the resultant clustering has been found to be of higher quality than the state-of-the-art methods.en_US
dc.identifier.urihttps://www.computer.org/csdl/proceedings-article/big-data/2022/10020901/1KfRe6nyKvC
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16157
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectComputer Scienceen_US
dc.subjectStream Clusteringen_US
dc.subjectAlgorithmsen_US
dc.subjectIoT systemsen_US
dc.titleAnyStreamKM: Anytime k-medoids Clustering for Streaming Dataen_US
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: