A Parallel Framework for Grid-Based Bottom-Up Subspace Clustering
| dc.contributor.author | Goyal, Navneet | |
| dc.contributor.author | Goyal, Poonam | |
| dc.date.accessioned | 2022-12-26T07:07:35Z | |
| dc.date.available | 2022-12-26T07:07:35Z | |
| dc.date.issued | 2016 | |
| dc.description.abstract | Clustering is a popular data mining and machine learning technique which discovers interesting patterns from unlabeled data by grouping similar objects together. Clustering high-dimensional data is a challenging task as points in high dimensional space are nearly equidistant from each other, rendering commonly used similarity measures ineffective. Subspace clustering has emerged as a possible solution to the problem of clustering high-dimensional data. In subspace clustering, we try to find clusters in different subspaces within a dataset. Many subspace clustering algorithms have been proposed in the last two decades to find clusters in multiple overlapping subspaces of high-dimensional data. Subspace clustering algorithms iteratively find the best subset of dimensions for a cluster from 2d-1 possible combinations in d-dimensional data. Subspace clustering is extremely compute intensive because of exhaustive search of subspaces, especially in the bottom-up subspace clustering algorithms. To address this issue, an efficient parallel framework for grid-based bottom-up subspace clustering algorithms is developed, considering popular algorithms belonging to this category. The framework is implemented for shared memory, distributed memory, and hybrid systems and is tested for three grid-based bottom-up subspace clustering algorithms: CLIQUE, MAFIA, and ENCLUS. All parallel implementations exhibit impressive speedup and scalability on real datasets. | en_US |
| dc.identifier.uri | https://ieeexplore.ieee.org/document/7796919 | |
| dc.identifier.uri | http://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/8124 | |
| dc.language.iso | en | en_US |
| dc.publisher | IEEE | en_US |
| dc.subject | Computer Science | en_US |
| dc.subject | Subspace clustering | en_US |
| dc.subject | Bottom-up | en_US |
| dc.subject | Clique | en_US |
| dc.subject | Parallel framework | en_US |
| dc.title | A Parallel Framework for Grid-Based Bottom-Up Subspace Clustering | en_US |
| dc.type | Article | en_US |
Files
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: