DSpace logo

Please use this identifier to cite or link to this item: http://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/8122
Full metadata record
DC FieldValueLanguage
dc.contributor.authorGoyal, Navneet
dc.contributor.authorGoyal, Poonam
dc.contributor.authorChalla, Jagat Sesh
dc.date.accessioned2022-12-26T07:00:41Z
dc.date.available2022-12-26T07:00:41Z
dc.date.issued2016
dc.identifier.urihttps://ieeexplore.ieee.org/document/7840586
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/8122
dc.description.abstractParallelizing data mining algorithms has become a necessity as we try to mine ever increasing volumes of data. Spatial data mining algorithms like Dbscan, Optics, Slink, etc. have been parallelized to exploit a cluster infrastructure. The efficiency achieved by existing algorithms can be attributed to spatial locality preservation using spatial indexing structures like k-d-tree, quad-tree, grid files, etc. for distributing data among cluster nodes. However, these indexing structures are static in nature, i.e., they need to scan the entire dataset to determine the partitioning coordinates. This results in high data distribution cost when the data size is large. In this paper, we propose a dynamic distributed data structure, DD-Rtree, which preserves spatial locality while distributing data across compute nodes in a shared nothing environment. Moreover, DD-Rtree is dynamic, i.e., it can be constructed incrementally making it useful for handling big data. We compare the quality of data distribution achieved by DD-Rtree with one of the recent distributed indexing structure, SD-Rtree. We also compare the efficiency of queries supported by these indexing structures along with the overall efficiency of DBSCAN algorithm. Our experimental results show that DD-Rtree achieves better data distribution and thereby resulting in improved overall efficiency.en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectComputer Scienceen_US
dc.subjectData Miningen_US
dc.subjectData distributionen_US
dc.subjectSpatial localityen_US
dc.subjectNeighborhood queriesen_US
dc.subjectk-NN queriesen_US
dc.titleDD-Rtree: A dynamic distributed data structure for efficient data distribution among cluster nodes for spatial data mining algorithmsen_US
dc.typeArticleen_US
Appears in Collections:Department of Computer Science and Information Systems

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.