A Rapid Prototyping Approach for High Performance Density-Based Clustering

Goyal, Navneet; Goyal, Poonam

A Rapid Prototyping Approach for High Performance Density-Based Clustering

Date

2019-10

Authors

Goyal, Navneet

Goyal, Poonam

Publisher

IEEE

Abstract

Big Data has significantly increased the dependence of data analytics community on High Performance Computing (HPC) systems. However, efficiently programming an HPC system is still a tedious task requiring specialized skills in parallelization and the use of platform-specific languages as well as mechanisms. We present a framework for quickly prototyping new/existing density-based clustering algorithms while obtaining low running times and high speedups via automatic parallelization. The user is required only to specify the sequential algorithm in a Domain Specific Language (DSL) for clustering at a very high level of abstraction. The parallelizing compiler for the DSL does the rest to leverage distributed systems - in particular, typical scale-out clusters made of commodity hardware. Our approach is based on recurring, parallelizable programming patterns known as Kernels, which are identified and parallelized by the compiler. We demonstrate the ease of programming and scalable performance for DBSCAN, SNN, and RECOME algorithms. We also establish that the proposed approach can achieve performance comparable to state-of-the-art manually parallelized implementations while requiring minimal programming effort that is several orders of magnitude smaller than those required on other parallel platforms like MPI/Spark.

Keywords

Computer Science, Big Data, Prototyping Approach, Density-Based Clustering, HPC

URI

https://ieeexplore.ieee.org/document/8964223
http://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/8113

Collections

Department of Computer Science and Information Systems

Full item page

A Rapid Prototyping Approach for High Performance Density-Based Clustering

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By