Twitter Data Modelling and Provenance Support for Key-Value Pair Databases

Goyal, Navneet

Twitter Data Modelling and Provenance Support for Key-Value Pair Databases

dc.contributor.author	Goyal, Navneet
dc.date.accessioned	2024-10-24T04:55:29Z
dc.date.available	2024-10-24T04:55:29Z
dc.date.issued	2021-02
dc.description.abstract	In Big Data environments, reliability of data plays an important role to determine trustworthiness of the outcomes of an analysis. Big data provenance ensures the reliability of data by providing details about the origin and historical paths of data. In recent years, the preponderance of big data and its applications are increasingly using Apache Cassandra due to its high availability and linear scalability. In this paper, we present a data provenance framework for Key-Value Pair Databases using the concept of Zero-Information Loss Database (ZILD). A large volume of real-time social media data is fetched from the Twitter’s network through live streaming with the help of Twitter Streaming APIs, and then modelled in Apache Cassandra based on a Query-Driven approach. This framework provides efficient provenance capturing support for select, aggregate, update, and historical queries. We evaluate the performance of proposed framework in terms of provenance capturing and querying capabilities using appropriate query sets.	en_US
dc.identifier.uri	https://link.springer.com/chapter/10.1007/978-3-030-69377-0_8
dc.identifier.uri	http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16160
dc.language.iso	en	en_US
dc.publisher	Springer	en_US
dc.subject	Computer Science	en_US
dc.subject	Big Data Analytics	en_US
dc.subject	Zero-Information Loss Database (ZILD)	en_US
dc.subject	Twitter Streaming	en_US
dc.title	Twitter Data Modelling and Provenance Support for Key-Value Pair Databases	en_US
dc.type	Article	en_US

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Department of Computer Science and Information Systems