DSpace logo

Please use this identifier to cite or link to this item: http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16160
Title: Twitter Data Modelling and Provenance Support for Key-Value Pair Databases
Authors: Goyal, Navneet
Keywords: Computer Science
Big Data Analytics
Zero-Information Loss Database (ZILD)
Twitter Streaming
Issue Date: Feb-2021
Publisher: Springer
Abstract: In Big Data environments, reliability of data plays an important role to determine trustworthiness of the outcomes of an analysis. Big data provenance ensures the reliability of data by providing details about the origin and historical paths of data. In recent years, the preponderance of big data and its applications are increasingly using Apache Cassandra due to its high availability and linear scalability. In this paper, we present a data provenance framework for Key-Value Pair Databases using the concept of Zero-Information Loss Database (ZILD). A large volume of real-time social media data is fetched from the Twitter’s network through live streaming with the help of Twitter Streaming APIs, and then modelled in Apache Cassandra based on a Query-Driven approach. This framework provides efficient provenance capturing support for select, aggregate, update, and historical queries. We evaluate the performance of proposed framework in terms of provenance capturing and querying capabilities using appropriate query sets.
URI: https://link.springer.com/chapter/10.1007/978-3-030-69377-0_8
http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16160
Appears in Collections:Department of Computer Science and Information Systems

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.