MapReduce Frame Work: Investigating Suitability for Faster Data Analytics

dc.contributor.authorMavani, Monali
dc.date.accessioned2024-05-08T09:13:51Z
dc.date.available2024-05-08T09:13:51Z
dc.date.issued2013
dc.description.abstractFaster data analytics is the ability to generate the desired report in near real time. Any application that looks at an aggregated view of a stream of data can be considered as an analytic application. The demand to process vast amounts of data to produce various market trends, user behavior, fraud behavior etc. becomes not just useful, but critical to the success of the business. In the past few years, fast data, i.e., high-speed data streams, has also exploded in volume and availability. Prime examples include sensor data streams, real-time stock market data, and social-media feeds such as Twitter, Facebook etc. New models for distributed stream processing have been evolved over a time. This research investigates the suitability of Google’s MapReduce (MR) parallel programming frame work for faster data processing. Originally MapReduce systems are geared towards batch processing. This paper proposes some optimizations to original MR framework for faster distributed data processing applications using distributed shared memory to store intermediate data and use of Remote Direct Access (RDMA) technology for faster data transfer across network.en_US
dc.identifier.urihttps://link.springer.com/chapter/10.1007/978-3-642-36321-4_11
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/14769
dc.language.isoenen_US
dc.publisherSpringeren_US
dc.subjectComputer Scienceen_US
dc.subjectData Analyticsen_US
dc.subjectGoogle’s MapReduce (MR)en_US
dc.titleMapReduce Frame Work: Investigating Suitability for Faster Data Analyticsen_US
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: