ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Efficient Streaming Data Analysis using Big Data tools

Journal: International Journal of Computer Techniques (Vol.4, No. 1)

Publication Date:

Authors : ;

Page : 15-22

Keywords : Keywords — BigData; Unstructured; Hadoop; Flume; Spark Streaming; Twitter; Apache Cassandra; Zeppelin; Analysis; JSON.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Big data is a popular term used to describe the large volume of data which includes structured, semi-structured and unstructured data. Now-a-days, unstructured data is growing in an explosive speed with the development of Internet and social networks like Twitter,Facebook & Yahoo etc., In order to process such colossal of data a software is required that does this efficiently and this is where Hadoop steps in. Hadoop has become one of the most used frameworks when dealing with big data. It is used to analyze and process big data. In this paper, Apache Flume is configured and integrated with spark streaming for streaming the data from twitter application. The streamed data is stored into Apache Cassandra. After retrieving the data, the data is going to be analyzed by using the concept of Apache Zeppelin. The result will be displayed on Dashboard and the dashboard result is also going to be analyzed and validating using JSON.

Last modified: 2017-12-12 12:45:58