ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Survey Paper on Data Lake

Journal: International Journal of Science and Research (IJSR) (Vol.5, No. 7)

Publication Date:

Authors : ; ;

Page : 1718-1720

Keywords : Big Data; Big Data analytics; Data Warehouse; Data Lake;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

One of the key driving forces behind the problem of Big Data is the rapid growth of unstructured data, which constitutes huge percentage of overall data [1]. The Big Data is not only about massive data capture and storage, but intelligently combining the past data that already exists inside an organization with the unstructured data. For an organization to be really successful to meet the latent benefits of Big Data, it needs the perfect technology in place to acquire the data, store it, combine it and enrich huge volumes of unstructured data in raw format. It should also have the ability to perform analytics, real-time, near-real-time analysis, batch processing on these huge volumes of data. To address these businesses needs efficiently, the concept of Data Lake is proposed. It is one of the empowering data capture and processing capability for Big Data analysis. Data Lake makes it possible to store all types of data irrespective of their schema and the formats. Data Lake is a massive, easily accessible, flexible enough and scalable large data repository.

Last modified: 2021-07-01 14:40:32