Survey Paper on Data Lake
Journal: International Journal of Science and Research (IJSR) (Vol.5, No. 7)Publication Date: 2016-07-05
Authors : Surabhi D Hegde; Ravinarayana B;
Page : 1718-1720
Keywords : Big Data; Big Data analytics; Data Warehouse; Data Lake;
Abstract
One of the key driving forces behind the problem of Big Data is the rapid growth of unstructured data, which constitutes huge percentage of overall data [1]. The Big Data is not only about massive data capture and storage, but intelligently combining the past data that already exists inside an organization with the unstructured data. For an organization to be really successful to meet the latent benefits of Big Data, it needs the perfect technology in place to acquire the data, store it, combine it and enrich huge volumes of unstructured data in raw format. It should also have the ability to perform analytics, real-time, near-real-time analysis, batch processing on these huge volumes of data. To address these businesses needs efficiently, the concept of Data Lake is proposed. It is one of the empowering data capture and processing capability for Big Data analysis. Data Lake makes it possible to store all types of data irrespective of their schema and the formats. Data Lake is a massive, easily accessible, flexible enough and scalable large data repository.
Other Latest Articles
Last modified: 2021-07-01 14:40:32