Lightning Fast Distributed Machine Learning Framework
Journal: International Journal of Science and Research (IJSR) (Vol.3, No. 11)Publication Date: 2014-11-05
Authors : Madhu Sudhan H V;
Page : 2913-2915
Keywords : Big Data; Distributed Machine Learning; Apache Spark; Python;
Abstract
According to IBM, Big Data can be expressed with 4 Vs, namely Volume, Velocity, Variety and Veracity. Lots of companies are incorporating Big Data in their business model to derive insights from the unstructured data. Big Data is analyzed using Statistical Methods and Machine Learning. Limiting factor using traditional technologies is the incompetence to use huge amounts of data to learn or train algorithms within a practical time. This problem can be handled by using in-memory and distributed machine learning techniques with the help of distributed data sets and by allocating learning to various workstations. A distributed machine learning framework is developed with Spark, Hadoop and Python to scale the machine learning algorithm and to reduce the intensive computation.
Other Latest Articles
- Thermodynamic Studies on Some Bio-Molecule Aqueous Solution at 303.15K Using Ultrasonic Technique
- Implementing Glycemic Index in the Management of Weight, BMI and Glycosylated Hemoglobin Levels in Type-II Diabetics
- Smart Antennas Anticipated for Cellular Mobile Communication
- Distribution of Plant Parasitic Nematodes in Sugarcane
- Role of Cloud Computing for Implementation of ERP in SMEs
Last modified: 2021-06-30 21:12:54