Survey on Different Techniques in SQL to Prepare Dataset for Data Mining
Journal: International Journal of Emerging Trends & Technology in Computer Science (IJETTCS) (Vol.4, No. 2)Publication Date: 2015-05-20
Authors : Shaila Eksambekar; Suhasini Itkar;
Page : 158-162
Keywords : Data Preprocessing; Dataset; Aggregation; SQL; relational DBMS; SPJ; CASE; PIVOT; K-means;
Abstract
One of the basic tasks in data mining activity is data preprocessing and preparing dataset. Efficient data analysis can be made easier with datasets having columns in horizontal tabular layout. This paper presents an overview of data preprocessing and dataset preparation techniques using SQL. To prepare dataset if we use SQL aggregations they return one column per aggregated group. This is the limitation of SQL aggregation. In this paper we have proposed need of effective and optimized usage of SQL to build dataset using horizontal aggregations. Also if the result of horizontal aggregation i.e. horizontal layout is integrated with K-means clustering algorithm we can get proper clusters.
Other Latest Articles
- Cut Detection and Re-routing in wireless sensor networks
- Hindi-English Translation Memory System
- Data Cleaning Using Clustering Based Data Mining Technique
- FAC-MACS: Fortified Access Control for Multi-Authority Cloud Storage Using CPABE
- Exploring Different Aspects of Social Network Analysis Using Web Mining Techniques
Last modified: 2015-05-15 20:33:28