EFFICIENT CLASSIFICATION METHOD FOR LARGE DATASET BY ASSIGNING THE KEY VALUE IN CLUSTERING?
Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.3, No. 1)Publication Date: 2014-01-30
Authors : B Rosiline Jeetha;
Page : 319-324
Keywords : Clustering; Canberra Distance; Classification; Nearest Neighbor Classification;
Abstract
Clustering analysis is used to explore the classification for large dataset and Canberra distance is generalized so that it can process the data with categorical attributes. Based on the generalized Canberra distance definition, an instance of constraint-based clustering is introduced. Meanwhile, the nearest neighbor classification is improved. Class-labeled clusters are regarded as classifying models used for classifying data. The proposed classification method can discover the data of big difference from the instances in training data, which may mean a new data type. The generalize Canberra distance for continuous numerical attributes data to mixed attributes data, and use clustering analysis technique to squash existing instances, improve the classical nearest neighbor classification method.
Other Latest Articles
- Fuzzy Mining Approach for Gene Clustering and Gene Function Prediction
- The board of directors and the financial performance of the Tunisian listed companies
- Bangladesh Garment Market Diversification: China, the Major Destination for Bangladesh Apparel
- Personality Dynamics on Perseverance Attitude of Individuals in Job Influencing Variables Assessment
- Leadership in Management: A Universal Leadership Model for the 21st Century
Last modified: 2014-01-25 02:37:33