Wordnet Based Document Clustering
Journal: International Journal of Science and Research (IJSR) (Vol.3, No. 5)Publication Date: 2014-05-15
Authors : Madhavi Katamaneni; Ashok Cheerala;
Page : 1610-1618
Keywords : Document clustering; Ontology; BOW; POS Tagging; Stemming; Labeling;
- Thermal Characterization and Mineral Composition of the Egyptian Alabaster “Carbonate Rocks”
- DEPENDENCE OF LOW VELOCITY ZONES PARAMETERS ON MINERAL COMPOSITION OF UKRAINIAN SHIELD ROCKS IN DIFFERENT РТ-CONDITIONS OF EXPERIMENT
- MATHEMATICAL MODELLING OF INFLUENCE OF THE MINERAL COMPOSITION AND POROSITY ON ELASTIC ANISOTROPIC PARAMETERS OF COMPLEX SEDIMENTARY ROCKS OF VOLYN-PODOLIA AREA
- Karst cavity detection in carbonate rocks by integration of high resolution geophysical methods.
- A study on the chemical and mineral composition of the protein-mineral paste from poultry and cattle bone raw materials
Abstract
Document clustering is considered as an important tool in the fast developing information explosion era. It is the process of grouping text documents into category groups and has found applications in various domains like information retrieval, web information systems. Ontology based computing is emerging as a natural evolution of existing technologies to design with the information onslaught. In current dissertation work, background knowledge derived from WordNet as ontology is applied during preprocessing of documents for document clustering. Document vectors constructed from WordNet synsets is used as input for clustering. Comparative analysis is done between clustering using k-means and clustering using bi-secting k-means. A document Categorization tool is developed which summarizes the hierarchy of concepts obtained from WordNet during clustering phase. GUI tool contains the association between WordNet concepts and documents belonging to the concept.
Other Latest Articles
Last modified: 2014-07-03 17:23:42