Topical Clustering Techniques of Twitter Documents Using Korean Wikipedia
Journal: The Journal of the Institute of Internet, Broadcasting and Communication (Vol.14, No. 5)Publication Date: 2014-10-31
Authors : Jae-Young Chang;
Page : 189-196
Keywords : SNS; Twitter; Clustering; Wikipedia; Feature Vector;
Abstract
Recently, the need for retrieving documents is growing in SNS environment such as twitter. For supporting the twitter search, a clustering technique classifying the massively retrieved documents in terms of topics is required. However, due to the nature of twitter, there is a limit in applying previous simple techniques to clustering the twitter documents. To overcome such problem, we propose in this paper a new clustering technique suitable to twitter environment. In proposed method, we augment new terms to feature vectors representing the twitter documents, and recalculate the weights of features using Korean Wikipedia. In addition, we performed the experiments with Korean twitter documents, and proved the usability of proposed method through performance comparison with the previous techniques.
Other Latest Articles
- A Reliable Study on the Accident Reconstruction using Accident Data Recorder
- Design and Development of Simulation Framework for Processing Window Query in Wireless Spatial Data Broadcasting Environment
- A Study on Smart Energy Management System using Information and Communication Technology
- Development of a Horse Robot for Indoor Leisure Sports
- Distributed Power Saving Control System Using Mobile Agent Based Active Rules
Last modified: 2016-01-19 13:47:59