ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Topical Clustering Techniques of Twitter Documents Using Korean Wikipedia

Journal: The Journal of the Institute of Internet, Broadcasting and Communication (Vol.14, No. 5)

Publication Date:

Authors : ;

Page : 189-196

Keywords : SNS; Twitter; Clustering; Wikipedia; Feature Vector;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Recently, the need for retrieving documents is growing in SNS environment such as twitter. For supporting the twitter search, a clustering technique classifying the massively retrieved documents in terms of topics is required. However, due to the nature of twitter, there is a limit in applying previous simple techniques to clustering the twitter documents. To overcome such problem, we propose in this paper a new clustering technique suitable to twitter environment. In proposed method, we augment new terms to feature vectors representing the twitter documents, and recalculate the weights of features using Korean Wikipedia. In addition, we performed the experiments with Korean twitter documents, and proved the usability of proposed method through performance comparison with the previous techniques.

Last modified: 2016-01-19 13:47:59