A Clustering Algorithm towards Microblogs based on Vector Space Model
会议名称:《2012 National Conference on Information Technology and Computer Science》
会议日期:2012年
学科分类:12[管理学] 1201[管理学-管理科学与工程(可授管理学、工学学位)]
基 金:supported by the Natural Science Foundation of Jiangsu Province BK2010131 Foundation of PLA University of Science and Technology 20110208
关 键 词:microblogs clustering k-means vector space model
摘 要:Weibos have become wildly popular in China in recent years,and state media reports that there are more than 300 million registered *** Real Name Policy[1] requires all users on Chinese weibo websites to register with the name that corresponds with their government issued ID *** the rapid development of the web,the research of consensus encounters new problems and *** a practical method for large-scale text clustering,instant messaging,text content analysis features,and find or track the social hot *** the file,which is not suitable for very common clustering algorithm? A new method is proposed of the named MVSM synthesis microblogging dialogue,but also enriched the words of the vector is not included in the text of the blog,but existing content is closely *** vector space this MVSM perform the dialogue,k-means *** on public datasets show better,MVSM than traditional k-means and kmeans algorithm into two.