Time series and semantics-based chinese microblog topic detection and tracking method

As a widely used tool in social networks,microblog is definitely with short document,quick broadcasting and topic changeable,which results in big challenging for social topic detection and tracking.A new systematic framework for micro-blog topic detection and tracking was proposed based on the micro...

Full description

Saved in:
Bibliographic Details
Main Authors: Tie-ming CHEN, Xiao-hao WANG, Wei-wei PANG, Jie JIANG
Format: Article
Language:English
Published: POSTS&TELECOM PRESS Co., LTD 2016-05-01
Series:网络与信息安全学报
Subjects:
Online Access:http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2016.00048
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841530373052301312
author Tie-ming CHEN
Xiao-hao WANG
Wei-wei PANG
Jie JIANG
author_facet Tie-ming CHEN
Xiao-hao WANG
Wei-wei PANG
Jie JIANG
author_sort Tie-ming CHEN
collection DOAJ
description As a widely used tool in social networks,microblog is definitely with short document,quick broadcasting and topic changeable,which results in big challenging for social topic detection and tracking.A new systematic framework for micro-blog topic detection and tracking was proposed based on the microblog clustering using temporal trend and semantic similarity.Firstly,a feature words selection method for hot topics was presented by defining the temporal frequent words set.Secondly,an initially clustering was conducted depending on the selected temporal frequent words set.As far as the overlaps between initial clusters concerned,an effective overlap elimination algorithm was proposed,by introducing the extended short document semantic membership,to separate any possible overlapped initial clusters.Finally,an aggregated topic clustering method was employed using the cluster semantic similarity matrix.The experiments were at last done on some real-world dataset from Sina microblog.It show that the method for chinese microblog topic detection and tracking can obtain excellent performance and results.
format Article
id doaj-art-5deb6fac038c4f1d86adf49631078466
institution Kabale University
issn 2096-109X
language English
publishDate 2016-05-01
publisher POSTS&TELECOM PRESS Co., LTD
record_format Article
series 网络与信息安全学报
spelling doaj-art-5deb6fac038c4f1d86adf496310784662025-01-15T03:04:33ZengPOSTS&TELECOM PRESS Co., LTD网络与信息安全学报2096-109X2016-05-012212959545360Time series and semantics-based chinese microblog topic detection and tracking methodTie-ming CHENXiao-hao WANGWei-wei PANGJie JIANGAs a widely used tool in social networks,microblog is definitely with short document,quick broadcasting and topic changeable,which results in big challenging for social topic detection and tracking.A new systematic framework for micro-blog topic detection and tracking was proposed based on the microblog clustering using temporal trend and semantic similarity.Firstly,a feature words selection method for hot topics was presented by defining the temporal frequent words set.Secondly,an initially clustering was conducted depending on the selected temporal frequent words set.As far as the overlaps between initial clusters concerned,an effective overlap elimination algorithm was proposed,by introducing the extended short document semantic membership,to separate any possible overlapped initial clusters.Finally,an aggregated topic clustering method was employed using the cluster semantic similarity matrix.The experiments were at last done on some real-world dataset from Sina microblog.It show that the method for chinese microblog topic detection and tracking can obtain excellent performance and results.http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2016.00048microblog textfrequent wordsfeature selectionclustering,topic detectiontime seriessemantics
spellingShingle Tie-ming CHEN
Xiao-hao WANG
Wei-wei PANG
Jie JIANG
Time series and semantics-based chinese microblog topic detection and tracking method
网络与信息安全学报
microblog text
frequent words
feature selection
clustering,topic detection
time series
semantics
title Time series and semantics-based chinese microblog topic detection and tracking method
title_full Time series and semantics-based chinese microblog topic detection and tracking method
title_fullStr Time series and semantics-based chinese microblog topic detection and tracking method
title_full_unstemmed Time series and semantics-based chinese microblog topic detection and tracking method
title_short Time series and semantics-based chinese microblog topic detection and tracking method
title_sort time series and semantics based chinese microblog topic detection and tracking method
topic microblog text
frequent words
feature selection
clustering,topic detection
time series
semantics
url http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2016.00048
work_keys_str_mv AT tiemingchen timeseriesandsemanticsbasedchinesemicroblogtopicdetectionandtrackingmethod
AT xiaohaowang timeseriesandsemanticsbasedchinesemicroblogtopicdetectionandtrackingmethod
AT weiweipang timeseriesandsemanticsbasedchinesemicroblogtopicdetectionandtrackingmethod
AT jiejiang timeseriesandsemanticsbasedchinesemicroblogtopicdetectionandtrackingmethod