Time series and semantics-based chinese microblog topic detection and tracking method
As a widely used tool in social networks,microblog is definitely with short document,quick broadcasting and topic changeable,which results in big challenging for social topic detection and tracking.A new systematic framework for micro-blog topic detection and tracking was proposed based on the micro...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
POSTS&TELECOM PRESS Co., LTD
2016-05-01
|
Series: | 网络与信息安全学报 |
Subjects: | |
Online Access: | http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2016.00048 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841530373052301312 |
---|---|
author | Tie-ming CHEN Xiao-hao WANG Wei-wei PANG Jie JIANG |
author_facet | Tie-ming CHEN Xiao-hao WANG Wei-wei PANG Jie JIANG |
author_sort | Tie-ming CHEN |
collection | DOAJ |
description | As a widely used tool in social networks,microblog is definitely with short document,quick broadcasting and topic changeable,which results in big challenging for social topic detection and tracking.A new systematic framework for micro-blog topic detection and tracking was proposed based on the microblog clustering using temporal trend and semantic similarity.Firstly,a feature words selection method for hot topics was presented by defining the temporal frequent words set.Secondly,an initially clustering was conducted depending on the selected temporal frequent words set.As far as the overlaps between initial clusters concerned,an effective overlap elimination algorithm was proposed,by introducing the extended short document semantic membership,to separate any possible overlapped initial clusters.Finally,an aggregated topic clustering method was employed using the cluster semantic similarity matrix.The experiments were at last done on some real-world dataset from Sina microblog.It show that the method for chinese microblog topic detection and tracking can obtain excellent performance and results. |
format | Article |
id | doaj-art-5deb6fac038c4f1d86adf49631078466 |
institution | Kabale University |
issn | 2096-109X |
language | English |
publishDate | 2016-05-01 |
publisher | POSTS&TELECOM PRESS Co., LTD |
record_format | Article |
series | 网络与信息安全学报 |
spelling | doaj-art-5deb6fac038c4f1d86adf496310784662025-01-15T03:04:33ZengPOSTS&TELECOM PRESS Co., LTD网络与信息安全学报2096-109X2016-05-012212959545360Time series and semantics-based chinese microblog topic detection and tracking methodTie-ming CHENXiao-hao WANGWei-wei PANGJie JIANGAs a widely used tool in social networks,microblog is definitely with short document,quick broadcasting and topic changeable,which results in big challenging for social topic detection and tracking.A new systematic framework for micro-blog topic detection and tracking was proposed based on the microblog clustering using temporal trend and semantic similarity.Firstly,a feature words selection method for hot topics was presented by defining the temporal frequent words set.Secondly,an initially clustering was conducted depending on the selected temporal frequent words set.As far as the overlaps between initial clusters concerned,an effective overlap elimination algorithm was proposed,by introducing the extended short document semantic membership,to separate any possible overlapped initial clusters.Finally,an aggregated topic clustering method was employed using the cluster semantic similarity matrix.The experiments were at last done on some real-world dataset from Sina microblog.It show that the method for chinese microblog topic detection and tracking can obtain excellent performance and results.http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2016.00048microblog textfrequent wordsfeature selectionclustering,topic detectiontime seriessemantics |
spellingShingle | Tie-ming CHEN Xiao-hao WANG Wei-wei PANG Jie JIANG Time series and semantics-based chinese microblog topic detection and tracking method 网络与信息安全学报 microblog text frequent words feature selection clustering,topic detection time series semantics |
title | Time series and semantics-based chinese microblog topic detection and tracking method |
title_full | Time series and semantics-based chinese microblog topic detection and tracking method |
title_fullStr | Time series and semantics-based chinese microblog topic detection and tracking method |
title_full_unstemmed | Time series and semantics-based chinese microblog topic detection and tracking method |
title_short | Time series and semantics-based chinese microblog topic detection and tracking method |
title_sort | time series and semantics based chinese microblog topic detection and tracking method |
topic | microblog text frequent words feature selection clustering,topic detection time series semantics |
url | http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2016.00048 |
work_keys_str_mv | AT tiemingchen timeseriesandsemanticsbasedchinesemicroblogtopicdetectionandtrackingmethod AT xiaohaowang timeseriesandsemanticsbasedchinesemicroblogtopicdetectionandtrackingmethod AT weiweipang timeseriesandsemanticsbasedchinesemicroblogtopicdetectionandtrackingmethod AT jiejiang timeseriesandsemanticsbasedchinesemicroblogtopicdetectionandtrackingmethod |