Text similarity detection method based on NLP

Current text similarity detection methods that ignore document structure information and lack semantic relevance.To solve these problems, a text-oriented similarity detection method was proposed.First, analytic hierarchy process (AHP) was used to calculate word position weight to extract feature wor...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaoli DAI, Shifeng LIU, Daqing GONG
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2021-10-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2021192/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841539250506432512
author Xiaoli DAI
Shifeng LIU
Daqing GONG
author_facet Xiaoli DAI
Shifeng LIU
Daqing GONG
author_sort Xiaoli DAI
collection DOAJ
description Current text similarity detection methods that ignore document structure information and lack semantic relevance.To solve these problems, a text-oriented similarity detection method was proposed.First, analytic hierarchy process (AHP) was used to calculate word position weight to extract feature words.Second, the Pearson correlation coefficient was used to measure semantic correlation between words which was the weight of generalized Dice coefficient to calculate similarity.Experimental results show that the proposed method can improve the precision of feature word extraction and the accuracy of similarity calculation results.
format Article
id doaj-art-c5af08ce28ea4aac903af94ee27f40f6
institution Kabale University
issn 1000-436X
language zho
publishDate 2021-10-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-c5af08ce28ea4aac903af94ee27f40f62025-01-14T07:22:59ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2021-10-014217318159745598Text similarity detection method based on NLPXiaoli DAIShifeng LIUDaqing GONGCurrent text similarity detection methods that ignore document structure information and lack semantic relevance.To solve these problems, a text-oriented similarity detection method was proposed.First, analytic hierarchy process (AHP) was used to calculate word position weight to extract feature words.Second, the Pearson correlation coefficient was used to measure semantic correlation between words which was the weight of generalized Dice coefficient to calculate similarity.Experimental results show that the proposed method can improve the precision of feature word extraction and the accuracy of similarity calculation results.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2021192/text similarityword position weightanalytic hierarchy process,feature word extractionPearson correlation coefficient
spellingShingle Xiaoli DAI
Shifeng LIU
Daqing GONG
Text similarity detection method based on NLP
Tongxin xuebao
text similarity
word position weight
analytic hierarchy process,
feature word extraction
Pearson correlation coefficient
title Text similarity detection method based on NLP
title_full Text similarity detection method based on NLP
title_fullStr Text similarity detection method based on NLP
title_full_unstemmed Text similarity detection method based on NLP
title_short Text similarity detection method based on NLP
title_sort text similarity detection method based on nlp
topic text similarity
word position weight
analytic hierarchy process,
feature word extraction
Pearson correlation coefficient
url http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2021192/
work_keys_str_mv AT xiaolidai textsimilaritydetectionmethodbasedonnlp
AT shifengliu textsimilaritydetectionmethodbasedonnlp
AT daqinggong textsimilaritydetectionmethodbasedonnlp