Building domain lexicon oriented to behavioral features in depression
Behavioral representations of the patients with depression reflect the clinical features and condition of the patients, therefore it is beneficial for disease diagnosis. However, in the construction of current depression lexicon, the correlation between the behavioral features and the condition of p...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | zho |
| Published: |
China InfoCom Media Group
2024-09-01
|
| Series: | 大数据 |
| Subjects: | |
| Online Access: | http://www.j-bigdataresearch.com.cn/thesisDetails#10.11959/j.issn.2096-0271.2024009 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850212280491311104 |
|---|---|
| author | ZHOU Ruotong ZHU Guangli LI Shuyu DUAN Wenjie LI Jiawei |
| author_facet | ZHOU Ruotong ZHU Guangli LI Shuyu DUAN Wenjie LI Jiawei |
| author_sort | ZHOU Ruotong |
| collection | DOAJ |
| description | Behavioral representations of the patients with depression reflect the clinical features and condition of the patients, therefore it is beneficial for disease diagnosis. However, in the construction of current depression lexicon, the correlation between the behavioral features and the condition of patients in depression texts is overlooked, resulting in incompleteness of the lexicon information. To address this problem, a domain lexicon construction, oriented to behavioral features in depression. was proposed which aimed to extend the domain lexicon's coverage of emotional expressions. Firstly, the seed word sets of sentiment and behavior were constructed by the TF-IDF algorithm respectively, the word set of sentiment was obtained by calculating PMI similarity between the seed word set of sentiment and the existing sentiment lexicon Secondly, the seed words of behavioral were labeled based on correspondence between behavioral features and the condition of patients, and further inputted into WoBERT with depression texts to separately generate dynamic word vectors. In addition, the candidate word set was acquired by calculating the similarity between the seed word set of behavioral and depression texts In addition,based on the similarity between words, the semantic graph was constructed to obtain the word set of behavioral features by label propagation algorithm. Finally, the emoticons with negative emotions on Weibo were collected to build the word set of emoticons. The word set of sentiment, the word set of behavioral features and the word set of emoticons were integrated into the Chinese Depression Domain Lexicon. Experimental results show that the constructed lexicon can improve the effect of depression text classification. |
| format | Article |
| id | doaj-art-0547b7dbeb404d40bd29f77839ff7784 |
| institution | OA Journals |
| issn | 2096-0271 |
| language | zho |
| publishDate | 2024-09-01 |
| publisher | China InfoCom Media Group |
| record_format | Article |
| series | 大数据 |
| spelling | doaj-art-0547b7dbeb404d40bd29f77839ff77842025-08-20T02:09:22ZzhoChina InfoCom Media Group大数据2096-02712024-09-01109610871199229Building domain lexicon oriented to behavioral features in depressionZHOU RuotongZHU GuangliLI ShuyuDUAN WenjieLI JiaweiBehavioral representations of the patients with depression reflect the clinical features and condition of the patients, therefore it is beneficial for disease diagnosis. However, in the construction of current depression lexicon, the correlation between the behavioral features and the condition of patients in depression texts is overlooked, resulting in incompleteness of the lexicon information. To address this problem, a domain lexicon construction, oriented to behavioral features in depression. was proposed which aimed to extend the domain lexicon's coverage of emotional expressions. Firstly, the seed word sets of sentiment and behavior were constructed by the TF-IDF algorithm respectively, the word set of sentiment was obtained by calculating PMI similarity between the seed word set of sentiment and the existing sentiment lexicon Secondly, the seed words of behavioral were labeled based on correspondence between behavioral features and the condition of patients, and further inputted into WoBERT with depression texts to separately generate dynamic word vectors. In addition, the candidate word set was acquired by calculating the similarity between the seed word set of behavioral and depression texts In addition,based on the similarity between words, the semantic graph was constructed to obtain the word set of behavioral features by label propagation algorithm. Finally, the emoticons with negative emotions on Weibo were collected to build the word set of emoticons. The word set of sentiment, the word set of behavioral features and the word set of emoticons were integrated into the Chinese Depression Domain Lexicon. Experimental results show that the constructed lexicon can improve the effect of depression text classification.http://www.j-bigdataresearch.com.cn/thesisDetails#10.11959/j.issn.2096-0271.2024009depression;domain lexicon;behavioral feature;WoBERT;label propagation algorithm |
| spellingShingle | ZHOU Ruotong ZHU Guangli LI Shuyu DUAN Wenjie LI Jiawei Building domain lexicon oriented to behavioral features in depression 大数据 depression;domain lexicon;behavioral feature;WoBERT;label propagation algorithm |
| title | Building domain lexicon oriented to behavioral features in depression |
| title_full | Building domain lexicon oriented to behavioral features in depression |
| title_fullStr | Building domain lexicon oriented to behavioral features in depression |
| title_full_unstemmed | Building domain lexicon oriented to behavioral features in depression |
| title_short | Building domain lexicon oriented to behavioral features in depression |
| title_sort | building domain lexicon oriented to behavioral features in depression |
| topic | depression;domain lexicon;behavioral feature;WoBERT;label propagation algorithm |
| url | http://www.j-bigdataresearch.com.cn/thesisDetails#10.11959/j.issn.2096-0271.2024009 |
| work_keys_str_mv | AT zhouruotong buildingdomainlexiconorientedtobehavioralfeaturesindepression AT zhuguangli buildingdomainlexiconorientedtobehavioralfeaturesindepression AT lishuyu buildingdomainlexiconorientedtobehavioralfeaturesindepression AT duanwenjie buildingdomainlexiconorientedtobehavioralfeaturesindepression AT lijiawei buildingdomainlexiconorientedtobehavioralfeaturesindepression |