A machine learning method to monitor China's AIDS epidemics with data from Baidu trends.

<h4>Background</h4>AIDS is a worrying public health issue in China and lacks timely and effective surveillance. With the diffusion and adoption of the Internet, the 'big data' aggregated from Internet search engines, which contain users' information on the concern or reali...

Full description

Saved in:
Bibliographic Details
Main Authors: Yongqing Nan, Yanyan Gao
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0199697&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850076927584370688
author Yongqing Nan
Yanyan Gao
author_facet Yongqing Nan
Yanyan Gao
author_sort Yongqing Nan
collection DOAJ
description <h4>Background</h4>AIDS is a worrying public health issue in China and lacks timely and effective surveillance. With the diffusion and adoption of the Internet, the 'big data' aggregated from Internet search engines, which contain users' information on the concern or reality of their health status, provide a new opportunity for AIDS surveillance. This paper uses search engine data to monitor and forecast AIDS in China.<h4>Methods</h4>A machine learning method, artificial neural networks (ANNs), is used to forecast AIDS incidences and deaths. Search trend data related to AIDS from the largest Chinese search engine, Baidu.com, are collected and selected as the input variables of ANNs, and officially reported actual AIDS incidences and deaths are used as the output variable. Three criteria, the mean absolute percentage error, the root mean squared percentage error, and the index of agreement, are used to test the forecasting performance of the ANN method.<h4>Results</h4>Based on the monthly time series data from January 2011 to June 2017, this article finds that, under the three criteria, the ANN method can lead to satisfactory forecasting of AIDS incidences and deaths, regardless of the change in the number of search queries.<h4>Conclusions</h4>Despite the inability to self-detect HIV/AIDS through online searching, Internet-based data should be adopted as a timely, cost-effective complement to a traditional AIDS surveillance system.
format Article
id doaj-art-64fb77d506f24a7fb9531737a4e8900f
institution DOAJ
issn 1932-6203
language English
publishDate 2018-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-64fb77d506f24a7fb9531737a4e8900f2025-08-20T02:45:55ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01137e019969710.1371/journal.pone.0199697A machine learning method to monitor China's AIDS epidemics with data from Baidu trends.Yongqing NanYanyan Gao<h4>Background</h4>AIDS is a worrying public health issue in China and lacks timely and effective surveillance. With the diffusion and adoption of the Internet, the 'big data' aggregated from Internet search engines, which contain users' information on the concern or reality of their health status, provide a new opportunity for AIDS surveillance. This paper uses search engine data to monitor and forecast AIDS in China.<h4>Methods</h4>A machine learning method, artificial neural networks (ANNs), is used to forecast AIDS incidences and deaths. Search trend data related to AIDS from the largest Chinese search engine, Baidu.com, are collected and selected as the input variables of ANNs, and officially reported actual AIDS incidences and deaths are used as the output variable. Three criteria, the mean absolute percentage error, the root mean squared percentage error, and the index of agreement, are used to test the forecasting performance of the ANN method.<h4>Results</h4>Based on the monthly time series data from January 2011 to June 2017, this article finds that, under the three criteria, the ANN method can lead to satisfactory forecasting of AIDS incidences and deaths, regardless of the change in the number of search queries.<h4>Conclusions</h4>Despite the inability to self-detect HIV/AIDS through online searching, Internet-based data should be adopted as a timely, cost-effective complement to a traditional AIDS surveillance system.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0199697&type=printable
spellingShingle Yongqing Nan
Yanyan Gao
A machine learning method to monitor China's AIDS epidemics with data from Baidu trends.
PLoS ONE
title A machine learning method to monitor China's AIDS epidemics with data from Baidu trends.
title_full A machine learning method to monitor China's AIDS epidemics with data from Baidu trends.
title_fullStr A machine learning method to monitor China's AIDS epidemics with data from Baidu trends.
title_full_unstemmed A machine learning method to monitor China's AIDS epidemics with data from Baidu trends.
title_short A machine learning method to monitor China's AIDS epidemics with data from Baidu trends.
title_sort machine learning method to monitor china s aids epidemics with data from baidu trends
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0199697&type=printable
work_keys_str_mv AT yongqingnan amachinelearningmethodtomonitorchinasaidsepidemicswithdatafrombaidutrends
AT yanyangao amachinelearningmethodtomonitorchinasaidsepidemicswithdatafrombaidutrends
AT yongqingnan machinelearningmethodtomonitorchinasaidsepidemicswithdatafrombaidutrends
AT yanyangao machinelearningmethodtomonitorchinasaidsepidemicswithdatafrombaidutrends