Influenza surveillance and early warning system based on search engine data
-
摘要: 随着互联网的不断发展和网络覆盖率的提高,搜索引擎日益成为人们查询生活信息的主要渠道,搜索关键词直接地反映了查询人的意图,且搜索数据可实时统计,因此网络搜索数据成了流感监测的理想数据源。在实际应用中,结合疾病预防控制中心等官方监测数据,可实现流感等重大传染病的早期预警,及早采取相关预防措施,降低疾病传播风险,减少国家及人民的财政负担。与传统的疾病监测相比网络搜索数据具有响应快、易获取、低成本等特点,但数字化疾病监测仍然面临着如网络用户行为的不确定性、搜索关键词获取的不准确性,网络覆盖的不全面性等诸多挑战。因此,未来的相关研究应着眼于如何校正提高互联网搜索数据的准确性,探讨如何将搜索引擎数据与地理信息系统相结合,利用时空大数据进行传染病预测预警。解除数据模块间的壁垒,一面多点,多面多点的采集数据,更好的实现大数据的共享,提高数据的利用率及监测预警的准确性。Abstract: With the continuous development of the Internet and improvement of the network coverage, search engine has become the main channel for information query. As query keywords directly reflect people´s intent and search data can be counted in real time, the network search data is considered the ideal data source for influenza surveillance. In the practical application, a surveillance and early warning system utilizing the search engine data and the official monitoring data, such as CDC, can perform early warning of major infectious diseases in time so as to take preventive measures, thus reducing the risk of disease transmission and the financial burden of the country and the people. Compared with the traditional disease monitor system, that based on the network search data has the characteristics of fast response, easy access, low cost and so on; however, it still faces many challenges, such as the uncertainty of the netizen’s behavior, the inaccuracy of the search keywords and the incomplete network coverage, etc. Therefore, future research should focus on improving the accuracy of Internet search data and explore the ways to use spatial and temporal data for infectious disease prediction and early warning through combining search engine data with geographic information system (GIS). Furthermore, they need to remove the barriers between data modules and collect data collectively and comprehensively, so as to realize the sharing of big data and improve the utilization of data and the accuracy of the monitoring and early warning system.
-
Key words:
- search engine /
- big data /
- influenza /
- monitoring /
- early warning
-
[1] 孙锦峰, 耿云亮, 郭奕瑞, 等. Elman神经网络与ARIMA模型对流感发病率预测效果的比较[J]. 郑州大学学报:医学版, 2013, 48(5): 584-7. http://industry.wanfangdata.com.cn/yj/Detail/Periodical?id=Periodical_henanykdx201305003 [2] Cox N. Influenza seasonality: timing and formulation of vaccines[J]. Bulletin World Health Organization, 2014, 92(5):311-3. [3] 陈国胜, 王靖飞, 李静, 等. 地理信息系统(GIS)进行高致病性禽流感控制中的应用[J]. 中国预防兽医学报, 2004, 15(06): 72-5. [4] 洪荣涛, 许龙善, 严延生, 等. 试述突发公共卫生事件的监测与预警[J]. 中国公共卫生管理, 2005, 21(2): 106-8. http://med.wanfangdata.com.cn/Paper/Detail?id=PeriodicalPaper_zgggwsgl200502008 [5] 鹿凤苓, 王昵, 杨其法, 等. 基层疾病监测预警模式的建立与应用[J]. 疾病监测, 2007, 22(5): 344-5, 358. http://d.wanfangdata.com.cn/Periodical_jbjc200705023.aspx [6] 霍飞, 高浩宇, 刘长娜, 等. 舆情监测在重大传染病预防控制工作中的应用[J]. 职业与健康, 2013, 29(23): 3205-9. http://www.cqvip.com/QK/97016X/201323/48152894.html [7] 马晓薇, 刘艳慧, 袁俊, 等. 微博与传统网络媒体对一起人禽流感疫情舆情监测特点分析[J]. 疾病监测, 2013, 28(1): 61-4. doi: 10.3784/j.issn.1003-9961.2013.1.019 [8] 刘诗洋, 王倩, 郑秋莹. 我国突发公共卫生事件网络舆情研究现状分析[J]. 管理观察, 2014,25(10): 98-101. http://edu.wanfangdata.com.cn/Periodical/Detail/glkxwz201410050 [9] 李莉, 陈蕾, 江涛. 舆情监测在人感染H7N9禽流感防控中的应用及效能分析[J]. 中国公共卫生管理, 2013, 29(06): 775-6. http://www.cqvip.com/QK/97016X/201323/48152894.html [10] 高云华, 王敏. 国外流感预报系统研究进展[J]. 人民军医, 2015, 06(6): 627-8. https://www.wenkuxiazai.com/doc/4d473a3e04a1b0717ed5dd69-2.html [11] 相海泉. 网络直报分秒即达[J]. 中国信息界:e医疗, 2014,7(10): 61-2. https://www.researchgate.net/profile/Shengjie_Lai/publication/268983746_The_application_of_signals_of_China_Infectious_Disease_Automated-alert_and_Response_System_CIDARS_in_Zhejiang_Province_2013/links/547d542b0cf2cfe203c20365/The-application-of-signals-of-China-Infectious-Disease-Automated-alert-and-Response-System-CIDARS-in-Zhejiang-Province-2013.pdf [12] 于颖洁, 罗小飞. 疾病预防控制信息系统的发展[J]. 中国药物与临床, 2015,14(09): 1361-2. http://d.wanfangdata.com.cn/Periodical_zgywylc201509065.aspx [13] 康敏, 钟豪杰, 杨芬, 等. 广东省2009年甲型H1N1流感流行与网络搜索情况的相关性分析[J]. 热带医学杂志, 2011, 11(6): 629-32. http://www.cqvip.com/QK/84226X/201106/38614732.html [14] 闫城榛, 韩志国. 应变大数据[J]. 中国传媒科技, 2013,6(07): 28-33. doi: 10.3969/j.issn.1671-0134.2013.07.007 [15] 黄达沧. 基于搜索引擎数据的手足口病监测[D]. 长春: 东北师范大学, 2015. [16] Wilson K, Brownstein JS. Early detection of disease outbreaks using the Internet[J]. CMAJ, 2009, 180(8): 829-31. doi: 10.1503/cmaj.1090215 [17] Ginsberg J, Mohebbi MH, Patel RS, et al. Detecting influenza epidemics using search engine query data[J]. Nature, 2009, 457(7232): 1012-4. doi: 10.1038/nature07634 [18] Eysenbach G. Infodemiology:tracking flu-related searches on the web for syndromic surveillance[C]//AMIA Annual Symposium Proceedings, 2006: 244-8. [19] Hulth A, Rydevik G, Linde A. Web queries as a source for syndromic surveillance[J]. PLoS One, 2009, 4(2): e4378-82. doi: 10.1371/journal.pone.0004378 [20] 李秀婷, 刘凡, 董纪昌, 等. 基于互联网搜索数据的中国流感监测[J]. 系统工程理论与实践, 2013, 33(12): 3028-34. doi: 10.12011/1000-6788(2013)12-3028 [21] Pew Research Center's Pew Internet & American Life Project[R]. Online Health Search, 2006. [22] Polgreen PM, Chen Y, Pennock DM, et al. Using Internet searches for influenza surveillance[J]. Clin Infect Dis, 2008, 47(11): 1443-8. doi: 10.1086/596019 [23] 肖辉, 周征奇, 肖革新, 等. 公共卫生领域中的数据挖掘[J]. 医学信息学杂志, 2013,18(12): 2-5. https://www.wenkuxiazai.com/doc/21634af91711cc7931b716f0-4.html [24] Doornik J. Improving the timeliness of data on influenza-like illnesses using Google search data[C]//8th Oxmetrics User Conference, 2010. [25] Milinovich GJ, Magalhães RJ, Hu W. Role of big data in the early detection of Ebola and other emerging infectious diseases[J]. Lancet Glob Health, 2015, 3(1): e20-1. doi: 10.1016/S2214-109X(14)70356-0 [26] Althouse BM, Ng YY, Cummings DA. Prediction of dengue incidence using search query surveillance[J]. PLoS Negl Trop Dis, 2011, 5(8): e1258-63. doi: 10.1371/journal.pntd.0001258 [27] 聪颖. 大数据能帮我们避免下一场瘟疫[J]. 吉林医学信息, 2014, 07(5): 30-1. http://www.cnki.com.cn/Article/CJFDTotal-JLYC201407027.htm [28] 李锐, 孙利谦, 熊成龙, 等. 基于互联网搜索数据研究全球高致病性禽流感病毒H5N1的暴发监测[J]. 中华疾病控制杂志, 2015, 19(8): 773-7. http://industry.wanfangdata.com.cn/dl/Magazine?magazineId=jbkzzz&yearIssue=2015_8 [29] Kang M, Zhong H, He J, et al. Using google trends for influenza surveillance in South China[J]. PLoS One, 2013, 8(1): e55205-8. doi: 10.1371/journal.pone.0055205 [30] Centers for Disease Control and Prevention. Percentage of adults aged ≥18 years who looked up health information on the Internet,by age group and Sex-National health interview survey,United States,January September,2009[J]. Morbid Mortal Weekly Rep, 2010, 59(15): 461-6. https://www.cdc.gov/mmwr/preview/mmwrhtml/mm5915a7.htm [31] Pelat C, Turbelin C, Bar-Hen A, et al. More diseases tracked by using Google Trends[J]. Emerg Infect Dis, 2009, 15(8): 1327-8. doi: 10.3201/eid1508.090299 [32] Wilson N, Mason K, Tobias M, et al. Interpreting google flu trends data for pandemic H1N1 influenza: the New Zealand experience[J]. Euro Surveill, 2009, 14(44): 19386-93. doi: 10.2807/ese.14.44.19386-en [33] Kelly H, Grant K. Interim analysis of pandemic influenza (H1N1) 2009 in Australia: surveillance trends, age of infection and effectiveness of seasonal vaccination[J]. Euro Surveill, 2009, 14(31): 19288-94. doi: 10.2807/ese.14.31.19288-en [34] 郭浩岩, 王林, 苏婧, 等. 2013年人感染H7N9禽流感舆情监测结果分析[J]. 中国健康教育, 2015,13(05): 479-81. http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=zgjkjy201505012 [35] Yuan Q, Nsoesie EO, Lv B, et al. Monitoring influenza epidemics in China with search query from baidu[J]. PLoS One, 2013, 8(5): e64323-8. doi: 10.1371/journal.pone.0064323 [36] Xie T, Yang Z, Yang S, et al. Correlation between reported human infection with avian influenza A H7N9 virus and cyber user awareness: what can we learn from digital epidemiology[J]. Int J Infect Dis, 2014, 22(1): 1-3. [37] Valdivia A, Lopez-Alcalde J, Vicente M, et al. Monitoring influenza activity in Europe with Google Flu Trends: comparison with the findings of sentinel physician networks - results for 2009-10[J]. Euro Surveill, 2010, 15(29): 19621-8. http://www.eurosurveillance.org/images/dynamic/EE/V15N29/art19621.pdf [38] Butler D. When google got flu wrong[J]. Nature, 2013, 494(7436): 155-6. doi: 10.1038/494155a [39] Dugas AF, Hsieh YH, Levin SR, et al. Google flu trends: correlation with emergency department influenza rates and crowding metrics[J]. Clin Infect Dis, 2012, 54(4): 463-9. doi: 10.1093/cid/cir883 [40] Hulth A, Rydevik G. Web query-based surveillance in Sweden during the influenza A(H1N1)2009 pandemic, April 2009 to February 2010[J]. Euro Surveill, 2011, 16(18): 19856-62. doi: 10.2807/ese.16.18.19856-en [41] Lazer D, Kennedy R, King G, et al. Big data. The parable of Google Flu: traps in big data analysis[J]. Science, 2014, 343(6176): 1203-5. doi: 10.1126/science.1248506 [42] 陈新保, 朱建军, 陈建群. 时空数据模型综述[J]. 地理科学进展, 2009, 28(01): 9-17. http://edu.wanfangdata.com.cn/Periodical/Detail/dlkxjz200901002 [43] 林玫, 李永红, 董柏青. 传染病预测预警方法在我国的应用现状[J]. 中国热带医学, 2010, 10(3): 308-9, 348. http://www.cqvip.com/QK/84113X/201003/33001891.html [44] 屈晓晖, 袁武, 袁文, 等. 时空大数据分析技术在传染病预测预警中的应用[J]. 中国数字医学, 2015(08): 36-9. doi: 10.3969/j.issn.1673-7571.2015.08.012
点击查看大图
计量
- 文章访问数: 1852
- HTML全文浏览量: 592
- PDF下载量: 9
- 被引次数: 0