Predicting Prevalence of Influenza-Like Illness From Geo-Tagged - PDF document

Predicting Prevalence of Influenza-Like Illness From Geo-Tagged Tweets Kewei Zhang * † Reza Arablouei † Raja Jurdak † * reza.arablouei@ raja.jurdak@ kewei.zhang@ csiro.au csiro.au uqconnect.edu.au * School of Information Technology and Electrical Engineering, University of Queensland, St.Lucia QLD, Australia † CSIRO Data 61, Pullenvale QLD, Australia ABSTRACT 2015, there were more than 30,000 influenza cases notified [5] when the number of flu notifications reached the highest in Modeling disease spread and distribution using social me- history during the same time period. Besides, public health dia data has become an increasingly popular research area. data are traditionally collected via surveys and by aggregat- While Twitter data has recently been investigated for esti- ing statistics obtained from healthcare institutions. Such mating disease spread, the extent to which it is representa- data collection processes are usually costly, slow, and retro- tive of disease spread and distribution in a macro perspective spective. is still an open question. In this paper, we focus on macro- Recently, analyzing data collected from Twitter , a micro- scale modeling of influenza-like illnesses (ILI) using a large blogging social network, has shown promise in assessing the dataset containing 8,961,932 tweets from Australia collected prevalence of flu [9]. However, modeling disease spread and in 2015. We first propose modifications of the state-of-the- distribution with Twitter data involves several challenging art ILI-related tweet detection approaches to acquire a more tasks. First of all, detecting tweets that contain expres- refined dataset. We normalize the number of detected ILI- sion of disease symptoms requires natural language process- related tweets with Internet access and Twitter penetration ing (NLP), which is an active research field with plenty of rates in each state. Then, we establish a state-level linear open challenges [12]. Moreover, health-related tweets are regression model between the number of ILI-related tweets relatively scarce [9] making their detection within a large and the number of real influenza notifications. The Pear- corpus of tweets a highly unbalanced classification problem. son correlation coefficient of the model is 0.93. Our results Zuccon et al. [21] investigated the suitability of statistical indicate that: 1) a strong positive linear correlation exists machine learning approaches in detecting ILI-related tweets between the number of ILI-related tweets and the number automatically. Their results show that the optimal f-score, of recorded influenza notifications at state scale; 2) Twit- which is the harmonic mean of precision and recall, is only ter data has promising ability in helping detect influenza up to 0.736 among most of the state-of-the-art approaches. outbreaks; 3) taking into account the population, Internet Considering the limited likelihood of users mentioning their access and Twitter penetration rates in each state enhances health condition in Twitter, only relying on classification the prevalence modeling analysis. techniques for obtaining ILI-related tweets can induce large errors and lead to a biased epidemic model. Keywords In this paper, we analyze a large database of 8,961,932 Classification; data mining; disease modeling; public health tweets from Australia collected in 2015 for studying the monitoring; regression analysis; Twitter disease spread and distribution of influenza-like illness epidemics. We propose modifications to the algorithm pro- posed in [16] to improve the ILI-related tweets classification 1. INTRODUCTION performance. We also take into account the Internet and Public health surveillance is an essential mission of ev- Twitter penetration rates at each state to normalize the re- ery government. In the current era of big data, data-driven sults. Afterwards, we establish a state-level model between epidemics modeling and surveillance system has drawn un- the Twitter data and the true influenza notification data and precedented attention. also perform temporal and spatial analysis for exploring how In Australia, epidemics of seasonal influenza are one of well can Twitter data capture the feature of disease spread the major public health concerns. Seasonal influenza strains and distribution. Furthermore, we identify the limitations circulate at peak during each winter. During the first half of of our study as well as the opportunity for further study on utilizing Twitter data for public health surveillance. The remainder of the paper is organized as follows. Sec- c ⃝ 2017 International World Wide Web Conference Committee (IW3C2), tion 2 presents related work. Section 3 gives some general published under Creative Commons CC BY 4.0 License. statistics about the dataset we use and provides the method- WWW 2017, April 3–7, 2017, Perth, Australia. ology of the experiment design. Section 4 presents the ex- ACM 978-1-4503-4914-7/17/04. http://dx.doi.org/10.1145/3041021.3051150 periment results and discussions. Section 5 elaborates on the limitations of the work. Section 6 provides conclusions and ideas for future work. . 1327

Predicting Prevalence of Influenza-Like Illness From Geo-Tagged - PDF document

Predicting Prevalence of Influenza-Like Illness From Geo-Tagged Tweets Kewei Zhang * Reza Arablouei Raja Jurdak * reza.arablouei@ raja.jurdak@ kewei.zhang@ csiro.au csiro.au uqconnect.edu.au * School of Information Technology and

GEO & Disaster Risk Reduction James Norris GEO Secretariat GEO in numbers Overview of GEO

2009 Influenza Update Influenza Facts Influenza Disease Protection, Treatment and

Syndromic Surveillance Predicting and monitoring influenza-like illness using telephone and

Influenza Tim Uyeki MD, MPH, MPP, FAAP Influenza Division National Center for Immunization and

Influenza vaccines Cheryl Cohen cherylc@nicd.ac.za Overview Burden of influenza and risk

Nothing to disclose. Influenza Update Lisa Winston, MD UCSF / San Francisco General Hospital

Fields of Geo-Data and Blockchain Done by : Nancy Abu Halemah Aisah al Qayem GEO DATA GEODATA

JET Job Skills Elementary School I Like Rain By Sarah Rogers-Tanner I like rain I dont like

The A(H7N9) influenza outbreak in China Anne Kelso Director WHO Collaborating Centre for

Franciscan Alliance Mandatory Workforce Influenza Vaccination Program Why a Mandatory Influenza

References References References Abbate R, Di Giuseppe G, Marinelli P, et al. Knowledge,

Swine Influenza Dr Paba Palihawadana Chief Epidemiologist Swine Influenza Respiratory

Surveillance of Avian Influenza in Animals FETP Avian Influenza Training Photo by Dr. Sue Trock

Influenza Session Robert L. Atmar, MD Chair, Influenza Work Group Advisory Committee on

Geo-Strategy https://www.youtube.com/watch?v=5GvjVUrmgNU Geo-politics Geo-economics

Geo Sense Presentation Actions Geo Sense Actions What is it? How does it work? Before Geo

Results for the Fourth Quarter and Full Year ended 31 December 2010 Disclaimer This

Abbotsford, BC October 10, 2018 Update status of the AI Insurance Initiative; Answer

London HIV Data Refresher data to end 2018 Meaghan Kall Principal Scientist Outline

SPA 2 Q 1 9 & 1 H 1 9 S U M M A R Y Disclaimer: The information contained in this presentation

A Public Health Approach to Advancing Early Care and Education Robert Gilchick, MD, MPH Medical

Week 10: 11/4-11/8, 2013 Unit II continues Conclude Ch. 20 Spanish-American War, begin WW I

Question A step by step walkthrough and analysis This presentation is based on the work of James

Ambulatory Surgical Center Quality Reporting Program Support Contractor Making It Work: A

Predicting Prevalence of Influenza-Like Illness From Geo-Tagged - PDF document

Predicting Prevalence of Influenza-Like Illness From Geo-Tagged Tweets Kewei Zhang * Reza Arablouei Raja Jurdak * reza.arablouei@ raja.jurdak@ kewei.zhang@ csiro.au csiro.au uqconnect.edu.au * School of Information Technology and

GEO &amp; Disaster Risk Reduction James Norris GEO Secretariat GEO in numbers Overview of GEO

2009 Influenza Update Influenza Facts Influenza Disease Protection, Treatment and

Syndromic Surveillance Predicting and monitoring influenza-like illness using telephone and

Influenza Tim Uyeki MD, MPH, MPP, FAAP Influenza Division National Center for Immunization and

Influenza vaccines Cheryl Cohen cherylc@nicd.ac.za Overview Burden of influenza and risk

Nothing to disclose. Influenza Update Lisa Winston, MD UCSF / San Francisco General Hospital

Fields of Geo-Data and Blockchain Done by : Nancy Abu Halemah Aisah al Qayem GEO DATA GEODATA

JET Job Skills Elementary School I Like Rain By Sarah Rogers-Tanner I like rain I dont like

The A(H7N9) influenza outbreak in China Anne Kelso Director WHO Collaborating Centre for

Franciscan Alliance Mandatory Workforce Influenza Vaccination Program Why a Mandatory Influenza

References References References Abbate R, Di Giuseppe G, Marinelli P, et al. Knowledge,

Swine Influenza Dr Paba Palihawadana Chief Epidemiologist Swine Influenza Respiratory

Surveillance of Avian Influenza in Animals FETP Avian Influenza Training Photo by Dr. Sue Trock

Influenza Session Robert L. Atmar, MD Chair, Influenza Work Group Advisory Committee on

Geo-Strategy https://www.youtube.com/watch?v=5GvjVUrmgNU Geo-politics Geo-economics

Geo Sense Presentation Actions Geo Sense Actions What is it? How does it work? Before Geo

Results for the Fourth Quarter and Full Year ended 31 December 2010 Disclaimer This

Abbotsford, BC October 10, 2018 Update status of the AI Insurance Initiative; Answer

London HIV Data Refresher data to end 2018 Meaghan Kall Principal Scientist Outline

SPA 2 Q 1 9 &amp; 1 H 1 9 S U M M A R Y Disclaimer: The information contained in this presentation

A Public Health Approach to Advancing Early Care and Education Robert Gilchick, MD, MPH Medical

Week 10: 11/4-11/8, 2013 Unit II continues Conclude Ch. 20 Spanish-American War, begin WW I

Question A step by step walkthrough and analysis This presentation is based on the work of James

Ambulatory Surgical Center Quality Reporting Program Support Contractor Making It Work: A

GEO & Disaster Risk Reduction James Norris GEO Secretariat GEO in numbers Overview of GEO

SPA 2 Q 1 9 & 1 H 1 9 S U M M A R Y Disclaimer: The information contained in this presentation