Correlation between national influenza surveillance data and google trends in South Korea - PubMed (original) (raw)

Sungjin Cho et al. PLoS One. 2013.

Abstract

Background: In South Korea, there is currently no syndromic surveillance system using internet search data, including Google Flu Trends. The purpose of this study was to investigate the correlation between national influenza surveillance data and Google Trends in South Korea.

Methods: Our study was based on a publicly available search engine database, Google Trends, using 12 influenza-related queries, from September 9, 2007 to September 8, 2012. National surveillance data were obtained from the Korea Centers for Disease Control and Prevention (KCDC) influenza-like illness (ILI) and virologic surveillance system. Pearson's correlation coefficients were calculated to compare the national surveillance and the Google Trends data for the overall period and for 5 influenza seasons.

Results: The correlation coefficient between the KCDC ILI and virologic surveillance data was 0.72 (p<0.05). The highest correlation was between the Google Trends query of H1N1 and the ILI data, with a correlation coefficient of 0.53 (p<0.05), for the overall study period. When compared with the KCDC virologic data, the Google Trends query of bird flu had the highest correlation with a correlation coefficient of 0.93 (p<0.05) in the 2010-11 season. The following queries showed a statistically significant correlation coefficient compared with ILI data for three consecutive seasons: Tamiflu (r = 0.59, 0.86, 0.90, p<0.05), new flu (r = 0.64, 0.43, 0.70, p<0.05) and flu (r = 0.68, 0.43, 0.77, p<0.05).

Conclusions: In our study, we found that the Google Trends for certain queries using the survey on influenza correlated with national surveillance data in South Korea. The results of this study showed that Google Trends in the Korean language can be used as complementary data for influenza surveillance but was insufficient for the use of predictive models, such as Google Flu Trends.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1

Figure 1. Time series plots of KCDC surveillance data and Google Trends data.

Figure 2

Figure 2. Time series plot of queries that consecutively show significant correlation coefficients (p<0.05).

Strong correlation is defined as a correlation coefficient r-value of >0.7. Tamiflu is the only query to show a strong correlation for two consecutive years.

References

    1. Varney SM, Hirshon JM (2006) Update on public health surveillance in emergency departments. Emerg Med Clin North Am 24: 1035–1052. - PubMed
    1. Henning KJ (2004) What is syndromic surveillance? MMWR Morb Mortal Wkly Rep 53 Suppl: 5–11 - PubMed
    1. Frenk J, Gomez-Dantes O (2002) Globalization And The Challenges To Health Systems. Health Affairs 21: 160–165. - PubMed
    1. Irvin CB, Nouhan PP, Rice K (2003) Syndromic analysis of computerized emergency department patients' chief complaints: An opportunity for bioterrorism and influenza surveillance. Annals of Emergency Medicine 41: 447–452. - PubMed
    1. Hirshon JM (2000) The rationale for developing public health surveillance systems based on emergency department data. Acad Emerg Med 7: 1428–1432. - PubMed

MeSH terms

Grants and funding

The authors have no support or funding to report.

LinkOut - more resources