Citation needed? Wikipedia bibliometrics during the first wave of the COVID-19 pandemic - PubMed (original) (raw)
Citation needed? Wikipedia bibliometrics during the first wave of the COVID-19 pandemic
Omer Benjakob et al. Gigascience. 2022.
Abstract
Background: With the COVID-19 pandemic's outbreak, millions flocked to Wikipedia for updated information. Amid growing concerns regarding an "infodemic," ensuring the quality of information is a crucial vector of public health. Investigating whether and how Wikipedia remained up to date and in line with science is key to formulating strategies to counter misinformation. Using citation analyses, we asked which sources informed Wikipedia's COVID-19-related articles before and during the pandemic's first wave (January-May 2020).
Results: We found that coronavirus-related articles referenced trusted media outlets and high-quality academic sources. Regarding academic sources, Wikipedia was found to be highly selective in terms of what science was cited. Moreover, despite a surge in COVID-19 preprints, Wikipedia had a clear preference for open-access studies published in respected journals and made little use of preprints. Building a timeline of English-language COVID-19 articles from 2001-2020 revealed a nuanced trade-off between quality and timeliness. It further showed how pre-existing articles on key topics related to the virus created a framework for integrating new knowledge. Supported by a rigid sourcing policy, this "scientific infrastructure" facilitated contextualization and regulated the influx of new information. Last, we constructed a network of DOI-Wikipedia articles, which showed the landscape of pandemic-related knowledge on Wikipedia and how academic citations create a web of shared knowledge supporting topics like COVID-19 drug development.
Conclusions: Understanding how scientific research interacts with the digital knowledge-sphere during the pandemic provides insight into how Wikipedia can facilitate access to science. It also reveals how, aided by what we term its "citizen encyclopedists," it successfully fended off COVID-19 disinformation and how this unique model may be deployed in other contexts.
Keywords: COVID-19; Wikipedia; bibliometrics; citizen science; infodemic; open science; sources.
© The Author(s) 2022. Published by Oxford University Press GigaScience.
Conflict of interest statement
Omer Benjakob is a journalist for Haaretz and has written about Wikipedia in the past.
Figures
Figure 1:
Characterization of scientific sources of the Wikipedia COVID-19 corpus. (A) Bar plot of the most cited academic sources. Top journals are highlighted in green and preprints are represented in red. Bottom right: Box plot of Altmetrics score of the 3 sets: the Wikipedia COVID-19 corpus, the EuroPMC COVID-19 search, and the full Wikipedia dump as of May 2020. Comparison of the occurrence of (B) open-access sources and (C) preprints (medRxiv and bioRxiv) in the 3 sets. Boxplots center indicates the median, and the bottom and top edges indicate the 25th and 75th percentiles; the wiskers extend 1.5 times the interquartile range.
Figure 2:
Top sources used in the Wikipedia COVID-19 corpus: A) source types, B) news agencies, C) websites, and D) publishers form the COVID-19 corpus sources (per Wikipedia’s citation template terminology). Several denominations for the same institution are present in the raw data which is highlighted here with the example of WHO and World Health Organization
Figure 3:
Historical perspective of the Wikipedia COVID-19 corpus. (A) COVID-19 article creation per year; inset: number of articles created before and after 2020. (B) Scientific citations added per year to the COVID-19 corpus and globally in Wikipedia (inset). Latency distribution of scientific papers (C) in the COVID-19 corpus and (D) the Wikipedia dump. See Supplementary Fig. S3 and in the GigaDB repository [54]. for an interactive version of the timeline.
Figure 4:
Network of articles–scientific papers (DOI) in the Wikipedia COVID-19 corpus. A network mapping scientific papers (with DOIs) cited in >1 article in the Wikipedia COVID-19 corpus was constructed. This network is composed of 454 edges, 179 DOIs (blue), and 136 Wikipedia articles (yellow). Nodes represent articles and their size is proportional to the number of connections. A zoom in on the cluster of Wikipedia articles dealing with COVID-19 drug development is depicted here for illustrative purposes. For clarity, edges marked in red indicate those connecting the DOIs cited directly in the “COVID-19 drug development” article and edges marked in blue indicate those connecting these DOIs to other articles citing them. See the GigaDB repository [54] for an interactive version of the network (see Supplementary Dataset S2).
Similar articles
- Assessing Public Interest Based on Wikipedia's Most Visited Medical Articles During the SARS-CoV-2 Outbreak: Search Trends Analysis.
Chrzanowski J, Sołek J, Fendler W, Jemielniak D. Chrzanowski J, et al. J Med Internet Res. 2021 Apr 12;23(4):e26331. doi: 10.2196/26331. J Med Internet Res. 2021. PMID: 33667176 Free PMC article. - The Most Influential Medical Journals According to Wikipedia: Quantitative Analysis.
Jemielniak D, Masukume G, Wilamowski M. Jemielniak D, et al. J Med Internet Res. 2019 Jan 18;21(1):e11429. doi: 10.2196/11429. J Med Internet Res. 2019. PMID: 30664451 Free PMC article. - The Role of Social Media in Health Misinformation and Disinformation During the COVID-19 Pandemic: Bibliometric Analysis.
Adebesin F, Smuts H, Mawela T, Maramba G, Hattingh M. Adebesin F, et al. JMIR Infodemiology. 2023 Sep 20;3:e48620. doi: 10.2196/48620. JMIR Infodemiology. 2023. PMID: 37728981 Free PMC article. - COVID-19 Study on Scientific Articles in Health Communication: A Science Mapping Analysis in Web of Science.
de Las Heras-Pedrosa C, Jambrino-Maldonado C, Rando-Cueto D, Iglesias-Sánchez PP. de Las Heras-Pedrosa C, et al. Int J Environ Res Public Health. 2022 Feb 2;19(3):1705. doi: 10.3390/ijerph19031705. Int J Environ Res Public Health. 2022. PMID: 35162726 Free PMC article. Review. - Defining Misinformation and Related Terms in Health-Related Literature: Scoping Review.
El Mikati IK, Hoteit R, Harb T, El Zein O, Piggott T, Melki J, Mustafa RA, Akl EA. El Mikati IK, et al. J Med Internet Res. 2023 Aug 9;25:e45731. doi: 10.2196/45731. J Med Internet Res. 2023. PMID: 37556184 Free PMC article. Review.
Cited by
- Strategies for crowdsourcing hearing health information: a comparative study of educational programs and volunteer-based campaigns on Wikimedia.
Morata TC, Zucki F, Arrigo AJ, Cruz PC, Gong W, Matos HGC, Montilha AAP, Peschanski JA, Cardoso MJ, Lacerda ABM, Berberian AP, Araujo ES, Luders D, Duarte JL, Jacob RTS, Chadha S, Mietchen D, Rasberry L, Alvarenga KF, Jacob LCB. Morata TC, et al. BMC Public Health. 2024 Sep 30;24(1):2646. doi: 10.1186/s12889-024-20105-8. BMC Public Health. 2024. PMID: 39343916 Free PMC article. - Co-designing a wiki-based community knowledge management system for personal science.
Kloppenborg K, Price Ball M, Jonas S, Wolf GI, Greshake Tzovaras B. Kloppenborg K, et al. R Soc Open Sci. 2024 Jul 10;11(7):240275. doi: 10.1098/rsos.240275. eCollection 2024 Jul. R Soc Open Sci. 2024. PMID: 39076354 Free PMC article. - Crowdsourcing Knowledge Production of COVID-19 Information on Japanese Wikipedia in the Face of Uncertainty: Empirical Analysis.
Yang K, Tanaka M. Yang K, et al. J Med Internet Res. 2023 Jun 29;25:e45024. doi: 10.2196/45024. J Med Internet Res. 2023. PMID: 37384371 Free PMC article. - Developing a scalable framework for partnerships between health agencies and the Wikimedia ecosystem.
Mietchen D, Rasberry L, Morata T, Sadowski JP, Novakovich J, Heilman JM. Mietchen D, et al. Res Ideas Outcomes. 2021 Jun 6;7:10.3897/rio.7.e68121. doi: 10.3897/rio.7.e68121. Res Ideas Outcomes. 2021. PMID: 39421138 Free PMC article. - Wikipedia as a tool for contemporary history of science: A case study on CRISPR.
Benjakob O, Guley O, Sevin JM, Blondel L, Augustoni A, Collet M, Jouveshomme L, Amit R, Linder A, Aviram R. Benjakob O, et al. PLoS One. 2023 Sep 13;18(9):e0290827. doi: 10.1371/journal.pone.0290827. eCollection 2023. PLoS One. 2023. PMID: 37703244 Free PMC article.
References
- Lavsa SM, Corman SL, Culley CM, et al. Reliability of Wikipedia as a medication information source for pharmacy students. Curr Pharm Teach Learn. 2011;3(2):154–8.
- Allahwala UK, Nadkarni A, Sebaratnam DF. Wikipedia use amongst medical students–new insights into the digital revolution. Med Teach. 2013;35(4):337–7. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical