Crowd-sourced BioGames: managing the big data problem for next-generation lab-on-a-chip platforms - PubMed (original) (raw)
Crowd-sourced BioGames: managing the big data problem for next-generation lab-on-a-chip platforms
Sam Mavandadi et al. Lab Chip. 2012.
Abstract
We describe a crowd-sourcing based solution for handling large quantities of data that are created by e.g., emerging digital imaging and sensing devices, including next generation lab-on-a-chip platforms. We show that in cases where the diagnosis is a binary decision (e.g., positive vs. negative, or infected vs. uninfected), it is possible to make accurate diagnosis by crowd-sourcing the raw data (e.g., microscopic images of specimens/cells) using entertaining digital games (i.e., ) that are played on PCs, tablets or mobile phones. We report the results and the analysis of a large-scale public experiment toward diagnosis of malaria infected human red blood cells (RBCs), where binary responses from approximately 1000 untrained individuals from more than 60 different countries are combined together (corresponding to more than 1 million cell diagnoses), resulting in an accuracy level that is comparable to those of expert medical professionals. This platform holds promise toward cost-effective and accurate tele-pathology, improved training of medical personnel, and can also be used to manage the "Big Data" problem that is emerging through next generation digital lab-on-a-chip devices.
Figures
Fig. 1
Left: The designed game can be played on multiple platforms. The user is asked to kill or bank infected and healthy cells, respectively. Right: Geographic locations of gamers that have generated the diagnoses so far; each balloon indicates an individual gamer. A blue balloon indicates a gamer with more than 100 submitted cell diagnoses, and a red balloon indicates a gamer with less than 100 submitted cell diagnoses. Since its public launch in May 2012, we have had more than 2,150 gamers from 77 countries, who registered on our servers generating more than 1.5 million individual cell diagnoses.
Fig. 2
Histogram of the overall accuracy levels of the 989 individual gamers who managed to label more than 100 cell images.
Fig. 3
Distribution of the deviation from the true accuracies of those estimated using the control images; the mean is at −0.014 and the variance is 0.026. The green curve is that of a Normal distribution with the same variance as the deviations and a mean of 0.
Fig. 4
The performance results obtained through combining responses of 989 gamers. The “effective crowd size” is the minimum number of times that every cell in the dataset has been diagnosed. Accuracy ≜ (TP + TN)/(TP + TN + FP + FN), Sensitivity ≜ TP/(TP + FN), Specificity ≜ TN/(TN + FP), PPV ≜ TP/(TP + FP), NPV ≜ TN/(TN + FN), FPR ≜ FP/(TN + FP), where TP, TN, FP, and FN correspond to the number of true positive, true negative, false positive, and false negative labels respectively.
References
- Nokia. Nokia 808 Pureview. 2012 http://europe.nokia.com/find-products/devices/nokia-808-pureview.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous