Diversity in Faces (original) (raw)

View PDF

Abstract:Face recognition is a long standing challenge in the field of Artificial Intelligence (AI). The goal is to create systems that accurately detect, recognize, verify, and understand human faces. There are significant technical hurdles in making these systems accurate, particularly in unconstrained settings due to confounding factors related to pose, resolution, illumination, occlusion, and viewpoint. However, with recent advances in neural networks, face recognition has achieved unprecedented accuracy, largely built on data-driven deep learning methods. While this is encouraging, a critical aspect that is limiting facial recognition accuracy and fairness is inherent facial diversity. Every face is different. Every face reflects something unique about us. Aspects of our heritage - including race, ethnicity, culture, geography - and our individual identify - age, gender, and other visible manifestations of self-expression, are reflected in our faces. We expect face recognition to work equally accurately for every face. Face recognition needs to be fair. As we rely on data-driven methods to create face recognition technology, we need to ensure necessary balance and coverage in training data. However, there are still scientific questions about how to represent and extract pertinent facial features and quantitatively measure facial diversity. Towards this goal, Diversity in Faces (DiF) provides a data set of one million annotated human face images for advancing the study of facial diversity. The annotations are generated using ten well-established facial coding schemes from the scientific literature. The facial coding schemes provide human-interpretable quantitative measures of facial features. We believe that by making the extracted coding schemes available on a large set of faces, we can accelerate research and development towards creating more fair and accurate facial recognition systems.

Submission history

From: Michele Merler [view email]
[v1] Tue, 29 Jan 2019 18:24:50 UTC (6,415 KB)
[v2] Wed, 30 Jan 2019 15:38:35 UTC (6,415 KB)
[v3] Mon, 11 Feb 2019 15:26:51 UTC (6,416 KB)
[v4] Sat, 16 Feb 2019 17:17:08 UTC (6,496 KB)
[v5] Wed, 20 Feb 2019 22:51:46 UTC (6,496 KB)
[v6] Mon, 8 Apr 2019 21:27:14 UTC (6,497 KB)