Datasheets for datasets

T Gebru, J Morgenstern, B Vecchione… - Communications of the …, 2021 - dl.acm.org
Communications of the ACM, 2021dl.acm.org
Datasheets for datasets Page 1 86 COMMUNICATIONS OF THE ACM | DECEMBER 2021 |
VOL. 64 | NO. 12 review articles DATA PLAYS A critical role in machine learning. Every
machine learning model is trained and evaluated using data, quite often in the form of static
datasets. The characteristics of these datasets fundamentally influence a model’s behavior: a
model is unlikely to perform well in the wild if its deployment context does not match its training
or evaluation datasets, or if these datasets reflect unwanted societal biases. Mismatches like …
Documentation to facilitate communication between dataset creators and consumers.
ACM Digital Library