Semantics derived automatically from language corpora contain human-like biases

Aylin Caliskan¹, Joanna J Bryson^{1

2}, Arvind Narayanan¹

Affiliations

¹ Center for Information Technology Policy, Princeton University, Princeton, NJ, USA. aylinc@princeton.edu jjb@alum.mit.edu arvindn@cs.princeton.edu.
² Department of Computer Science, University of Bath, Bath BA2 7AY, UK.

PMID: 28408601
DOI: 10.1126/science.aal4230

Free article

Semantics derived automatically from language corpora contain human-like biases

Aylin Caliskan et al. Science. 2017.

Free article

. 2017 Apr 14;356(6334):183-186.

doi: 10.1126/science.aal4230.

Authors

Aylin Caliskan¹, Joanna J Bryson^{1

2}, Arvind Narayanan¹

Affiliations

¹ Center for Information Technology Policy, Princeton University, Princeton, NJ, USA. aylinc@princeton.edu jjb@alum.mit.edu arvindn@cs.princeton.edu.
² Department of Computer Science, University of Bath, Bath BA2 7AY, UK.

PMID: 28408601
DOI: 10.1126/science.aal4230

Abstract

Machine learning is a means to derive artificial intelligence by discovering patterns in existing data. Here, we show that applying machine learning to ordinary human language results in human-like semantic biases. We replicated a spectrum of known biases, as measured by the Implicit Association Test, using a widely used, purely statistical machine-learning model trained on a standard corpus of text from the World Wide Web. Our results indicate that text corpora contain recoverable and accurate imprints of our historic biases, whether morally neutral as toward insects or flowers, problematic as toward race or gender, or even simply veridical, reflecting the status quo distribution of gender with respect to careers or first names. Our methods hold promise for identifying and addressing sources of bias in culture, including technology.

PubMed Disclaimer

Comment in

An AI stereotype catcher.
Greenwald AG. Greenwald AG. Science. 2017 Apr 14;356(6334):133-134. doi: 10.1126/science.aan0649. Epub 2017 Apr 13. Science. 2017. PMID: 28408558 No abstract available.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Semantics derived automatically from language corpora contain human-like biases

Affiliations

Semantics derived automatically from language corpora contain human-like biases

Authors

Affiliations

Abstract

Comment in

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources