Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Aug 28:2:29.
doi: 10.3389/fdata.2019.00029. eCollection 2019.

Reflections on Gender Analyses of Bibliographic Corpora

Affiliations

Reflections on Gender Analyses of Bibliographic Corpora

Helena Mihaljević et al. Front Big Data. .

Abstract

The interplay between an academic's gender and their scholarly output is a riveting topic at the intersection of scientometrics, data science, gender studies, and sociology. Its effects can be studied to analyze the role of gender in research productivity, tenure and promotion standards, collaboration and networks, or scientific impact, among others. The typical methodology in this field of research is based on a number of assumptions that are customarily not discussed in detail in the relevant literature, but undoubtedly merit a critical examination. Presumably the most confronting aspect is the categorization of gender. An author's gender is typically inferred from their name, further reduced to a binary feature by an algorithmic procedure. This and subsequent data processing steps introduce biases whose effects are hard to estimate. In this report we describe said problems and discuss the reception and interplay of this line of research within the field. We also outline the effect of obstacles, such as non-availability of data and code for transparent communication. Building on our research on gender effects on scientific publications, we challenge the prevailing methodology in the field and offer a critical reflection on some of its flaws and pitfalls. Our observations are meant to open up the discussion around the need and feasibility of more elaborated approaches to tackle gender in conjunction with analyses of bibliographic sources.

Keywords: automatic gender recognition; bias; data science; gender; reproducibility; science studies; societal issues.

PubMed Disclaimer

References

    1. Angermuller J., van Leeuwen T. (2019). On the social uses of scientometrics: the quantification of academic evaluation and the rise of numerocracy in higher education, in Quantifying Approaches to Discourse for Social Scientists, Postdisciplinary Studies in Discourse, ed Scholz R. (Cham: Springer International Publishing; ), 89–119.
    1. Blei D. M., Smyth P. (2017). Science and data science. Proc. Natl. Acad. Sci. U.S.A. 114, 8689–8692. 10.1073/pnas.1702076114 - DOI - PMC - PubMed
    1. Bonham K. S., Stefan M. I. (2017). Women are underrepresented in computational biology: an analysis of the scholarly literature in biology, computer science and computational biology. PLoS Comput. Biol. 13, 1–12. 10.1371/journal.pcbi.1005134 - DOI - PMC - PubMed
    1. Buolamwini J., Gebru T. (2018). Gender shades: intersectional accuracy disparities in commercial gender classification, in Proceedings of the 1st Conference on Fairness, Accountability and Transparency, Volume 81 of Proceedings of Machine Learning Research, eds Friedler S. A., Wilson C. (New York, NY: PMLR; ), 77–91.
    1. Butler J. (1988). Performative acts and gender constitution: an essay in phenomenology and feminist theory. Theatre J. 40, 519–531. 10.2307/3207893 - DOI

LinkOut - more resources