Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Oct 29;8(1):e182.
doi: 10.1017/cts.2024.632. eCollection 2024.

Race, ethnicity, and considerations for data collection and analysis in research studies

Affiliations
Review

Race, ethnicity, and considerations for data collection and analysis in research studies

Sima Sharghi et al. J Clin Transl Sci. .

Abstract

Research studies involving human subjects require collection of and reporting on demographic data related to race and ethnicity. However, existing practices lack standardized guidelines, leading to misrepresentation and biased inferences and conclusions for underrepresented populations in research studies. For instance, sometimes there is a misconception that self-reported racial or ethnic identity may be treated as a biological variable with underlying genetic implications, overlooking its role as a social construct reflecting lived experiences of specific populations. In this manuscript, we use the We All Count data equity framework, which organizes data projects across seven stages: Funding, Motivation, Project Design, Data Collection, Analysis, Reporting, and Communication. Focusing on data collection and analysis, we use examples - both real and hypothetical - to review common practice and provide critiques and alternative recommendations. Through these examples and recommendations, we hope to provide the reader with some ideas and a starting point as they consider embedding a lens of justice, equity, diversity, and inclusivity from research conception to dissemination of findings.

Keywords: Race; analysis; data collection; ethnicity; generalizability.

PubMed Disclaimer

Conflict of interest statement

None.

Figures

Figure 1.
Figure 1.
Examples of “Table 1” from several publications.
Figure 2.
Figure 2.
Examples of data collection form segments to solicit race and ethnicity information from a study participant.
Figure 3.
Figure 3.
Visual of hypothetical example involving controlling for ethnicity versus heterogeneity of treatment effect.

References

    1. What is Data Equity and Why Does it Matter?, https://data.org/resources/what-is-data-equity-and-why-does-it-matter/. Accessed March 01, 2023.
    1. THE DATA EQUITY FRAMEWORK, https://weallcount.com/the-data-process/. Accessed March 01,2023.
    1. Collins FS. What we do and don’t know about ‘race’, ‘ethnicity’, genetics and health at the dawn of the genome era. Nat Genet. 2004;36(S11):S13–5. doi: 10.1038/ng1436. - DOI - PubMed
    1. Mersha TB, Beck AF. The social, economic, political, and genetic value of race and ethnicity in 2020. Hum Genomics. 2020;14(1):37. doi: 10.1186/s40246-020-00284-2. - DOI - PMC - PubMed
    1. Cascino TM, Colvin M, Lanfear DE, et al. Racial inequities in access to VAD and transplant persist after consideration for preferences for care: a report from the registry evaluation for vital information for VADs in ambulatory life (REVIVAL). Circ Heart Fail. 2023;16(1):e009745. - PMC - PubMed

LinkOut - more resources