Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Feb 28:2023.09.15.558001.
doi: 10.1101/2023.09.15.558001.

Evaluating batch correction methods for image-based cell profiling

Affiliations
Free PMC article

Evaluating batch correction methods for image-based cell profiling

John Arevalo et al. bioRxiv. .
Free PMC article

Update in

Abstract

High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands of perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects pose severe limitations to community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we benchmarked seven high-performing scRNA-seq batch correction techniques, representing diverse approaches, using a newly released Cell Painting dataset, the largest publicly accessible image-based dataset. We focused on five different scenarios with varying complexity, and we found that Harmony, a mixture-model based method, consistently outperformed the other tested methods. Our proposed framework, benchmark, and metrics can additionally be used to assess new batch correction methods in the future. Overall, this work paves the way for improvements that allow the community to make best use of public Cell Painting data for scientific discovery.

Keywords: Batch correction; Cell Painting; Harmony; high-throughput phenotypic screening; image-based profiling; machine learning; morphological profiling; pseudo-bulk analysis.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests The Authors declare the following competing interests: S.S. and A.E.C. serve as scientific advisors for companies that use image-based profiling and Cell Painting (A.E.C: Recursion, SyzOnc; S.S.: Waypoint Bio, Dewpoint Therapeutics, Deepcell) and receive research funding and occasional talk honoraria from various pharmaceutical and biotechnology companies. All other authors declare no competing interests.

References

    1. Nat Methods. 2022 Jan;19(1):41-50 - PubMed
    1. Nat Methods. 2022 Dec;19(12):1550-1557 - PubMed
    1. Biostatistics. 2007 Jan;8(1):118-27 - PubMed
    1. Nat Methods. 2019 Dec;16(12):1289-1296 - PubMed
    1. Cell. 2017 Nov 30;171(6):1437-1452.e17 - PubMed

Publication types