Evaluating batch correction methods for image-based cell profiling
- PMID: 39095341
- PMCID: PMC11297288
- DOI: 10.1038/s41467-024-50613-5
Evaluating batch correction methods for image-based cell profiling
Abstract
High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands of perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects severely limit community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we benchmark ten high-performing single-cell RNA sequencing (scRNA-seq) batch correction techniques, representing diverse approaches, using a newly released Cell Painting dataset, JUMP. We focus on five scenarios with varying complexity, ranging from batches prepared in a single lab over time to batches imaged using different microscopes in multiple labs. We find that Harmony and Seurat RPCA are noteworthy, consistently ranking among the top three methods for all tested scenarios while maintaining computational efficiency. Our proposed framework, benchmark, and metrics can be used to assess new batch correction methods in the future. This work paves the way for improvements that enable the community to make the best use of public Cell Painting data for scientific discovery.
© 2024. The Author(s).
Conflict of interest statement
The Authors declare the following competing interests: S.S. and A.E.C. serve as scientific advisors for companies that use image-based profiling and Cell Painting (A.E.C: Recursion, SyzOnc, Quiver Bioscience, S.S.: Waypoint Bio, Dewpoint Therapeutics, Deepcell) and receive honoraria for occasional scientific visits to pharmaceutical and biotechnology companies. All other authors declare no competing interests.
Figures
Update of
-
Evaluating batch correction methods for image-based cell profiling.bioRxiv [Preprint]. 2024 Feb 28:2023.09.15.558001. doi: 10.1101/2023.09.15.558001. bioRxiv. 2024. Update in: Nat Commun. 2024 Aug 2;15(1):6516. doi: 10.1038/s41467-024-50613-5. PMID: 37745478 Free PMC article. Updated. Preprint.
References
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous
