Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Jul 2:arXiv:2311.13417v2.

Reproducible image-based profiling with Pycytominer

Affiliations

Reproducible image-based profiling with Pycytominer

Erik Serrano et al. ArXiv. .

Update in

  • Reproducible image-based profiling with Pycytominer.
    Serrano E, Chandrasekaran SN, Bunten D, Brewer KI, Tomkinson J, Kern R, Bornholdt M, Fleming SJ, Pei R, Arevalo J, Tsang H, Rubinetti V, Tromans-Coia C, Becker T, Weisbart E, Bunne C, Kalinin AA, Senft R, Taylor SJ, Jamali N, Adeboye A, Abbasi HS, Goodman A, Caicedo JC, Carpenter AE, Cimini BA, Singh S, Way GP. Serrano E, et al. Nat Methods. 2025 Apr;22(4):677-680. doi: 10.1038/s41592-025-02611-8. Epub 2025 Mar 3. Nat Methods. 2025. PMID: 40032995 Free PMC article.

Abstract

Advances in high-throughput microscopy have enabled the rapid acquisition of large numbers of high-content microscopy images. Whether by deep learning or classical algorithms, image analysis pipelines then produce single-cell features. To process these single-cells for downstream applications, we present Pycytominer, a user-friendly, open-source python package that implements the bioinformatics steps, known as "image-based profiling". We demonstrate Pycytominer's usefulness in a machine learning project to predict nuisance compounds that cause undesirable cell injuries.

PubMed Disclaimer

Conflict of interest statement

Disclosure of competing interests The authors declare they have no competing interests relevant to this work.

Figures

Figure 1.
Figure 1.. The standard image-based profiling experiment and the role of Pycytominer.
(A) In the experimental phase, a scientist plates cells, often perturbing them with chemical or genetic agents and performs microscopy imaging. In image analysis, using CellProfiler for example, a scientist applies several data processing steps to generate image-based profiles. In addition, scientists can apply a more flexible approach by using deep learning models, such as DeepProfiler, to generate image-based profiles. (B) Pycytominer performs image-based profiling to process morphology features and make them ready for downstream analyses.
Figure 2.
Figure 2.. Model performance and evaluation with JUMP data.
(A) Our pycytominer-based workflow to process publicly-available data and to train a machine learning model to predict cellular injury. (B) Precision and recall scores for predicting various cellular injuries, comparing the not shuffled model (solid lines) with the shuffled model (dashed lines) across distinct injury types and data splits, with blue indicating the training set and red indicating the test set. The F1 scores for each injury represent only the testing and training datasets with the not shuffled model. (C) Confusion matrices assessing the model’s predictive performance across training, testing, and holdout data. (D) Cytoskeletal injury probability distribution generated by the shuffled (red) and non-shuffled (blue) models. The three groups represent the ground truth wells (top), wells predicted to have cytoskeletal injury (middle), and wells not predicted to have cytoskeletal injury (bottom).

References

    1. Way G. P., Sailem H., Shave S., Kasprowicz R. & Carragher N. O. Evolution and impact of high content imaging. SLAS Discov (2023) doi: 10.1016/j.slasd.2023.08.009. - DOI - PubMed
    1. Schindelin J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012). - PMC - PubMed
    1. Stirling D. R. et al. CellProfiler 4: improvements in speed, utility and usability. BMC Bioinformatics 22, 433 (2021). - PMC - PubMed
    1. Scheeder C., Heigwer F. & Boutros M. Machine learning and image-based profiling in drug discovery. Curr Opin Syst Biol 10, 43–52 (2018). - PMC - PubMed
    1. Caicedo J. C. et al. Data-analysis strategies for image-based cell profiling. Nat. Methods 14, 849–863 (2017). - PMC - PubMed

Publication types

LinkOut - more resources