. 2004 Jun;14(6):1130-6.

doi: 10.1101/gr.2383804.

Automatic identification of subcellular phenotypes on human cell arrays

Christian Conrad¹, Holger Erfle, Patrick Warnat, Nathalie Daigle, Thomas Lörch, Jan Ellenberg, Rainer Pepperkok, Roland Eils

Affiliations

PMID: 15173118
PMCID: PMC419791
DOI: 10.1101/gr.2383804

Automatic identification of subcellular phenotypes on human cell arrays

Christian Conrad et al. Genome Res. 2004 Jun.

. 2004 Jun;14(6):1130-6.

doi: 10.1101/gr.2383804.

Authors

Christian Conrad¹, Holger Erfle, Patrick Warnat, Nathalie Daigle, Thomas Lörch, Jan Ellenberg, Rainer Pepperkok, Roland Eils

Affiliation

¹ Intelligent Bioinformatics Systems, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany.

PMID: 15173118
PMCID: PMC419791
DOI: 10.1101/gr.2383804

Abstract

Light microscopic analysis of cell morphology provides a high-content readout of cell function and protein localization. Cell arrays and microwell transfection assays on cultured cells have made cell phenotype analysis accessible to high-throughput experiments. Both the localization of each protein in the proteome and the effect of RNAi knock-down of individual genes on cell morphology can be assayed by manual inspection of microscopic images. However, the use of morphological readouts for functional genomics requires fast and automatic identification of complex cellular phenotypes. Here, we present a fully automated platform for high-throughput cell phenotype screening combining human live cell arrays, screening microscopy, and machine-learning-based classification methods. Efficiency of this platform is demonstrated by classification of eleven subcellular patterns marked by GFP-tagged proteins. Our classification method can be adapted to virtually any microscopic assay based on cell morphology, opening a wide range of applications including large-scale RNAi screening in human cells.

PubMed Disclaimer

Figures

**Figure 1**
Workflow of phenotype classification system. The mix of targeted DNA (e.g., subcellular clones–GFP fusion), gelatin, and transfection reagents are prepared in microwells and spotted in arrays into chamber slides by a DNA-spotting robot. Cultured cells are added to the chamber slides and transfected with the DNA on the arrayed spots. After 10 h incubation at 37°C the fluorescence signal of the expressed GFP signal can be visualized. On the automatic scanning platform a motorized stage and motorized z-stepper perform the scanning of the live cells. The control of the illumination and the automatic calculation of the integration time allow an automatic acquisition of cell images. Cells with GFP-signals are captured and image features are extracted. These features serve as input for training of the automated classification system.

**Figure 2**
Automatically captured images of live MCF7 cells representing localization classes. (A) Cells were counterstained by Hoechst (blue) and different GFP-tagged cDNAs were expressed in cells transfected on cell arrays (green). Image acquisition is designed in such a way that each image contains only one cell. From *top left* to *bottom right*: GFP-tagged NES→cytoplasm (96 images), EGFP→cyto-nuclear (170 images), GFP-tagged cDNA #488→mitochondria (108 images), YFP-tagged LB1→nuclear lamina (108 images), GFP-tagged ErbB1→plasma membrane (70 images), YFP-tagged SRb→endoplasmatic reticulum (84 images), GFP-tagged cDNA #351→nucleoli (116 images), YFP-tagged cDNA #447→peroxisomes (119 images), YFP-tagged cDNA #22f21→microtubules (78 images), YFP-tagged H2B→nuclear (111 images), YFP-tagged GalT→Golgi (93 images) imaged from two different spatial directions. For cDNA reference see Supplemental Table 1 (see also http://www.dkfz.de/LIFEdb/ and http://harvester.embl.de/; Wiemann et al. 2001). (B) Artifacts of the corresponding subcellular localization class are shown in the *right* column: mitochondria artifacts (116 images), endoplasmatic reticulum artifacts (99 images), microtubules artifact (67 images), Golgi artifact (77 images). In total 1035 images were labeled as artifacts and assigned to this class. Note that artifact cells show expression levels both below and above the level of expression typically observed for valid cells (*left*). Hence, the level of expression is not sufficient to discriminate artifact cells from valid cells (see also Supplemental Fig. 1).

**Figure 3**
Accuracy of phenotype classification. (A) Classification accuracy determined on the test set for three different classification algorithms using the first 25 ranked features obtained by three different feature selection methods. In contrast to the BayesANN approach, the ANN/GA runs with nonconvergent fivefold cross-validation as regularization. As the result of the BayesANN classifier is dependent on random initialization of network weights, the averaged classification accuracies and the corresponding standard deviations of 10 classification runs of the same test set are shown, while the ANN/GA generates many similar networks during the evolutionary search process. Only the SVM was trained with the entire set of 323 features. (B) While BayesANN performed best on average in combination with SAM for feature selection, the overall best classifier was obtained by combination of BayesANN with STEPWISE (82.6% accuracy). The cone diagram shows single class accuracies obtained by this overall best classifier for all subcellular classes and artifact class.

See this image and copyright information in PMC

Cited by

Single object profiles regression analysis (SOPRA): a novel method for analyzing high-content cell-based screens.
Gurumurthy RK, Pleissner KP, Chumduri C, Meyer TF, Mäurer AP. Gurumurthy RK, et al. BMC Bioinformatics. 2022 Oct 21;23(1):440. doi: 10.1186/s12859-022-04981-8. BMC Bioinformatics. 2022. PMID: 36271357 Free PMC article.
Image classification of human carcinoma cells using complex wavelet-based covariance descriptors.
Keskin F, Suhre A, Kose K, Ersahin T, Cetin AE, Cetin-Atalay R. Keskin F, et al. PLoS One. 2013;8(1):e52807. doi: 10.1371/journal.pone.0052807. Epub 2013 Jan 16. PLoS One. 2013. PMID: 23341908 Free PMC article.
Mining and integration of pathway diagrams from imaging data.
Kozhenkov S, Baitaluk M. Kozhenkov S, et al. Bioinformatics. 2012 Mar 1;28(5):739-42. doi: 10.1093/bioinformatics/bts018. Epub 2012 Jan 20. Bioinformatics. 2012. PMID: 22267504 Free PMC article.
Phenotypic profiling of the human genome reveals gene products involved in plasma membrane targeting of SRC kinases.
Ritzerfeld J, Remmele S, Wang T, Temmerman K, Brügger B, Wegehingel S, Tournaviti S, Strating JR, Wieland FT, Neumann B, Ellenberg J, Lawerenz C, Hesser J, Erfle H, Pepperkok R, Nickel W. Ritzerfeld J, et al. Genome Res. 2011 Nov;21(11):1955-68. doi: 10.1101/gr.116087.110. Epub 2011 Jul 27. Genome Res. 2011. PMID: 21795383 Free PMC article.
A reference library for assigning protein subcellular localizations by image-based machine learning.
Schormann W, Hariharan S, Andrews DW. Schormann W, et al. J Cell Biol. 2020 Mar 2;219(3):e201904090. doi: 10.1083/jcb.201904090. J Cell Biol. 2020. PMID: 31968357 Free PMC article.

See all "Cited by" articles

References

1. Bishop, C.M. 2000. Neural network for pattern recognition. Oxford University Press, New York.
1. Bulinski, J.C., Odde, D.J., Howell, B.J., Salmon, T.D., and Waterman-Storer, C.M. 2001. Rapid dynamics of the microtubule binding of ensconsin in vivo. J. Cell. Sci. 114: 3885–3897. - PubMed
1. Burhardt, H. and Siggelow, S. 2001. Invariant features in pattern recognition—fundamentals and application. In Nonlinear model-based image/video processing and analysis (eds. C. Kotropoulos and I. Pitas), pp. 269–307. John Wiley & Sons, New York.
1. Chang, T. and Kuo, C.C. 1993. Texture analysis and classification with tree-structured wavelet transform. IEEE Transactions on Image Processing 2: 429–441. - PubMed
1. Chen, X., Velliste, M., Weinstein, S., Jarvik, J.W., and Murphy, R.F. 2003. Location proteomics—Building subcellular location tree from high resolution 3D fluorescence microcope images of randomly-tagged proteins. Manipulation and Analysis of Biomolecules, Cells, and Tissues, Proceedings of SPIE 4962: 298–306.

WEB SITE REFERENCES

1. http://www.dkfz.de/LIFEdb/; cDNA database.
1. http://harvester.embl.de/; Database cross linker.
1. http://www.csie.ntu.edu.tw/∼cjlin/libsvm/; Support Vector Machine implementation.
1. http://www.ncrg.aston.ac.uk/netlab/; Neural Network toolbox using Matlab.
1. http://brain.unr.edu; Source code of ANN (NevProp3) by Philip Goodman.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Automatic identification of subcellular phenotypes on human cell arrays

Affiliation

Automatic identification of subcellular phenotypes on human cell arrays

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

WEB SITE REFERENCES

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Abstract

Figures

Similar articles

Cited by

References

WEB SITE REFERENCES

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical