Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 23;9(1):96.
doi: 10.1038/s41597-022-01236-2.

R code and downstream analysis objects for the scRNA-seq atlas of normal and tumorigenic human breast tissue

Affiliations

R code and downstream analysis objects for the scRNA-seq atlas of normal and tumorigenic human breast tissue

Yunshun Chen et al. Sci Data. .

Abstract

Breast cancer is a common and highly heterogeneous disease. Understanding cellular diversity in the mammary gland and its surrounding micro-environment across different states can provide insight into cancer development in the human breast. Recently, we published a large-scale single-cell RNA expression atlas of the human breast spanning normal, preneoplastic and tumorigenic states. Single-cell expression profiles of nearly 430,000 cells were obtained from 69 distinct surgical tissue specimens from 55 patients. This article extends the study by providing quality filtering thresholds, downstream processed R data objects, complete cell annotation and R code to reproduce all the analyses. Data quality assessment measures are presented and details are provided for all the bioinformatic analyses that produced results described in the study.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Dataset overview. (a) Diagram showing the data processing pipeline from sample collection to downstream bioinformatics analyses. (b) Schematic overview of the all the integration analyses and the samples involved in each integration analysis. Under each category, the names of the samples are listed and the total number of samples is shown in the bracket.
Fig. 2
Fig. 2
Quality control and cell filtering. Box plots of (a) the library sizes and (b) the numbers of detected genes for all the cells in each of the 69 samples before filtering. Boxes show median and quartiles and whiskers show minimum and maximum. Boxes are colored by tumor type. (c) Number of cells in each of the 69 samples. Blue segments show the number of cells that are kept after the cell filtering while red segments show filtered cells. The proportion of filtered cells is labelled on top of each bar.
Fig. 3
Fig. 3
Bulk RNA-seq. (a) Number of mapped and unmapped read pairs for each sample in the human mammary gland bulk RNA-seq. (b) MDS plot showing that the bulk RNA-seq samples cluster by cell type. Distances on the plot correspond to root-mean-square log2-fold-change for the top 500 differential genes between each pair of samples. Percentage variance explained is also shown. (c) Biological coefficient of variation for each gene in the bulk RNA-seq data.

References

    1. Bray F, et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians. 2018;68:394–424. - PubMed
    1. Visvader JE. Keeping abreast of the mammary epithelial hierarchy and breast tumorigenesis. Genes & Development. 2009;23:2563–2577. doi: 10.1101/gad.1849509. - DOI - PMC - PubMed
    1. Sotiriou C, et al. Breast cancer classification and prognosis based on gene expression profiles from a population-based study. Proceedings of the National Academy of Sciences. 2003;100:10393–10398. doi: 10.1073/pnas.1732912100. - DOI - PMC - PubMed
    1. Turner NC, Reis-Filho JS. Basal-like breast cancer and the BRCA1 phenotype. Oncogene. 2006;25:5846–5853. doi: 10.1038/sj.onc.1209876. - DOI - PubMed
    1. Pal B, et al. A single-cell RNA expression atlas of normal, preneoplastic and tumorigenic states in the human breast. EMBO Journal. 2021;40:e107333. doi: 10.15252/embj.2020107333. - DOI - PMC - PubMed

Publication types