The haplotype-resolved genome and epigenome of the aneuploid HeLa cancer cell line
- PMID: 23925245
- PMCID: PMC3740412
- DOI: 10.1038/nature12064
The haplotype-resolved genome and epigenome of the aneuploid HeLa cancer cell line
Abstract
The HeLa cell line was established in 1951 from cervical cancer cells taken from a patient, Henrietta Lacks. This was the first successful attempt to immortalize human-derived cells in vitro. The robust growth and unrestricted distribution of HeLa cells resulted in its broad adoption--both intentionally and through widespread cross-contamination--and for the past 60 years it has served a role analogous to that of a model organism. The cumulative impact of the HeLa cell line on research is demonstrated by its occurrence in more than 74,000 PubMed abstracts (approximately 0.3%). The genomic architecture of HeLa remains largely unexplored beyond its karyotype, partly because like many cancers, its extensive aneuploidy renders such analyses challenging. We carried out haplotype-resolved whole-genome sequencing of the HeLa CCL-2 strain, examined point- and indel-mutation variations, mapped copy-number variations and loss of heterozygosity regions, and phased variants across full chromosome arms. We also investigated variation and copy-number profiles for HeLa S3 and eight additional strains. We find that HeLa is relatively stable in terms of point variation, with few new mutations accumulating after early passaging. Haplotype resolution facilitated reconstruction of an amplified, highly rearranged region of chromosome 8q24.21 at which integration of the human papilloma virus type 18 (HPV-18) genome occurred and that is likely to be the event that initiated tumorigenesis. We combined these maps with RNA-seq and ENCODE Project data sets to phase the HeLa epigenome. This revealed strong, haplotype-specific activation of the proto-oncogene MYC by the integrated HPV-18 genome approximately 500 kilobases upstream, and enabled global analyses of the relationship between gene dosage and expression. These data provide an extensively phased, high-quality reference genome for past and future experiments relying on HeLa, and demonstrate the value of haplotype resolution for characterizing cancer genomes and epigenomes.
Figures




Similar articles
-
Long-distance interaction of the integrated HPV fragment with MYC gene and 8q24.22 region upregulating the allele-specific MYC expression in HeLa cells.Int J Cancer. 2017 Aug 1;141(3):540-548. doi: 10.1002/ijc.30763. Epub 2017 May 19. Int J Cancer. 2017. PMID: 28470669 Free PMC article.
-
Haplotype-resolved and integrated genome analysis of the cancer cell line HepG2.Nucleic Acids Res. 2019 May 7;47(8):3846-3861. doi: 10.1093/nar/gkz169. Nucleic Acids Res. 2019. PMID: 30864654 Free PMC article.
-
Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562.Genome Res. 2019 Mar;29(3):472-484. doi: 10.1101/gr.234948.118. Epub 2019 Feb 8. Genome Res. 2019. PMID: 30737237 Free PMC article.
-
Genome-based versus gene-based theory of cancer: possible implications for clinical practice.J Biosci. 2011 Sep;36(4):719-24. doi: 10.1007/s12038-011-9099-9. J Biosci. 2011. PMID: 21857118 Review.
-
Browsing (Epi)genomes: a guide to data resources and epigenome browsers for stem cell researchers.Cell Stem Cell. 2013 Jul 3;13(1):14-21. doi: 10.1016/j.stem.2013.06.006. Cell Stem Cell. 2013. PMID: 23827707 Free PMC article. Review.
Cited by
-
The AKR1B1 inhibitor epalrestat suppresses the progression of cervical cancer.Mol Biol Rep. 2020 Aug;47(8):6091-6103. doi: 10.1007/s11033-020-05685-z. Epub 2020 Aug 5. Mol Biol Rep. 2020. PMID: 32761301
-
DualGCN: a dual graph convolutional network model to predict cancer drug response.BMC Bioinformatics. 2022 Apr 15;23(Suppl 4):129. doi: 10.1186/s12859-022-04664-4. BMC Bioinformatics. 2022. PMID: 35428192 Free PMC article.
-
Building trust in 21st century genomics.G3 (Bethesda). 2013 Aug 7;3(8):1209-11. doi: 10.1534/g3.113.007690. G3 (Bethesda). 2013. PMID: 23926223 Free PMC article. No abstract available.
-
A novel missense mutation in CCDC88C activates the JNK pathway and causes a dominant form of spinocerebellar ataxia.J Med Genet. 2014 Sep;51(9):590-5. doi: 10.1136/jmedgenet-2014-102333. Epub 2014 Jul 25. J Med Genet. 2014. PMID: 25062847 Free PMC article.
-
HPV integration status conversion and CIN2 + cancer risk stratification based on HPV integration levels among HPV integration-positive women: a 1-year follow-up study.BMC Cancer. 2025 May 19;25(1):885. doi: 10.1186/s12885-025-14138-4. BMC Cancer. 2025. PMID: 40383798 Free PMC article.
References
-
- Gey GO, Coffman WD, Kubicek MT. Tissue culture studies of the proliferative capacity of cervical carcinoma and normal epithelium. Cancer research. 1952;12:264–265.
-
- Gartler SM. Apparent Hela cell contamination of human heteroploid cell lines. Nature. 1968;217:750–751. - PubMed
-
- Skloot R. The immortal life of Henrietta Lacks. Crown Publishers; 2010.
-
- Macville M, et al. Comprehensive and definitive molecular cytogenetic characterization of HeLa cells by spectral karyotyping. Cancer Res. 1999;59:141–150. - PubMed
Publication types
MeSH terms
Associated data
- Actions
- Actions
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials
Miscellaneous