Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jul 25:9:75.
doi: 10.1186/s13148-017-0370-2. eCollection 2017.

Maternal blood contamination of collected cord blood can be identified using DNA methylation at three CpGs

Affiliations

Maternal blood contamination of collected cord blood can be identified using DNA methylation at three CpGs

Alexander M Morin et al. Clin Epigenetics. .

Abstract

Background: Cord blood is a commonly used tissue in environmental, genetic, and epigenetic population studies due to its ready availability and potential to inform on a sensitive period of human development. However, the introduction of maternal blood during labor or cross-contamination during sample collection may complicate downstream analyses. After discovering maternal contamination of cord blood in a cohort study of 150 neonates using Illumina 450K DNA methylation (DNAm) data, we used a combination of linear regression and random forest machine learning to create a DNAm-based screening method. We identified a panel of DNAm sites that could discriminate between contaminated and non-contaminated samples, then designed pyrosequencing assays to pre-screen DNA prior to being assayed on an array.

Results: Maternal contamination of cord blood was initially identified by unusual X chromosome DNA methylation patterns in 17 males. We utilized our DNAm panel to detect contaminated male samples and a proportional amount of female samples in the same cohort. We validated our DNAm screening method on an additional 189 sample cohort using both pyrosequencing and DNAm arrays, as well as 9 publically available cord blood 450K data sets. The rate of contamination varied from 0 to 10% within these studies, likely related to collection specific methods.

Conclusions: Maternal blood can contaminate cord blood during sample collection at appreciable levels across multiple studies. We have identified a panel of markers that can be used to identify this contamination, either post hoc after DNAm arrays have been completed, or in advance using a targeted technique like pyrosequencing.

Keywords: 450K; Blood banking; Contamination; Cord blood; DNA methylation; Genotyping; Maternal blood.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

All Drakenstein study participants gave informed consent to participate and the study was approved by University of Cape Town IRB. All other data was publically available.

Consent for publication

Not applicable.

Competing interests

MJJ, MSK, AMM, EGG, JLM, and LMM are in the process of applying for a patent relating to the work presented.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
Principal component and X chromosome DNA methylation (DNAm) patterns revealed maternal blood contamination in cord blood. a Plotting the first two principal components of 450K DNAm data identified a number of male samples with DNAm patterns similar to female participants or intermediate between male and female. b Examining the distribution of X chromosome DNAm beta values in these samples revealed that the intermediate male samples clearly showed patterns indicative of a mixture of male (top) and female (bottom) distributions
Fig. 2
Fig. 2
DNA methylation at 10 autosomal CpGs was sufficient to correctly identify all known contaminated male samples, and found 13 contaminated female samples. a The 10 CpGs selected by the random forest method clearly separate cord and adult samples, and also clearly discriminate non-contaminated (N) from contaminated (C) male samples, and divide unknown (U) samples into two groups. b Counting the number of sites over thresholds per sample (x axis), contamination was called if at least 5 of the 10 CpGs were above the threshold. Unclear males were all non-contaminated, and 13 females were identified as being contaminated. c A subset of 3 out of the 10 CpGs can be used for pyrosequencing screening. Two thresholds are shown—one requiring two of the three CpGs to be above the threshold to be called contaminated (yellow), and one requiring all three (red)
Fig. 3
Fig. 3
Summary of performance of all methods used to predict cord blood contamination. Each column represents the same participant across each method. The 10 CpG method using 450K array data was the most reliable, but using a subset of three CpGs was sufficient to identify at least 82% of contaminated samples
Fig. 4
Fig. 4
Pre-screening using the pyrosequencing method correctly identified contaminated male samples. a Applying a cut-off of 2 CpGs above the threshold (yellow line) to the 3 CpG pyrosequencing method on validation data, 18 males and 15 females were identified as contaminated. b Principal component plot of EPIC DNA methylation data on all non-contaminated samples with two male samples that had been called contaminated by pyrosequencing showed that contaminated male samples had been correctly identified. c Using the 10 CpG method from EPIC data, only the 2 male samples known to be contaminated had more than 5 CpGs above the threshold (red line)
Fig. 5
Fig. 5
Identification of studies with significant contamination levels in public data. Using available data, we examined the 10 CpGs chosen to identify contamination, though some studies had previously filtered their data and some CpGs were not available. We called maternal contamination of samples if more than 50% of the available CpGs were above our contamination thresholds, and identified two studies (GSE54399 and PREDO) with contaminated samples

Similar articles

Cited by

References

    1. Hodyl NA, Roberts CT, Bianco-Miotto T. Cord blood DNA methylation biomarkers for predicting neurodevelopmental outcomes. Genes (Basel). 2016;7. - PMC - PubMed
    1. Küpers LK, Xu X, Jankipersadsing SA, Vaez A, la Bastide-van Gemert S, Scholtens S, et al. DNA methylation mediates the effect of maternal smoking during pregnancy on birthweight of the offspring. Int J Epidemiol. 2015;44:1224–1237. doi: 10.1093/ije/dyv048. - DOI - PMC - PubMed
    1. Hermansen MC. Nucleated red blood cells in the fetus and newborn. Arch Dis Child Fetal Neonatal Ed. 2001;84:211F–2215. doi: 10.1136/fn.84.3.F211. - DOI - PMC - PubMed
    1. Armson BA. Maternal/Fetal Medicine Committee, Society of Obstetricians and Gynaecologists of Canada. Umbilical cord blood banking: implications for perinatal care providers. J Obstet Gynaecol Can. 2005;27:263–290. doi: 10.1016/S1701-2163(16)30520-5. - DOI - PubMed
    1. Lo YM, Lau TK, Chan LY, Leung TN, Chang AM. Quantitative analysis of the bidirectional fetomaternal transfer of nucleated cells and plasma DNA. Clin Chem. 2000;46:1301–1309. - PubMed

Publication types