Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;7(11):e49575.
doi: 10.1371/journal.pone.0049575. Epub 2012 Nov 19.

Unexpected relationships and inbreeding in HapMap phase III populations

Affiliations

Unexpected relationships and inbreeding in HapMap phase III populations

Eric L Stevens et al. PLoS One. 2012.

Abstract

Correct annotation of the genetic relationships between samples is essential for population genomic studies, which could be biased by errors or omissions. To this end, we used identity-by-state (IBS) and identity-by-descent (IBD) methods to assess genetic relatedness of individuals within HapMap phase III data. We analyzed data from 1,397 individuals across 11 ethnic populations. Our results support previous studies (Pemberton et al., 2010; Kyriazopoulou-Panagiotopoulou et al., 2011) assessing unknown relatedness present within this population. Additionally, we present evidence for 1,657 novel pairwise relationships across 9 populations. Surprisingly, significant Cotterman's coefficients of relatedness K1 (IBD1) values were detected between pairs of known parents. Furthermore, significant K2 (IBD2) values were detected in 32 previously annotated parent-child relationships. Consistent with a hypothesis of inbreeding, regions of homozygosity (ROH) were identified in the offspring of related parents, of which a subset overlapped those reported in previous studies (Gibson et al. 2010; Johnson et al. 2011). In total, we inferred 28 inbred individuals with ROH that overlapped areas of relatedness between the parents and/or IBD2 sharing at a different genomic locus between a child and a parent. Finally, 8 previously annotated parent-child relationships had unexpected K0 (IBD0) values (resulting from a chromosomal abnormality or genotype error), and 10 previously annotated second-degree relationships along with 38 other novel pairwise relationships had unexpected IBD2 (indicating two separate paths of recent ancestry). These newly described types of relatedness may impact the outcome of previous studies and should inform the design of future studies relying on the HapMap Phase III resource.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. IBD estimates of previously annotated and novel relatedness in phase III HapMap.
Each circle represents a pair of individuals with estimated Cotterman coefficients of relatedness K0, K1, and K2 (percent of the genome shared IBD0, IBD1, and IBD2). (A) Previously annotated relationships given by the International HapMap Consortium , Pemberton et al. , and Kyriazopoulou-Panagiotopoulou et al. were plotted by group (x-axis) and K1 values (y-axis) and labeled by their degree of relationship. Arrow 1 corresponds to identical samples NA21737/NA21344. (B) Unexpected K2 values (y-axis) in previously annotated parent-child and second-degree relatedness for each group (x-axis). Only K2 values greater than 0.001 are shown. Arrow 2 corresponds to NA21362/NA21438. (C) Estimated IBD0 (y-axis) in previously annotated parent-child relationships for each group (x-axis). Only K0 values greater than 0.001 are shown. Arrows 3–5 highlight NA12874/NA12865, NA12889/NA12877, and NA10863/NA12234, respectively. Only K2 values greater than 0.001 are shown. (D) Novel relatedness between pairs of individuals separated by group (x-axis) and estimated K1 (y-axis). Only K1 values greater than 0.025 are shown. (E) Novel relatedness between pairs of individuals previously identified in Panel B for MKK and MXL (x-axis) with unexpected K2 (y-axis). (F) Inferred degrees of relationship (including those unable to be called; x-axis) plotted as a function of K1. All 2260 pairwise comparisons inferred to be related from any study (including this one) are shown, excluding identical samples. Note the overlap between percent of genome shared IBD1 and degree of relationship. Abbreviation: NC, no relationship called; r value, relatedness value.
Figure 2
Figure 2. Evidence for consanguinity in HapMap Phase III individuals.
Pairwise comparisons of IBS were plotted across a chromosome by position for pairs of individuals that had unexpected IBD1 and IBD2 for their relationship type. (A) IBS observations for two parents (YRI father/mother NA18504/NA18505) are shown for chromosome 4. Note region 1 which indicates an absence of IBS0 calls and inferred IBD1 status. (B) IBS measurements between father and son (NA18504/NA18503) are plotted for chromosome 4. Note region 2 in which there are few IBS0 and IBS1 calls thus implying IBD2 status. (C). Genotypes of the son (NA18503) are shown for chromosome 4. Note region 3 in which there is a lack of AB calls, aligning with region 1, thus indicating autozygosity. (D) Ideogram for chromosome 4. (E) IBS observations between two YRI parents (father/mother NA19121/NA19122) are plotted along chromosome 20. Note region 1 in which there is a lack of IBS0 calls indicating an IBD1 region. (F) Genotypes of the son (NA19123) are shown for chromosome 20. Note region 1 in which there are zero AB calls in the same region of IBD1 between the parents implying autozygosity in the child. (G) Ideogram for chromosome 20.
Figure 3
Figure 3. Reconstruction of a partial pedigree from the MKK group.
We analyzed MKK genotype data using IBD analysis and inferred the familial relationships of 61 individuals with 46 being related to at least 1 other person. This graph contains relationships constructed from second-degree, full-sibling, parent-child, and identical relationships (with the exception of NA21352 and NA21351 who are inferred to be first-cousins based on their second-degree relationship to NA21414; see top left of figure). All indicated relationships are based on previous analysis (siblings: thick green lines), previous annotation (family trios; family ID), and inferred analyses (sibling relationships, thick blue lines; corrected parent-child orientation, thick red lines; corrections made to annotated relationships, thick yellow lines; other familial relationships; thin black lines). Dashed rectangles indicate family units annotated by the HapMap project at the Coriell website. F indicates family identifier (e.g. F2654). Individual identifiers are shown as the last three digits of NA21xxx (e.g. 353 at the upper left of the figure corresponds to individual NA21353). All IBD information is given in Table S1. Note that several individuals who are part of MKK (e.g. NA12310 in family 2566) and for whom cell lines were created did not have SNP data as part of the HapMap Phase III release.

Similar articles

Cited by

References

    1. Altshuler DM, Gibbs RA, Peltonen L, Dermitzakis E, Schaffner SF, et al. (2010) Integrating common and rare genetic variation in diverse human populations. Nature 467: 52–58. - PMC - PubMed
    1. International HapMap Consortium (2005) A haplotype map of the human genome. Nature 437: 1299–1320. - PMC - PubMed
    1. Frazer KA, Ballinger DG, Cox DR, Hinds DA, Stuve LL, et al. (2007) A second generation human haplotype map of over 3.1 million SNPs. Nature 449: 851–861. - PMC - PubMed
    1. Donnelly P (2008) Progress and challenges in genome-wide association studies in humans. Nature 456: 728–731. - PubMed
    1. Cheung VG, Spielman RS, Ewens KG, Weber TM, Morley M, et al. (2005) Mapping determinants of human gene expression by regional and genome-wide association. Nature 437: 1365–1369. - PMC - PubMed