A fast and accurate method for detection of IBD shared haplotypes in genome-wide SNP data
- PMID: 28176766
- PMCID: PMC5437913
- DOI: 10.1038/ejhg.2017.6
A fast and accurate method for detection of IBD shared haplotypes in genome-wide SNP data
Abstract
Identical by descent (IBD) segments are used to understand a number of fundamental issues in genetics. IBD segments are typically detected using long stretches of identical alleles between haplotypes in phased, whole-genome SNP data. Phase or SNP call errors in genomic data can degrade accuracy of IBD detection and lead to false-positive/negative calls and to under/overextension of true IBD segments. Furthermore, the number of comparisons increases quadratically with sample size, requiring high computational efficiency. We developed a new IBD segment detection program, FISHR (Find IBD Shared Haplotypes Rapidly), in an attempt to accurately detect IBD segments and to better estimate their endpoints using an algorithm that is fast enough to be deployed on very large whole-genome SNP data sets. We compared the performance of FISHR to three leading IBD segment detection programs: GERMLINE, refined IBD, and HaploScore. Using simulated and real genomic sequence data, we show that FISHR is slightly more accurate than all programs at detecting long (>3 cm) IBD segments but slightly less accurate than refined IBD at detecting short (~1 cm) IBD segments. More centrally, FISHR outperforms all programs in determining the true endpoints of IBD segments, which is crucial for several applications of IBD information. FISHR takes two to three times longer than GERMLINE to run, whereas both GERMLINE and FISHR were orders of magnitude faster than refined IBD and HaploScore. Overall, FISHR provides accurate IBD detection in unrelated individuals and is computationally efficient enough to be utilized on large SNP data sets >60 000 individuals.
Conflict of interest statement
The authors declare no conflict of interest.
Figures




References
-
- Setty MN, Gusev A, Pe'er I: HLA type inference via haplotypes identical by descent. J Comput Biol 2011; 18: 483–493. - PubMed
-
- Soi S, Scheinfeldt L, Lambert C et al Demographic histories of African hunting-gathering populations inferred from genome-wide SNP variation. International Congress of Human Genetics/American Society of Human Genetics meeting, Montreal, Canada 2011; (abstract 100).
Publication types
MeSH terms
Grants and funding
- HHSN268201100012C/HL/NHLBI NIH HHS/United States
- HHSN268201100009I/HL/NHLBI NIH HHS/United States
- R01 DK058845/DK/NIDDK NIH HHS/United States
- HHSN268201100010C/HL/NHLBI NIH HHS/United States
- HHSN268201100011I/HL/NHLBI NIH HHS/United States
- HHSN268201100011C/HL/NHLBI NIH HHS/United States
- HHSN268201100006C/HL/NHLBI NIH HHS/United States
- HHSN268201100005I/HL/NHLBI NIH HHS/United States
- T32 MH016880/MH/NIMH NIH HHS/United States
- HHSN268201100007I/HL/NHLBI NIH HHS/United States
- R01 MH100141/MH/NIMH NIH HHS/United States
- U01 HG004399/HG/NHGRI NIH HHS/United States
- P01 CA087969/CA/NCI NIH HHS/United States
- HHSN268201100008C/HL/NHLBI NIH HHS/United States
- HHSN268201100005G/HL/NHLBI NIH HHS/United States
- HHSN268201100008I/HL/NHLBI NIH HHS/United States
- HHSN268201100007C/HL/NHLBI NIH HHS/United States
- P01 CA055075/CA/NCI NIH HHS/United States
- U01 HG004402/HG/NHGRI NIH HHS/United States
- U01 HG004424/HG/NHGRI NIH HHS/United States
- HHSN268201100009C/HL/NHLBI NIH HHS/United States
- HHSN268201100005C/HL/NHLBI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous