Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014;11(5):523-544.
doi: 10.2217/pme.14.34.

The road from next-generation sequencing to personalized medicine

Affiliations

The road from next-generation sequencing to personalized medicine

Manuel L Gonzalez-Garay. Per Med. 2014.

Abstract

Moving from a traditional medical model of treating pathologies to an individualized predictive and preventive model of personalized medicine promises to reduce the healthcare cost on an overburdened and overwhelmed system. Next-generation sequencing (NGS) has the potential to accelerate the early detection of disorders and the identification of pharmacogenetics markers to customize treatments. This review explains the historical facts that led to the development of NGS along with the strengths and weakness of NGS, with a special emphasis on the analytical aspects used to process NGS data. There are solutions to all the steps necessary for performing NGS in the clinical context where the majority of them are very efficient, but there are some crucial steps in the process that need immediate attention.

Keywords: CADD; GWAVA; NGS; functional prediction program; genomics; personalized medicine; workflow management system.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Timeline: the major events in next-generation sequencing. On the left is the year of the event
EST: Expressed sequence tag; IHGSC: International human genome sequencing consortium; ENCODE: Encyclopedia of DNA elements; NCBI: National Center for Biotechnology Information; WGS: Whole-exome sequencing.
Figure 2
Figure 2. Generic pipeline for the analysis of next-generation sequencing
Multiple steps involved in the analysis of data from the next-generation sequencing. The paired-end short reads, from the sequencing machine, are submitted to a quality control process. The adaptors are removed from the reads, and then the reads are mapped to the human reference by using short-read sequencing mapping tools. The alignments in the sequence alignment/map format are cleaned with tools like Pickard and transformed into a binary version of the sequence alignment/map format BAM. The BAM file is processed with tools like the Genome Analysis Toolkit to clean up the alignments. Quality control reports are generated, and variants are extracted by the use of variant callers. The document containing the variants or variant call format is annotated and filtered. Low-frequency variants that are known or predicted to be damaging are validated and used to generate a final report to the physicians or genetic counselors. BAM: Binary Sequence Alignment/Map format; dbNSFP: Lightweight database of human nonsynonymous SNPs and their functional predictions; GATK: Genome Analysis Toolkit; HGMD: Human gene mutation database; HPG: High performance genomics; MAF: Minor allele frequency; QC: Quality control; SAM: Sequence Alignment/Map format; SRSMT: Short read sequencing mapping tools; VEF: Variant effect predictor; VCF: Variant call format.
Figure 3
Figure 3. The road from next-generation sequencing to personalized medicine
An overall view of how next-generation sequencing will be incorporated into the medical healthcare system. At the time of birth, a small sample of blood is taken from the patient and submitted to whole genome sequencing. The physicians and genetic counselors will provide a detailed family and medical history to an entity that will store and analyze the next-generation sequencing data. This entity will receive additional information such as metabolomics, proteomics and transcriptomes, among others, as well as new bioinformatics interpretation will be performed in collaboration with molecular biologist, physicians and genetic counselors. The physicians will review the reports and formulate recommendations and treatments for the patient. The process will be interactive with constant communication between the doctor, patient and entity in charge of the data interpretation.

References

    1. Bloom DE, Cafiero ET, Jané-Llopis E, et al. The global economic burden of noncommunicable diseases. Geneva: World Economic Forum; www3.weforum.org/docs/WEF_Harvard_HE_GlobalEconomicBurdenNonCommunicable....
    1. Caskey CT, Gonzalez-Garay ML, Pereira S, Mcguire AL. Adult genetic risk screening. Annu Rev Med. 2014;65:1–17. - PubMed
    1. Sanger F, Nicklen S, Coulson AR. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci USA. 1977;74(12):5463–5467. - PMC - PubMed
    1. Wetterstrand KA. DNA sequencing costs: data from the NHGRI Genome Sequencing Program (GSP) www.genome.gov/sequencingcosts.
    1. NHGRI: all about the Human Genome Project (HGP) www.genome.gov/10001772.