Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jul 3;45(W1):W567-W572.
doi: 10.1093/nar/gkx425.

VCF.Filter: interactive prioritization of disease-linked genetic variants from sequencing data

Affiliations

VCF.Filter: interactive prioritization of disease-linked genetic variants from sequencing data

Heiko Müller et al. Nucleic Acids Res. .

Abstract

Next generation sequencing is widely used to link genetic variants to diseases, and it has massively accelerated the diagnosis and characterization of rare genetic diseases. After initial bioinformatic data processing, the interactive analysis of genome, exome, and panel sequencing data typically starts from lists of genetic variants in VCF format. Medical geneticists filter and annotate these lists to identify variants that may be relevant for the disease under investigation, or to select variants that are reported in a clinical diagnostics setting. We developed VCF.Filter to facilitate the search for disease-linked variants, providing a standalone Java program with a user-friendly interface for interactive variant filtering and annotation. VCF.Filter allows the user to define a broad range of filtering criteria through a graphical interface. Common workflows such as trio analysis and cohort-based filtering are pre-configured, and more complex analyses can be performed using VCF.Filter's support for custom annotations and filtering criteria. All filtering is documented in the results file, thus providing traceability of the interactive variant prioritization. VCF.Filter is an open source tool that is freely and openly available at http://vcffilter.rarediseases.at.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
VCF.Filter overview and workflow for interactive prioritization of genetic variants. VCF.filter takes a VCF file as input, implements several filtering and analysis methods, and produces a filtered VCF file as output. The user can optionally provide additional input files for customized filtering, such as lists of genomic regions to include or exclude, and a table of allele frequencies in a reference cohort. VCF.Filter also produces a visualization of genetic variants using the Hilbert curve and tables of cohort allele frequencies for variant filtering.
Figure 2.
Figure 2.
Analysis example using the ‘Filter variants’ module of VCF.Filter. Once a VCF file has been loaded into VCF.Filter, the user can interactively define filtering rules, for example to identify disease-linked variants. Filtering can be based on annotations in the VCF file (e.g. variant type, location, or variant effect predictions). In addition, the user can upload a table of variant allele frequencies in a suitable reference cohort and filter potentially disease-linked variants by cohort allele frequencies. Furthermore, lists of genomic regions for inclusion (pass) and exclusion (non-pass) can be configured to restrict the analysis to certain genomic regions and/or to specifically exclude other regions. This screenshot shows an example with filtering on one standard VCF field (CHROM) and on three custom VCF annotations. The output is written to the text area in the bottom part of the user interface and can also be saved to a VCF file.

References

    1. Danecek P., Auton A., Abecasis G., Albers C.A., Banks E., DePristo M.A., Handsaker R.E., Lunter G., Marth G.T., Sherry S.T. et al. . The variant call format and VCFtools. Bioinformatics. 2011; 27:2156–2158. - PMC - PubMed
    1. Sherry S.T., Ward M., Sirotkin K.. dbSNP-database for single nucleotide polymorphisms and other classes of minor genetic variation. Genome Res. 1999; 9:677–679. - PubMed
    1. Landrum M.J., Lee J.M., Riley G.R., Jang W., Rubinstein W.S., Church D.M., Maglott D.R.. ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 2014; 42:D980–D985. - PMC - PubMed
    1. Lek M., Karczewski K.J., Minikel E.V., Samocha K.E., Banks E., Fennell T., O’Donnell-Luria A.H., Ware J.S., Hill A.J., Cummings B.B. et al. . Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016; 536:285–291. - PMC - PubMed
    1. Cingolani P., Platts A., Wang le L., Coon M., Nguyen T., Wang L., Land S.J., Lu X., Ruden D.M.. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012; 6:80–92. - PMC - PubMed

Publication types