Using RNA-Seq to Discover Genetic Polymorphisms That Produce Hidden Splice Variants
- PMID: 28766294
- DOI: 10.1007/978-1-4939-7204-3_10
Using RNA-Seq to Discover Genetic Polymorphisms That Produce Hidden Splice Variants
Abstract
RNA-seq is a powerful and popular technology for studying posttranscriptional regulation of gene expression, such as alternative splicing. The first step in analyzing RNA-seq data is to map the sequenced reads back to the genome. However, commonly used RNA-seq aligners use the consensus splice site dinucleotide motifs to map reads across splice junctions. This can be deceiving due to genomic variants that create novel splice site dinucleotides, leaving the personal splice junction reads un-mapped to the reference genome. We developed and evaluated a method called RNA Personal Genome Alignment Analyzer (rPGA) to identify "hidden" splicing variations in personal transcriptomes, by mapping personal RNA-seq data to personal genomes. Our work demonstrates that the personal genome approach to RNA-seq read alignment enables the discovery of a large but previously unknown catalog of splicing variations in human populations.
Keywords: Alternative splicing; RNA-seq; Single-nucleotide polymorphisms (SNPs).
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources