Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 27;14(1):436.
doi: 10.1186/s13104-021-05851-x.

Power calculator for detecting allelic imbalance using hierarchical Bayesian model

Affiliations

Power calculator for detecting allelic imbalance using hierarchical Bayesian model

Katrina Sherbina et al. BMC Res Notes. .

Abstract

Objective: Allelic imbalance (AI) is the differential expression of the two alleles in a diploid. AI can vary between tissues, treatments, and environments. Methods for testing AI exist, but methods are needed to estimate type I error and power for detecting AI and difference of AI between conditions. As the costs of the technology plummet, what is more important: reads or replicates?

Results: We find that a minimum of 2400, 480, and 240 allele specific reads divided equally among 12, 5, and 3 replicates is needed to detect a 10, 20, and 30%, respectively, deviation from allelic balance in a condition with power > 80%. A minimum of 960 and 240 allele specific reads divided equally among 8 replicates is needed to detect a 20 or 30% difference in AI between conditions with comparable power. Higher numbers of replicates increase power more than adding coverage without affecting type I error. We provide a Python package that enables simulation of AI scenarios and enables individuals to estimate type I error and power in detecting AI and differences in AI between conditions.

Keywords: Allele specific reads; Allelic imbalance; Biological replicates; Power; Simulation; Type I error.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Read counts are simulated for different scenarios in two conditions. A scenario is defined as a specific number of simulations, number of allele specific reads, number of biological replicates (bioreps), level of allelic imbalance (AI) θ, and the probability of mapping an allele g1 (g2) specific read. Without loss of generality, let allele g1 be allele A and g2 be allele C (blue boxes). The number of allele specific reads (yellow reads) is the sum of unambiguously mapped reads in the experiment. Grey reads are reads that map equally well, i.e. ambiguously, to both alleles. Biological replicates in an experiment are samples from the same genotype and condition. In this example, there are k biological replicates, 12×k allele specific reads, and the probability of an allele specific read is ri,g1=ri,g2=0.8. The null H1 and H2 hypotheses are allelic balance θ1 = 0.5 in condition 1 (ex. liver) and θ2 = 0.5 in condition 2 (ex. kidney), respectively. These cases are used to estimate the Type I error in rejecting allelic balance in conditions 1 (H1) and 2 (H2). In this example, θ1 = 0.55 under the alternative (alt) H1 hypothesis and θ2 = 0.55 under the alternative (alt) H2 hypothesis. These cases are used to estimate the power in rejecting allelic balance in conditions 1 (H1) and 2 (H2). θ1 = 0.5 and θ2 = 0.55 under the alternative (alt) H3 hypothesis, which allows estimation of the power rejecting equal levels of AI between the two conditions (H3). The null H3 hypothesis is simulated in both the complete null case: θ1 = θ2 = 0.5 and in the scenario where there is allelic imbalance in both conditions θ1 = θ2 = 0.55. These cases can be used to estimate the Type I error in rejecting equal levels of AI between the two conditions (H3)
Fig. 2
Fig. 2
Variations in type I error (y-axis) are shown as a function of the number of biological replicates, or bioreps (x-axis) assuming different numbers of allele specific reads. H1 and H3 refer to the null hypothesis of allelic balance within a condition (H1) and the null hypothesis of equal levels of AI between the two conditions (H3). The Type I Error (y-axis) is computed as the proportion of simulations for which the Bayesian evidence against allelic balance within a condition or equal AI between conditions is < 0.05. Plots a and b show eight simulated values of the number (#) of allele specific reads, which is the sum of the reads that map unambiguously to an allele in the experiment. Plots c and d show four simulated values of the number (#) of allele specific reads per bioreps, which is the number of allele specific reads divided by the number of bioreps. ΔAI is deviation from the null, i.e. deviation from allelic balance in condition (ΔAI1) or the relative difference in the levels of allelic imbalance between the two conditions (ΔAI3)The probability of an allele specific read is ri,g1 = ri,g2 = 0.8 and there are 1000 simulations
Fig. 3
Fig. 3
H1 refers to simulations under the alternative hypothesis of allelic imbalance within a condition and H3 refers to unequal levels of AI between the two conditions. For H1, the x-axis is the effect size, which is the relative deviation from allelic balance ΔAI1= θ-θ0θ0, where θ0=0.5. For H3, the x-axis is the relative difference in levels of AI between the two conditions ΔAI3= θ2-θ1θ1 where the first condition is simulated under the null hypothesis and the second under the alternative hypothesis θ0.5. The power (y-axis) is computed as the proportion of simulations for which the Bayesian evidence against allelic balance within a condition or against equal levels of AI between conditions is < 0.05. There are 1000 simulations and the probability of an allele specific read is ri,g1 = ri,g2 = 0.8. Simulations for 3, 4, 5, 6, 8, and 12 biological replicates (bioreps, x-axis) for varying numbers (#) of allele specific reads are reported

Similar articles

Cited by

References

    1. Wittkopp PJ, Haerum BK, Clark AG. Evolutionary changes in cis and trans gene regulation. Nature. 2004;430:85–88. doi: 10.1038/nature02698. - DOI - PubMed
    1. Genissel A, McIntyre LM, Wayne ML, Nuzhdin SV. Cis and trans regulatory effects contribute to natural variation in transcriptome of drosophila melanogaster. Mol Biol Evol. 2007;25:101–110. doi: 10.1093/molbev/msm247. - DOI - PubMed
    1. Graze RM, McIntyre LM, Main BJ, Wayne ML, Nuzhdin SV. Regulatory divergence in Drosophila melanogaster and D. simulans, a genomewide analysis of allele-specific expression. Genetics. 2009;183:547–561. doi: 10.1534/genetics.109.105957. - DOI - PMC - PubMed
    1. Graze RM, Novelo LL, Amin V, Fear JM, Casella G, Nuzhdin SV, et al. Allelic imbalance in drosophila hybrid heads: exons, isoforms, and evolution. Mol Biol Evol. 2012;29:1521–1532. doi: 10.1093/molbev/msr318. - DOI - PMC - PubMed
    1. Zou F, Sun W, Crowley JJ, Zhabotynsky V, Sullivan PF, de Pardo-Manuel Villena F. A novel statistical approach for jointly analyzing RNA-Seq data from F1 reciprocal crosses and inbred lines. Genetics. 2014;197:389–399. doi: 10.1534/genetics.113.160119. - DOI - PMC - PubMed