. 2021 Nov 27;14(1):436.

doi: 10.1186/s13104-021-05851-x.

Power calculator for detecting allelic imbalance using hierarchical Bayesian model

Katrina Sherbina¹, Luis G León-Novelo², Sergey V Nuzhdin³, Lauren M McIntyre⁴, Fabio Marroni⁵

Affiliations

¹ Quantitative and Computational Biology Section, University of Southern California, Los Angeles, CA, 90046, USA.
² Department of Biostatistics and Data Science, The University of Texas Health Science Center at Houston-School of Public Health, Houston, TX, 77030, USA.
³ Molecular and Computational Biology Section, University of Southern California, Los Angeles, CA, 90046, USA.
⁴ Genetics Institute and Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, 32603, USA.
⁵ Dipartimento di Scienze Agroalimentari, Ambientali e Animali, Università di Udine, 33100, Udine, Italy. fabio.marroni@uniud.it.

PMID: 34838135
PMCID: PMC8626927
DOI: 10.1186/s13104-021-05851-x

Power calculator for detecting allelic imbalance using hierarchical Bayesian model

Katrina Sherbina et al. BMC Res Notes. 2021.

. 2021 Nov 27;14(1):436.

doi: 10.1186/s13104-021-05851-x.

Authors

Katrina Sherbina¹, Luis G León-Novelo², Sergey V Nuzhdin³, Lauren M McIntyre⁴, Fabio Marroni⁵

Affiliations

¹ Quantitative and Computational Biology Section, University of Southern California, Los Angeles, CA, 90046, USA.
² Department of Biostatistics and Data Science, The University of Texas Health Science Center at Houston-School of Public Health, Houston, TX, 77030, USA.
³ Molecular and Computational Biology Section, University of Southern California, Los Angeles, CA, 90046, USA.
⁴ Genetics Institute and Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, 32603, USA.
⁵ Dipartimento di Scienze Agroalimentari, Ambientali e Animali, Università di Udine, 33100, Udine, Italy. fabio.marroni@uniud.it.

PMID: 34838135
PMCID: PMC8626927
DOI: 10.1186/s13104-021-05851-x

Abstract

Objective: Allelic imbalance (AI) is the differential expression of the two alleles in a diploid. AI can vary between tissues, treatments, and environments. Methods for testing AI exist, but methods are needed to estimate type I error and power for detecting AI and difference of AI between conditions. As the costs of the technology plummet, what is more important: reads or replicates?

Results: We find that a minimum of 2400, 480, and 240 allele specific reads divided equally among 12, 5, and 3 replicates is needed to detect a 10, 20, and 30%, respectively, deviation from allelic balance in a condition with power > 80%. A minimum of 960 and 240 allele specific reads divided equally among 8 replicates is needed to detect a 20 or 30% difference in AI between conditions with comparable power. Higher numbers of replicates increase power more than adding coverage without affecting type I error. We provide a Python package that enables simulation of AI scenarios and enables individuals to estimate type I error and power in detecting AI and differences in AI between conditions.

Keywords: Allele specific reads; Allelic imbalance; Biological replicates; Power; Simulation; Type I error.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

**Fig. 1**
Read counts are simulated for different scenarios in two conditions. A scenario is defined as a specific number of simulations, number of allele specific reads, number of biological replicates (bioreps), level of allelic imbalance (AI) θ, and the probability of mapping an allele g1 (g2) specific read. Without loss of generality, let allele g1 be allele A and g2 be allele C (blue boxes). The number of allele specific reads (yellow reads) is the sum of unambiguously mapped reads in the experiment. Grey reads are reads that map equally well, *i.e.* ambiguously, to both alleles. Biological replicates in an experiment are samples from the same genotype and condition. In this example, there are k biological replicates, $12 \times k$ allele specific reads, and the probability of an allele specific read is $r_{i, g 1} = r_{i, g 2} = 0.8$ . The null H1 and H2 hypotheses are allelic balance θ₁ = 0.5 in condition 1 (ex. liver) and θ₂ = 0.5 in condition 2 (ex. kidney), respectively. These cases are used to estimate the Type I error in rejecting allelic balance in conditions 1 (H1) and 2 (H2). In this example, θ₁ = 0.55 under the alternative (alt) H1 hypothesis and θ₂ = 0.55 under the alternative (alt) H2 hypothesis. These cases are used to estimate the power in rejecting allelic balance in conditions 1 (H1) and 2 (H2). θ₁ = 0.5 and θ₂ = 0.55 under the alternative (alt) H3 hypothesis, which allows estimation of the power rejecting equal levels of AI between the two conditions (H3). The null H3 hypothesis is simulated in both the complete null case: θ₁ = θ₂ = 0.5 and in the scenario where there is allelic imbalance in both conditions θ₁ = θ₂ = 0.55. These cases can be used to estimate the Type I error in rejecting equal levels of AI between the two conditions (H3)

**Fig. 2**
Variations in type I error (y-axis) are shown as a function of the number of biological replicates, or bioreps (x-axis) assuming different numbers of allele specific reads. H1 and H3 refer to the null hypothesis of allelic balance within a condition (H1) and the null hypothesis of equal levels of AI between the two conditions (H3). The Type I Error (y-axis) is computed as the proportion of simulations for which the Bayesian evidence against allelic balance within a condition or equal AI between conditions is $<$ 0.05. Plots a and b show eight simulated values of the number (#) of allele specific reads, which is the sum of the reads that map unambiguously to an allele in the experiment. Plots c and d show four simulated values of the number (#) of allele specific reads per bioreps, which is the number of allele specific reads divided by the number of bioreps. $Δ A I$ is deviation from the null, *i.e.* deviation from allelic balance in condition ( ${Δ A I}_{1}$ ) or the relative difference in the levels of allelic imbalance between the two conditions ( ${Δ A I}_{3}$ )The probability of an allele specific read is r_i,g1 = r_i,g2 = *0.8* and there are 1000 simulations

**Fig. 3**
H1 refers to simulations under the alternative hypothesis of allelic imbalance within a condition and H3 refers to unequal levels of AI between the two conditions. For H1, the x-axis is the effect size, which is the relative deviation from allelic balance ${Δ A I}_{1} =$ $\frac{|θ - θ_{0}|}{θ_{0}}$ , where $θ_{0} = 0.5$ . For H3, the x-axis is the relative difference in levels of AI between the two conditions ${Δ A I}_{3} =$ $\frac{|θ_{2} - θ_{1}|}{θ_{1}}$ where the first condition is simulated under the null hypothesis and the second under the alternative hypothesis $θ \neq 0.5$ . The power (y-axis) is computed as the proportion of simulations for which the Bayesian evidence against allelic balance within a condition or against equal levels of AI between conditions is $<$ 0.05. There are 1000 simulations and the probability of an allele specific read is r_i,g1 = r_i,g2 = *0.8*. Simulations for 3, 4, 5, 6, 8, and 12 biological replicates (bioreps, x-axis) for varying numbers (#) of allele specific reads are reported

See this image and copyright information in PMC

Cited by

Denervation alters the secretome of myofibers and thereby affects muscle stem cell lineage progression and functionality.
Henze H, Hüttner SS, Koch P, Schüler SC, Groth M, von Eyss B, von Maltzahn J. Henze H, et al. NPJ Regen Med. 2024 Mar 1;9(1):10. doi: 10.1038/s41536-024-00353-3. NPJ Regen Med. 2024. PMID: 38424446 Free PMC article.

References

1. Wittkopp PJ, Haerum BK, Clark AG. Evolutionary changes in cis and trans gene regulation. Nature. 2004;430:85–88. doi: 10.1038/nature02698. - DOI - PubMed
1. Genissel A, McIntyre LM, Wayne ML, Nuzhdin SV. Cis and trans regulatory effects contribute to natural variation in transcriptome of drosophila melanogaster. Mol Biol Evol. 2007;25:101–110. doi: 10.1093/molbev/msm247. - DOI - PubMed
1. Graze RM, McIntyre LM, Main BJ, Wayne ML, Nuzhdin SV. Regulatory divergence in Drosophila melanogaster and D. simulans, a genomewide analysis of allele-specific expression. Genetics. 2009;183:547–561. doi: 10.1534/genetics.109.105957. - DOI - PMC - PubMed
1. Graze RM, Novelo LL, Amin V, Fear JM, Casella G, Nuzhdin SV, et al. Allelic imbalance in drosophila hybrid heads: exons, isoforms, and evolution. Mol Biol Evol. 2012;29:1521–1532. doi: 10.1093/molbev/msr318. - DOI - PMC - PubMed
1. Zou F, Sun W, Crowley JJ, Zhabotynsky V, Sullivan PF, de Pardo-Manuel Villena F. A novel statistical approach for jointly analyzing RNA-Seq data from F1 reciprocal crosses and inbred lines. Genetics. 2014;197:389–399. doi: 10.1534/genetics.113.160119. - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Power calculator for detecting allelic imbalance using hierarchical Bayesian model

Affiliations

Power calculator for detecting allelic imbalance using hierarchical Bayesian model

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials