Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 30;42(9):1445-1460.
doi: 10.1002/sim.9680. Epub 2023 Mar 5.

A Bayesian hierarchical model for signal extraction from protein microarrays

Affiliations

A Bayesian hierarchical model for signal extraction from protein microarrays

Sophie Bérubé et al. Stat Med. .

Abstract

Protein microarrays are a promising technology that measure protein levels in serum or plasma samples. Due to their high technical variability and high variation in protein levels across serum samples in any population, directly answering biological questions of interest using protein microarray measurements is challenging. Analyzing preprocessed data and within-sample ranks of protein levels can mitigate the impact of between-sample variation. As for any analysis, ranks are sensitive to preprocessing, but loss function based ranks that accommodate major structural relations and components of uncertainty are very effective. Bayesian modeling with full posterior distributions for quantities of interest produce the most effective ranks. Such Bayesian models have been developed for other assays, for example, DNA microarrays, but modeling assumptions for these assays are not appropriate for protein microarrays. Consequently, we develop and evaluate a Bayesian model to extract the full posterior distribution of normalized protein levels and associated ranks for protein microarrays, and show that it fits well to data from two studies that use protein microarrays produced by different manufacturing processes. We validate the model via simulation and demonstrate the downstream impact of using estimates from this model to obtain optimal ranks.

Keywords: Bayesian models; bioinformatics; optimal ranks; preprocessing; protein microarrays.

PubMed Disclaimer

Conflict of interest statement

CONFLICT OF INTEREST STATEMENT

The authors declare that there is no conflict of interest.

Figures

FIGURE 1
FIGURE 1
Posterior distributions for four different active spots (t=a) of S on four malaria arrays (A,B,C,D) and four HuProt arrays (E,F,G,H) fit with Model 1. Vertical dashed lines are at mean of values Yi,t=a,p,j corresponding to the protein p.
FIGURE 2
FIGURE 2
Posterior distributions for means, μi,t of four different control spots on four malaria arrays (A, B, C, D) and four HuProt arrays (E, F, G, H) fit with Model 1. Vertical dashed lines are at mean values of Yi,t,p,j for a particular type of control protein, t1,,T.
FIGURE 3
FIGURE 3
P for all 503 malaria arrays and 5 of the 100 HuProt arrays fit with Models 1, 2, and 3. The red line is y=x.
FIGURE 4
FIGURE 4
Percentile residuals (P) for 10 simulated arrays under generating models A, B, C, D, E, and F (Section 3.3), analyzed with Models 1 and 2 respectively.
FIGURE 5
FIGURE 5
Each color is a protein simulated under generating mechanism A, B, C, D, E, or F each column shows one of the true ranks of S, the Squared error loss ranks T¨, obtained from posterior distributions under Model 1 or Model 2. Lines represent how the protein ranks change from the top 30 true values to those under the two analysis models.
FIGURE 6
FIGURE 6
Each color is a protein on the original malaria or HuProt array, each column shows one of: the published ranks from Kobayashi et al, the ranks of fluorescent intensity preprocessed according to the procedure described in Bérubé et al, the squared error loss ranks, T¨, and the gamma (γ=3) ranks, Q¨. Lines represent how ranks of proteins change across each method of preprocessing and ranking.

References

    1. Huang Y, Zhu H. Protein array-based approaches for biomarker discovery in cancer. Genomics Proteomics Bioinform. 2017;15:73–81. - PMC - PubMed
    1. Zhu H, Hu S, Jona G, et al. Severe acute respiratory syndrome diagnostics using a coronavirus protein microarray. Proc Natl Acad Sci. 2006;103:4011–4016. - PMC - PubMed
    1. Nagele E, Han M, Demarshall C, Belinka B, Nagele R. Diagnosis of Alzheimer’s disease based on disease-specific autoantibody profiles in human sera. PloS One. 2011;6:e23112. - PMC - PubMed
    1. Ramachandran N, Srivastava S, LaBaer J. Applications of protein microarrays for biomarker discovery. Proteomics Clin Appl. 2008;2:1444–1459. - PMC - PubMed
    1. Hartmann M, Roeraade J, Stoll D, Templin M, Joos T. Protein microarrays for diagnostic assays. Anal Bioanal Chem. 2009;393:1407–1416. - PubMed

Publication types

LinkOut - more resources