Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 19;5(9):101023.
doi: 10.1016/j.patter.2024.101023. eCollection 2024 Sep 13.

A privacy-preserving approach for cloud-based protein fold recognition

Affiliations

A privacy-preserving approach for cloud-based protein fold recognition

Ali Burak Ünal et al. Patterns (N Y). .

Abstract

The complexity and cost of training machine learning models have made cloud-based machine learning as a service (MLaaS) attractive for businesses and researchers. MLaaS eliminates the need for in-house expertise by providing pre-built models and infrastructure. However, it raises data privacy and model security concerns, especially in medical fields like protein fold recognition. We propose a secure three-party computation-based MLaaS solution for privacy-preserving protein fold recognition, protecting both sequence and model privacy. Our efficient private building blocks enable complex operations privately, including addition, multiplication, multiplexer with a different methodology, most-significant bit, modulus conversion, and exact exponential operations. We demonstrate our privacy-preserving recurrent kernel network (RKN) solution, showing that it matches the performance of non-private models. Our scalability analysis indicates linear scalability with RKN parameters, making it viable for real-world deployment. This solution holds promise for converting other medical domain machine learning algorithms to privacy-preserving MLaaS using our building blocks.

Keywords: cloud-based machine learning; data privacy; machine learning as a service; multi-party computation; privacy preserving machine learning; protein fold recognition; recurrent kernel networks.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
The overview of our privacy-preserving RKN as a service via MPC (1) At first, the data owner, i.e., Alice, secret shares her data and sends the shares to the proxies. Similarly, the model owner, i.e., Bob, does the same with the parameters of the model. (2) Then, by using the outsourced model and the data, the proxies, P0 and P1, perform the operations required for the inference of the data on the model with the help of P2, which is the helper. (3) Finally, the proxies send the shares of the prediction of the given data to the data owner.
Figure 2
Figure 2
The architecture of RKN and internal computations of its layers (A and B) The arithmetic circuits of (A) a single neuron of RKN at position t and k-mer level u and (B) the linear classifier layer of RKN after the last position of the input sequence x are depicted. zij represents the i-th character of the j-th anchor point, and ci[j] represents the initial mapping of the sequence up to its j-th character into a q-dimensional vector based on anchor points of length i where i{1,,k}, j{1,,s}, and q is the number of anchor points. (C) For k=3 and |x|=5, RKN is shown where the green nodes are the single neurons and the pink one is the linear classifier.
Figure 3
Figure 3
The results of the execution time analysis of our RKN as a service The results of the execution time analysis of our RKN as a service on both WAN and LAN settings for varying (A) numbers of anchor points for a fixed k-mer length and sequence length, (B) lengths of k-mers for a fixed number of anchor points and sequence length, and (C) lengths of sequences for a fixed number of anchor points and k-mer length.

References

    1. Kallel A., Rekik M., Khemakhem M. Hybrid-based framework for covid-19 prediction via federated machine learning models. J. Supercomput. 2022;78:7078–7105. - PMC - PubMed
    1. Qin H., Zawad S., Zhou Y., Padhi S., Yang L., Yan F. Reinforcement-learning-empowered mlaas scheduling for serving intelligent internet of things. IEEE Internet Things J. 2020;7:6325–6337.
    1. Alabbadi M.M. Mobile learning (mlearning) based on cloud computing: mlearning as a service (mlaas) Proc. UBICOMM. 2011:296–302.
    1. Anfinsen C.B. Principles that govern the folding of protein chains. Science. 1973;181:223–230. - PubMed
    1. Orengo C.A., Todd A.E., Thornton J.M. From protein structure to function. Curr. Opin. Struct. Biol. 1999;9:374–382. - PubMed

LinkOut - more resources