Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 21;26(10):4922.
doi: 10.3390/ijms26104922.

Structural Homology Fails to Predict Secretion Efficiency in Pichia pastoris: Divergent Responses of Architecturally Similar scFvs to Multi-Parametric Genetic Engineering

Affiliations

Structural Homology Fails to Predict Secretion Efficiency in Pichia pastoris: Divergent Responses of Architecturally Similar scFvs to Multi-Parametric Genetic Engineering

Ningning Wang et al. Int J Mol Sci. .

Abstract

AI-driven biologics manufacturing demands an efficient protein production platform. In this study, we optimized scFv secretion in Pichia pastoris through three strategies: gene dosage optimization, expression cassette design, and endoplasmic reticulum (ER) secretory pathway reprogramming. Using two structurally homologous scFv variants-PR961 and PR953-with divergent basal secretion levels (12.35:1 ratio), we demonstrate that protein-specific thresholds govern optimization efficacy. While increasing gene copy numbers yielded limited improvements (PR961: 1.25-fold at four copies; PR953: 2.37-fold at six copies), reconfiguring the expression cassette to a VH-linker-VL orientation significantly enhanced secretion (11.18-fold for PR961; 5.09-fold for PR953). Twenty-one genes in three functional modules of the secretory pathway were knocked out or overexpressed. The pathway reprogramming results revealed distinct regulatory dependencies: PR961 secretion relied on ER-to-Golgi trafficking (SEC23 overexpression: 1.20-fold), whereas PR953 depended more on upstream translocation (SEC62: 1.66-fold) and oxidative folding (ERO1: 1.81-fold) enhancements. Notably, both variants exhibited a glycosylation-dependent regulation through CNE1. Our findings challenge the assumption that structural homology (63% amino acid identity; RMSD 0.47 Å) ensures consistent optimization outcomes, highlighting the imperative for protein-tailored engineering strategies in synthetic biology.

Keywords: Pichia pastoris; expression cassette design; gene dosage optimization; scFv; secretory pathway engineering.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Characteristic comparison of PR961 and PR953. (A) Amino acid sequences of the two scFvs were compared using ESPript 3.0. Red boxes highlight homologous regions, and arrows indicate the conserved cysteine positions. (B) Basic information for PR961 and PR953, including potential N- and O-glycosylation sites, was predicted using the DTU Health Tech website. (C) Overlay of protein crystal structures of PR961 (yellow, PDB ID: 7DET) and PR953 (orange, PDB ID: 7DEU) obtained from RCSB PDB. (D) Growth curves of PR961 and PR953-harboring yeast strains incubated for 24 h in MGY medium. (E) SDS-PAGE analysis of PR961 and PR953 (VH-linker-VL, 2 copies) secretion in P. pastoris. (F) PR961 and PR953 secretion levels were quantified in three independent experiments using ImageJ 1.53a gel analysis software. Statistical significance is denoted as p < 0.0001 (****).
Figure 2
Figure 2
Secretion of PR961 and PR953 in their respective recombinant strains with varying gene dosages. (A) Growth curves of PR961 and PR953 strains with different copy numbers incubated for 24 h in MGY medium. (B) SDS-PAGE analysis of PR961 and PR953 with different copy numbers. (C) PR961 and PR953 secretion levels were quantified in three independent experiments using ImageJ 1.53a gel analysis software. Statistical significance levels are denoted as follows: p < 0.05 (*) and p < 0.001 (***).
Figure 3
Figure 3
Secretion of PR961 and PR953 in their respective recombinant strains carrying constructs with both VH-linker-VL and VL-linker-VH orientations. (A) Schematic representation of VH-linker-VL and VL-linker-VH construction orientations for both PR961 and PR953. (B) Growth curves of PR961 and PR953 strains transformed with VH-linker-VL and VL-linker-VH constructs incubated for 24 h in MGY medium. (C) SDS-PAGE analysis of PR961 and PR953 in their respective expression strains, carrying constructs with different scFv domain orientations. (D) Secretion levels of PR961 and PR953 were quantified using ImageJ 1.53a software, and statistical significance was determined based on three independent experiments. Statistical significance is denoted as p < 0.0001 (****).
Figure 4
Figure 4
Schematic illustration of the three ER functional modules and the twenty-one specific genes targeted within these modules in this study. The illustration was generated using the BioRender mapping website (https://www.biorender.com/, accessed on 11 March 2025).
Figure 5
Figure 5
Effects on PR961 and PR953 secretion by disrupting ER-related genes. (A) Growth curves of nine distinct single-gene knockout strains of PR961 and PR953 incubated for 24 h in MGY medium. (B) SDS-PAGE analysis of PR961 and PR953 secretion levels. The gels were then stained with SYPRO dye. (C) Quantification of PR961 and PR953 secretion using ImageJ 1.53a software. Significant differences were determined from three independent experiments. (D) Heat map of the fold changes in PR961 and PR953 secretion relative to their respective controls. Statistical significance levels are denoted as follows: p < 0.05 (*), p < 0.001 (***), and p < 0.0001 (****).
Figure 6
Figure 6
Effects on PR961 and PR953 secretion by overexpressing ER-related genes. (A) Growth curves of twenty-one distinct single-gene overexpression strains of PR961 and PR953 incubated 24 h in MGY medium. (B) SDS-PAGE analysis of PR961 and PR953 secretion levels. The gels were stained using the SYPRO dye method. (C) Quantification of PR961 and PR953 secretion using the ImageJ 1.53a software. Significant differences were determined from three independent experiments. (D) Heat map of the fold changes in PR961 and PR953 secretion relative to their respective controls. Statistical significance levels are denoted as follows: p < 0.05 (*), p < 0.01 (**), and p < 0.0001 (****).

References

    1. Radziwon K., Weeks A.M. Protein engineering for selective proteomics. Curr. Opin. Chem. Biol. 2021;60:10–19. doi: 10.1016/j.cbpa.2020.07.003. - DOI - PubMed
    1. Lakhawat S.S., Chandel S., Jaswal S.K., Sharma P.K. Protein Engineering, a Robust Tool to Engineer Novel Functions in Protein. Protein Pept. Lett. 2023;30:541–551. doi: 10.2174/0929866530666230519122612. - DOI - PubMed
    1. Meinen B.A., Bahl C.D. Breakthroughs in computational design methods open up new frontiers for de novo protein engineering. Protein Eng. Des. Sel. 2021;34:gzab007. doi: 10.1093/protein/gzab007. - DOI - PubMed
    1. Linsky T.W., Noble K., Tobin A.R., Crow R., Carter L., Urbauer J.L., Baker D., Strauch E.-M. Sampling of structure and sequence space of small protein folds. Nat. Commun. 2022;13:7151. doi: 10.1038/s41467-022-34937-8. - DOI - PMC - PubMed
    1. Orsi E., Schada von Borzyskowski L., Noack S., Nikel P.I., Lindner S.N. Automated in vivo enzyme engineering accelerates biocatalyst optimization. Nat. Commun. 2024;15:3447. doi: 10.1038/s41467-024-46574-4. - DOI - PMC - PubMed

MeSH terms

Substances

Supplementary concepts

LinkOut - more resources