Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 12:2024:10.17912/micropub.biology.001107.
doi: 10.17912/micropub.biology.001107. eCollection 2024.

Remote homology identification of the Drosophila melanogaster ortholog of the RNA Polymerase I subunit Rpa34/POLR1G

Affiliations

Remote homology identification of the Drosophila melanogaster ortholog of the RNA Polymerase I subunit Rpa34/POLR1G

Ryan Palumbo et al. MicroPubl Biol. .

Abstract

Highly conserved orthologous proteins are easily identified by sequence homology alone, whereas poorly conserved orthologs require additional structural information to be identified. All Drosophila orthologs of RNA polymerase I, II, and III subunits-except one-have been identified by sequence homology. Here, we identified CG11076 as the missing Rpa34/POLR1G ortholog in Drosophila . Remote homology detection and secondary structure analysis showed that CG11076 is predicted to have high structural conservation with Rpa34/POLR1G, and phylogenetic analysis demonstrated that these proteins are closely related. Our work underscores the importance of utilizing both sequence and structure to identify highly divergent orthologous proteins in different species.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there are no conflicts of interest present.

Figures

Figure 1.
<b>
CG11076 is the
<i>Drosophila</i>
ortholog of yeast Rpa34 and human POLR1G
</b>
Figure 1. CG11076 is the Drosophila ortholog of yeast Rpa34 and human POLR1G
(A-B) HHpred results for S. cerevisiae Rpa34 (ScRpa34) against the Drosophila proteome (A) and Drosophila CG11076-PA (DmCG11076-PA) against the PDB database (B). Topmost bar represents the domain structure of the query protein. CTD: carboxy-terminal domain. The bottom three bars represent the target proteins. Target bars overlap the region of the query that was aligned by HHpred. P: probability (the likelihood of a positive match between the query and target), E: E-value (the average number of nonhomologous hits expected), I: percent identity (percent of identical residues between the query and target), F: percent fold (the percent of equivalent query and target residues predicted to have a similar secondary structure). (C) Multi Sequence Alignment (MSA) of ScRpa34, human POLR1G (HsPOLR1G), and DmCG11076-PA using HHpred. Residues highlighted in blue are identical in at least two sequences. Residues that are identical in all three sequences are marked by an asterisk. Note that the long, disordered C-terminal domain of HsPOLR1G was not included in the MSA to prevent erroneous gaps in the MSA. (D) FELLS secondary structure analysis and disorder prediction of DmCG11076-PA. (E-F) DmCG11076-PA modeled onto the Cryo-EM structures of ScRpa34 (E) and HsPOLR1G (F) using MODELLER. Alignment and RMSD calculations were made in PyMol. (G) AlphaFold predicted structure of DmCG11076-PA, colored by confidence (b-factor) in PyMol. (H) Alignment of the beta sheet segments of the AlphaFold predicted structures of ScRpa34, DmCG11076-PA, and HsPOLR1G. Alignment and RMSD calculations were made in PyMol. (I) Phylogenetic analysis of Rpa34 orthologs and paralogs from S. cerevisiae , Drosophila , and human.

References

    1. Gabler F, Nam SZ, Till S, Mirdita M, Steinegger M, Söding J, Lupas AN, Alva V. Protein Sequence Analysis Using the MPI Bioinformatics Toolkit. Curr Protoc Bioinformatics. 2020 Dec 1;72(1):e108–e108. doi: 10.1002/cpbi.108. - DOI - PubMed
    1. Geiger SR, Lorenzen K, Schreieck A, Hanecker P, Kostrewa D, Heck AJ, Cramer P. RNA polymerase I contains a TFIIF-related DNA-binding subcomplex. Mol Cell. 2010 Aug 27;39(4):583–594. doi: 10.1016/j.molcel.2010.07.028. - DOI - PubMed
    1. Hildebrand A, Remmert M, Biegert A, Söding J. Fast and accurate automatic structure prediction with HHpred. Proteins. 2009;77 Suppl 9:128–132. doi: 10.1002/prot.22499. - DOI - PubMed
    1. Knutson BA, McNamar R, Rothblum LI. Dynamics of the RNA polymerase I TFIIF/TFIIE-like subcomplex: a mini-review. Biochem Soc Trans. 2020 Oct 30;48(5):1917–1927. doi: 10.1042/BST20190848. - DOI - PMC - PubMed
    1. Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Res. 2022 Jul 5;50(W1):W276–W279. doi: 10.1093/nar/gkac240. - DOI - PMC - PubMed

LinkOut - more resources