Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 8;25(1):604.
doi: 10.1186/s12870-025-06595-7.

Prediction of new candidate proteins and analysis of sub-modules and protein hubs associated with seed development in rice (Oryza sativa) using an ensemble network-based systems biology approach

Affiliations

Prediction of new candidate proteins and analysis of sub-modules and protein hubs associated with seed development in rice (Oryza sativa) using an ensemble network-based systems biology approach

M R P De Silva et al. BMC Plant Biol. .

Abstract

Background: Rice is a critical global food source, but it faces challenges due to nutritional deficiencies and the pressures of a growing population. Understanding the molecular mechanisms and protein functions in rice seed development is essential to improve yield and grain quality. However, there is still a significant knowledge gap regarding the key proteins and their interactions that govern rice seed development. Protein-protein interaction (PPI) analysis is a powerful tool for studying developmental processes like seed development, though its potential in rice research is yet to be fully realized. With the aim of unraveling the protein interaction landscape associated with rice seed development, this systems biology study conducted a PPI network-based analysis. Using a list of known seed development proteins from the Gene Ontology (GO) knowledgebase and literature, novel candidate proteins for seed development were predicted using an ensemble of network-based algorithms, including Majority Voting, Hishigaki Algorithm, Functional Flow, and Random Walk with Restart, which were selected based on their popularity and usability. The predictions were validated using enrichment analysis and cross-checked with independent transcriptomic analysis results. The rice seed development sub-network was further analyzed for community and hub detection.

Results: The study predicted 196 new proteins linked to rice seed development and identified 14 sub-modules within the network, each representing different developmental pathways, such as endosperm development and seed growth regulation. Of these, 17 proteins were identified as intra-modular hubs and 6 as inter-modular hubs. Notably, the protein SDH1 emerged as a dual hub, acting as both an intra-modular and inter-modular hub, highlighting its importance in seed development PPI network stability.

Conclusions: These findings, including the identified hub proteins and sub-modules, provide a better understanding of the PPI interaction landscape governing seed development in rice. This information is useful for achieving a systems biology understanding of seed development. This study implements an ensemble of algorithms for the analysis and showcases how systems biology techniques can be applied in developmental biology.

Keywords: Agronomics; Hub proteins; Network-based algorithms; PPI network; Rice; Sub-modules.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not Applicable. Consent for publication: All authors have read and approved the final manuscript for publication. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The bioinformatics pipeline used for prediction and validation of protein candidates for seed development in rice. The hub protein and sub-pathway analysis procedures are also represented
Fig. 2
Fig. 2
Precision-Recall Curves (A) and Receiver Operator Characteristic Curves (B) for the five algorithms, including Majority Voting (MV), Hishigaki Algorithm (HA), Random Walk with Restart (RWR), and Functional flow (FF), and the ensemble model. The ensemble model outperforms all the other algorithms on both metrics based on its higher area under the curve values
Fig. 3
Fig. 3
Modified precision top-N curve for protein predictions related to rice seed development, obtained from the rice protein–protein interaction (PPI) network. The x-axis represents the number of top-ranked predicted proteins (N), while the y-axis shows the modified precision, calculated as the percentage of known differentially expressed proteins (DEPs) among the top N predictions. The curve peaks at N = 454, where a maximum precision of 43.17% was achieved
Fig. 4
Fig. 4
A visual representation of the seed development protein–protein interaction (PPI) sub-network of rice, with different colors assigned to the detected sub-modules, based on the Louvain community detection algorithm. The sub-modules are numbered and the most relevant Gene Ontology-Biological Process (GO-BP) term related to seed development according to enrichment analysis results are listed in the legend
Fig. 5
Fig. 5
Network visualization of hub proteins (highly connected proteins within the network) in the seed development protein–protein interaction (PPI) sub-network of rice. Node size and color indicate the type of hub classification: larger green nodes, such as Ehd2, represent intra-modular hubs (proteins highly connected within a single module), identified using the Z-score method; larger blue nodes, such as SODCP, represent inter-modular hubs (proteins with high connectivity between modules), identified using the partition coefficient (PC). The SDH1 protein, which functions as both an intra- and inter-modular hub, is shown as a larger orange node. Smaller white nodes represent non-hub proteins (proteins with low connectivity). Sub-modules, identified through the Louvain community detection algorithm, are numbered, and the most significantly enriched Gene Ontology-Biological Process (GO-BP) terms derived from functional enrichment analysis are listed in the legend

Similar articles

Cited by

References

    1. Huang R, Jiang L, Zheng J, Wang T, Wang H, Huang Y. Genetic bases of rice grain shape: So many genes, so little known. Trends Plant Sci. 2013;18:218–26. - PubMed
    1. Waterworth WM, Bray CM, West CE. Seeds and the art of genome maintenance. Front Plant Sci. Frontiers Media S.A.; 2019. - PMC - PubMed
    1. King T, Cole M, Farber JM, Eisenbrand G, Zabaras D, Fox EM. Food safety for food security: Relationship between global megatrends and developments in food safety. Trends Food Sci Technol. 2017;68:160–75.
    1. Sadigov R. Rapid Growth of the World Population and Its Socioeconomic Results. Sci World J. 2022;2022(1):8110229. - PMC - PubMed
    1. Deng ZY, Gong CY, Wang T. Use of proteomics to understand seed development in rice. Proteomics. 2013;13(12–13):1784–800. - PubMed

LinkOut - more resources