Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 5;11(1):268.
doi: 10.1038/s41597-024-03099-1.

AI and the democratization of knowledge

Affiliations

AI and the democratization of knowledge

Christophe Dessimoz et al. Sci Data. .

Abstract

The solution of the longstanding “protein folding problem” in 2021 showcased the transformative capabilities of AI in advancing the biomedical sciences. AI was characterized as successfully learning from protein structure data, which then spurred a more general call for AI-ready datasets to drive forward medical research. Here, we argue that it is the broad availability of knowledge, not just data, that is required to fuel further advances in AI in the scientific domain. This represents a quantum leap in a trend toward knowledge democratization that had already been developing in the biomedical sciences: knowledge is no longer primarily applied by specialists in a sub-field of biomedicine, but rather multidisciplinary teams, diverse biomedical research programs, and now machine learning. The development and application of explicit knowledge representations underpinning democratization is becoming a core scientific activity, and more investment in this activity is required if we are to achieve the promise of AI.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The need for democratization of data and knowledge. For data to become broadly valuable, it must be transformed into a form that can be correctly interpreted and used by a broad community comprising both non-experts and AI.
Fig. 2
Fig. 2
The data-information-knowledge hierarchy in empirical sciences and the path to democratization. Increasing democratization requires additional effort to transform toward consistent, explicit knowledge models, which for complex models requires extensive curation of training sets sufficient for AI.

Similar articles

Cited by

  • Closing the Gap Between Vision and Victory in Type 1 Diabetes: The NIDDK Human Islet Research Network (HIRN) Initiative.
    Parent AV, Agarwal A, Alvarez-Dominguez JR, Arrojo E Drigo R, Atkinson MA, Brehm MA, Ferreira LMR, Kaestner KH, Kravets V, Niland JC, O'Meally D, Rouse L, Russ HA, Stabler CL, Kaddis JS. Parent AV, et al. Diabetes. 2025 Jul 1;74(7):1057-1067. doi: 10.2337/db25-0097. Diabetes. 2025. PMID: 40272263 Free PMC article. Review.
  • The Future of a Myriad of Accelerated Biodiscoveries Lies in AI-Powered Mass Spectrometry and Multiomics Integration.
    Bilbao A. Bilbao A. J Mass Spectrom. 2025 Aug;60(8):e5157. doi: 10.1002/jms.5157. J Mass Spectrom. 2025. PMID: 40616377 Free PMC article.
  • The need to implement FAIR principles in biomolecular simulations.
    Amaro RE, Åqvist J, Bahar I, Battistini F, Bellaiche A, Beltran D, Biggin PC, Bonomi M, Bowman GR, Bryce RA, Bussi G, Carloni P, Case DA, Cavalli A, Chang CA, Cheatham TE 3rd, Cheung MS, Chipot C, Chong LT, Choudhary P, Cisneros GA, Clementi C, Collepardo-Guevara R, Coveney P, Covino R, Crawford TD, Dal Peraro M, de Groot BL, Delemotte L, De Vivo M, Essex JW, Fraternali F, Gao J, Gelpí JL, Gervasio FL, González-Nilo FD, Grubmüller H, Guenza MG, Guzman HV, Harris S, Head-Gordon T, Hernandez R, Hospital A, Huang N, Huang X, Hummer G, Iglesias-Fernández J, Jensen JH, Jha S, Jiao W, Jorgensen WL, Kamerlin SCL, Khalid S, Laughton C, Levitt M, Limongelli V, Lindahl E, Lindorff-Larsen K, Loverde S, Lundborg M, Luo YL, Luque FJ, Lynch CI, MacKerell AD Jr, Magistrato A, Marrink SJ, Martin H, McCammon JA, Merz K, Moliner V, Mulholland AJ, Murad S, Naganathan AN, Nangia S, Noe F, Noy A, Oláh J, O'Mara ML, Ondrechen MJ, Onuchic JN, Onufriev A, Osuna S, Palermo G, Panchenko AR, Pantano S, Parish C, Parrinello M, Perez A, Perez-Acle T, Perilla JR, Pettitt BM, Pietropaolo A, Piquemal JP, Poma AB, Praprotnik M, Ramos MJ, Ren P, Reuter N, Roitberg A, Rosta E, Rovira C, Roux B, Rothlisberger U, San… See abstract for full author list ➔ Amaro RE, et al. Nat Methods. 2025 Apr;22(4):641-645. doi: 10.1038/s41592-025-02635-0. Nat Methods. 2025. PMID: 40175561 No abstract available.

References

    1. Maxson Jones K, Ankeny RA, Cook-Deegan R. The Bermuda Triangle: The Pragmatics, Policies, and Principles for Data Sharing in the History of the Human Genome Project. J. Hist. Biol. 2018;51:693–805. doi: 10.1007/s10739-018-9538-7. - DOI - PMC - PubMed
    1. Wilkinson MD, et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016;3:160018. doi: 10.1038/sdata.2016.18. - DOI - PMC - PubMed
    1. Kumar, A. Automation of data prep, ML, and data science. in Proceedings of the 2021 International Conference on Management of Data. 10.1145/3448016.3457537 (ACM, 2021).
    1. Jumper J, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. - DOI - PMC - PubMed
    1. Baek M, et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021;373:871–876. doi: 10.1126/science.abj8754. - DOI - PMC - PubMed