Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 5;11(1):268.
doi: 10.1038/s41597-024-03099-1.

AI and the democratization of knowledge

Affiliations

AI and the democratization of knowledge

Christophe Dessimoz et al. Sci Data. .

Abstract

The solution of the longstanding “protein folding problem” in 2021 showcased the transformative capabilities of AI in advancing the biomedical sciences. AI was characterized as successfully learning from protein structure data, which then spurred a more general call for AI-ready datasets to drive forward medical research. Here, we argue that it is the broad availability of knowledge, not just data, that is required to fuel further advances in AI in the scientific domain. This represents a quantum leap in a trend toward knowledge democratization that had already been developing in the biomedical sciences: knowledge is no longer primarily applied by specialists in a sub-field of biomedicine, but rather multidisciplinary teams, diverse biomedical research programs, and now machine learning. The development and application of explicit knowledge representations underpinning democratization is becoming a core scientific activity, and more investment in this activity is required if we are to achieve the promise of AI.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The need for democratization of data and knowledge. For data to become broadly valuable, it must be transformed into a form that can be correctly interpreted and used by a broad community comprising both non-experts and AI.
Fig. 2
Fig. 2
The data-information-knowledge hierarchy in empirical sciences and the path to democratization. Increasing democratization requires additional effort to transform toward consistent, explicit knowledge models, which for complex models requires extensive curation of training sets sufficient for AI.

References

    1. Maxson Jones K, Ankeny RA, Cook-Deegan R. The Bermuda Triangle: The Pragmatics, Policies, and Principles for Data Sharing in the History of the Human Genome Project. J. Hist. Biol. 2018;51:693–805. doi: 10.1007/s10739-018-9538-7. - DOI - PMC - PubMed
    1. Wilkinson MD, et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016;3:160018. doi: 10.1038/sdata.2016.18. - DOI - PMC - PubMed
    1. Kumar, A. Automation of data prep, ML, and data science. in Proceedings of the 2021 International Conference on Management of Data. 10.1145/3448016.3457537 (ACM, 2021).
    1. Jumper J, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. - DOI - PMC - PubMed
    1. Baek M, et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021;373:871–876. doi: 10.1126/science.abj8754. - DOI - PMC - PubMed