BIGCHEM: Challenges and Opportunities for Big Data Analysis in Chemistry
- PMID: 27464907
- PMCID: PMC5129546
- DOI: 10.1002/minf.201600073
BIGCHEM: Challenges and Opportunities for Big Data Analysis in Chemistry
Abstract
The increasing volume of biomedical data in chemistry and life sciences requires the development of new methods and approaches for their handling. Here, we briefly discuss some challenges and opportunities of this fast growing area of research with a focus on those to be addressed within the BIGCHEM project. The article starts with a brief description of some available resources for "Big Data" in chemistry and a discussion of the importance of data quality. We then discuss challenges with visualization of millions of compounds by combining chemical and biological data, the expectations from mining the "Big Data" using advanced machine-learning methods, and their applications in polypharmacology prediction and target de-convolution in phenotypic screening. We show that the efficient exploration of billions of molecules requires the development of smart strategies. We also address the issue of secure information sharing without disclosing chemical structures, which is critical to enable bi-party or multi-party data sharing. Data sharing is important in the context of the recent trend of "open innovation" in pharmaceutical industry, which has led to not only more information sharing among academics and pharma industries but also the so-called "precompetitive" collaboration between pharma companies. At the end we highlight the importance of education in "Big Data" for further progress of this area.
© 2016 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.
Similar articles
-
Equipment and analytical companies meeting continuous challenges. May 20-21, 2014 Continuous Manufacturing Symposium.J Pharm Sci. 2015 Mar;104(3):821-31. doi: 10.1002/jps.24282. Epub 2014 Dec 1. J Pharm Sci. 2015. PMID: 25448273 Review.
-
Clinical Trial Data as Public Goods: Fair Trade and the Virtual Knowledge Bank as a Solution to the Free Rider Problem - A Framework for the Promotion of Innovation by Facilitation of Clinical Trial Data Sharing among Biopharmaceutical Companies in the Era of Omics and Big Data.Public Health Genomics. 2016;19(4):211-9. doi: 10.1159/000446101. Epub 2016 Jun 1. Public Health Genomics. 2016. PMID: 27241319
-
Sprinkling the pixie dust: reflections on innovation and innovators in medicinal chemistry and drug discovery.Drug Discov Today. 2020 Mar;25(3):599-609. doi: 10.1016/j.drudis.2020.01.006. Epub 2020 Jan 22. Drug Discov Today. 2020. PMID: 31981481 Review.
-
Machine Learning in Chemoinformatics and Medicinal Chemistry.Annu Rev Biomed Data Sci. 2022 Aug 10;5:43-65. doi: 10.1146/annurev-biodatasci-122120-124216. Epub 2022 Apr 19. Annu Rev Biomed Data Sci. 2022. PMID: 35440144 Review.
-
The project data sphere initiative: accelerating cancer research by sharing data.Oncologist. 2015 May;20(5):464-e20. doi: 10.1634/theoncologist.2014-0431. Epub 2015 Apr 15. Oncologist. 2015. PMID: 25876994 Free PMC article.
Cited by
-
GenUI: interactive and extensible open source software platform for de novo molecular generation and cheminformatics.J Cheminform. 2021 Sep 25;13(1):73. doi: 10.1186/s13321-021-00550-y. J Cheminform. 2021. PMID: 34563271 Free PMC article.
-
Intuition-Enabled Machine Learning Beats the Competition When Joint Human-Robot Teams Perform Inorganic Chemical Experiments.J Chem Inf Model. 2019 Jun 24;59(6):2664-2671. doi: 10.1021/acs.jcim.9b00304. Epub 2019 May 22. J Chem Inf Model. 2019. PMID: 31025861 Free PMC article.
-
Discovery of TIGIT inhibitors based on DEL and machine learning.Front Chem. 2022 Jul 26;10:982539. doi: 10.3389/fchem.2022.982539. eCollection 2022. Front Chem. 2022. PMID: 35958238 Free PMC article.
-
Annotation of Peptide Structures Using SMILES and Other Chemical Codes-Practical Solutions.Molecules. 2017 Nov 27;22(12):2075. doi: 10.3390/molecules22122075. Molecules. 2017. PMID: 29186902 Free PMC article. Review.
-
Open Innovation in Medical and Pharmaceutical Research: A Literature Landscape Analysis.Front Pharmacol. 2021 Jan 14;11:587526. doi: 10.3389/fphar.2020.587526. eCollection 2020. Front Pharmacol. 2021. PMID: 33519448 Free PMC article.
References
-
- Big Data. https://en.wikipedia.org/wiki/Big_data (10 June 2016).
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources