Lessons learnt on the analysis of large sequence data in animal genomics
- PMID: 29624711
- DOI: 10.1111/age.12655
Lessons learnt on the analysis of large sequence data in animal genomics
Abstract
The 'omics revolution has made a large amount of sequence data available to researchers and the industry. This has had a profound impact in the field of bioinformatics, stimulating unprecedented advancements in this discipline. Mostly, this is usually looked at from the perspective of human 'omics, in particular human genomics. Plant and animal genomics, however, have also been deeply influenced by next-generation sequencing technologies, with several genomics applications now popular among researchers and the breeding industry. Genomics tends to generate huge amounts of data, and genomic sequence data account for an increasing proportion of big data in biological sciences, due largely to decreasing sequencing and genotyping costs and to large-scale sequencing and resequencing projects. The analysis of big data poses a challenge to scientists, as data gathering currently takes place at a faster pace than does data processing and analysis, and the associated computational burden is increasingly taxing, making even simple manipulation, visualization and transferring of data a cumbersome operation. The time consumed by the processing and analysing of huge data sets may be at the expense of data quality assessment and critical interpretation. Additionally, when analysing lots of data, something is likely to go awry-the software may crash or stop-and it can be very frustrating to track the error. We herein review the most relevant issues related to tackling these challenges and problems, from the perspective of animal genomics, and provide researchers that lack extensive computing experience with guidelines that will help when processing large genomic data sets.
Keywords: animal genetics; big data; computational biology; data analysis; genome sequence; next-generation sequencing; ’omics.
© 2018 Stichting International Foundation for Animal Genetics.
Similar articles
-
Critical role of bioinformatics in translating huge amounts of next-generation sequencing data into personalized medicine.Sci China Life Sci. 2013 Feb;56(2):110-8. doi: 10.1007/s11427-013-4439-7. Epub 2013 Feb 8. Sci China Life Sci. 2013. PMID: 23393026 Review.
-
Bacterial Genomic Data Analysis in the Next-Generation Sequencing Era.Methods Mol Biol. 2016;1415:407-22. doi: 10.1007/978-1-4939-3572-7_21. Methods Mol Biol. 2016. PMID: 27115645
-
Trends in IT Innovation to Build a Next Generation Bioinformatics Solution to Manage and Analyse Biological Big Data Produced by NGS Technologies.Biomed Res Int. 2015;2015:904541. doi: 10.1155/2015/904541. Epub 2015 Jun 1. Biomed Res Int. 2015. PMID: 26125026 Free PMC article. Review.
-
'Big data', Hadoop and cloud computing in genomics.J Biomed Inform. 2013 Oct;46(5):774-81. doi: 10.1016/j.jbi.2013.07.001. Epub 2013 Jul 18. J Biomed Inform. 2013. PMID: 23872175
-
BioVLAB-mCpG-SNP-EXPRESS: A system for multi-level and multi-perspective analysis and exploration of DNA methylation, sequence variation (SNPs), and gene expression from multi-omics data.Methods. 2016 Dec 1;111:64-71. doi: 10.1016/j.ymeth.2016.07.019. Epub 2016 Jul 28. Methods. 2016. PMID: 27477210
Cited by
-
The effects of Thymus capitatus essential oil topical application on milk quality: a systems biology approach.Sci Rep. 2025 Feb 7;15(1):4627. doi: 10.1038/s41598-025-88168-0. Sci Rep. 2025. PMID: 39920235 Free PMC article.
-
Dual Analysis of Virus-Host Interactions: The Case of Ostreid herpesvirus 1 and the Cupped Oyster Crassostrea gigas.Evol Bioinform Online. 2019 Feb 22;15:1176934319831305. doi: 10.1177/1176934319831305. eCollection 2019. Evol Bioinform Online. 2019. PMID: 30828244 Free PMC article. Review.
-
Feeding Pre-weaned Calves With Waste Milk Containing Antibiotic Residues Is Related to a Higher Incidence of Diarrhea and Alterations in the Fecal Microbiota.Front Vet Sci. 2021 Jul 8;8:650150. doi: 10.3389/fvets.2021.650150. eCollection 2021. Front Vet Sci. 2021. PMID: 34307516 Free PMC article.
-
Analysis of Genetic Diversity in Romanian Carpatina Goats Using SNP Genotyping Data.Animals (Basel). 2024 Feb 7;14(4):560. doi: 10.3390/ani14040560. Animals (Basel). 2024. PMID: 38396528 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources