Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Sep;19(5):1355-1365.
doi: 10.1111/1755-0998.13044. Epub 2019 Jun 24.

Rapid identification and interpretation of gene-environment associations using the new R.SamBada landscape genomics pipeline

Affiliations

Rapid identification and interpretation of gene-environment associations using the new R.SamBada landscape genomics pipeline

Solange Duruz et al. Mol Ecol Resour. 2019 Sep.

Abstract

samβada is a genome-environment association software, designed to search for signatures of local adaptation. However, pre- and postprocessing of data can be labour-intensive, preventing wider uptake of the method. We have now developed R.SamBada, an r-package providing a pipeline for landscape genomic analysis based on samβada, spanning from the retrieval of environmental conditions at sampling locations to gene annotation using the Ensembl genome browser. As a result, R.SamBada standardizes the landscape genomics pipeline and eases the search for candidate genes of local adaptation, enhancing reproducibility of landscape genomic studies. The efficiency and power of the pipeline is illustrated using two examples: sheep populations from Morocco with no evident population structure and Lidia cattle from Spain displaying population substructuring. In both cases, R.SamBada enabled rapid identification and interpretation of candidate genes, which are further discussed in the light of local adaptation. The package is available in the r CRAN package repository and on GitHub (github.com/SolangeD/R.SamBada).

Keywords: Lidia cattle breed; Moroccan sheep; gene-environment association; landscape genomics; local adaptation; r-package.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Overall functionalities and process in R.SamBada. Grey boxes with italic names indicate functions included in the package. The process starts with a genomic file and a file with sample locations or list of IDs. The preprocessing will format the genomic file and prepare the environmental file; samβada is then run parallelly on multiple cores; after computing of p‐, q‐values, Manhattan plots and maps can be drawn and Ensembl database can be queried
Figure 2
Figure 2
Manhattan plot showing the q‐values for each marker (with G‐ or Wald‐Score > 6) on chromosome 23 of Moroccan sheep associated with annual precipitation as calculated in samβada in a univariate mode. Points in red correspond to models involving two nonsynonymous SNPs (ss1208941124 and ss1208941157) in the MC5R gene (ss1208941124 having the lowest q‐value of the two). The red horizontal bar shows a significance threshold of 0.05 [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 3
Figure 3
Spatial occurrence of the CC genotype for SNP ss1208941124. In the background, the shaded topography with mean annual precipitation (given in [mm/year]) is displayed [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 4
Figure 4
Spatial distribution of the Lidia cattle population structure according to the scores of the first principal component, with a shaded relief and mean annual temperature [°C * 10] as background, as provided in the WorldClim database. Due to overlaps, close points are scattered around the farm [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 5
Figure 5
Manhattan plot of the Lidia cattle study, showing the p‐values with Bonferroni correction as derived from the samβada models involving mean annual temperature and one population variable. The red point corresponds to SNP ARS‐BFGL‐NGS‐106879, located 30,000 base pairs apart from the HSPB8 gene [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 6
Figure 6
Presence–absence of the AA genotype of SNP ARS‐BFGL‐NGS‐106879 reported with shaded relief and mean annual temperature [°C * 10]) as background. Due to overlaps, close points are scattered around the farm [Colour figure can be viewed at wileyonlinelibrary.com]

Comment in

References

    1. Alberto, F. J. , Boyer, F. , Orozco‐terWengel, P. , Streeter, I. , Servin, B. , de Villemereuil, P. , … Pompanon, F. (2018). Convergent genomic signatures of domestication in sheep and goats. Nature Communications, 9(1), 813 10.1038/s41467-018-03206-y - DOI - PMC - PubMed
    1. Alexander, D. H. , Novembre, J. , & Lange, K. (2009). Fast model‐based estimation of ancestry in unrelated individuals. Genome Research, 19(9), 1655–1664. 10.1101/gr.094052.109 - DOI - PMC - PubMed
    1. Balkenhol, N. , Dudaniec, R. Y. , Krutovsky, K. V. , Johnson, J. S. , Cairns, D. M. , Segelbacher, G. … Joost, S. (2019). Landscape genomics: Understanding relationships between environmental heterogeneity and genomic characteristics of populations In Rajora O. P. (Ed.), Population genomics: Concepts, approaches and applications, population genomics (pp. 261–322). Cham, Switzerland: Springer International Publishing; 10.1007/13836_2017_2 - DOI
    1. Bedward, M. , Eppstein, D. , & Menzel, P. (2018). packcircles: Circle packing. Retrieved from https://CRAN.R-project.org/package=packcircles
    1. Boletin Oficial del Estado (2001). Boletin Oficial del Estado. REAL DECRETO 60/2001, de 26 de enero, sobre prototipo racial de la raza bovina de lidia. pp. 5255–5261.