Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009;10(4):R42.
doi: 10.1186/gb-2009-10-4-r42. Epub 2009 Apr 24.

A whole-genome assembly of the domestic cow, Bos taurus

Affiliations

A whole-genome assembly of the domestic cow, Bos taurus

Aleksey V Zimin et al. Genome Biol. 2009.

Abstract

Background: The genome of the domestic cow, Bos taurus, was sequenced using a mixture of hierarchical and whole-genome shotgun sequencing methods.

Results: We have assembled the 35 million sequence reads and applied a variety of assembly improvement techniques, creating an assembly of 2.86 billion base pairs that has multiple improvements over previous assemblies: it is more complete, covering more of the genome; thousands of gaps have been closed; many erroneous inversions, deletions, and translocations have been corrected; and thousands of single-nucleotide errors have been corrected. Our evaluation using independent metrics demonstrates that the resulting assembly is substantially more accurate and complete than alternative versions.

Conclusions: By using independent mapping data and conserved synteny between the cow and human genomes, we were able to construct an assembly with excellent large-scale contiguity in which a large majority (approximately 91%) of the genome has been placed onto the 30 B. taurus chromosomes. We constructed a new cow-human synteny map that expands upon previous maps. We also identified for the first time a portion of the B. taurus Y chromosome.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Chromosome (Chr) lengths (in base pairs) based on amount of sequence in the B. taurus assembly placed on each chromosome.
Figure 2
Figure 2
Cumulative plot of the N statistic for both the UMD2 (blue) and BCM4 (red) assemblies. Each point (X, Y) in the plot shows the contig size Y such that X% of the genome is contained in contigs of length Y or larger, for a genome of size 2.5 Gbp. For example, the N50 size for each assembly corresponds to the value of Y at X = 50; for UMD2 this value is 93,156 and for BCM4 it is 81,627.
Figure 3
Figure 3
Examples of large-scale disagreements between UMD2 and BCM4. (a) Dot-plot alignment of the region between 15 Mbp and 25 Mbp of chromosome 26 showing a large inversion in BCM4 compared to UMD2; (b) positions of Cmap markers for the same region of chromosome 26, plotted against their positions in UMD2 (blue) and BCM4 (red), showing that Cmap supports the UMD2 assembly. (c) Alignment of 7 Mbp of chromosome 27, showing a large inversion in BCM4 compared to UMD2; (d) positions of Cmap markers for the same region of chromosome 27, showing as in (b) that Cmap is in much closer agreement with the UMD2 assembly.
Figure 4
Figure 4
Assembly comparison by gene mapping. (a) Number of RefSeq mRNA sequences (out of 8,689) that can be aligned to each genome assembly at varying coverage cutoffs (horizontal axis) with at least 95% sequence identity. (b) Difference in the number of mRNAs mapping to the two assemblies at different levels of coverage, plotted as UMD2 minus BCM4. Negative values indicate that BCM4 has more genes at a given level, while positive values indicate that UMD2 has more. For example, at 0-5% coverage, 104 more mRNAs map to BCM4 than to UMD2. At 95-100% coverage, 275 more mRNAs map to UMD2. Blue, UMD2 assembly; red, BCM4 assembly.
Figure 5
Figure 5
Aligment of B. taurus chromosome X to human chromosome X, showing regions of large-scale synteny. Most of the two chromosomes is shared in the five large blocks evident in the figure. Red: sequences are aligned in the same orientation; blue: sequences are aligned, but one is in the reverse complement orientation. The inverted (blue) block at approximately 25 Mbp in B. taurus, although small at this scale, spans over 800 Kbp.

References

    1. Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA, Gocayne JD, Amanatides P, Ballew RM, Huson DH, Wortman JR, Zhang Q, Kodira CD, Zheng XH, Chen L, Skupski M, Subramanian G, Thomas PD, Zhang J, Gabor Miklos GL, Nelson C, Broder S, Clark AG, Nadeau J, McKusick VA, Zinder N, et al. The sequence of the human genome. Science. 2001;291:1304–1351. doi: 10.1126/science.1058040. - DOI - PubMed
    1. Celniker SE, Wheeler DA, Kronmiller B, Carlson JW, Halpern A, Patel S, Adams M, Champe M, Dugan SP, Frise E, Hodgson A, George RA, Hoskins RA, Laverty T, Muzny DM, Nelson CR, Pacleb JM, Park S, Pfeiffer BD, Richards S, Sodergren EJ, Svirskas R, Tabor PE, Wan K, Stapleton M, Sutton GG, Venter C, Weinstock G, Scherer SE, Myers EW, et al. Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence. Genome Biol. 2002;3:RESEARCH0079. doi: 10.1186/gb-2002-3-12-research0079. - DOI - PMC - PubMed
    1. The International Human Genome Sequencing Consortium Finishing the euchromatic sequence of the human genome. Nature. 2004;431:931–945. doi: 10.1038/nature03001. - DOI - PubMed
    1. Snelling WM, Chiu R, Schein JE, Hobbs M, Abbey CA, Adelson DL, Aerts J, Bennett GL, Bosdet IE, Boussaha M, Brauning R, Caetano AR, Costa MM, Crawford AM, Dalrymple BP, Eggen A, Wind A Everts-van der, Floriot S, Gautier M, Gill CA, Green RD, Holt R, Jann O, Jones SJ, Kappes SM, Keele JW, de Jong PJ, Larkin DM, Lewin HA, McEwan JC, et al. A physical map of the bovine genome. Genome Biol. 2007;8:R165. doi: 10.1186/gb-2007-8-8-r165. - DOI - PMC - PubMed
    1. Havlak P, Chen R, Durbin KJ, Egan A, Ren Y, Song XZ, Weinstock GM, Gibbs RA. The Atlas genome assembly system. Genome Res. 2004;14:721–732. doi: 10.1101/gr.2264004. - DOI - PMC - PubMed

Publication types