Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jun 11:7:29.
doi: 10.1186/1471-2229-7-29.

A BAC end view of the Musa acuminata genome

Affiliations

A BAC end view of the Musa acuminata genome

Foo Cheung et al. BMC Plant Biol. .

Abstract

Background: Musa species contain the fourth most important crop in developing countries. Here, we report the analysis of 6,252 BAC end-sequences, in order to view the sequence composition of the Musa acuminata genome in a cost effective and efficient manner.

Results: BAC end sequencing generated 6,252 reads representing 4,420,944 bp, including 2,979 clone pairs with an average read length after cleaning and filtering of 707 bp. All sequences have been submitted to GenBank, with the accession numbers DX451975-DX458350. The BAC end-sequences, were searched against several databases and significant homology was found to mitochondria and chloroplast (2.6%), transposons and repetitive sequences (36%) and proteins (11%). Functional interpretation of the protein matches was carried out by Gene Ontology assignments from matches to Arabidopsis and was shown to cover a broad range of categories. From protein matching regions of Musa BAC end-sequences, it was determined that the GC content of coding regions was 47%. Where protein matches encompassed a start codon, GC content as a function of position (5' to 3') across 129 bp sliding windows generates a "rice-like" gradient. A total of 352 potential SSR markers were discovered. The most abundant simple sequence repeats in four size categories were AT-rich. After filtering mitochondria and chloroplast matches, thousands of BAC end-sequences had a significant BLASTN match to the Oryza sativa and Arabidopsis genome sequence. Of these, a small number of BAC end-sequence pairs were shown to map to neighboring regions of the Oryza sativa genome representing regions of potential microsynteny.

Conclusion: Database searches with the BAC end-sequences and ab initio analysis identified those reads likely to contain transposons, repeat sequences, proteins and simple sequence repeats. Approximately 600 BAC end-sequences contained protein sequences that were not found in the existing available Musa expressed sequence tags, repeat or transposon databases. In addition, gene statistics, GC content and profile could also be estimated based on the region matching the top protein hit. A small number of BAC end pair sequences can be mapped to neighboring regions of the Oryza sativa representing regions of potential microsynteny. These results suggest that a large-scale BAC end sequencing strategy has the potential to anchor a small proportion of the genome of Musa acuminata to the genomes of Oryza sativa and possibly Arabidopsis.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Number of Musa BES containing hits to The TIGR Plant Gene Indices using blat.
Figure 3
Figure 3
Mean GC content as a function of position (5' to 3') across 129 bp sliding windows.
Figure 2
Figure 2
Gene Ontology assignments for Musa BES.

Similar articles

Cited by

References

    1. Meinke DW, Cherry JM, Dean C, Rounsley SD, Koornneef M. Arabidopsis thaliana: a model plant for genome analysis. Science. 1998;662:679–682. - PubMed
    1. International Rice Genome Sequencing Project The map based sequence of the rice genome. Nature. 2005;436:793–800. doi: 10.1038/nature03895. - DOI - PubMed
    1. Zhao W, Wang J, He X, Huang X, Jiao Y, Dai M, Wei S, Fu J, Chen Y, Ren X, Zhang Y, Ni P, Zhang J, Li S, Wang J, Wong GK, Zhao H, Yu J, Yang H, Wang J. BGI-RIS, An integrated information resource and comparative analysis workbench for rice genomics. Nucleic Acids Res. 2004;32:D377–82. doi: 10.1093/nar/gkh085. - DOI - PMC - PubMed
    1. Lysak MA, Dolezelova M, Horry JP, Swennen R, Dolezel J. Flow cytometric analysis of nuclear DNA content in Musa. Theor Appl Genet. 1999;98:1344–1350. doi: 10.1007/s001220051201. - DOI
    1. Kamate K, Brown S, Durand P, Bureau JM, De Nay D, Trinh TH. Nuclear DNA content and base composition in 28 taxa of Musa. Genome. 2001;44:622–627. doi: 10.1139/gen-44-4-622. - DOI - PubMed

Publication types

Associated data

LinkOut - more resources