Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Sep;13(9):2152-63.
doi: 10.1101/gr.1068603. Epub 2003 Aug 12.

Locating sequence on FPC maps and selecting a minimal tiling path

Affiliations

Locating sequence on FPC maps and selecting a minimal tiling path

Friedrich W Engler et al. Genome Res. 2003 Sep.

Abstract

This study discusses three software tools, the first two aid in integrating sequence with an FPC physical map and the third automatically selects a minimal tiling path given genomic draft sequence and BAC end sequences. The first tool, FSD (FPC Simulated Digest), takes a sequenced clone and adds it back to the map based on a fingerprint generated by an in silico digest of the clone. This allows verification of sequenced clone positions and the integration of sequenced clones that were not originally part of the FPC map. The second tool, BSS (Blast Some Sequence), takes a query sequence and positions it on the map based on sequence associated with the clones in the map. BSS has multiple uses as follows: (1) When the query is a file of marker sequences, they can be added as electronic markers. (2) When the query is draft sequence, the results of BSS can be used to close gaps in a sequenced clone or the physical map. (3) When the query is a sequenced clone and the target is BAC end sequences, one may select the next clone for sequencing using both sequence comparison results and map location. (4) When the query is whole-genome draft sequence and the target is BAC end sequences, the results can be used to select many clones for a minimal tiling path at once. The third tool, pickMTP, automates the majority of this last usage of BSS. Results are presented using the rice FPC map, BAC end sequences, and whole-genome shotgun from Syngenta.

PubMed Disclaimer

Figures

Figure 1
Figure 1
SD clones and electronic markers. The clones in blue are simulated digest clones from the Japanese minimal tiling path; most of the original clones are not in FPC. The markers in blue are electronic markers supplied by Gramene. As described in the text, these markers could have been added by BSS. The contig display is from FPC V7. Tracks can be added, resized, and moved around.
Figure 2
Figure 2
The windows that would be shown for a Marker→Sequence search using MegaBLAST in the Batch mode, which runs the search on all contigs. Values are entered in the Setup and Batch BSS windows and the search started. The results can be viewed in the BSS Report window. If the results are to be automatically added as markers using the Add as FPC Markers function, the Marker Add Conditions may first be altered.
Figure 3
Figure 3
Selecting the next clone for sequencing. Clone AP005522 has been sequenced. It was blasted against all of the BESs in the contig. The clone was added as a marker attached to all of the clones that had a BES hit. Selecting the AP005522 marker highlights these clones. Some clones are obviously contained in the sequenced clone, so can be ignored. The BSS report file can be viewed to see details of the hits.
Figure 4
Figure 4
The distribution of expressway lengths.
Figure 5
Figure 5
The five different types of pairs. Case 1 pairs consist of two overlapping clones, with the exact amount of overlap identified by the seqCtg. Case 2 clones have a gap between them; they are bridged by a seqCtg. In Case 3, the two clones cover a similar area. Case 4 pairs result from false positive hits. Case 5 pairs result from two small clones hitting on opposite ends of a long seqCtg; note this is similar to Case 1, but a large seqCtg may result in a large overlap.
Figure 6
Figure 6
(A) S1 and S2 are sequenced contigs, and C1, C2, and C3 are clones. The underlined sequence represents the BESs and where they hit on the seqCtg. Note, all lengths of sequences are tremendously reduced to fit on the page. (B) There are four possible pairs, shown in the four rows in the table. A + indicates the BES was not reverse complemented to match the seqCtg. An rc indicates that it was reverse complemented. The seqCtg-BES hits for the first two pairs both have the same orientations, whereas the orientations for the next two are different. The first two are rejected as candidate MTP pairs, wheras the second two are retained.
Figure 7
Figure 7
Construction of DAGs from overlapping clone pairs. (A) The amount of overlap or distance is recorded for all good clone pairs, shown as w, x, y, and z. (B) Clones define vertices, and pairs define edges, with edge weight determined by overlap or distance. If there is a gap between the clones, then the value is multiplied by 10. An MTP, shown in gray, is picked from the expressways. Note the junction from C to F, in which there is no sequence evidence that the clones overlap.

References

    1. Aho, A., Hopcroft, J., and Ullman, J. 1983. Data structures and algorithms. pp. 203–208. Addison-Wesley, Reading, MA.
    1. Altschul, S., Madden, T., Schaffer, A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389–3402. - PMC - PubMed
    1. The Arabidopsis Genome Initiative. 2000. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 769–815. - PubMed
    1. Barry, G. 2001. The use of the Monsanto draft rice genome sequence in research. Plant Phys. 125: 1164–1165. - PMC - PubMed
    1. Bentley, D.R., Deloukas, P., Dunham, A., French, L., Gregory, S.G., Humphray, S.J., Mungall, A.J., Ross, M.T., Carter, N.P., Dunham, I., et al. 2001. The physical maps for sequencing human chromosomes 1, 6, 9, 10, 13, 20 and X. Nature 409: 942–943. - PubMed

WEB SITE REFERENCES

    1. http://ftp.genome.washington.edu/RM/RepeatMasker.html; Smit, A.F.A. and Green, P., RepeatMasker.
    1. http://rgp.dna.affrc.go.jp/Publicdata.html; Japanese Rice Genome Research Program site for genetic markers and sequence.
    1. http://www.ensembl.org/Mus_musculus/; Ensembl Mouse Genome Server.
    1. http://www.genome.arizona.edu/fpc/rice; Rice Physical Mapping Home Page.
    1. http://www.genome.arizona.edu/fpc/rice/bss.html; Web-based BSS for rice.

Publication types