Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1996;6(2):109-17.
doi: 10.3109/10425179609010197.

Experiment files and their application during large-scale sequencing projects

Experiment files and their application during large-scale sequencing projects

J K Bonfield et al. DNA Seq. 1996.

Abstract

The data for large scale sequencing projects are passed through several processing steps prior to assembly, and post-assembly processing generally requires knowledge of more than just the sequence of each reading. We address here the problem of providing data to individual programs and of combining all the tasks into a single process. The solution comprises two components: a file format (experiment file format) that stores information about readings, and a script (PREGAP) that controls the creation and use of experiment files by the processing programs. PREGAP can take a batch of data from a variety of sequencing instruments, gather information about each reading, and then scan the reading to select the 3' end of the good quality data, mark sequencing vector, other cloning vector sequences, and Alu segments. The results of all these operations are added to the experiment file for each reading, ready for processing by the assembly program. Experiment files also provide a mechanism for using alternative assembly engines with our package.

PubMed Disclaimer

Publication types

LinkOut - more resources