Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2001 Mar;17(3):282-3.
doi: 10.1093/bioinformatics/17.3.282.

Clustering of highly homologous sequences to reduce the size of large protein databases

Affiliations

Clustering of highly homologous sequences to reduce the size of large protein databases

W Li et al. Bioinformatics. 2001 Mar.

Abstract

We present a fast and flexible program for clustering large protein databases at different sequence identity levels. It takes less than 2 h for the all-against-all sequence comparison and clustering of the non-redundant protein database of over 560,000 sequences on a high-end PC. The output database, including only the representative sequences, can be used for more efficient and sensitive database searches.

PubMed Disclaimer

Publication types

LinkOut - more resources