PseqIP: a nonredundant and exhaustive protein sequence data bank generated from 4 major existing collections

J M Claverie¹, L Bricault

Affiliations

PMID: 3449852
DOI: 10.1002/prot.340010110

PseqIP: a nonredundant and exhaustive protein sequence data bank generated from 4 major existing collections

J M Claverie et al. Proteins. 1986 Sep.

. 1986 Sep;1(1):60-5.

doi: 10.1002/prot.340010110.

Authors

J M Claverie¹, L Bricault

Affiliation

¹ Computer Science Unit, Institut Pasteur, Paris, France.

PMID: 3449852
DOI: 10.1002/prot.340010110

Abstract

Four major protein sequence data collections (NBRF-PIR, PSD-Kyoto, PGtrans, and NEWAT) have been merged into a single nonredundant data bank called PseqIP. The data bank entries were automatically matched by a heuristic computer program relying on the fast computation of the number of tetrapeptides shared by two sequences. PseqIP 1.0 includes 6,068 different protein sequences for a total of 1,357,067 residues, representing most of the available sequence information to date. During the course of this work, we found about 600 occurrences of a protein sequence recorded with a one-amino-acid variation in at least two different data banks. A flat file (ASCII computer-readable format) version of PseqIP 1.0, well-suited for exhaustive homology searches and statistical sequence analysis, is available from our laboratory.

PubMed Disclaimer

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
- Wiley

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

PseqIP: a nonredundant and exhaustive protein sequence data bank generated from 4 major existing collections

Affiliation

PseqIP: a nonredundant and exhaustive protein sequence data bank generated from 4 major existing collections

Authors

Affiliation

Abstract

MeSH terms

Substances

LinkOut - more resources

Full Text Sources