. 2006 Jul 1;34(Web Server issue):W350-5.

doi: 10.1093/nar/gkl159.

DILIMOT: discovery of linear motifs in proteins

Victor Neduva¹, Robert B Russell

Affiliations

PMID: 16845024
PMCID: PMC1538856
DOI: 10.1093/nar/gkl159

DILIMOT: discovery of linear motifs in proteins

Victor Neduva et al. Nucleic Acids Res. 2006.

. 2006 Jul 1;34(Web Server issue):W350-5.

doi: 10.1093/nar/gkl159.

Authors

Victor Neduva¹, Robert B Russell

Affiliation

¹ EMBL, Meyerhofstrasse 1, 69117 Heidelberg, Germany.

PMID: 16845024
PMCID: PMC1538856
DOI: 10.1093/nar/gkl159

Abstract

Discovery of protein functional motifs is critical in modern biology. Small segments of 3-10 residues play critical roles in protein interactions, post-translational modifications and trafficking. DILIMOT (DIscovery of LInear MOTifs) is a server for the prediction of these short linear motifs within a set of proteins. Given a set of sequences sharing a common functional feature (e.g. interaction partner or localization) the method finds statistically over-represented motifs likely to be responsible for it. The input sequences are first passed through a set of filters to remove regions unlikely to contain instances of linear motifs. Motifs are then found in the remaining sequence and ranked according to a statistic that measure over-representation and conservation across homologues in related species. The results are displayed via a visual interface for easy perusal. The server is available at http://dilimot.embl.de.

PubMed Disclaimer

Figures

**Figure 1**
The server process and output. (A) Schematic showing how submitted sequences are filtered, motifs found and arranged into a ranked list sorted by P (left). When the species is provided, sequences are assigned to the orthologous groups, species–specific probabilities for over-represented motifs are calculated (coloured box) the list resorted by S_CONS (right). (B) Example of server output. A list of putative motifs is reported in an interactive table (left), which gives general details for each of them. Clicking on each motif launches an additional page (right) showing sequences containing the motif, where the motif is found in them and the degree to which the motif is conserved in related species. Motif locations (red bars) and other features found in the sequences, such as domains, are shown graphically and detailed below each image.

**Figure 2**
The EB1 motif SxIP detected by the server. (A) A sequence logo (27) for the EB1 binding motif, generated using all instances of the motif in the input set. (B) Examples of EB1 binding proteins from the input set (represented as boxes) and multiple alignments of putative motif containing regions. Dark blue regions in the boxes denote those removed by the domain and redundancy filters. A known EB1 binding region (in APC) lies at the C-terminus of a Pfam domain. To avoid its removal, we simply cut the sequence down to this region alone (switching the Pfam filter off will have similar effect). Sequences for the motif-containing region are shown aligned to the best homologues in closely related species. Amino acids in the alignments are coloured according to residue type: blue, positive; red, negative; light-blue, small; yellow, hydrophobic; green, aromatic; magenta, polar; Proline, orange. Positions within the predicted motif are denoted by red triangles. Species abbreviations: Hsa, *H.sapiens*; Mmu, *M.musculus*; Rno, *R.norwegicus*; Gga, *G.gallus*; Fru, *F.rubripes*; Cgi, *Candida glabrata*; Kla, *Kluyveromyces lactis*; Kwa, *Kluyveromyces waltii*; Ego, *Eremothecium gossypii*; Sce, *Saccharomyces cerevisiae*; Dha, *Debaryomyces hansenii*.

**Figure 3**
Features of known linear motifs. (A) Distributions of length (red), number of specified (i.e. non-‘x’; green) and invariant (i.e. a single specific residue; blue) positions for 120 known linear motifs extracted from the ELM database (7). Note that four motifs with lengths of 13–18 are not shown in the first (red) plot for clarity. (B) Degree to which residues are over-represented in known motifs. Numbers show the ratio of the abundance of the residue within the 120 motifs from ELM to the abundance in globular domains as computed from the protein databank [PDB; (28)]. ‘ALL’ includes all 120, ‘LIG’ are the 66 ligand binding, ‘TRG’ the 16 targeting and ‘MOD’ the 30 modification site motifs. For 7 of 40 residues in the latter two categories there were too few counts to obtain a confident measurement (i.e. <5); these are denoted by an asterix. Note that we have not included a fourth ELM category CLV, which includes protein cleavage sites, as there were too few examples to compute meaningful numbers. Colour scheme: red, strongly favoured in linear motifs compared to globular proteins; orange, moderately favoured; light-blue moderately disfavoured; blue strongly disfavoured.

See this image and copyright information in PMC

Cited by

+TIPs: SxIPping along microtubule ends.
Kumar P, Wittmann T. Kumar P, et al. Trends Cell Biol. 2012 Aug;22(8):418-28. doi: 10.1016/j.tcb.2012.05.005. Epub 2012 Jun 28. Trends Cell Biol. 2012. PMID: 22748381 Free PMC article. Review.
The SLiMDisc server: short, linear motif discovery in proteins.
Davey NE, Edwards RJ, Shields DC. Davey NE, et al. Nucleic Acids Res. 2007 Jul;35(Web Server issue):W455-9. doi: 10.1093/nar/gkm400. Epub 2007 Jun 18. Nucleic Acids Res. 2007. PMID: 17576682 Free PMC article.
Finding motif pairs in the interactions between heterogeneous proteins via bootstrapping and boosting.
Kim J, Huang DS, Han K. Kim J, et al. BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S57. doi: 10.1186/1471-2105-10-S1-S57. BMC Bioinformatics. 2009. PMID: 19208160 Free PMC article.
Sequence- and interactome-based prediction of viral protein hotspots targeting host proteins: a case study for HIV Nef.
Sarmady M, Dampier W, Tozeren A. Sarmady M, et al. PLoS One. 2011;6(6):e20735. doi: 10.1371/journal.pone.0020735. Epub 2011 Jun 28. PLoS One. 2011. PMID: 21738584 Free PMC article.
Experimental detection of short regulatory motifs in eukaryotic proteins: tips for good practice as well as for bad.
Gibson TJ, Dinkel H, Van Roey K, Diella F. Gibson TJ, et al. Cell Commun Signal. 2015 Nov 18;13:42. doi: 10.1186/s12964-015-0121-y. Cell Commun Signal. 2015. PMID: 26581338 Free PMC article. Review.

See all "Cited by" articles

References

1. Letunic I., Copley R.R., Schmidt S., Ciccarelli F.D., Doerks T., Schultz J., Ponting C.P., Bork P. SMART 4.0: towards genomic data integration. Nucleic Acids Res. 2004;32:D142–D144. - PMC - PubMed
1. Bateman A., Birney E., Cerruti L., Durbin R., Etwiller L., Eddy S.R., Griffiths-Jones S., Howe K.L., Marshall M., Sonnhammer E.L. The Pfam protein families database. Nucleic Acids Res. 2002;30:276–280. - PMC - PubMed
1. Eddy S.R. Profile hidden Markov models. Bioinformatics. 1998;14:755–763. - PubMed
1. Madera M., Gough J. A comparison of profile hidden Markov model procedures for remote homology detection. Nucleic Acids Res. 2002;30:4321–4328. - PMC - PubMed
1. Bork P., Gibson T.J. Applying motif and profile searches. Meth. Enzymol. 1996;266:162–184. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

DILIMOT: discovery of linear motifs in proteins

Affiliation

DILIMOT: discovery of linear motifs in proteins

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources