Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jan;38(Database issue):D167-80.
doi: 10.1093/nar/gkp1016. Epub 2009 Nov 17.

ELM: the status of the 2010 eukaryotic linear motif resource

Affiliations

ELM: the status of the 2010 eukaryotic linear motif resource

Cathryn M Gould et al. Nucleic Acids Res. 2010 Jan.

Abstract

Linear motifs are short segments of multidomain proteins that provide regulatory functions independently of protein tertiary structure. Much of intracellular signalling passes through protein modifications at linear motifs. Many thousands of linear motif instances, most notably phosphorylation sites, have now been reported. Although clearly very abundant, linear motifs are difficult to predict de novo in protein sequences due to the difficulty of obtaining robust statistical assessments. The ELM resource at http://elm.eu.org/ provides an expanding knowledge base, currently covering 146 known motifs, with annotation that includes >1300 experimentally reported instances. ELM is also an exploratory tool for suggesting new candidates of known linear motifs in proteins of interest. Information about protein domains, protein structure and native disorder, cellular and taxonomic contexts is used to reduce or deprecate false positive matches. Results are graphically displayed in a 'Bar Code' format, which also displays known instances from homologous proteins through a novel 'Instance Mapper' protocol based on PHI-BLAST. ELM server output provides links to the ELM annotation as well as to a number of remote resources. Using the links, researchers can explore the motifs, proteins, complex structures and associated literature to evaluate whether candidate motifs might be worth experimental investigation.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The ELM Resource hierarchy represented as a pyramid. ‘Functional Site’ provides a general description of the biology, for example, MAP Kinases have a docking motif in their substrates. There are more than one class of MAPK docking motifs and ELM currently provides two ‘ELM Motif’ entries. These contain the motif regular expression and are annotated with more specific information as well as linking out to remote resources including PubMed, NCBI Taxonomy and GO. At the base of the pyramid are the ‘ELM Instances’ that belong to a given ‘ELM Motif’ entry. The instances are annotated with information about experimental methods and instance quality and link to external resources including UniProt, PDB and PubMed.
Figure 2.
Figure 2.
Details from browse pages for the entry LIG_CAP-Gly_1 (http://elm.eu.org/elmPages/LIG_CAP-Gly_1.html). The upper window shows the description and the regular expression for the motif. Scrolling down past the references and the GO terms (not shown) leads to the table of known instances (middle window). Key information in the table includes whether an instance is a true positive, a link to the UniProt sequence entry and, if available, links to PDB structure entries (49). Clicking on the linked sequence for the instance in the EB1 protein (MARE1_HUMAN) opens a new page summarizing the annotated experimental evidence for the given instance. In this case, the motif has been exhaustively analysed and the supporting evidence is solid.
Figure 3.
Figure 3.
Graphic from the output page of the ELM server queried with Epsin-1 sequence from the UniProt entry EPN1_HUMAN. The key indicates the content of the various coloured bars, e.g. the three connected by dotted arrows. Thirteen true LM instances are annotated either in this sequence or an orthologue from another species (magenta and red bar codes, respectively). Mouseover provides panels with different information depending on context, three examples of which are shown. One indicates an ENTH domain retrieved from SMART. A second points at an annotated DPW motif. The third mouseover provides the most detail: a structure for the ENTH domain (PDB entry d1h0) was used by the SF (41) to report that a cyclin motif candidate is too buried to be significant. Clicking on any object in the graphic will link to further details.
Figure 4.
Figure 4.
Representative results from the CS web interface, displayed with the annotated sequence alignment editor JalView (86). The alignment shows the set of sequences obtained by the CS filter with the human Epsin1 query sequence at top: the sequences belong to several paralogous families of Epsins. Four motif matches are highlighted in the reference sequence (magenta, annotated in this sequence; red, assigned by the instance mapper; blue, unannotated match) and in other sequences that align to the reference motif (green). The left-most match is a known instance of TRG_AP2beta_CARGO_1 and gives a top score of 1.00 despite only being present in sequences belonging to two of the Epsin paralogues. This is because most sequences that lack the motif have gaps aligned to it that do not affect the CS score. The second motif is a candidate instance for MOD_PKA_2 but is poorly conserved, scoring 0.05. This candidate would probably not be worth investigating unless there was prior evidence of phosphorylation at the site. The remaining two motifs are known instances of LIG_Clathr_ClatBox_1 and LIG_EH_1, which obtain the maximum CS score since they are conserved in all Epsin paralogues.
Figure 5.
Figure 5.
Flow scheme for the ELM Instance Mapper. For each predicted LM from an ELM database search, a PHI-BLAST search is performed against a database containing all sequences with known instances of the predicted LM. Input to PHI-BLAST is the query sequence and the ELM Regular Expression (which is adapted for use with PHI-BLAST). Each of the aligned motifs, between query and ELM instance sequence, are evaluated and scored (see main text). If the motif in the ELM instance sequence is a known instance, and the calculated score is above a threshold (Sei ≥ 0.3), it is reported as a mapped instance. Both the ELM instance mapper and the underlying PHI-BLAST results are returned to the ELM server, for the user to inspect.
Figure 6.
Figure 6.
Workflow diagram illustrating how a user might explore LM candidates with ELM. The pipeline proceeds through three main phases utilizing the ELM resource (beige background) ELM associated tools (green) and more general bioinformatics resources (pink). Candidate LMs can be rejected by ELM filters if in unsuitable contexts. Sequence conservation and enrichment in interaction data using DiLiMot or SLiMFinder can provide additional scores to rank motifs. In the final phase any potentially relevant bioinformatics resources should be examined to provide further context to motif candidates. If promising candidates survive this process, the end point of the bioinformatics pipeline has been reached and laboratory validation is now required.

Similar articles

  • ELM server: A new resource for investigating short functional sites in modular eukaryotic proteins.
    Puntervoll P, Linding R, Gemünd C, Chabanis-Davidson S, Mattingsdal M, Cameron S, Martin DM, Ausiello G, Brannetti B, Costantini A, Ferrè F, Maselli V, Via A, Cesareni G, Diella F, Superti-Furga G, Wyrwicz L, Ramu C, McGuigan C, Gudavalli R, Letunic I, Bork P, Rychlewski L, Küster B, Helmer-Citterich M, Hunter WN, Aasland R, Gibson TJ. Puntervoll P, et al. Nucleic Acids Res. 2003 Jul 1;31(13):3625-30. doi: 10.1093/nar/gkg545. Nucleic Acids Res. 2003. PMID: 12824381 Free PMC article.
  • A structure filter for the Eukaryotic Linear Motif Resource.
    Via A, Gould CM, Gemünd C, Gibson TJ, Helmer-Citterich M. Via A, et al. BMC Bioinformatics. 2009 Oct 24;10:351. doi: 10.1186/1471-2105-10-351. BMC Bioinformatics. 2009. PMID: 19852836 Free PMC article.
  • ELM-the Eukaryotic Linear Motif resource-2024 update.
    Kumar M, Michael S, Alvarado-Valverde J, Zeke A, Lazar T, Glavina J, Nagy-Kanta E, Donagh JM, Kalman ZE, Pascarelli S, Palopoli N, Dobson L, Suarez CF, Van Roey K, Krystkowiak I, Griffin JE, Nagpal A, Bhardwaj R, Diella F, Mészáros B, Dean K, Davey NE, Pancsa R, Chemes LB, Gibson TJ. Kumar M, et al. Nucleic Acids Res. 2024 Jan 5;52(D1):D442-D455. doi: 10.1093/nar/gkad1058. Nucleic Acids Res. 2024. PMID: 37962385 Free PMC article.
  • Computational prediction of protein-protein interactions.
    Obenauer JC, Yaffe MB. Obenauer JC, et al. Methods Mol Biol. 2004;261:445-68. doi: 10.1385/1-59259-762-9:445. Methods Mol Biol. 2004. PMID: 15064475 Review.
  • Identification of motifs in protein sequences.
    Sonnhammer EL, Wolfsberg TG. Sonnhammer EL, et al. Curr Protoc Cell Biol. 2001 May;Appendix 1:Appendix 1C. doi: 10.1002/0471143030.cba01cs00. Curr Protoc Cell Biol. 2001. PMID: 18228275 Review.

Cited by

References

    1. Diella F, Haslam N, Chica C, Budd A, Michael S, Brown NP, Trave G, Gibson TJ. Understanding eukaryotic linear motifs and their role in cell signaling and regulation. Front. Biosci. 2008;13:6580–6603. - PubMed
    1. Neduva V, Russell RB. Peptides mediating interaction networks: new leads at last. Curr. Opin. Biotechnol. 2006;17:465–471. - PubMed
    1. Kadaveru K, Vyas J, Schiller MR. Viral infection and human disease—insights from minimotifs. Front. Biosci. 2008;13:6455–6471. - PMC - PubMed
    1. Fox-Erlich S, Schiller MR, Gryk MR. Structural conservation of a short, functional, peptide-sequence motif. Front. Biosci. 2009;14:1143–1151. - PMC - PubMed
    1. Petsalaki E, Russell RB. Peptide-mediated interactions in biological systems: new discoveries and applications. Curr. Opin. Biotechnol. 2008;19:344–350. - PubMed

Publication types