PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions
- PMID: 39720898
- PMCID: PMC11669117
- DOI: 10.1002/pro.70004
PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions
Abstract
Protein-protein interactions are often mediated by a modular peptide recognition domain binding to a short linear motif (SLiM) in the disordered region of another protein. To understand the features of SLiMs that are important for binding and to identify motif instances that are important for biological function, it is useful to examine the evolutionary conservation of motifs across homologous proteins. However, the intrinsically disordered regions (IDRs) in which SLiMs reside evolve rapidly. Consequently, multiple sequence alignment (MSA) of IDRs often misaligns SLiMs and underestimates their conservation. We present PairK (pairwise k-mer alignment), an MSA-free method to align and quantify the relative local conservation of subsequences within an IDR. Lacking a ground truth for conservation, we tested PairK on the task of distinguishing biologically important motif instances from background motifs, under the assumption that biologically important motifs are more conserved. The method outperforms both standard MSA-based conservation scores and a modern LLM-based conservation score predictor. PairK can quantify conservation over wider phylogenetic distances than MSAs, indicating that some SLiMs are more conserved than MSA-based metrics imply. PairK is available as an open-source python package at https://github.com/jacksonh1/pairk. It is designed to be easily adapted for use with other SLiM tools and for diverse applications.
Keywords: conservation; intrinsically disordered proteins; multiple sequence alignment; short linear motif.
© 2024 The Protein Society.
Update of
-
PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions.bioRxiv [Preprint]. 2024 Jul 24:2024.07.23.604860. doi: 10.1101/2024.07.23.604860. bioRxiv. 2024. Update in: Protein Sci. 2025 Jan;34(1):e70004. doi: 10.1002/pro.70004. PMID: 39091826 Free PMC article. Updated. Preprint.
Similar articles
-
PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions.bioRxiv [Preprint]. 2024 Jul 24:2024.07.23.604860. doi: 10.1101/2024.07.23.604860. bioRxiv. 2024. Update in: Protein Sci. 2025 Jan;34(1):e70004. doi: 10.1002/pro.70004. PMID: 39091826 Free PMC article. Updated. Preprint.
-
Structural dynamics of IDR interactions in human SFPQ and implications for liquid-liquid phase separation.Acta Crystallogr D Struct Biol. 2025 Jul 1;81(Pt 7):357-379. doi: 10.1107/S2059798325005303. Epub 2025 Jun 27. Acta Crystallogr D Struct Biol. 2025. PMID: 40574713 Free PMC article.
-
SHARK: web server for alignment-free homology assessment for intrinsically disordered and unalignable protein regions.Nucleic Acids Res. 2025 Jul 7;53(W1):W512-W519. doi: 10.1093/nar/gkaf408. Nucleic Acids Res. 2025. PMID: 40396357 Free PMC article.
-
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x. Respir Res. 2024. PMID: 39709425 Free PMC article. Review.
-
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4. Cochrane Database Syst Rev. 2021. Update in: Cochrane Database Syst Rev. 2022 May 23;5:CD011535. doi: 10.1002/14651858.CD011535.pub5. PMID: 33871055 Free PMC article. Updated.
References
-
- Bashaw GJ, Kidd T, Murray D, Pawson T, Goodman CS. Repulsive axon guidance: Abelson and enabled play opposing roles downstream of the roundabout receptor. Cell. 2000;101:703–715. - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous