Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 26:3:900573.
doi: 10.3389/falgy.2022.900573. eCollection 2022.

Development of a Sequence Searchable Database of Celiac Disease-Associated Peptides and Proteins for Risk Assessment of Novel Food Proteins

Affiliations

Development of a Sequence Searchable Database of Celiac Disease-Associated Peptides and Proteins for Risk Assessment of Novel Food Proteins

Plaimein Amnuaycheewa et al. Front Allergy. .

Abstract

Celiac disease (CeD) is an autoimmune enteropathy induced by prolamin and glutelin proteins in wheat, barley, rye, and triticale recognized by genetically restricted major histocompatibility (MHC) receptors. Patients with CeD must avoid consuming these proteins. Regulators in Europe and the United States expect an evaluation of CeD risks from proteins in genetically modified (GM) crops or novel foods for wheat-related proteins. Our database includes evidence-based causative peptides and proteins and two amino acid sequence comparison tools for CeD risk assessment. Sequence entries are based on the review of published studies of specific gluten-reactive T cell activation or intestinal epithelial toxicity. The initial database in 2012 was updated in 2018 and 2022. The current database holds 1,041 causative peptides and 76 representative proteins. The FASTA sequence comparison of 76 representative CeD proteins provides an insurance for possible unreported epitopes. Validation was conducted using protein homologs from Pooideae and non-Pooideae monocots, dicots, and non-plant proteins. Criteria for minimum percent identity and maximum E-scores are guidelines. Exact matches to any of the 1,041 peptides suggest risks, while FASTA alignment to the 76 CeD proteins suggests possible risks. Matched proteins should be tested further by CeD-specific CD4/8+ T cell assays or in vivo challenges before their use in foods.

Keywords: Pooideae; T-cell epitopes; celiac disease; gluten; peptide database; prolamin; risk assessment; sequence comparison.

PubMed Disclaimer

Conflict of interest statement

RG declares limited funding from six biotechnology companies from 2009 to 2012 for support of the database management. Unilever SEAC and NuSeed Americas provided limited funding to the AllergenOnline.org database from 2018 to 2021. These companies did not contribute to or see the article prior to submission. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Taxonomic tree of cereal and dicotyledonous plants based on NCBI taxonomy. Published evidence of CeD safe foods show reactions only to grains of the Pooideae subfamily of grasses.
Figure 2
Figure 2
(A) Amino acid sequence alignments of an α-gliadin (NCBI accession number: CAB76964) with 53 overlapping CeD-associated peptides identified with the exact sequence match tool; (B) full FASTA sequence alignment results with homology scores of the α-gliadin theoretically substituted with 13 alanine residues; (C) full FASTA sequence alignment results with homology scores of the α-gliadin theoretically substituted with 11 alanine residues.
Figure 3
Figure 3
Proposed evaluation criteria to predict the likelihood of a query protein to cause elicitation of CeD. An exact match to any of the 1,041 peptides indicates probable rejection. Alternatively, a FASTA3 alignment with an E-score limit of 1e-14 and minimum alignment length > 100 AA with an identity percent of the protein at 45% should trigger testing or rejection.

References

    1. van Putten MC, Frewer LJ, Gilissen LJWJ, Gremmen B, Peijnenburg AACM, Wichers HJ. Novel foods and food allergies: a review of the issues. Trends Food Sci Technol. (2006) 17:289 10.1016/j.tifs.2005.11.010 - DOI
    1. Codex Alimentarius Commission . In: Alinorm 03/34: Joint FAO/WHO Food Standard Programme, Codex Alimentarious Commission, Twenty-Fifth Session, Rome, Italy, 30 June−5 July, 2003. Appendix III, Guideline for the conduct of food safety assessment of foods derived from recombinant-DNA plants, and Appendix IV, Annex on the assessment of possible allergenicity. Twenty-Fifth Session (FA); Joint FAO/WHO Food Standards Programme. (2003). p. 47–60.
    1. Rubio-Tapia A, Ludvigsson JF, Brantner TL, Murray JA, Everhart JE. The prevalence of celiac disease in the United States. Am J Gastroenterol. (2012) 107:1538–44; quiz 1537, 1545. 10.1038/ajg.2012.219 - DOI - PubMed
    1. Singh P, Arora A, Strand TA, Leffler DA, Catassi C, Green PH, et al. . Global prevalence of celiac disease: systematic review and meta-analysis. Clin Gastroenterol Hepatol. (2018) 16:823–36.e2. 10.1016/j.cgh.2017.06.037 - DOI - PubMed
    1. Cukrowska B, Sowinska A, Bierla JB, Czarnowska E, Rybak A, Grzybowska-Chlebowczyk U. Intestinal epithelium, intraepithelial lymphocytes and the gut microbiota - key players in the pathogenesis of celiac disease. World J Gastroenterol. (2017) 23:7505–18. 10.3748/wjg.v23.i42.7505 - DOI - PMC - PubMed