Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 19:3:24.
doi: 10.1038/s41538-019-0056-6. eCollection 2019.

Food authentication from shotgun sequencing reads with an application on high protein powders

Affiliations

Food authentication from shotgun sequencing reads with an application on high protein powders

Niina Haiminen et al. NPJ Sci Food. .

Abstract

Here we propose that using shotgun sequencing to examine food leads to accurate authentication of ingredients and detection of contaminants. To demonstrate this, we developed a bioinformatic pipeline, FASER (Food Authentication from SEquencing Reads), designed to resolve the relative composition of mixtures of eukaryotic species using RNA or DNA sequencing. Our comprehensive database includes >6000 plants and animals that may be present in food. FASER accurately identified eukaryotic species with 0.4% median absolute difference between observed and expected proportions on sequence data from various sources including sausage meat, plants, and fish. FASER was applied to 31 high protein powder raw factory ingredient total RNA samples. The samples mostly contained the expected source ingredient, chicken, while three samples unexpectedly contained pork and beef. Our results demonstrate that DNA/RNA sequencing of food ingredients, combined with a robust analysis, can be used to find contaminants and authenticate food ingredients in a single assay.

Keywords: Food microbiology; Metagenomics.

PubMed Disclaimer

Conflict of interest statement

Competing interestsThe authors were employed by private or academic organizations as described in the author affiliations at the time this work was completed. IBM Corporation, Mars Incorporated, and Bio-Rad Laboratories are members of the Consortium for Sequencing the Food Supply Chain. The authors declare no other competing interests.

Figures

Fig. 1
Fig. 1
Pipeline applied to food sample sequencing data to determine matrix species and their relative proportions. In the taxonomic assignment step with exemplary diagram, reads are placed on the lowest common ancestor (LCA) of the nodes that they hit, in case of multiple hits per read. In the relative quantification step the read counts at internal nodes are re-assigned to the species at the leaf nodes
Fig. 2
Fig. 2
Illustration of the minimum size of a subsample to obtain a desired limit of detection. The required number of reads is shown as a function of frequency of species S (in the full sample). In this example with a total number of N = 300 million reads, we desire with high probability P (here P ≥ 0.9999) to have limit of detection at least L = 100 sampled reads coming from species S when S is present. For example, when frequency of S is 0.1% (x = 0.001), a subsample of 141,499 reads from the total 300 million reads is required (marked with a square). When frequency is S is 2% (x = 0.02), fewer than 10,000 reads are required (marked with a circle)
Fig. 3
Fig. 3
FASER pipeline accuracy on two simulated food mixtures. a Simulated food matrix 1. b Simulated food matrix 2. Insets are shown separately to accommodate different scales. Details regarding the input genomes are given in Supplementary Table 6
Fig. 4
Fig. 4
Raw high protein powder (poultry meal) samples’ FASER results showing unexpected non-chicken components. a Percentage of expected content (chicken). b Percentage of unexpected content showing species with relative proportion >0.1% of total matrix composition. Content from Bos taurus (beef) and Sus scrofa (pork) is detected for samples MFMB-04, MFMB-20, and MFMB-38

References

    1. Vierk K, Falci K, Wolyniak C, Klontz KC. Recalls of foods containing undeclared allergens reported to the US Food and Drug Administration, fiscal year 1999. J. Allergy Clin. Immunol. 2002;109:1022–1026. doi: 10.1067/mai.2002.124500. - DOI - PubMed
    1. Brooks S, Elliott CT, Spence M, Walsh C, Dean M. Four years post-horsegate: an update of measures and actions put in place following the horsemeat incident of 2013. npj Sci. Food. 2017;1:5. doi: 10.1038/s41538-017-0007-z. - DOI - PMC - PubMed
    1. Lo Y-T, Shaw P-C. DNA-based techniques for authentication of processed food and food supplements. Food Chem. 2018;240:767–774. doi: 10.1016/j.foodchem.2017.08.022. - DOI - PubMed
    1. Ripp F, et al. All-Food-Seq (AFS): a quantifiable screen for species in biological samples by deep DNA sequencing. BMC Genomics. 2014;15:639. doi: 10.1186/1471-2164-15-639. - DOI - PMC - PubMed
    1. Barbosa Cristina, Nogueira Sofia, Saraiva Ricardo, Chaves Sandra. Food Traceability and Authenticity. Boca Raton, FL : CRC Press, 2017. | Series: Food biology series | “A science publishers book.”: CRC Press; 2017. Next Generation Sequencing in Food Authenticity and Safety; pp. 164–179.