Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan-Dec;14(1):2106621.
doi: 10.1080/19420862.2022.2106621.

NAb-seq: an accurate, rapid, and cost-effective method for antibody long-read sequencing in hybridoma cell lines and single B cells

Affiliations

NAb-seq: an accurate, rapid, and cost-effective method for antibody long-read sequencing in hybridoma cell lines and single B cells

Hema Preethi Subas Satish et al. MAbs. 2022 Jan-Dec.

Abstract

Despite their common use in research, monoclonal antibodies are currently not systematically sequenced. This can lead to issues with reproducibility and the occasional loss of antibodies with loss of cell lines. Hybridoma cell lines have been the primary means of generating monoclonal antibodies from immunized animals, including mice, rats, rabbits, and alpacas. Excluding therapeutic antibodies, few hybridoma-derived antibody sequences are known. Sanger sequencing has been "the gold standard" for antibody gene sequencing, but this method relies on the availability of species-specific degenerate primer sets for amplification of light and heavy antibody genes and it requires lengthy and expensive cDNA preparation. Here, we leveraged recent improvements in long-read Oxford Nanopore Technologies (ONT) sequencing to develop Nanopore Antibody sequencing (NAb-seq): a three-day, species-independent, and cost-effective workflow to characterize paired full-length immunoglobulin light- and heavy-chain genes from hybridoma cell lines. When compared to Sanger sequencing of two hybridoma cell lines, long-read ONT sequencing was highly accurate, reliable, and amenable to high throughput. We further show that the method is applicable to single cells, allowing efficient antibody discovery in rare populations such as memory B cells. In summary, NAb-seq promises to accelerate identification and validation of hybridoma antibodies as well as antibodies from single B cells used in research, diagnostics, and therapeutics.

Keywords: Antibody sequencing; B cell; hybridoma; long-read; nanopore sequencing; rat B cell cloning; single cell; workflow.

PubMed Disclaimer

Conflict of interest statement

The authors report there are no competing interests to declare.

Figures

Figure 1.
Figure 1.
NAb-seq workflow for parallel sequencing of full-length antibody heavy and light chain sequences from hybridoma cell lines and single B cells.
(a) Electrophoresis showing bands of DNA ranging in size from 400 to 2500 kilobases for cDNA libraries A and B. (b) Bar graph showing 300,000 pass reads for cDNA library A and 200,000 pass reads for cDNA library B. (c) Graph showing that cDNA libraries A and B both have a significant number of sequence reads at lengths in the range of 400–3000 bp. (d) Table of sequence characteristics for the heavy- and light-chain sequences obtained from cDNA libraries A and B.
Figure 2.
NAb-seq of two hybridoma cell lines revealed antibody sequences. (a) cDNA library size and amplification. Total RNA from two hybridoma cell lines was extracted and converted to cDNA followed by amplification and barcoding, generating whole transcriptome cDNA libraries A and B. (b) Basecalling in Guppy’s super-high accuracy mode yielded approximately 0.5 million total pass reads. (c) Read length of pass reads varied from ~400 bp to ~3000 bp. (d) Sequence analysis of cDNA libraries A and B reveals V(D)J recombination, C gene usage and complementarity-determining region 3 (CDR3) amino acid sequence. Consensus calling of antibody transcripts revealed IgG isotype.
(a) Schematics of eight antibody sequences are color-coded to highlight the variable, constant, 5’ UTR, 3ʹUTR, CDR, and unknown regions. Long-read sequences are identical to the Sanger sequences for each of the 7D10 and 3C10 light and heavy chains and contain additional sequences at either end. (b) 16 lines of text shows 100% alignment of the 3C10 light-chain sequences from Sanger and Nanopore sequencing. The Nanopore sequence has extra sequence information at each end that is highlighted as orange and purple, and whose location in GenBank is indicated.
Figure 3.
Antibody sequences from libraries A and B align with 100% accuracy to the 7D10 and 3C10 sequences. (a) Schematic of alignment of 7D10 and 3C10 antibody chains, as derived from Sanger (top row) and Nanopore (bottom row) sequencing methods. Additional bases present in the Nanopore sequence have been annotated with BLAST as indicated. Short sequences at the beginning of reads (gray) were sometimes unable to be annotated with BLAST, nor did they match the primer sequences used during library preparation. (b) Sequence alignment of the light chain sequence of 3C10 as derived from Sanger (top row) and Nanopore (bottom row) sequencing methods. Additional bases present in the Nanopore sequence have been annotated with BLAST. Regions highlighted as in Figure 3a. See Supplementary Figures 1–3 for other chains of 7D10 and 3C10.
(a) Electrophoresis showing bands of DNA ranging in size from 600 to 3500 kilobases for cDNA libraries I, II, III, and IV. (b Bar graph showing >500,000 pass reads for cDNA libraries I, II, III, and IV, and ~300,000 un/misclassified reads. (c) Graph showing that average sequence read length is >1000 for cDNA libraries I and II and <1000 for cDNA libraries III and IV. (d) Table of sequence characteristics for the heavy- and light-chain sequences obtained from cDNA libraries I, II, III, and IV.
Figure 4.
NAb-seq of four single B cells revealed antibody sequences. (a) cDNA library size and amplification. Total RNA from four sorted single B cells were extracted and converted to whole transcriptome cDNA libraries I–IV. (b) Basecalling in Guppy’s super-high accuracy mode yielded approximately 5 million total pass reads. (c) Read length of pass reads varied from ~400 bp to ~5000 bp. (d) Sequence analysis of cDNA libraries I to IV reveals V(D)J recombination, C gene usage and complementarity-determining region 3 (CDR3) amino acid sequence. Consensus calling of antibody transcripts revealed IgM and IgD isotypes.

Similar articles

Cited by

References

    1. Berglund L, Bjorling E, Oksvold P, Fagerberg L, Asplund A, Szigyarto CA, Persson A, Ottosson J, Wernerus H, Nilsson P, et al. A genecentric human protein atlas for expression profiles based on antibodies. Mol Cell Proteomics. 2008;7(10):2019–12. doi:10.1074/mcp.R800013-MCP200. - DOI - PubMed
    1. Slaastad H, Wu W, Goullart L, Kanderova V, Tjonnfjord G, Stuchly J, Kalina T, Holm A, Lund-Johansen F.. Multiplexed immuno-precipitation with 1725 commercially available antibodies to cellular proteins. Proteomics. 2011;11(23):4578–82. doi:10.1002/pmic.201000744. - DOI - PubMed
    1. Begley CG. Six red flags for suspect work. Nature. 2013;497(7450):433–34. doi:10.1038/497433a. - DOI - PubMed
    1. Bradbury A, Pluckthun A. Reproducibility: standardize antibodies used in research. Nature. 2015;518(7537):27–29. doi:10.1038/518027a. - DOI - PubMed
    1. Bradbury AR, Pluckthun A. Getting to reproducible antibodies: the rationale for sequenced recombinant characterized reagents. Protein Eng Des Sel. 2015;28(10):303–05. doi:10.1093/protein/gzv051. - DOI - PubMed

Publication types

Substances

LinkOut - more resources