Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Oct 17;8(1):967.
doi: 10.1038/s41467-017-01104-3.

Mass spectrometry sequencing of long digital polymers facilitated by programmed inter-byte fragmentation

Affiliations

Mass spectrometry sequencing of long digital polymers facilitated by programmed inter-byte fragmentation

Abdelaziz Al Ouahabi et al. Nat Commun. .

Abstract

In the context of data storage miniaturization, it was recently shown that digital information can be stored in the monomer sequences of non-natural macromolecules. However, the sequencing of such digital polymers is currently limited to short chains. Here, we report that intact multi-byte digital polymers can be sequenced in a moderate resolution mass spectrometer and that full sequence coverage can be attained without requiring pre-analysis digestion or the help of sequence databases. In order to do so, the polymers are designed to undergo controlled fragmentations in collision-induced dissociation conditions. Each byte of the sequence is labeled by an identification tag and a weak alkoxyamine group is placed between 2 bytes. As a consequence of this design, the NO-C bonds break first upon collisional activation, thus leading to a pattern of mass tag-shifted intact bytes. Afterwards, each byte is individually sequenced in pseudo-MS3 conditions and the whole sequence is found.Digital information can be stored in monomer sequences of non-natural macromolecules, but only short chains can be read. Here the authors show long multi-byte digital polymers sequenced in a moderate resolution mass spectrometer. Full sequence coverage can be attained without pre-analysis digestion or the help from sequence databases.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Fig. 1
Fig. 1
General concept studied herein for the sequencing of long digital polymer chains. a Molecular structure of the sequence-coded polymers prepared by automated phosphoramidite chemistry. These digital polymers contain n + 1 coded bytes noted in red. A byte is a sequence of eight coded monomers that represent 8 bits. Two consecutive bytes are separated by a linker noted in black, which contains a NO-C bond that can be preferentially cleaved during MS/MS analysis. In order to sort out the bytes after MS/MS cleavage, n bytes of the sequence are labeled with a mass tag noted in blue. b Molecular structure and mass of the two coded synthons that define the binary code in the polymers. c Molecular structure and mass of the mass tags that are used as bytes labels. In order to induce identifiable mass shifts after MS/MS cleavage, the mass of a byte tag (noted in blue) shall not be a multiple of 28, which is the mass difference between a 0 and a 1 coded unit. In addition, the mass difference between two tags (noted in grey) shall not be a multiple of 28. d Schematic representation of the mass spectrometry sequencing of a digital polymer containing 4 bytes of information. The polymer is first analyzed in MS/MS conditions, which lead to the favored cleavage of the weak NO-C bonds (depicted in yellow inside the grey spacers). Since they carry mass tags, the resulting cleaved bytes are sorted out by mass in the MS/MS spectrum (the displayed MS2 cartoon is idealized for clarity). Afterwards, each byte can be easily sequenced in MS3 conditions and the whole binary sequence can be deciphered
Fig. 2
Fig. 2
Sequencing of a 4-byte digital polymer that contains the ASCII-encoded word Byte. a High-resolution electrospray mass spectrum (MS1) obtained in the negative ion mode for a 4-byte digital polymer (Supplementary Table 1, Entry 5). The upper numbers represent the different charge states observed for the polymer. Dark grey diamonds indicate in-source fragments (see Supplementary Table 2 for a detailed interpretation of each peak) and dark grey squares designate different charge states of two different synthesis impurities. b MS2 spectrum (0.56 eV, center of mass frame) obtained by collision-induced dissociation of the [M-12H]12− precursor ion, which carries on average three charges per byte. In this case, the MS/MS single-byte, double-byte, and triple-byte fragments are predominantly observed as trianions, hexa-anions, and nona-anions, respectively. Other charge states can be observed to a minor extent and are denoted by dark grey circles (see Supplementary Table 2 for a detailed interpretation of each peak). c Molecular sequencing (pseudo-MS3) of a byte fragment obtained by collision-induced dissociation (0.56 eV, center of mass frame) of the precursor trianion [M-3H]3−. For clarity, only the sequencing of byte 4 is shown as an example in this figure. The sequencing of bytes 1–3 is shown in Supplementary Fig. 6
Fig. 3
Fig. 3
MS2 spectrum of an 8-byte digital polymer containing the ASCII-encoded word Sequence. This spectrum (0.59 eV, center of mass frame) was obtained by collision-induced dissociation of the [M-24H]24− precursor ion of polymer 11 in Supplementary Table 1. In this case, the MS/MS single-byte, double-byte, triple-byte, and tetra-byte fragments are predominantly observed as trianions, hexa-anions, nona-anions, and dodeca-anions, respectively. Other charge states can be observed to a minor extent and are denoted by dark grey circles (see Supplementary Figs. 21b and 22 for a detailed interpretation of each peak). The subsequent pseudo-MS3 sequencing of all bytes is shown in Supplementary Figs. 23–25

Similar articles

Cited by

References

    1. Zhirnov V, Zadegan RM, Sandhu GS, Church GM, Hughes WL. Nucleic acid memory. Nat. Mater. 2016;15:366–370. doi: 10.1038/nmat4594. - DOI - PMC - PubMed
    1. Colquhoun H, Lutz J-F. Information-containing macromolecules. Nat. Chem. 2014;6:455–456. doi: 10.1038/nchem.1958. - DOI - PubMed
    1. Lutz J-F, Ouchi M, Liu DR, Sawamoto M. Sequence-controlled polymers. Science. 2013;341:1238149. doi: 10.1126/science.1238149. - DOI - PubMed
    1. Church GM, Gao Y, Kosuri S. Next-generation digital information storage in DNA. Science. 2012;337:1628–1628. doi: 10.1126/science.1226355. - DOI - PubMed
    1. Goldman N, et al. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA. Nature. 2013;494:77–80. doi: 10.1038/nature11875. - DOI - PMC - PubMed

Publication types