Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 9;3(1):184.
doi: 10.1038/s42004-020-00431-9.

Reading mixtures of uniform sequence-defined macromolecules to increase data storage capacity

Affiliations

Reading mixtures of uniform sequence-defined macromolecules to increase data storage capacity

Maximiliane Frölich et al. Commun Chem. .

Abstract

In recent years, the field of molecular data storage has emerged from a niche to a vibrant research topic. Herein, we describe a simultaneous and automated read-out of data stored in mixtures of sequence-defined oligomers. Therefore, twelve different sequence-defined tetramers and three hexamers with different mass markers and side chains are successfully synthesised via iterative Passerini three-component reactions and subsequent deprotection steps. By programming a straightforward python script for ESI-MS/MS analysis, it is possible to automatically sequence and thus read-out the information stored in these oligomers within one second. Most importantly, we demonstrate that the use of mass-markers as starting compounds eases MS/MS data interpretation and furthermore allows the unambiguous reading of sequences of mixtures of sequence-defined oligomers. Thus, high data storage capacity considering the field of synthetic macromolecules (up to 64.5 bit in our examples) can be obtained without the need of synthesizing long sequences, but by mixing and simultaneously analysing shorter sequence-defined oligomers.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Concept of the automated read-out of a mixture of sequence-defined molecules by varying 12 different aldehydes and specifically designed mass markers (TAGs).
Iterative step synthesis with the Passerini three component reaction (P-3CR), using twelve different aldehydes and three different TAGs. The aldehydes can be introduced at any desired position of the oligomer and provide the sidechains of the macromolecule and thus differentiate each repeating unit. Subsequently, the individual sequences of an oligomer mixture can be analyzed via ESI-MS and ESI-MS/MS, followed by fully automated read-out with the computer program with a clearly defined position of the TAGs.
Fig. 2
Fig. 2. Schematic representation of the variation of the twelve different aldehydes (colored bullets) and SEC traces of three different tetramers, one for each tag.
a Chemical structure and SEC traces of a tetramer with TAG1 and the sidechain variation of the aldehydes 14al (see Supplementary Methods for detailed information) for another three tetramers. b Chemical structure and SEC traces of a tetramer with TAG2 and the sidechain variation of the aldehydes 14al (see Supplementary Methods for detailed information) for another three tetramers. c Chemical structure and SEC traces of a tetramer with TAG3 and the sidechain variation of the aldehydes 14al (see Supplementary Methods for detailed information) for another three tetramers. d Chemical structures of TAG1–3.
Fig. 3
Fig. 3. Characterization of the sequence-defined hexamer with TAG1.
a Chemical structure of the sequence-defined hexamer H2. b SEC traces of each P-3CR product. c High-resolution of the ESI-MS measurement of H2; calculated isotopic pattern (red) and measured isotopic pattern (black).
Fig. 4
Fig. 4. Most common fragmentation patterns of the oligomer during fragmentation by tandem ESI-MS/MS.
a Fragmentation next to the carbonyl, which is preferred in measurements without any additives. b The fragmentation next to the ester group is preferred when sodium trifluoroacetate is used as additive.
Fig. 5
Fig. 5. Read-out of the sequence-defined hexamer H2.
Read-out of the hexamer H2 via tandem ESI-MS/MS with an NCE of 18. In the spectrum the read-out from both ends of the oligomer is shown, using the fragmentation next to the carbonyl.
Fig. 6
Fig. 6. Read-out of a mixture of three hexamers, with clearly defined positions of the TAGs to increase the data storage capacity.
a ESI-MS spectrum of a mixture of three different hexamers H1-H3 that was used for subsequent tandem ESI-MS/MS fragmentation. For the fragmentation, one of the respective molecule peaks was chosen at a time. b fragmentation of hexamer H1. c fragmentation of hexamer H2. d fragmentation of hexamer H3.

Similar articles

Cited by

References

    1. Jenkins AD, Stepto RFT, Kratochvíl P, Suter UW. Glossary of basic terms in polymer science. Pure Appl. Chem. 1996;68:2287–2311. doi: 10.1351/pac199668122287. - DOI
    1. Holloway JO, Van Lijsebetten F, Badi N, Houck HA, Du Prez FE. From sequence‐defined macromolecules to macromolecular pin codes. Adv. Sci. 2020;7:1903698. doi: 10.1002/advs.201903698. - DOI - PMC - PubMed
    1. Lee JM, et al. High-density information storage in an absolutely defined aperiodic sequence of monodisperse copolyester. Nat. Commun. 2020;11:1–9. - PMC - PubMed
    1. Mutlu H, Lutz JF. Reading polymers: sequencing of natural and synthetic macromolecules. Angew. Chem. Int. Ed. 2014;53:13010–13019. doi: 10.1002/anie.201406766. - DOI - PubMed
    1. Zydziak N, et al. Coding and decoding libraries of sequence-defined functional copolymers synthesized via photoligation. Nat. Commun. 2016;7:13672. doi: 10.1038/ncomms13672. - DOI - PMC - PubMed