Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 4;8(1):23.
doi: 10.1038/s41540-022-00233-w.

Adaptive coding for DNA storage with high storage density and low coverage

Affiliations

Adaptive coding for DNA storage with high storage density and low coverage

Ben Cao et al. NPJ Syst Biol Appl. .

Abstract

The rapid development of information technology has generated substantial data, which urgently requires new storage media and storage methods. DNA, as a storage medium with high density, high durability, and ultra-long storage time characteristics, is promising as a potential solution. However, DNA storage is still in its infancy and suffers from low space utilization of DNA strands, high read coverage, and poor coding coupling. Therefore, in this work, an adaptive coding DNA storage system is proposed to use different coding schemes for different coding region locations, and the method of adaptively generating coding constraint thresholds is used to optimize at the system level to ensure the efficient operation of each link. Images, videos, and PDF files of size 698 KB were stored in DNA using adaptive coding algorithms. The data were sequenced and losslessly decoded into raw data. Compared with previous work, the DNA storage system implemented by adaptive coding proposed in this paper has high storage density and low read coverage, which promotes the development of carbon-based storage systems.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
DNA storage versus conventional magnetic storage.
Fig. 2
Fig. 2
Different coding classes in DNA storage.
Fig. 3
Fig. 3
The influence of high base balance constraint on A and T base contents.
Fig. 4
Fig. 4
The influence of high base balance constraint on C and G base contents.
Fig. 5
Fig. 5
Gibson assembles address bits and data bits.
Fig. 6
Fig. 6
Process diagram of an adaptive coding for DNA storage system.
Fig. 7
Fig. 7
The difference between base balance degree constraint and GC content constraint.
Fig. 8
Fig. 8
Constraints and algorithms are optional threshold adaptive non-payload coding algorithms.
Fig. 9
Fig. 9
Classification and overlap of coding constraints.
Fig. 10
Fig. 10
Independent random storage that requires no additional storage.

References

    1. Davis J. Microvenus. Art. J. 1996;55:70–74. doi: 10.1080/00043249.1996.10791743. - DOI
    1. Bancroft C, Bowler T, Bloom B, Clelland C. Long-Term Storage of Information in DNA. Science. 2001;293:1763–1765. doi: 10.1126/science.293.5536.1763c. - DOI - PubMed
    1. Church GM, Gao Y, Kosuri S. Next-generation digital information storage in DNA. Science. 2012;337:1628–1628. doi: 10.1126/science.1226355. - DOI - PubMed
    1. Goldman N, et al. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA. Nature. 2013;494:77–80. doi: 10.1038/nature11875. - DOI - PMC - PubMed
    1. Yazdi S, Gabrys R, Milenkovic O. Portable and error-free DNA-based data storage. Sci. Rep. 2017;7:6. doi: 10.1038/s41598-017-00059-1. - DOI - PMC - PubMed

Publication types

LinkOut - more resources