Random access in large-scale DNA data storage
- PMID: 29457795
- DOI: 10.1038/nbt.4079
Random access in large-scale DNA data storage
Abstract
Synthetic DNA is durable and can encode digital data with high density, making it an attractive medium for data storage. However, recovering stored data on a large-scale currently requires all the DNA in a pool to be sequenced, even if only a subset of the information needs to be extracted. Here, we encode and store 35 distinct files (over 200 MB of data), in more than 13 million DNA oligonucleotides, and show that we can recover each file individually and with no errors, using a random access approach. We design and validate a large library of primers that enable individual recovery of all files stored within the DNA. We also develop an algorithm that greatly reduces the sequencing read coverage required for error-free decoding by maximizing information from all sequence reads. These advances demonstrate a viable, large-scale system for DNA data storage and retrieval.
Similar articles
-
Iterative Soft Decoding Algorithm for DNA Storage Using Quality Score and Redecoding.IEEE Trans Nanobioscience. 2024 Jan;23(1):81-90. doi: 10.1109/TNB.2023.3284406. Epub 2024 Jan 3. IEEE Trans Nanobioscience. 2024. PMID: 37294652
-
Driving the Scalability of DNA-Based Information Storage Systems.ACS Synth Biol. 2019 Jun 21;8(6):1241-1248. doi: 10.1021/acssynbio.9b00100. Epub 2019 May 24. ACS Synth Biol. 2019. PMID: 31117362
-
DNA Fountain enables a robust and efficient storage architecture.Science. 2017 Mar 3;355(6328):950-954. doi: 10.1126/science.aaj2038. Science. 2017. PMID: 28254941
-
The zettabyte era is in our DNA.Nat Comput Sci. 2024 Nov;4(11):813-817. doi: 10.1038/s43588-024-00717-1. Epub 2024 Nov 8. Nat Comput Sci. 2024. PMID: 39516373 Review.
-
Trends to store digital data in DNA: an overview.Mol Biol Rep. 2018 Oct;45(5):1479-1490. doi: 10.1007/s11033-018-4280-y. Epub 2018 Aug 2. Mol Biol Rep. 2018. PMID: 30073589 Review.
Cited by
-
Nanopore Detection Assisted DNA Information Processing.Nanomaterials (Basel). 2022 Sep 9;12(18):3135. doi: 10.3390/nano12183135. Nanomaterials (Basel). 2022. PMID: 36144924 Free PMC article. Review.
-
Information decay and enzymatic information recovery for DNA data storage.Commun Biol. 2022 Oct 20;5(1):1117. doi: 10.1038/s42003-022-04062-9. Commun Biol. 2022. PMID: 36266439 Free PMC article.
-
Expanding the Molecular Alphabet of DNA-Based Data Storage Systems with Neural Network Nanopore Readout Processing.Nano Lett. 2022 Mar 9;22(5):1905-1914. doi: 10.1021/acs.nanolett.1c04203. Epub 2022 Feb 25. Nano Lett. 2022. PMID: 35212544 Free PMC article.
-
Parallel molecular computation on digital data stored in DNA.Proc Natl Acad Sci U S A. 2023 Sep 12;120(37):e2217330120. doi: 10.1073/pnas.2217330120. Epub 2023 Sep 5. Proc Natl Acad Sci U S A. 2023. PMID: 37669382 Free PMC article.
-
Erratum: Random access in large-scale DNA data storage.Nat Biotechnol. 2018 Jul 6;36(7):660. doi: 10.1038/nbt0718-660c. Nat Biotechnol. 2018. PMID: 29979658 No abstract available.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous