For First Time, DNA Tech Offers Both Data Storage and Computing Functions

August 22, 2024 lhmarlie

AI generated image of a conceptualized DNA as storage

Researchers from North Carolina State University and Johns Hopkins University have demonstrated a technology capable of a suite of data storage and computing functions – repeatedly storing, retrieving, computing, erasing or rewriting data – that uses DNA rather than conventional electronics. Previous DNA data storage and computing technologies could complete some but not all of these tasks.

“In conventional computing technologies, we take for granted that the ways data are stored and the way data are processed are compatible with each other,” says project leader Albert Keung, co-corresponding author of a paper on the work. “But in reality, data storage and data processing are done in separate parts of the computer, and modern computers are a network of complex technologies.” Keung is an associate professor of chemical and biomolecular engineering and a Goodnight Distinguished Scholar at NC State.

“DNA computing has been grappling with the challenge of how to store, retrieve and compute when the data is being stored in the form of nucleic acids,” Keung says. “For electronic computing, the fact that all of a device’s components are compatible is one reason those technologies are attractive. But, to date, it’s been thought that while DNA data storage may be useful for long-term data storage, it would be difficult or impossible to develop a DNA technology that encompassed the full range of operations found in traditional electronic devices: storing and moving data; the ability to read, erase, rewrite, reload or compute specific data files; and doing all of these things in programmable and repeatable ways.

“We’ve demonstrated that these DNA-based technologies are viable, because we’ve made one.”

The new technology is made possible by recent techniques that have enabled the creation of soft polymer materials that have unique morphologies.

“Specifically, we have created polymer structures that we call dendricolloids – they start at the microscale, but branch off from each other in a hierarchical way to create a network of nanoscale fibers,” says Orlin Velev, co-corresponding author and the S. Frank and Doris Culberson Distinguished Professor of Chemical and Biomolecular Engineering at NC State. “This morphology creates a structure with a high surface area, which allows us to deposit DNA among the nanofibrils without sacrificing the data density that makes DNA attractive for data storage in the first place.”

“You could put a thousand laptops’ worth of data into DNA-based storage that’s the same size as a pencil eraser,” Keung says.

“The ability to distinguish DNA information from the nanofibers it’s stored on allows us to perform many of the same functions you can do with electronic devices,” says Kevin Lin, first author of the paper and a former Ph.D. student at NC State. “We can copy DNA information directly from the material’s surface without harming the DNA. We can also erase targeted pieces of DNA and then rewrite to the same surface, like deleting and rewriting information stored on the hard drive. It essentially allows us to conduct the full range of DNA data storage and computing functions. In addition, we found that when we deposit DNA on the dendricolloid material, the material helps to preserve the DNA.”

“You could say that Keung’s team is providing the equivalent of microcircuits, and the dendricolloidal material that my team creates provides the circuit board,” says Velev. “Our NC State collaborator Adriana San Miguel helped us incorporate the materials into microfluidic channels that direct the flow of nucleic acids and reagents, allowing us to move data and initiate computing commands. Winston Timp’s lab at Johns Hopkins contributed their expertise on nanopore sequencing, which helps us directly read the data in RNA after copying it from DNA on the material’s surface. And James Tuck’s lab – also here at NC State – has developed algorithms that allow us to convert data into nucleic acid sequences and vice versa while controlling for potential errors.”

The researchers have demonstrated that the new data storage and computing technology – which they call a “primordial DNA store and compute engine” – is capable of solving simple sudoku and chess problems. And testing suggests that it could store data securely for thousands of years in commercially available spaces without degrading the information-storing DNA.

“What’s more, the dendrocolloidal host material itself is relatively inexpensive and easy to fabricate,” Velev says.

“There’s a lot of excitement about molecular data storage and computation, but there have been significant questions about how practical the field may be,” says Keung. “We looked back at the history of computing and how the creation of ENIAC inspired the field. We wanted to develop something that would inspire the field of molecular computing. And we hope what we’ve done here is a step in that direction.”

The paper, “A Primordial DNA Store and Compute Engine,” is published in the journal Nature Nanotechnology. The paper was co-authored by Kevin Volkel and Andrew Clark, former Ph.D. students at NC State; Cyrus Cao and Rachel Polak, Ph.D. students at NC State; Adriana San Miguel, an associate professor of chemical and biomolecular engineering at NC State; James Tuck, a professor of electrical and computer engineering at NC State; Winston Timp, an associate professor of biomedical engineering at Johns Hopkins University; and Paul Hook, a postdoctoral researcher at Johns Hopkins.

The work was done with support from the National Science Foundation under grants 2027655 and 1901324.

Keung and Tuck are co-founders of DNAli Data Technologies, so have potential interest in translating and commercializing DNA-based information systems. Keung, Volkel, Tuck, and Lin are inventors on patent application WO 2020/096679, which has been licensed to DNAli Data Technologies and from which some of this work is derived.

This article was written by Matt Shipman for NC State News.

Note to Editors: The study abstract follows.

“A Primordial DNA Store and Compute Engine”

Authors: Kevin N. Lin, Kevin Volkel, Cyrus Cao, Rachel Polak, Andrew S. Clark, Adriana San Miguel, James M. Tuck, Orlin D. Velev and Albert J. Keung, North Carolina State University; and Paul W. Hook and Winston Timp, Johns Hopkins University

Published: Aug. 22, Nature Nanotechnology

DOI: 10.1038/s41565-024-01771-6

Abstract: Any modern information system is expected to feature a set of primordial features and functions: a substrate stably carrying data; the ability to repeatedly write, read, erase, reload and compute on specific data from that substrate; and the overall ability to execute such functions in a seamless and programmable manner. For nascent molecular information technologies, proof-of-principle realization of this set of primordial capabilities would advance the vision for their continued development. Here we present a DNA-based store and compute engine that captures these primordial capabilities. This system comprises multiple image files encoded into DNA and adsorbed onto ~50-μm-diameter, highly porous, hierarchically branched, colloidal substrate particles comprised of naturally abundant cellulose acetate. Their surface areas are over 200 cm² mg⁻¹ with binding capacities of over 10¹² DNA oligos mg⁻¹, 10 TB mg⁻¹ or 10⁴ TB cm⁻³. This ‘dendricolloid’ stably holds DNA files better than bare DNA with an extrapolated ability to be repeatedly lyophilized and rehydrated over 170 times compared with 60 times, respectively. Accelerated ageing studies project half-lives of ~6,000 and 2 million years at 4 °C and −18 °C, respectively. The data can also be erased and replaced, and non-destructive file access is achieved through transcribing from distinct synthetic promoters. The resultant RNA molecules can be directly read via nanopore sequencing and can also be enzymatically computed to solve simplified 3 × 3 chess and sudoku problems. Our study establishes a feasible route for utilizing the high information density and parallel computational advantages of nucleic acids.

Tags:
- department of chemical and biomolecular engineering
- dna data storage