Exploring potential biosafety implications in DNA information storage

Deoxyribonucleic acid (DNA) information storage has emerged as a promising solution to address the challenges of traditional silicon-based data storage systems. However, the biosafety implications of artificially synthesized DNA sequences in this technology remain understudied. This research evaluat...

Full description

Saved in:
Bibliographic Details
Main Authors: Shangzhe Li, Yue Shi, Jing Yang, Haizhou Liu, Lijia Jia, Di Liu
Format: Article
Language:English
Published: Elsevier 2025-04-01
Series:Biosafety and Health
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2590053625000424
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849711859882524672
author Shangzhe Li
Yue Shi
Jing Yang
Haizhou Liu
Lijia Jia
Di Liu
author_facet Shangzhe Li
Yue Shi
Jing Yang
Haizhou Liu
Lijia Jia
Di Liu
author_sort Shangzhe Li
collection DOAJ
description Deoxyribonucleic acid (DNA) information storage has emerged as a promising solution to address the challenges of traditional silicon-based data storage systems. However, the biosafety implications of artificially synthesized DNA sequences in this technology remain understudied. This research evaluates the biosafety risks associated with five representative DNA storage encoding methods [Church, Goldman, DNA Fountain, Grass, and movable-type (MT) encoding] by analyzing their sequence similarities to natural biological DNA. Through Kraken2 taxonomic classification and Basic Local Alignment Search Tool for nucleotides (BLASTn) alignment analysis, we found that while most artificially designed DNA sequences showed significant differences from known biological sequences, specific encoding methods produced sequences similar to natural genomes. The MT encoding method showed the highest annotation rate (4.59 %) in Kraken2 analysis, while Goldman and Fountain methods demonstrated significant local sequence alignments in BLASTn analysis. Sequence length positively correlated with annotation rates, suggesting longer sequences pose potentially higher biosafety risks. Furthermore, aligned sequences often exhibited characteristics of tandem repeats, particularly in non-coding regions. These findings highlight the importance of incorporating biosafety considerations in DNA storage encoding method development and suggest that randomization strategies may help mitigate potential risks. Our study provides valuable insights into the safe advancement of DNA storage technology and emphasizes the need for comprehensive biosafety evaluation in synthetic biology applications.
format Article
id doaj-art-0efb542570ec4e299747b34674465e7e
institution DOAJ
issn 2590-0536
language English
publishDate 2025-04-01
publisher Elsevier
record_format Article
series Biosafety and Health
spelling doaj-art-0efb542570ec4e299747b34674465e7e2025-08-20T03:14:31ZengElsevierBiosafety and Health2590-05362025-04-017213213910.1016/j.bsheal.2025.03.006Exploring potential biosafety implications in DNA information storageShangzhe Li0Yue Shi1Jing Yang2Haizhou Liu3Lijia Jia4Di Liu5Key Laboratory of Special Pathogens and Biosafety, Center for Biosafety Mega-Science, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, China; University of Chinese Academy of Sciences, Beijing 101408, ChinaKey Laboratory of Special Pathogens and Biosafety, Center for Biosafety Mega-Science, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, ChinaKey Laboratory of Special Pathogens and Biosafety, Center for Biosafety Mega-Science, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, ChinaKey Laboratory of Special Pathogens and Biosafety, Center for Biosafety Mega-Science, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, ChinaKey Laboratory of Special Pathogens and Biosafety, Center for Biosafety Mega-Science, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, China; Corresponding authors: Key Laboratory of Special Pathogens and Biosafety, Center for Biosafety Mega-Science, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, China (L. Jia and D. Liu).Key Laboratory of Special Pathogens and Biosafety, Center for Biosafety Mega-Science, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, China; University of Chinese Academy of Sciences, Beijing 101408, China; Corresponding authors: Key Laboratory of Special Pathogens and Biosafety, Center for Biosafety Mega-Science, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan 430071, China (L. Jia and D. Liu).Deoxyribonucleic acid (DNA) information storage has emerged as a promising solution to address the challenges of traditional silicon-based data storage systems. However, the biosafety implications of artificially synthesized DNA sequences in this technology remain understudied. This research evaluates the biosafety risks associated with five representative DNA storage encoding methods [Church, Goldman, DNA Fountain, Grass, and movable-type (MT) encoding] by analyzing their sequence similarities to natural biological DNA. Through Kraken2 taxonomic classification and Basic Local Alignment Search Tool for nucleotides (BLASTn) alignment analysis, we found that while most artificially designed DNA sequences showed significant differences from known biological sequences, specific encoding methods produced sequences similar to natural genomes. The MT encoding method showed the highest annotation rate (4.59 %) in Kraken2 analysis, while Goldman and Fountain methods demonstrated significant local sequence alignments in BLASTn analysis. Sequence length positively correlated with annotation rates, suggesting longer sequences pose potentially higher biosafety risks. Furthermore, aligned sequences often exhibited characteristics of tandem repeats, particularly in non-coding regions. These findings highlight the importance of incorporating biosafety considerations in DNA storage encoding method development and suggest that randomization strategies may help mitigate potential risks. Our study provides valuable insights into the safe advancement of DNA storage technology and emphasizes the need for comprehensive biosafety evaluation in synthetic biology applications.http://www.sciencedirect.com/science/article/pii/S2590053625000424DNA information storageBiosafetyBiosecurityEncoding methodsSynthetic DNA
spellingShingle Shangzhe Li
Yue Shi
Jing Yang
Haizhou Liu
Lijia Jia
Di Liu
Exploring potential biosafety implications in DNA information storage
Biosafety and Health
DNA information storage
Biosafety
Biosecurity
Encoding methods
Synthetic DNA
title Exploring potential biosafety implications in DNA information storage
title_full Exploring potential biosafety implications in DNA information storage
title_fullStr Exploring potential biosafety implications in DNA information storage
title_full_unstemmed Exploring potential biosafety implications in DNA information storage
title_short Exploring potential biosafety implications in DNA information storage
title_sort exploring potential biosafety implications in dna information storage
topic DNA information storage
Biosafety
Biosecurity
Encoding methods
Synthetic DNA
url http://www.sciencedirect.com/science/article/pii/S2590053625000424
work_keys_str_mv AT shangzheli exploringpotentialbiosafetyimplicationsindnainformationstorage
AT yueshi exploringpotentialbiosafetyimplicationsindnainformationstorage
AT jingyang exploringpotentialbiosafetyimplicationsindnainformationstorage
AT haizhouliu exploringpotentialbiosafetyimplicationsindnainformationstorage
AT lijiajia exploringpotentialbiosafetyimplicationsindnainformationstorage
AT diliu exploringpotentialbiosafetyimplicationsindnainformationstorage