关键词: DNA storage De Bruijn Trim graph Dynamic binary sequence

来  源:   DOI:10.1016/j.csbj.2023.09.004   PDF(Pubmed)

Abstract:
DNA is a high-density, long-term stable, and scalable storage medium that can meet the increased demands on storage media resulting from the exponential growth of data. The existing DNA storage encoding schemes tend to achieve high-density storage but do not fully consider the local and global stability of DNA sequences and the read and write accuracy of the stored information. To address these problems, this article presents a graph-based De Bruijn Trim Rotation Graph (DBTRG) encoding scheme. Through XOR between the proposed dynamic binary sequence and the original binary sequence, k-mers can be divided into the De Bruijn Trim graph, and the stored information can be compressed according to the overlapping relationship. The simulated experimental results show that DBTRG ensures base balance and diversity, reduces the likelihood of undesired motifs, and improves the stability of DNA storage and data recovery. Furthermore, the maintenance of an encoding rate of 1.92 while storing 510 KB images and the introduction of novel approaches and concepts for DNA storage encoding methods are achieved.
摘要:
DNA是高密度的,长期稳定,和可扩展的存储介质,可以满足数据指数增长对存储介质的需求。现有的DNA存储编码方案趋向于实现高密度存储,但没有充分考虑DNA序列的局部和全局稳定性以及存储信息的读写精度。为了解决这些问题,本文提出了一种基于图的DeBruijn修剪旋转图(DBTRG)编码方案。通过将所提出的动态二进制序列与原始二进制序列进行异或,k-mers可以分为DeBruijn修剪图,存储的信息可以根据重叠关系进行压缩。仿真实验结果表明,DBTRG保证了基平衡和多样性,减少了不期望的图案的可能性,提高了DNA存储和数据恢复的稳定性。此外,实现了在存储510KB图像时保持1.92的编码率,并引入了用于DNA存储编码方法的新颖方法和概念。
公众号