关键词: Chinese text storage DNA long double-stranded structure storage DNA shift coding DNA storage plasmid storage

来  源:   DOI:10.3389/fbinf.2023.1276934   PDF(Pubmed)

Abstract:
DNA, as the storage medium in organisms, can address the shortcomings of existing electromagnetic storage media, such as low information density, high maintenance power consumption, and short storage time. Current research on DNA storage mainly focuses on designing corresponding encoders to convert binary data into DNA base data that meets biological constraints. We have created a new Chinese character code table that enables exceptionally high information storage density for storing Chinese characters (compared to traditional UTF-8 encoding). To meet biological constraints, we have devised a DNA shift coding scheme with low algorithmic complexity, which can encode any strand of DNA even has excessively long homopolymer. The designed DNA sequence will be stored in a double-stranded plasmid of 744bp, ensuring high reliability during storage. Additionally, the plasmid\'s resistance to environmental interference ensuring long-term stable information storage. Moreover, it can be replicated at a lower cost.
摘要:
DNA,作为生物体的储存介质,可以解决现有电磁存储介质的缺点,例如低信息密度,维护功耗高,和短的存储时间。当前对DNA存储的研究主要集中在设计相应的编码器,以将二进制数据转换为满足生物学约束的DNA基础数据。我们创建了一个新的汉字代码表,可以实现非常高的信息存储密度来存储汉字(与传统的UTF-8编码相比)。为了满足生物限制,我们设计了一种低算法复杂度的DNA移位编码方案,可以编码DNA的任何链甚至具有过长的均聚物。设计的DNA序列将存储在744bp的双链质粒中,确保存储过程中的高可靠性。此外,质粒对环境干扰的抵抗力,确保信息长期稳定储存。此外,它可以以较低的成本复制。
公众号