• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学 ›› 2024, Vol. 46 ›› Issue (01): 72-82.

• 计算机网络与信息安全 • 上一篇    下一篇

DNA存储场景下的大小喷泉码模型设计

崔竞松1,2,蒋昌跃1,2,郭迟3   

  1. (1.武汉大学国家网络安全学院,湖北 武汉 430072;2.武汉大学空天信息安全与可信计算教育部重点实验室,湖北 武汉 430072;
    3.武汉大学卫星导航定位技术研究中心,湖北 武汉 430072)
  • 收稿日期:2022-10-27 修回日期:2023-04-10 接受日期:2024-01-25 出版日期:2024-01-25 发布日期:2024-01-15
  • 基金资助:
    国家重点研发计划(2022YFB3903801);湖北省重大科技专项(2022AAA009)

A large and mini fountain code model in DNA storage

CUI Jing-song1,2,JIANG Chang-yue1,2,GUO Chi3   

  1. (1.School of Cyber Science and Engineering,Wuhan University,Wuhan 430072;
    2.Key Laboratory of Aerospace Information Security and Trusted Computing,
    Ministry of Education,Wuhan University,Wuhan 430072;
    3.GNSS Research Center,Wuhan University,Wuhan 430072,China)
  • Received:2022-10-27 Revised:2023-04-10 Accepted:2024-01-25 Online:2024-01-25 Published:2024-01-15

摘要: 在DNA存储等应用场景中,传统喷泉码算法需要占用额外信道资源将源文件分组数目K传递给解码端。在实际应用中,虽然可以将K嵌入在每一个编码数据分组中进行传递,但这种做法会严重浪费信道的带宽。针对上述问题,提出了一种大小喷泉码模型,通过增加小喷泉码这一带外信道来优化关键参数的传递。小喷泉码将每个编码分组中有关参数K所占用空间的粒度降至1 bit,有效减少了带宽资源的消耗。此外,小喷泉码还能适应由于DNA存储介质不均匀所导致的编码序列不定长的限制条件,一定条件下甚至可以完全不占用额外信道带宽。

关键词: DNA存储, 喷泉码, LT码, 规避序列

Abstract: In application scenarios such as DNA storage, the traditional fountain code algorithm must transmit the number K of source file packets to the decoder through an additional channel. In practical applications, although K can be embedded in each coded data packet to transmit this key parameter, this method will seriously waste the channel's bandwidth. Aiming at the above problems, a large and mini fountain code model is proposed, which optimizes the transmission of critical parameters by adding the out-of-band channel of the mini fountain code. The mini fountain code reduces the granularity of the space occupied by the critical information about the parameter K in each coding group to 1 bit, effectively reducing the consumption of bandwidth resources. In addition, the mini fountain code can also adapt to the restriction of the indefinite length of the coding sequence caused by the inhomogeneity of the DNA storage medium. Under certain conditions, it cannot even occupy additional channel bandwidth at all.


Key words: DNA storage, fountain code, LT code, avoidance sequence