|
|
Comparison and Analysis on Scientific Research Programs on DNA Data Storage |
ZHANG Da-lu1,GE Qi2,FENG Yi-bo2,CHEN Wei-gang2,3,*() |
1. China National Center for Biotechnology Development, Beijing 100039, China 2. School of Microelectronics, Tianjin University, Tianjin 300072, China 3. Frontiers Science Center for Synthetic Biology of Ministry of Education, Tianjin University, Tianjin 300072, China |
|
|
Abstract The explosive growth of global data has become an important engine for the development of the digital economy. However, traditional data storage media are limited by power consumption, volume and cost, and cannot meet the ever-increasing demand for data storage. The new storage method using deoxyribonucleic acid (DNA) molecule as storage medium has attracted great attention at home and abroad. Major countries in the world have carried out top-level planning for its research and deployed a series of important scientific research plans. However, DNA data storage is a new interdisciplinary research field, and its development “source” and “flow” still need to be deeply analyzed. To solve this problem, this paper explores the development “source” of DNA data storage from the perspective of fusion of information, semiconductor and synthetic biology, and analyzes and summarizes the development plan of DNA data storage in major countries and regions in the world in recent years. We present the layout of scientific research projects at home and abroad, particularly, the basic research program promoted by the Alliance for Semiconductor Synthetic Biology, the application-oriented intensive research program promoted by Defense Advanced Research Projects Agency (DARPA) and Intelligence Advanced Research Projects Activity (IARPA), the Horizon 2020 Program of the European Union, and the major research and development program of China. By comparison, it can be observed that the United States mainly adopts the government-led, application-oriented research mode, while the European Union and China follow up in time during the 13th Five-Year Plan period. During the 14th Five-year Plan period, China has set up the national key research and development program of “Fusion of the Biological Technology and Information Technology (BT and IT Fusion)”, which is committed to promoting the development of DNA data storage and related fields, and realizing the development of DNA data storage to drive the development of biochemical instruments, and even biological economy and digital economy. This paper explores the “source” and “flow” of the development of DNA data storage, and provides a reference for researchers to identify the “real” problems that really limit the development of this field, and also provides a reference for science and technology management departments to identify the international development trend in DNA data storage.
|
Received: 01 December 2021
Published: 07 July 2022
|
|
Corresponding Authors:
Wei-gang CHEN
E-mail: chenwg@tju.edu.cn
|
|
|
[1] |
Reinsel D, Gantz J, Rydning J. Data age 2025: the evolution of data to life-critical, don’t focus on big data, focus on the data that’s big. IDC White Paper. International Data Corporation, 2017: 1-25.
|
|
|
[2] |
Ceze L, Nivala J, Strauss K. Molecular digital data storage using DNA. Nature Reviews Genetics, 2019, 20(8): 456-466.
doi: 10.1038/s41576-019-0125-3
|
|
|
[3] |
Meiser L C, Antkowiak P L, Koch J, et al. Reading and writing digital data in DNA. Nature Protocols, 2020, 15(1): 86-101.
doi: 10.1038/s41596-019-0244-5
pmid: 31784718
|
|
|
[4] |
Dong Y M, Sun F J, Ping Z, et al. DNA storage: research landscape and future prospects. National Science Review, 2020, 7(6): 1092-1107.
doi: 10.1093/nsr/nwaa007
|
|
|
[5] |
Ping Z, Ma D Z, Huang X L, et al. Carbon-based archiving: current progress and future prospects of DNA-based data storage. GigaScience, 2019, 8(6): giz075.
doi: 10.1093/gigascience/giz075
|
|
|
[6] |
Hao Y Y, Li Q, Fan C H, et al. Data storage based on DNA. Small Structures, 2021, 2(2): 2000046.
doi: 10.1002/sstr.202000046
|
|
|
[7] |
Xu C T, Zhao C, Ma B, et al. Uncertainties in synthetic DNA-based data storage. Nucleic Acids Research, 2021, 49(10): 5451-5469.
doi: 10.1093/nar/gkab230
|
|
|
[8] |
Jiang C, Zhang Y N, Wang F, et al. Toward smart information processing with synthetic DNA molecules. Macromolecular Rapid Communications, 2021, 42(11): e2100084.
|
|
|
[9] |
Tan X, Ge L Q, Zhang T Z, et al. Preservation of DNA for data storage. Russian Chemical Reviews, 2021, 90(2): 280-291.
doi: 10.1070/RCR4994
|
|
|
[10] |
丁明珠, 李炳志, 王颖, 等. 合成生物学重要研究方向进展. 合成生物学, 2020, 1(1): 7-28.
|
|
|
[10] |
Ding M Z, Li B Z, Wang Y, et al. Significant research progress in synthetic biology. Synthetic Biology Journal, 2020, 1(1): 7-28.
|
|
|
[11] |
韩明哲, 陈为刚, 宋理富, 等. DNA信息存储:生命系统与信息系统的桥梁. 合成生物学, 2021, 2(3): 309-322.
|
|
|
[11] |
Han M Z, Chen W G, Song L F, et al. DNA information storage: bridging biological and digital world. Synthetic Biology Journal, 2021, 2(3): 309-322.
|
|
|
[12] |
许鹏, 方刚, 石晓龙, 等. DNA存储及其研究进展. 电子与信息学报, 2020, 42(6): 1326-1331.
|
|
|
[12] |
Xu P, Fang G, Shi X L, et al. DNA storage and its research progress. Journal of Electronics & Information Technology, 2020, 42(6): 1326-1331.
|
|
|
[13] |
滕越, 杨姗, 李金玉, 等. DNA数据存储技术原理及其研究进展. 生物化学与生物物理进展, 2021, 48(5): 494-504.
|
|
|
[13] |
Teng Y, Yang S, Li J Y, et al. Principle and progress of DNA data storage. Progress in Biochemistry and Biophysics, 2021, 48(5): 494-504.
|
|
|
[14] |
毛秀海, 李凡, 左小磊. DNA数据存储. 电子与信息学报, 2020, 42(6): 1303-1312.
|
|
|
[14] |
Mao X H, Li F, Zuo X L. DNA data storage. Journal of Electronics & Information Technology, 2020, 42(6): 1303-1312.
|
|
|
[15] |
Nguyen B H, Takahashi C N, Gupta G, et al. Scaling DNA data storage with nanoscale electrode wells. Science Advances, 2021, 7(48): eabi6714.
doi: 10.1126/sciadv.abi6714
|
|
|
[16] |
Meiser L C, Nguyen B H, Chen Y J, et al. Synthetic DNA applications in information technology. Nature Communications, 2022, 13(1): 352.
doi: 10.1038/s41467-021-27846-9
pmid: 35039502
|
|
|
[17] |
陈为刚, 潘林强. DNA数据存储: 高通量结绳记事. 中国计算机学会通讯, 2022, 18(4): 10-15.
|
|
|
[17] |
Chen W G, Pan L Q. DNA data storage: high throughput knotting. Communications of the CCF, 2022, 18(4): 10-15.
|
|
|
[18] |
Semiconductor Research Corporation. National Institute of Standards and Technology, 2018 semiconductor synthetic biology roadmap. [2022-05-17]. https://www.src.org/library/publication/p095387/p095387.pdf.
|
|
|
[19] |
王欣, 赵鹏, 李清扬, 等. 半导体合成生物学的研究进展. 化工学报, 2021, 72(5): 2426-2435.
|
|
|
[19] |
Wang X, Zhao P, Li Q Y, et al. Research advances in semiconductor synthetic biology. CIESC Journal, 2021, 72(5): 2426-2435.
|
|
|
[20] |
Semiconductor Industry Association, Semiconductor Research Corporation. Semiconductor research opportunities: an industry vision and guide. [2022-05-17]. https://www.semiconductors.org/wp-content/uploads/2018/06/SIA-SRC-Vision-Report-3.30.17.pdf.
|
|
|
[21] |
Semiconductor Industry Association, Semiconductor Research Corporation. Decadal plan for semiconductors. [2022-05-17]. https://www.src.org/about/decadal-plan.
|
|
|
[22] |
National Science Foundation. Semiconductor synthetic biology for information processing and storage technologies (SemiSynBio). [2022-05-17]. https://www.nsf.gov/pubs/2017/nsf17557/nsf17557.htm.
|
|
|
[23] |
National Science Foundation. Semiconductor synthetic biology for information storage and retrieval (SemiSynBio-II). [2022-05-17]. https://www.nsf.gov/pubs/2020/nsf20518/nsf20518.htm.
|
|
|
[24] |
National Science Foundation. New NSF awards support the creation of bio-based semiconductors. [2022-05-17]. https://www.nsf.gov/news/news_summ.jsp?cntn_id=295968.
|
|
|
[25] |
Defense Advanced Research Projects Agency. Molecular informatics. [2022-05-17]. https://www.darpa.mil/program/molecular-informatics.
|
|
|
[26] |
Intelligence Advanced Research Projects Activity (IARPA). Molecular information storage (MIST). [2022-05-17]. https://www.iarpa.gov/research-programs/mist.
|
|
|
[27] |
曹芹, 旷苗, 王晶, 等. 我国“合成生物学”项目立项概况与实施管理建议. 合成生物学. 2020, 1(4): 495-502.
|
|
|
[27] |
Cao Q, Kuang M, Wang J, et al. Overview of “synthetic biology” projects in China and suggestions for implementation and management. Synthetic Biology Journal, 2020, 1(4): 495-502.
|
|
|
[28] |
Horizon 2020 (H2020). Coding for security and DNA storage. [2022-05-17]. https://cordis.europa.eu/project/id/801434.
|
|
|
[29] |
Horizon 2020 (H2020). Oligoarchive-Intelligent DNA storage for archival. [2022-05-17]. https://cordis.europa.eu/project/id/863320.
|
|
|
[30] |
Horizon 2020 (H2020). DNA data storage. [2022-05-17]. https://cordis.europa.eu/project/id/889300.
|
|
|
[31] |
Horizon 2020 (H2020). DNA data storage technology for a sustainable digital future. [2022-05-17]. https://cordis.europa.eu/project/id/970550.
|
|
|
[32] |
Horizon 2020 (H2020). DNA-FAST light driven data technology with multiplexed optical encoding and readout. [2022-05-17]. https://cordis.europa.eu/project/id/964995.
|
|
|
[33] |
第十三届全国人民代表大会常务委员会第四次会议. 中华人民共和国国民经济和社会发展第十四个五年规划和. 中华人民共和国国民经济和社会发展第十四个五年规划和 2035 年远景目标纲要. [2022-05-17].
|
|
|
[33] |
The Fourth Session of the Standing Committee of the 13th National People’s Congress. Outline of the 14th Five-Year Plan (2021-2025) for National Economic and Social Development and Vision 2035 of the People’s Republic of China. [2022-05-17]. http://www.gov.cn/xinwen/2021-03/13/content_5592681.htm.
|
|
|
[34] |
国家科技管理信息系统公共服务平台. “生物与信息融合(BT与IT融合)”重点专项2021年度项目申报指南. [2022-05-17]. https://service.most.gov.cn/sbjhyl2021zy/ index_2.html.
|
|
|
[34] |
National Science and Technology Information System, Public Service Platform. Guidelines for 2021 annual project application of the major project of “Biotechnology and Information Technology Fusion (BT and IT Fusion)” [2022-05-17]. https://service.most.gov.cn/sbjhyl2021zy/index_2.html.
|
|
|
[35] |
习近平主持中央政治局第三十四次集体学习:把握数字经济发展趋势和规律, 推动我国数字经济健康发展. [2022-05-17]. http://www.gov.cn/xinwen/2021-10/19/content_5643653.htm.
|
|
|
[35] |
Xi stresses sound development of digital economy. [2022-05-17]. http://www.gov.cn/xinwen/2021-10/19/content_5643653.htm.
|
|
|
[36] |
国务院. “十四五”数字经济发展规划. [2022-05-17]. http://www.gov.cn/zhengce/content/2022-01/12/content_5667817.htm.
|
|
|
[36] |
The State Council.The 14th five-year plan on digital economy development. [2022-05-17]. http://www.gov.cn/zhengce/content/2022-01/12/content_5667817.htm.
|
|
|
[37] |
中华人民共和国国家互联网信息办公室. “十四五”国家信息化规划. [2022-05-17]. http://www.cac.gov.cn/2021-12/27/c_1642205314518676.htm.
|
|
|
[37] |
Cyberspace Administration of China. The 14th five-year plan for national informatization. [2022-05-17]. http://www.cac.gov.cn/2021-12/27/c_1642205314518676.htm.
|
|
|
[38] |
中华人民共和国国家发展和改革委员会. “十四五”生物经济发展规划. [2022-05-17]. https://www.ndrc.gov.cn/xwdt/tzgg/202205/P020220510324283427632.pdf.
|
|
|
[38] |
National Development and Reform Commission. The 14th five-year plan for bioeconomic development. [2022-05-17]. https://www.ndrc.gov.cn/xwdt/tzgg/202205/P020220510324283427632.pdf.
|
|
|
[39] |
刘晓, 王跃, 毛开云, 等. 生物技术与信息技术的融合发展. 中国科学院院刊, 2020, 35(1): 34-42.
|
|
|
[39] |
Liu X, Wang Y, Mao K Y, et al. Converge development of biotechnology and information technology. Bulletin of Chinese Academy of Sciences, 2020, 35(1): 34-42.
|
|
|
[40] |
宋琪, 丁陈君, 吴晓燕, 等. DNA存储技术国际发展态势分析. 世界科技研究与发展, 2021, 43(1): 24-42.
|
|
|
[40] |
Song Q, Ding C J, Wu X Y, et al. Analysis on the development strategies and trends of DNA storage technology. World Sci-Tech R & D, 2021, 43(1): 24-42.
|
|
|
[41] |
DARPA. Creating technology breakthroughs and new capabilities for national security.[2022-05-17]. https://www.darpa.mil/attachments/DARPA-2019-framework.pdf.
|
|
|
[42] |
Kosuri S, Church G M. Large-scale de novo DNA synthesis: technologies and applications. Nature Methods, 2014, 11(5): 499-507.
doi: 10.1038/nmeth.2918
|
|
|
[43] |
Deamer D, Akeson M, Branton D. Three decades of nanopore sequencing. Nature Biotechnology, 2016, 34(5): 518-524.
doi: 10.1038/nbt.3423
pmid: 27153285
|
|
|
[44] |
DNA Data Storage Alliance. Preserving our digital legacy: an introduction to DNA data storage. [2022-05-17]. https://dnastoragealliance.org/dev/publications.
|
|
|
[45] |
Goldman N, Bertone P, Chen S Y, et al. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA. Nature, 2013, 494(7435): 77-80.
|
|
|
[46] |
国家科技管理信息系统公共服务平台. “合成生物学”重点专项 2018 年度拟立项项目公示清单. [2022-05-17]. https://service.most.gov.cn/u/cms/static/201907/09141707pvlf.pdf.
|
|
|
[46] |
National Science and Technology Information System, Public Service Platform. Announcement list of project of the “synthetic biology” major projects to be approved in 2018. [2022-05-17]. https://service.most.gov.cn/u/cms/static/201907/09141707pvlf.pdf.
|
|
|
[47] |
国家科技管理信息系统公共服务平台. “合成生物学”重点专项 2020 年度拟立项项目公示清单. [2022-05-17]. https://service.most.gov.cn/u/cms/static/202010/10162014jjm8.pdf.
|
|
|
[47] |
National Science and Technology Information System, Public Service Platform. Announcement list of project of the “synthetic biology” major projects to be approved in 2020. [2022-05-17]. https://service.most.gov.cn/u/cms/static/202010/10162014jjm8.pdf.
|
|
|
[48] |
国家科技管理信息系统公共服务平台. “变革性技术关键科学问题”重点专项 2020 年度项目申报指南. [2022-05-17]. https://service.most.gov.cn/u/cms/static/201910/11090105grv6.pdf.
|
|
|
[48] |
National Science and Technology Information System, Public Service Platform. Guidelines for 2020 annual project application of the major project of “key scientific issues in transformative technologies”. [2022-05-17]. https://service.most.gov.cn/u/cms/static/201910/11090105grv6.pdf.
|
|
|
[49] |
国家科技管理信息系统公共服务平台. “生物与信息融合(BT与IT融合)”重点专项2022年度项目申报指南. [2022-05-17]. https://service.most.gov.cn/kjjh_tztg_all/20220429/4900.html.
|
|
|
[49] |
National Science and Technology Information System, Public Service Platform. Guidelines for 2022 annual project application of the major project of “Biotechnology and Information Technology Fusion (BT and IT fusion)”. [2022-05-17]. https://service.most.gov.cn/kjjh_tztg_all/20220429/4900.html.
|
|
|
[50] |
Chen W G, Han M Z, Zhou J T, et al. An artificial chromosome for data storage. National Science Review, 2021, 8(5): nwab028.
doi: 10.1093/nsr/nwab028
|
|
|
[51] |
Zhou J T, Zhang C, Wei R, et al. Exogenous artificial DNA forms chromatin structure with active transcription in yeast. Science China Life Sciences, 2021, https://doi.org/10.1007/s11427-021-2044-x.
|
|
|
[52] |
Fan C Y, Deng Q, Zhu T F. Bioorthogonal information storage in L-DNA with a high-fidelity mirror-image Pfu DNA polymerase. Nature Biotechnology, 2021, 39(12): 1548-1555.
doi: 10.1038/s41587-021-00969-6
|
|
|
[53] |
陈为刚, 黄刚, 李炳志, 等. 音视频文件的DNA信息存储. 中国科学: 生命科学, 2020, 50(1): 81-85.
|
|
|
[53] |
Chen W G, Huang G, Li B Z, et al. DNA information storage for audio and video files. Scientia Sinica (Vitae), 2020, 50(1): 81-85.
|
|
|
[54] |
陈为刚, 葛奇, 王盼盼, 等. 细胞内大片段DNA数据存储的多RS码交织编码. 合成生物学, 2021, 2(3): 428-443.
|
|
|
[54] |
Chen W G, Ge Q, Wang P P, et al. Multiple interleaved RS codes for data storage using up to Mb-scale synthetic DNA in living cells. Synthetic Biology Journal, 2021, 2(3): 428-443.
|
|
|
[55] |
葛奇, 张鹏, 韩明哲, 等. 纳米孔测序信号处理及其在DNA数据存储的应用. 中国生物工程杂志, 2021, 41(8): 75-89.
|
|
|
[55] |
Ge Q, Zhang P, Han M Z, et al. Signal processing for nanopore sequencing and its application in DNA data storage. China Biotechnology, 2021, 41(8): 75-89.
|
|
|
[56] |
Xu C T, Ma B, Gao Z L, et al. Electrochemical DNA synthesis and sequencing on a single electrode with scalability for integrated data storage. Science Advances, 2021, 7(46): eabk0100.
doi: 10.1126/sciadv.abk0100
|
|
|
[57] |
齐浩, 郜艳敏. 一种应用于DNA数据存储的寡核苷酸库恒温扩增方法: 中国, CN201911086860.0. 2021-04-20[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=5CBA8DFA5EAACFFA9FEB6FAA8GCACIDABGGAAIBAAHEA9IHG.
|
|
|
[57] |
Qi H, Gao Y M. Oligonucleotide library constant-temperature amplification method applied to DNA data storage: China, CN201911086860.0. 2021-04-20[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=5CBA8DFA5EAACFFA9FEB6FAA8GCACIDABGGAAIBAAHEA9IHG.
|
|
|
[58] |
戴俊彪, 吴庆余, 乃哥麦提·伊加提, 等. 将数据进行生物存储并还原的方法: 中国, CN201610786435.2. 2021-07-13[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=9EDA4DBA9EIE9EAB9HDC2BBABFGA9BHH9HAE9FAABIHACGIA.
|
|
|
[58] |
Dai J B, Wu Q Y, Yijiati N, et al. Data are subjected to biometric storage and the method reduced: China, CN201610786435.2, 2021-07-13[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=9EDA4DBA9EIE9EAB9HDC2BBABFGA9BHH9HAE9FAABIHACGIA.
|
|
|
[59] |
陈为刚, 黄刚, 韩昌彩, 等. 一种DNA数据存储混合错误纠正与数据恢复方法: 中国, CN201910596136.6. 2021-08-13[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=4CAA8IAA3BBAEHFA9IBB8CGA7BBAEEIA9DBB3BBA3AAA9CDC.
|
|
|
[59] |
Chen W G, Huang G, Han C C, et al. DNA data storage mixed error correction and data recovery method: China, CN201910596136.6. 2021-08-13[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=4CAA8IAA3BBAEHFA9IBB8CGA7BBAEEIA9DBB3BBA3AAA9CDC.
|
|
|
[60] |
陈非, 卜东波, 马灌楠, 等. DNA活字存储系统和方法: 中国, CN202010688281.X. 2021-08-20[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=9EHE3BCA9IDE8AIA8DEA9ICB9EFG9CCD9BFF9IAB9ADC5BDA.
|
|
|
[60] |
Chen F, Bu D B, Ma G N, et al. DNA type storage system and method: China, CN202010688281.X. 2021-08-20[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=9EHE3BCA9IDE8AIA8DEA9ICB9EFG9CCD9BFF9IAB9ADC5BDA.
|
|
|
[61] |
陈为刚, 韩昌彩. 可包含人造碱基的DNA存储分层表示与交织编码方法: 中国, CN201810573636.3. 2021-08-24[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=9BGA2CAA9IBBEFIA7BFA9EGB9HDGBGFACIIA9CHG9CHDCHGA.
|
|
|
[61] |
Chen W G, Han C C. DNA storage layered representation and interweaving coding method capable of containing artificial base: China, CN201810573636.3. 2021-08-24[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=9BGA2CAA9IBBEFIA7BFA9EGB9HDGBGFACIIA9CHG9CHDCHGA.
|
|
|
[62] |
元英进, 韩明哲, 陈为刚, 等. 基于DNA的信息存储方法: 中国, CN201811377712.X. 2021-11-12[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=9IEF9DHB9GHFDEIA8AHA9HED9BHC9EFHCFHA9ECE7AGA9GCB.
|
|
|
[62] |
Yuan Y J, Han M Z, Chen W G, et al. DNA-based information storage method: China, CN201811377712.X. 2021-11-12[2022-05-17]. https://cprs.patentstar.com.cn/Search/Detail?ANE=9IEF9DHB9GHFDEIA8AHA9HED9BHC9EFHCFHA9ECE7AGA9GCB.
|
|
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|