Please wait a minute...

中国生物工程杂志

China Biotechnology
China Biotechnology  2017, Vol. 37 Issue (3): 124-132    DOI: 10.13523/j.cb.20170317
    
Integrating Distributed Heterogeneous Food Microorganism Data by Semantic Web Technology
WU Lin-huan, LU Zhen-ming, GONG Jin-song, SHI Jin-song, XU Zheng-hong
Key Laboratory of Industrial Biotechnology of Ministry of Education, School of Pharmaceutical Science, Jiangnan University, Wuxi 214122, China
Download: HTML   PDF(2105KB) HTML
Export: BibTeX | EndNote (RIS)      

Abstract  

With the rapid development of next generation sequencing technology and the researches on fermentation mechanism of food microorganism, data and knowledge of food microorganisms increased enormously, including genomic, metagenomics, metabolic and phylogenetic information. These data are distributed from different resources with various data formats. An integrated data platform is necessary for better understanding of biological knowledge from such growing heterogeneous data. As a result, we construct a food microorganism database using semantic web technology. We describe information of gene, genome sequences, gene ontology, protein sequences and structures, pathway and enzyme in the form of Resource Description Framework (RDF) from a wide range of open data resources. In this database, physiological information of microbes from culture collections could be linked to the genomic information and further linked to the metabolic information which allows flexible queries across different domains. User-friendly interfaces of the database provide the ability to answer a number of food microorganisms research related questions based on the linked data.



Key wordsFood microorganisms      Linked data      Semantic web     
Received: 19 September 2016      Published: 25 March 2017
ZTFLH:  Q811.4  
Cite this article:

WU Lin-huan, LU Zhen-ming, GONG Jin-song, SHI Jin-song, XU Zheng-hong. Integrating Distributed Heterogeneous Food Microorganism Data by Semantic Web Technology. China Biotechnology, 2017, 37(3): 124-132.

URL:

https://manu60.magtech.com.cn/biotech/10.13523/j.cb.20170317     OR     https://manu60.magtech.com.cn/biotech/Y2017/V37/I3/124

[1] Carole G, Robert S. State of the nation in data integration for bioinformatics. Journal of Biomedical Informatics, 2008,41(5):687-693.
[2] Clark T, Martin S, Liefeld T. Globally distributed object identification for biological knowledge bases. Brief Bioinform, 2004, 5(1):59-70.
[3] Ashburner M, Ball C A, Blake J A,et al. Gene ontology:tool for the unification of biology. Nat Genet, 2000, 25(1):25-29.
[4] Mark A M, Natalya F N, Nigam H S,et al. The national center for biomedical ontology. J Am Med Inform Assoc, 2012,19(2):190-195.
[5] Simon J, James M, Jerven B, et al. The EBI RDF platform:linked open data for the life sciences. Bioinformatics, 2014,30(9):1338-1339.
[6] SIB Swiss Institute of Bioinformatics Members, The SIB Swiss Institute of Bioinformatics' resources:focus on curated databases. Nucleic Acids Res,2016,44(D1):D27-D37.
[7] Alison C, Jose C, Peter A, et al. Bio2RDF release 2:improved coverage, interoperability and provenance of life science linked data. ESWC, 2013,788(2):200-212.
[8] Maulik R K, Michel D. An Ebola virus-centered knowledge base. Database, dio:10.1093/database/bav049.2015, 1-11.
[9] Simon J, Julie K, Joost S, et al, Developing a kidney and urinary pathway knowledge base, Journal of Biomedical Semantics, 2011, 2(Suppl 2):S7.
[10] Linhuan W, Qinglan S, Hideaki S, et al. Global catalogue of microorganisms (gcm):a comprehensive database and information retrieval, analysis, and visualization system for microbial resources, BMC Genomics, 2013,14:933.
[11] NCBI Resource Coordinators. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res, 2016, 44(D1):D7-D19.
[12] Paul A K, Deanna M C, Francoise T, et al. Assembly:a resource for assembled genomes at NCBI. Nucleic Acids Res, 2016,44(D1):D73-D80.
[13] Karen C, Ilene K M, David J. LGenBank. Nucleic Acids Res,2016,44(D1):D67-D72.
[14] Peter W R, Andreas P,Chunxiao B, et al. The RCSB Protein Data Bank:views of structural biology for basic and applied research and education. Nucleic Acids Res, 2016,43(D1):D345-D356.
[15] Kanehisa M, Sato Y, Kawashima M, et al. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res, 2016, 44(D1):D457-D462.
[16] Keegan K, Glass E, Meyer F. MG-RAST, a Metagenomics Service for Analysis of Microbial Community Structure and Function. Methods Mol Biol, 2016:1399:207-233.

No related articles found!