• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2010, Vol. 32 ›› Issue (8): 104-107.doi: 10.3969/j.issn.1007130X.2010.

• 论文 • 上一篇    下一篇

一种基于KEGG数据库重构代谢网络的新方法

周婷婷1,3,容健锋2,王正华1,董蕴源1,王勇献1,朱云平3   

  1. (1.并行与分布处理国家重点实验室,湖南 长沙 410073;2.香港理工大学计算学系,香港;3.蛋白质组学国家重点实验室北京蛋白质组研究中心,军事医学科学院放射与辐射医学研究所,北京 102206)
  • 收稿日期:2009-06-22 修回日期:2009-10-10 出版日期:2010-07-25 发布日期:2010-07-28
  • 作者简介:周婷婷(1980),女,河南开封人,博士生,研究方向为生物信息学;容健锋,软件工程师;王正华,教授,博士生导师,研究方向为生物信息学;董蕴源,博士生,研究方向为生物信息学;王勇献,副研究员,研究方向为生物信息学;朱云平,研究员,研究方向为生物信息学。
  • 基金资助:

    国家自然科学基金资助项目(60773021,60603054)

A New Approach to Reconstructing Metabolic Networks from KEGG

ZHOU Tingting1,3,YUNG K F Samuel2,WANG Zhenghua1,DONG Yunyuan1,WANG Yongxian1,ZHU Yunping3   

  1. (1.National Laboratory for Parallel and Distributed Processing,Changsha 410073;2.Department of Computing, the Hong Kong Polytechnic University,Hong Kong;3.State Key Laboratory of Proteomics,Beijing Proteome Research Center,Beijing Institute of Radiation Medicine,Beijing 102206,China)
  • Received:2009-06-22 Revised:2009-10-10 Online:2010-07-25 Published:2010-07-28

摘要:

从代谢物、酶和生化反应信息重新构建正确的代谢网络是各项代谢网络相关研究非常关键的第一步。针对以往重构方法存在的数据难以及时更新、数据有冗余、获取数据慢等问题,本文采用分而治之的递归策略,提出了一种基于KEGG数据库自下而上重构全物种代谢网络的新方法。与以前的方法相比,本方法的优点在于:使用KEGG的Web服务获取数据,以保证数据的准确性和及时更新;依靠KEGG/PATHWAY库的数据选择机制选取数据,以保证构建网络的数据无冗余;整个方法基于Java实现,保证程序的跨平台通用性;通过构建MySQL本地数据库将远程数据本地化,大大降低数据读取的时耗。评估结果显示,该方法不仅能够保证重建网络数据的准确性和及时更新,而且有效地提高了多物种多次重构情况下的网络重构效率。

关键词: 代谢网络, 网络构建, KEGG数据库, Web服务, MySQL

Abstract:

The highquality network reconstruction from metabolites, enzymes and reactions is the first step for the study on metabolic networks. However, the previous reconstruction approaches have some disadvantages. For example, the data contains redundancy and could hardly be updated in time. Besides, the data retrieval is always timeconsuming. In this paper, we propose a new bottomup approach for the organismspecific metabolic network reconstruction. The web service of KEGG undertakes the data to be correct and uptodate. The data selection mechanism of KEGG/PATHWAY ensures the reconstructed networks reliable. The whole approach is implemented using Java, which can be used on any platform independently. The MySQL database is deployed to map the remote data locally, which greatly shortens the elapsed time for the data retrieval. As is shown in the evaluation, this method can not only ensure the data for the network reconstruction is accurate and uptodate, but also shorten the time in the case that many metabolic networks need reconstructing repeatedly.

Key words: metabolic network;network reconstruction;KEGG database;web service;MySQL