• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

J4 ›› 2013, Vol. 35 ›› Issue (2): 96-102.

• 论文 • 上一篇    下一篇

基于流形结构重建的启动子识别

张友新,王立宏   

  1. (烟台大学计算机学院,山东 烟台 264005)
  • 收稿日期:2012-01-04 修回日期:2012-03-28 出版日期:2013-02-25 发布日期:2013-02-25
  • 基金资助:

    国家自然科学基金资助项目(61070118,61170224)

New promoter recognition method based on manifold reconstruction

ZHANG Youxin,WANG Lihong   

  1. (School of Computer Science and Technology,Yantai University,Yantai 264005,China)
  • Received:2012-01-04 Revised:2012-03-28 Online:2013-02-25 Published:2013-02-25

摘要:

启动子识别是生物信息学的一个重要研究方向,根据启动子本身的特点已经有基于信号、内容和CpG岛等多种识别算法。针对基因序列数据数据量大、维数高、非线性的特点,提出了基于流形结构重建的启动子识别算法,先利用非线性降维方法压缩数据,然后再进行启动子识别。实验结果表明,该方法能够取得较好的结果。

关键词: 数据降维;流形学习;启动子识别

Abstract:

Promoter recognition is one of the important research directions in bioinformatics. Due to the characteristics of the promoters, there have been many kinds of identification algorithms based on the signal, the content and the CpG islands. Gene sequences are large scale nonlinear data with high dimensionality, so a new promoter recognition method based on manifold reconstruction is proposed, using nonlinear dimensionality reduction method to compress the data before promoter recognition. The experimental results show that this method can obain better results.

Key words: dimensionality reduction;manifold learning;promoter recognition