• 中国计算机学会会刊
  • 中国科技核心期刊
  • 中文核心期刊

计算机工程与科学

• 论文 • 上一篇    下一篇

基于KL散度的RNA-Seq数据差异异构体比例检测

欧书华,刘学军,张礼   


  1. (南京航空航天大学计算机科学与技术学院,江苏 南京 211106)
  • 收稿日期:2015-09-08 修回日期:2015-11-04 出版日期:2017-01-25 发布日期:2017-01-25
  • 基金资助:

    国家自然科学基金(61170152)

Differential isoform ratio detection based
on KL divergence for RNASeq data

OU Shuhua,LIU Xuejun,ZHANG Li   

  1. (College of Computer Science & Technology,Nanjing University of Aeronautics & Astronautics,Nanjing 211106,China)
  • Received:2015-09-08 Revised:2015-11-04 Online:2017-01-25 Published:2017-01-25

摘要:

近年来,RNAseq技术被广泛应用于差异表达基因和异构体的检测,但目前大多数方法都是识别单个异构体的差异表达,无法同时检测同一个基因中所包含异构体表达比例的差异,因此提出一个差异异构体比例检测方法。该方法基于先前设计的sLDASeq模型,运用该模型中隐含变量的概率分布,采用KL散度进行差异异构体比例的分析。首先使用最新的SEQC数据集评估sLDASeq模型表达水平的性能,结果表明该方法能准确地估计基因中异构体的比例。接着通过模拟数据集进行差异异构体比例的检测,与其他方法相比,实验结果表明该方法在差异异构体比例检测方面具有较高的准确性。
 

关键词: RNASeq, 基因异构体表达水平, 平滑LDA, KL散度, 差异异构体比例

Abstract:

RNAseq technology has been widely applied in detecting differential gene and isoform expression. However, many methods have been developed for detecting difference in expression for each individual isoform of a gene, rather than for the ratio of all the isoforms in the same gene. Now we present a new method to test each gene for differential isoform ratio between two conditions. The method is based on the previously designed sLDASeq and adopts the KL divergence for the detection of differential isoform ratio. We first use the new benchmark, SEQC, to validate sLDASeq’s performance on gene and isoform expression calculation. The results show that the model can calculate the proportion of isoforms in a gene accurately. We then use the KL divergence of the probability of the latent variables of the sLDASeq to detect differential isoform ratios between the two conditions of simulation datasets. The results show that the proposed method has a high accuracy in comparison with other methods in detecting differential isoform ratio.

Key words: RNA-Seq, gene isoform expression, smoothed LDA, KL divergence, differential isoform ratio