Computer Engineering & Science ›› 2022, Vol. 44 ›› Issue (04): 707-712.
• Artificial Intelligence and Data Mining • Previous Articles Next Articles
GAN Qiu-yun
Received:
Revised:
Accepted:
Online:
Published:
Abstract: SNP (Single Nucleotide Polymorphism) is the most common variation in biological heritable variation, which occurs between single nucleoside acid-base groups in DNA sequence. ED algorithm and SNP-index algorithm are two commonly used algorithms to calculate SNP sites. The whole genome sequencing data of F2 generation of arabidopsis thaliana are obtained by high-throughput sequencing. The sequencing data are filtered, screened and compared based on Linux platform. The number of SNP sites and the proportion of SNP genotypes detected under different algorithms are compared. The experimental results show that the number of SNP sites obtained by ED algorithm is more and more widely distributed than SNP index algorithm, and the relative distribution density is larger than that of SNP index algorithm, but the number of SNP sites and the proportion of SNP genotypes obtained by the two algorithms are similar.
Key words: single nucleotide polymorphism(SNP), biological information, ED algorithm, SNP-index algorithm
GAN Qiu-yun. Comparison and analysis of ED algorithm and SNP-index algorithm in calculating SNP sites——Take arabidopsis thaliana for example[J]. Computer Engineering & Science, 2022, 44(04): 707-712.
0 / / Recommend
Add to citation manager EndNote|Ris|BibTeX
URL: http://joces.nudt.edu.cn/EN/
http://joces.nudt.edu.cn/EN/Y2022/V44/I04/707