基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Development of efficient gene prediction algorithms is one of the fundamental efforts in gene prediction study in the area of genomics. In genomic signal processing the basic step of the identification of protein coding regions in DNA sequences is based on the period-3 property exhibited by nucleotides in exons. Several approaches based on signal processing tools and numerical representations have been applied to solve this problem, trying to achieve more accurate predictions. This paper presents a new indicator sequence based on amino acid sequence, called as aminoacid indicator sequence, derived from DNA string that uses the existing signal processing based time-domain and frequency domain methods to predict these regions within the billions long DNA sequence of eukaryotic cells which reduces the computational load by one-third. It is known that each triplet of bases, called as codon, instructs the cell machinery to synthesize an amino acid. The codon sequence therefore uniquely identifies an amino acid sequence which defines a protein. Thus the protein coding region is attributed by the codons in amino acid sequence. This property is used for detection of period-3 regions using amino acid sequence. Physico-chemical properties of amino acids are used for numerical representation. Various accuracy measures such as exonic peaks, discriminating factor, sensitivity, specificity, miss rate, wrong rate and approximate correlation are used to demonstrate the efficacy of the proposed predictor. The proposed method is validated on various organisms using the standard data-set HMR195, Burset and Guigo and KEGG. The simulation result shows that the proposed method is an effective approach for protein coding prediction.
推荐文章
Spatial prediction of landslide susceptibility using GIS-based statistical and machine learning mode
Landslide susceptibility mapping
Statistical model
Machine learning model
Four cases
Groundwater quality assessment using multivariate analysis, geostatistical modeling, and water quali
Groundwater
Multivariate analysis
Geostatistical modeling
Geochemical modeling
Mineralization
Ordinary Kriging
Spatial analysis of carbon storage density of mid-subtropical forests using geostatistics: a case st
Carbon storage density
Geostatistics
Mid-subtropical forests
Spatial autocorrelation
Spatial heterogeneity
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 A reduced computational load protein coding predictor using equivalent amino acid sequence of DNA string with period-3 based time and frequency domain analysis
来源期刊 美国分子生物学期刊(英文) 学科 医学
关键词 Genomics BIOINFORMATICS CODON Coding region Amino Acid SEQUENCE Fourier Transform Antinotch Filter Periodicity-3 Indicator SEQUENCE
年,卷(期) 2011,(2) 所属期刊栏目
研究方向 页码范围 79-86
页数 8页 分类号 R73
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2011(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
Genomics
BIOINFORMATICS
CODON
Coding
region
Amino
Acid
SEQUENCE
Fourier
Transform
Antinotch
Filter
Periodicity-3
Indicator
SEQUENCE
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
美国分子生物学期刊(英文)
季刊
2161-6620
武汉市江夏区汤逊湖北路38号光谷总部空间
出版文献量(篇)
191
总下载数(次)
0
总被引数(次)
0
论文1v1指导