Self-Organizing Approach for Finding Borders of DNA Coding Regions
会议名称:《CCAST“复杂性问题”研讨会》
会议日期:2001年
学科分类:0710[理学-生物学] 071010[理学-生物化学与分子生物学] 081704[工学-应用化学] 07[理学] 08[工学] 0817[工学-化学工程与技术]
摘 要:正 Motivation: A good estimation of the borders of coding regions by a composition based algorithm will help signal-based algorithms to refine the annotation. The entropic segmentation approach has its limitations. A sell-organizing approach is proposed based on the model of codon usage for coding regions and positional preference for noncoding regions. Results: The symmetry between the direct together with reverse coding regions and phaseshifting are adopted for both reducing the number of parameters to as few as 63+3 and uniquely inferring noncoding regions and the 6 phases of coding regions. Without requiring prior training, parameters can be estimated by iteration. A multi-sampling technique from sliding windows is used to select segments with high confidence, which can provide a training set for other sophisticated algorithms. A mixed-window model is then used to find borders between the accurately inferred segments. Contact: zheng@***