A greedy algorithm for optimal recombination

时间:2025-04-06

denote the collection of all sequences of length n over \Sigma. For any s1 = a1a2 \Delta \Delta \Delta a j a j+1 \Delta \Delta \Delta an, s2 = b1b2 \Delta \Delta \Delta b j b j+1 \Delta \Delta \Delta bn 2 \Sigma

A Greedy Algorithm for Optimal RecombinationShiquan Wu and Xun Gu?Center of Bioinformatics and Biological Statistics Iowa State University, Ames, IA 50011, USA

fsqwu,xgug@iastate.edu

Abstract. Let be an alphabet and n denote the collection of all sequences of length n over . For any s1= a1 a2 aj aj+1 an, s2= b1 b2 bj bj+1 bn 2 n; a recombination of s1 and s2 at position j is de ned as an operation that crosses s1 and s2 at position j and generates t1= a1 a2 aj bj+1 bn and t2= b1 b2 bj aj+1 an: Denote A and S two collections of sequences. In this paper, we discuss generating A from S by a series of recombinations in minimum number of steps. We present a greedy algorithm for nding the optimal recombination evolutionary history from S to any tree A of sequences when jSj= 2.

1 IntroductionVarious types of mutations on sequences play an important role in computational biology. Transformations using insertion, deletion, substitution, and reversal are widely studied by applying statistics and algorithms (cf. 1,2]). Recently, much attention has been paid to recombination of sequences. Hein developed some algorithms for recombination problems(cf. 3,4]). Kececioglu and Gus eld discussed a recombination distance problem on generating a third sequence from a given pair of sequences in optimal recombination cost (cf. 5]). Generally, an edit distance problem of genomes is widely studied and its aim is to nd the optimal evolutionary history from some ancestors to some descendents by using certain types of mutations such as insertion, deletion, substitution(point mutation),reversal, etc. Parsimony trees and phylogenetic trees are some intersting problems in the category. In 6], an alignment with recombination is discussed and it is an edit distance problem involved recombinations. In this paper, we discuss a similar distance problem involved recombinations. The purpose is to generate a collection A of sequences from another collection S of sequences by a series of recombinations in minimum number of steps.

1.1 Problem and example De nition 1. Let be an alphabet (e.g.,= fa; c; g; tg, etc) and collection of all sequences of length n over . For any s= a a aj aj1 1 2

+1

n the an,

?

Research supported in part by NIH Grant RO1 GM62118 (to X.G) and NSF of China (19771025). Wu is on leave from Math Dept,N.U.D.T.,Hunan 410073,China.

…… 此处隐藏:563字,全部文档内容请下载后查看。喜欢就下载吧 ……
A greedy algorithm for optimal recombination.doc 将本文的Word文档下载到电脑

精彩图片

热门精选

大家正在看

× 游客快捷下载通道(下载后可以自由复制和排版)

限时特价:7 元/份 原价:20元

支付方式:

开通VIP包月会员 特价:29元/月

注:下载文档有可能“只有目录或者内容不全”等情况,请下载之前注意辨别,如果您已付费且无法下载或内容有问题,请联系我们协助你处理。
微信:fanwen365 QQ:370150219