A two-phase strategy for detecting recombination in nucleotide sequences

Chan, Cheong Xin, Beiko, Robert G. and Ragan, Mark A. (2007) A two-phase strategy for detecting recombination in nucleotide sequences.

Attached Files (Some files may be inaccessible until you login with your UQ eSpace credentials)
Name Description MIMEType Size Downloads
Chan_SACJ_final.pdf Full-text of the accepted manuscript application/pdf 195.32KB 0

Title A two-phase strategy for detecting recombination in nucleotide sequences
Abstract/Summary Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. Delineating recombination events is important in the study of molecular evolution, as inference of such events provides a clearer picture of the phylogenetic relationships among different gene sequences or genomes. Nevertheless, detecting recombination events can be a daunting task, as the performance of different recombination-detecting approaches can vary, depending on evolutionary events that take place after recombination. We previously evaluated the effects of post-recombination events on the prediction accuracy of recombination-detecting approaches using simulated nucleotide sequence data. The main conclusion, supported by other studies, is that one should not depend on a single method when searching for recombination events. In this paper, we introduce a two-phase strategy, applying three statistical measures to detect the occurrence of recombination events, and a Bayesian phylogenetic approach to delineate breakpoints of such events in nucleotide sequences. We evaluate the performance of these approaches using simulated data, and demonstrate the applicability of this strategy to empirical data. The two-phase strategy proves to be time-efficient when applied to large datasets, and yields high-confidence results.
Keyword Recombination detection
Sequence analysis
Evolution and phylogenetics
Publisher South African Computer Journal
Date 2007-06-30
Subjects 270200 Genetics
279900 Other Biological Sciences
239901 Biological Mathematics
Author Chan, Cheong Xin
Beiko, Robert G.
Ragan, Mark A.
Open Access Status Other
References K. N. Kreuzer. “Interplay between DNA replication and recombination in prokaryotes”. Annual Review of Microbiology, vol. 59, pp. 43–67, 2005. R. D. Camerini-Otero and P. Hsieh. “Homologous recombination proteins in prokaryotes and eukaryotes”. Annual Review of Genetics, vol. 29, pp. 509–552, 1995. N. Kleckner. “Meiosis: how could it work?” Proceedings of the National Academy of Sciences of the United States of America, vol. 93, no. 16, pp. 8167–8174, 1996. G. F. Weiller. “Phylogenetic profiles: a graphical method for detecting genetic recombinations in homologous sequences”. Molecular Biology and Evolution, vol. 15, no. 3, pp. 326–335, 1998. G. J. Etherington, J. Dicks and I. N. Roberts. “Recombination Analysis Tool (RAT): a program for the high-throughput detection of recombination”. Bioinformatics, vol. 21, no. 3, pp. 278–281, 2005. S. Sawyer. “Statistical tests for detecting gene conversion”. Molecular Biology and Evolution, vol. 6, no. 5, pp. 526–538, 1989. I. B. Jakobsen and S. Easteal. “A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequences”. Computer Applications in the Biosciences, vol. 12, no. 4, pp. 291–295, 1996. T. C. Bruen, H. Philippe and D. Bryant. “A simple and robust statistical test for detecting the presence of recombination”. Genetics, vol. 172, no. 4, pp. 2665–2681, 2006. J. Hein. “Reconstructing evolution of sequences subject to recombination using parsimony”. Mathematical Biosciences, vol. 98, no. 2, pp. 185–200, 1990. S. L. Kosakovsky Pond, D. Posada, M. B. Gravenor, C. H. Woelk and S. D. Frost. “Automated phylogenetic detection of recombination using a genetic algorithm”. Molecular Biology and Evolution, vol. 23, no. 10, pp. 1891–1901, 2006. J. Graham, B. McNeney and F. Seillier-Moiseiwitsch. “Stepwise detection of recombination breakpoints in sequence alignments”. Bioinformatics, vol. 21, no. 5, pp. 589–595, 2005. D. Posada and K. A. Crandall. “Evaluation of methods for detecting recombination from DNA sequences: computer simulations”. Proceedings of the National Academy of Sciences of the United States of America, vol. 98, no. 24, pp. 13757–13762, 2001. C. Wiuf, T. Christensen and J. Hein. “A simulation study of the reliability of recombination detection methods”. Molecular Biology and Evolution, vol. 18, no. 10, pp. 1929–1939, 2001. D. Posada. “Evaluation of methods for detecting recombination from DNA sequences: empirical data”. Molecular Biology and Evolution, vol. 19, no. 5, pp. 708–717, 2002. C. X. Chan, R. G. Beiko and M. A. Ragan. “Detecting recombination in evolving nucleotide sequences”. BMC Bioinformatics, vol. 7, no. Art. 412, 2006. J. Maynard Smith. “Analyzing the mosaic structure of genes”. Journal of Molecular Evolution, vol. 34, no. 2, pp. 126–129, 1992. V. N. Minin, K. S. Dorman, F. Fang and M. A. Suchard. “Dual multiple change-point model leads to more accurate recombination detection”. Bioinformatics, vol. 21, no. 13, pp. 3034–3042, 2005. A. Rambaut and N. C. Grassly. “Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees”. Computer Applications in the Biosciences, vol. 13, no. 3, pp. 235–238, 1997. M. Hasegawa, H. Kishino and T. A. Yano. “Dating of the human ape splitting by a molecular clock of mitochondrial DNA”. Journal of Molecular Evolution, vol. 22, no. 2, pp. 160–174, 1985. R. G. Beiko, T. J. Harlow and M. A. Ragan. “Highways of gene sharing in prokaryotes”. Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 40, pp. 14332–14337, 2005. T. J. Harlow, J. P. Gogarten and M. A. Ragan. “A hybrid clustering approach to recognition of protein families in 114 microbial genomes”. BMC Bioinformatics, vol. 5, no. Art. 45, 2004. R. G. Beiko, C. X. Chan and M. A. Ragan. “A word-oriented approach to alignment validation”. Bioinformatics , vol. 21, no. 10, pp. 2230–2239, 2005. M. A. Suchard, R. E. Weiss, K. S. Dorman and J. S. Sinsheimer. “Inferring spatial phylogenetic variation along nucleotide sequences: a multiple change-point model”. Journal of the American Statistical Association , vol. 98, no. 462, pp. 427–437, 2003.
Additional Notes This is an author version of the paper accepted for publication in the June 2007 issue of the South African Computer Journal.

Version Filter Type
Citation counts: Google Scholar Search Google Scholar
Created: Fri, 25 May 2007, 17:47:03 EST by Cheong Xin Chan on behalf of Dorothy Hill Physical Sciences & Engineer Library