Alignment modulates ancestral sequence reconstruction accuracy

Ricardo Assunçao Vialle, Asif U. Tamuri, Nick Goldman

Research output: Contribution to journalArticlepeer-review

54 Scopus citations

Abstract

Accurate reconstruction of ancestral states is a critical evolutionary analysis when studying ancient proteins and comparing biochemical properties between parental or extinct species and their extant relatives. It relies on multiple sequence alignment (MSA) which may introduce biases, and it remains unknown how MSA methodological approaches impact ancestral sequence reconstruction (ASR). Here, we investigate how MSA methodology modulates ASR using a simulation study of various evolutionary scenarios. We evaluate the accuracy of ancestral protein sequence reconstruction for simulated data and compare reconstruction outcomes using different alignment methods. Our results reveal biases introduced not only by aligner algorithms and assumptions, but also tree topology and the rate of insertions and deletions. Under many conditions we find no substantial differences between the MSAs. However, increasing the difficulty for the aligners can significantly impact ASR. The MAFFT consistency aligners and PRANK variants exhibit the best performance, whereas FSA displays limited performance. We also discover a bias towards reconstructed sequences longer than the true ancestors, deriving fromapreference for inferring insertions, in almost allMSAmethodological approaches. In addition, we find measures of MSA quality generally correlate highly with reconstruction accuracy. Thus, we show MSA methodological differences can affect the quality of reconstructions and propose MSA methods should be selected with care to accurately determine ancestral states with confidence.

Original languageEnglish
Pages (from-to)1783-1797
Number of pages15
JournalMolecular Biology and Evolution
Volume35
Issue number7
DOIs
StatePublished - 1 Jul 2018
Externally publishedYes

Keywords

  • Ancestral protein reconstruction
  • Ancestral sequence reconstruction
  • Multiple sequence alignment
  • Phylogenetic analysis
  • Simulation.

Fingerprint

Dive into the research topics of 'Alignment modulates ancestral sequence reconstruction accuracy'. Together they form a unique fingerprint.

Cite this