SATé-II: Very Fast and Accurate Simultaneous Estimation of Multiple Sequence Alignments and Phylogenetic Trees (original) (raw)

Journal Article

,

1Department of Computer Science, University of Texas at Austin, Austin, TX 78712, USA

Search for other works by this author on:

,

1Department of Computer Science, University of Texas at Austin, Austin, TX 78712, USA

Search for other works by this author on:

,

2Department of Ecology and Evolutionary Biology, University of Kansas at Lawrence, Lawrence, KS 66045, USA

Search for other works by this author on:

,

3Department of Computer Science, Calvin College, Grand Rapids, MI 49546, USA

Search for other works by this author on:

,

2Department of Ecology and Evolutionary Biology, University of Kansas at Lawrence, Lawrence, KS 66045, USA

Search for other works by this author on:

,

4Scientific Computing Group, Heidelberg Institute for Theoretical Studies, Schloss-Wolfsbrunnenweg 35, D-69118 Heidelberg, Germany

Search for other works by this author on:

5Section of Integrative Biology, School of Biological Sciences, University of Texas at Austin, Austin, TX, USA

*Correspondence to be sent to: The University of Texas at Austin, Austin, TX 78712, USA; E-mail: rlinder@mail.utexas.edu.

Search for other works by this author on:

Revision received:

09 November 2010

Published:

01 December 2011

Cite

Kevin Liu, Tandy J. Warnow, Mark T. Holder, Serita M. Nelesen, Jiaye Yu, Alexandros P. Stamatakis, C. Randal Linder, SATé-II: Very Fast and Accurate Simultaneous Estimation of Multiple Sequence Alignments and Phylogenetic Trees, Systematic Biology, Volume 61, Issue 1, January 2012, Page 90, https://doi.org/10.1093/sysbio/syr095
Close

Navbar Search Filter Mobile Enter search term Search

Abstract

Highly accurate estimation of phylogenetic trees for large data sets is difficult, in part because multiple sequence alignments must be accurate for phylogeny estimation methods to be accurate. Coestimation of alignments and trees has been attempted but currently only SATé estimates reasonably accurate trees and alignments for large data sets in practical time frames (Liu K., Raghavan S., Nelesen S., Linder C.R., Warnow T. 2009b. Rapid and accurate large-scale coestimation of sequence alignments and phylogenetic trees. Science. 324:1561–1564). Here, we present a modification to the original SATé algorithm that improves upon SATé (which we now call SATé-I) in terms of speed and of phylogenetic and alignment accuracy. SATé-II uses a different divide-and-conquer strategy than SATé-I and so produces smaller more closely related subsets than SATé-I; as a result, SATé-II produces more accurate alignments and trees, can analyze larger data sets, and runs more efficiently than SATé-I. Generally, SATé is a metamethod that takes an existing multiple sequence alignment method as an input parameter and boosts the quality of that alignment method. SATé-II-boosted alignment methods are significantly more accurate than their unboosted versions, and trees based upon these improved alignments are more accurate than trees based upon the original alignments. Because SATé-I used maximum likelihood (ML) methods that treat gaps as missing data to estimate trees and because we found a correlation between the quality of tree/alignment pairs and ML scores, we explored the degree to which SATé's performance depends on using ML with gaps treated as missing data to determine the best tree/alignment pair. We present two lines of evidence that using ML with gaps treated as missing data to optimize the alignment and tree produces very poor results. First, we show that the optimization problem where a set of unaligned DNA sequences is given and the output is the tree and alignment of those sequences that maximize likelihood under the Jukes–Cantor model is uninformative in the worst possible sense. For all inputs, all trees optimize the likelihood score. Second, we show that a greedy heuristic that uses GTR+Gamma ML to optimize the alignment and the tree can produce very poor alignments and trees. Therefore, the excellent performance of SATé-II and SATé-I is not because ML is used as an optimization criterion for choosing the best tree/alignment pair but rather due to the particular divide-and-conquer realignment techniques employed.

© The Author(s) 2011. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com

You do not currently have access to this article.

Personal account

Get help with access

Institutional access

Access to content on Oxford Academic is often provided through institutional subscriptions and purchases. If you are a member of an institution with an active account, you may be able to access content in one of the following ways:

IP based access

Typically, access is provided across an institutional network to a range of IP addresses. This authentication occurs automatically, and it is not possible to sign out of an IP authenticated account.

Sign in through your institution

Choose this option to get remote access when outside your institution. Shibboleth/Open Athens technology is used to provide single sign-on between your institution’s website and Oxford Academic.

  1. Click Sign in through your institution.
  2. Select your institution from the list provided, which will take you to your institution's website to sign in.
  3. When on the institution site, please use the credentials provided by your institution. Do not use an Oxford Academic personal account.
  4. Following successful sign in, you will be returned to Oxford Academic.

If your institution is not listed or you cannot sign in to your institution’s website, please contact your librarian or administrator.

Sign in with a library card

Enter your library card number to sign in. If you cannot sign in, please contact your librarian.

Society Members

Society member access to a journal is achieved in one of the following ways:

Sign in through society site

Many societies offer single sign-on between the society website and Oxford Academic. If you see ‘Sign in through society site’ in the sign in pane within a journal:

  1. Click Sign in through society site.
  2. When on the society site, please use the credentials provided by that society. Do not use an Oxford Academic personal account.
  3. Following successful sign in, you will be returned to Oxford Academic.

If you do not have a society account or have forgotten your username or password, please contact your society.

Sign in using a personal account

Some societies use Oxford Academic personal accounts to provide access to their members. See below.

Personal account

A personal account can be used to get email alerts, save searches, purchase content, and activate subscriptions.

Some societies use Oxford Academic personal accounts to provide access to their members.

Viewing your signed in accounts

Click the account icon in the top right to:

Signed in but can't access content

Oxford Academic is home to a wide variety of products. The institutional subscription may not cover the content that you are trying to access. If you believe you should have access to that content, please contact your librarian.

Institutional account management

For librarians and administrators, your personal account also provides access to institutional account management. Here you will find options to view and activate subscriptions, manage institutional settings and access options, access usage statistics, and more.

Purchase

Short-term Access

To purchase short-term access, please sign in to your personal account above.

Don't already have a personal account? Register

SATé-II: Very Fast and Accurate Simultaneous Estimation of Multiple Sequence Alignments and Phylogenetic Trees - 24 Hours access

EUR €51.00

GBP £44.00

USD $55.00

Rental

Read this now at DeepDyve

This article is also available for rental through DeepDyve.

Citations

Views

Altmetric

Metrics

Total Views 5,257

3,869 Pageviews

1,388 PDF Downloads

Since 12/1/2016

Month: Total Views:
December 2016 4
January 2017 13
February 2017 20
March 2017 42
April 2017 37
May 2017 12
June 2017 8
July 2017 34
August 2017 22
September 2017 42
October 2017 39
November 2017 25
December 2017 58
January 2018 136
February 2018 77
March 2018 61
April 2018 48
May 2018 54
June 2018 42
July 2018 58
August 2018 37
September 2018 29
October 2018 34
November 2018 61
December 2018 38
January 2019 45
February 2019 44
March 2019 58
April 2019 93
May 2019 73
June 2019 54
July 2019 50
August 2019 59
September 2019 69
October 2019 44
November 2019 45
December 2019 38
January 2020 52
February 2020 54
March 2020 60
April 2020 61
May 2020 37
June 2020 73
July 2020 62
August 2020 135
September 2020 64
October 2020 51
November 2020 32
December 2020 82
January 2021 60
February 2021 65
March 2021 58
April 2021 44
May 2021 44
June 2021 39
July 2021 49
August 2021 42
September 2021 68
October 2021 72
November 2021 65
December 2021 62
January 2022 64
February 2022 68
March 2022 70
April 2022 67
May 2022 99
June 2022 73
July 2022 84
August 2022 90
September 2022 67
October 2022 79
November 2022 56
December 2022 66
January 2023 80
February 2023 50
March 2023 64
April 2023 90
May 2023 54
June 2023 37
July 2023 54
August 2023 25
September 2023 67
October 2023 53
November 2023 57
December 2023 46
January 2024 74
February 2024 66
March 2024 65
April 2024 51
May 2024 58
June 2024 52
July 2024 61
August 2024 81
September 2024 49
October 2024 6

Citations

276 Web of Science

×

Email alerts

Citing articles via

More from Oxford Academic