VNU-UET Repository

QMaker: Fast and Accurate Method to Estimate Empirical Models of Protein Evolution

Minh, Bui Quang and Dang, Cao Cuong and Vinh, Le Sy and Lanfear, Robert (2021) QMaker: Fast and Accurate Method to Estimate Empirical Models of Protein Evolution. Systematic Biology . ISSN 1063-5157

This is the latest version of this item.

[img] PDF
Download (1MB)


Amino acid substitution models play a crucial role in phylogenetic analyses. Maximum likelihood (ML) methods have been proposed to estimate amino acid substitution models; however, they are typically complicated and slow. In this article, we propose QMaker, a new ML method to estimate a general time-reversible Q matrix from a large protein data set consisting of multiple sequence alignments. QMaker combines an efficient ML tree search algorithm, a model selection for handling the model heterogeneity among alignments, and the consideration of rate mixture models among sites. We provide QMaker as a user-friendly function in the IQ-TREE software package ( supporting the use of multiple CPU cores so that biologists can easily estimate amino acid substitution models from their own protein alignments. We used QMaker to estimate new empirical general amino acid substitution models from the current Pfam database as well as five clade-specific models for mammals, birds, insects, yeasts, and plants. Our results show that the new models considerably improve the fit between model and data and in some cases influence the inference of phylogenetic tree topologies.[Amino acid replacement matrices; amino acid substitution models; maximum likelihood estimation; phylogenetic inferences.

Item Type: Article
Subjects: Information Technology (IT)
ISI-indexed journals
Divisions: Faculty of Information Technology (FIT)
Depositing User: Cao Cuong Dang
Date Deposited: 28 Jun 2021 02:33
Last Modified: 28 Jun 2021 02:33

Available Versions of this Item

Actions (login required)

View Item View Item