6533b82dfe1ef96bd12909f0

RESEARCH PRODUCT

GWideCodeML: A python package for testing evolutionary hypotheses at the genome-wide level

Eladio BarrioEladio BarrioLaura G. MacíasLaura G. MacíasChristina Toft

subject

Maximum likelihoodQH426-470Software and Data ResourcesBiologycomputer.software_genreGenomeEvolution Molecular03 medical and health sciencesMolecular levelMolecular evolutionGeneticsCodonMolecular BiologyPhylogenyGenetics (clinical)030304 developmental biologycomputer.programming_languageComparative genomics0303 health sciencesPhylogenetic treeComparative genomicsPositive selectionProtein sequence analysis030302 biochemistry & molecular biologyGenome analysisPython (programming language)Biological EvolutionPositive selectionMolecular evolutionData miningcomputerSoftwarePython

description

One of the most widely used programs for detecting positive selection, at the molecular level, is the program codeml, which is implemented in the Phylogenetic Analysis by Maximum Likelihood (PAML) package. However, it has a limitation when it comes to genome-wide studies, as it runs on a gene-by-gene basis. Furthermore, the size of such studies will depend on the number of orthologous genes the genomes have income and these are often restricted to only account for instances where a one-to-one relationship is observed between the genomes. In this work, we present GWideCodeML, a Python package, which runs a genome-wide codeml with the option of parallelization. To maximize the number of analyzed genes, the package allows for a variable number of taxa in the alignments and will automatically prune the topology to fit each of them, before running codeml.

10.1534/g3.120.401874http://dx.doi.org/10.1534/g3.120.401874