CALANGO: a phylogeny-aware comparative genomics tool for discovering quantitative genotype-phenotype associations across species
The increasing availability of high-quality genomic, annotation and phenotypic data for different species contrasts with the lack of general software for comparative genomics that integrates these data types in a statistically sound framework in order to produce biologically meaningful knowledge. In this work, we present CALANGO (Comparative AnaLysis with ANnotation-based Genomic cOmponentes), a first-principles comparative genomics tool to search for annotation terms, such as GO terms or Pfam domain IDs, associated with a quantitative variable used to rank species data, after correcting for phylogenetic relatedness. This information can be used to annotate genomes at any level, including p…