6533b7cffe1ef96bd1258f7b

RESEARCH PRODUCT

Proteome-wide comparison between the amino acid composition of domains and linkers

Daniel BrüneMiguel A. Andrade-navarroPablo Mier

subject

Proteomics570BacteriaProteomeAmino acid compositionlcsh:Rlcsh:MedicineEukaryotaArchaea570 Life sciencesResearch Notelcsh:Biology (General)Sequence Analysis ProteinCatalytic DomainDomainsAmino Acid SequenceLinkerslcsh:Science (General)lcsh:QH301-705.5570 Biowissenschaftenlcsh:Q1-390

description

Objective Amino acid composition is a sequence feature that has been extensively used to characterize proteomes of many species and protein families. Yet the analysis of amino acid composition of protein domains and the linkers connecting them has received less attention. Here, we perform both a comprehensive full-proteome amino acid composition analysis and a similar analysis focusing on domains and linkers, to uncover domain- or linker-specific differential amino acid usage patterns. Results The amino acid composition in the 38 proteomes studied showcase the greater variability found in archaea and bacteria species compared to eukaryotes. When focusing on domains and linkers, we describe the preferential use of polar residues in linkers and hydrophobic residues in domains. To let any user perform this analysis on a given domain (or set of them), we developed a dedicated R script called RACCOON, which can be easily used and can provide interesting insights into the compositional differences between a domain and its surrounding linkers. Electronic supplementary material The online version of this article (10.1186/s13104-018-3221-0) contains supplementary material, which is available to authorized users.

10.1186/s13104-018-3221-0http://europepmc.org/articles/PMC5807739