Author: Pablo Mier

0000000000532515

AUTHOR

Pablo Mier

showing 33 related works from this author

Toward completion of the Earth’s proteome: an update a decade later

2017

Protein databases are steadily growing driven by the spread of new more efficient sequencing techniques. This growth is dominated by an increase in redundancy (homologous proteins with various degrees of sequence similarity) and by the incapability to process and curate sequence entries as fast as they are created. To understand these trends and aid bioinformatic resources that might be compromised by the increasing size of the protein sequence databases, we have created a less-redundant protein data set. In parallel, we analyzed the evolution of protein sequence databases in terms of size and redundancy. While the SwissProt database has decelerated its growth mostly because of a focus on i…

ProteomeOperations researchKnowledge Bases0206 medical engineering02 engineering and technologyComputational biologyBiology03 medical and health sciencesAnnotationProtein sequencingSequence Analysis ProteinThree-domain systemRedundancy (engineering)AnimalsHumansDatabases ProteinMolecular Biology030304 developmental biologySequence (medicine)0303 health sciencesComputational BiologyProteinsProtein superfamilyProteomeUniProtSoftware020602 bioinformaticsInformation SystemsBriefings in Bioinformatics

0000000000532515

AUTHOR

Pablo Mier

Toward completion of the Earth’s proteome: an update a decade later

2017

Avoided motifs: short amino acid strings missing from protein datasets.

2020

Traitpedia: a collaborative effort to gather species traits

2018

Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases

2019

FastaHerder2: Four Ways to Research Protein Function and Evolution with Clustering and Clustered Databases.

2016

Between Interactions and Aggregates: The PolyQ Balance

2021

The importance of definitions in the study of polyQ regions: A tale of thresholds, impurities and sequence context

2020

Disentangling the complexity of low complexity proteins

2020

Flanking regions determine the structure of the poly-glutamine homo- repeat in huntingtin through mechanisms common among glutamine-rich human protei…

2020

A novel approach to investigate the evolution of structured tandem repeat protein families by exon duplication.

2020

The latent geometry of the human protein interaction network

2017

The Role of Low Complexity Regions in Protein Interaction Modes: An Illustration in Huntingtin

2021

CRISPR sequences are sometimes erroneously translated and can contaminate public databases with spurious proteins containing spaced repeats

2020

AnABlast: Re-searching for Protein-Coding Sequences in Genomic Regions

2019

The 18S ribosomal RNA m 6 A methyltransferase Mettl5 is required for normal walking behavior in Drosophila

2020

Repeatability in protein sequences

2019

The Conservation of Low Complexity Regions in Bacterial Proteins Depends on the Pathogenicity of the Strain and Subcellular Location of the Protein

2021

orthoFind Facilitates the Discovery of Homologous and Orthologous Proteins

2015

dAPE: a web server to detect homorepeats and follow their evolution.

2016

Evolutionary Study of Disorder in Protein Sequences

2020

Protein-protein interactions can be predicted using coiled coil co-evolution patterns

2016

Assessing the low complexity of protein sequences via the low complexity triangle.

2020

PlaToLoCo: the first web meta-server for visualization and annotation of low complexity regions in proteins

2020

MAGA: A Supervised Method to Detect Motifs From Annotated Groups in Alignments

2020

Automated selection of homologs to track the evolutionary history of proteins

2018

REP2: A Web Server to Detect Common Tandem Repeats in Protein Sequences

2020

Proteome-wide comparison between the amino acid composition of domains and linkers

2018

SuppFile1.fasta.txt – Supplemental material for MAGA: A Supervised Method to Detect Motifs From Annotated Groups in Alignments

2020

Glutamine Codon Usage and polyQ Evolution in Primates Depend on the Q Stretch Length

2018

SuppFile2.fasta.txt – Supplemental material for MAGA: A Supervised Method to Detect Motifs From Annotated Groups in Alignments

2020

Additional file 2: of Automated selection of homologs to track the evolutionary history of proteins

2018

Additional file 1: of Automated selection of homologs to track the evolutionary history of proteins

2018

MOESM1 of Proteome-wide comparison between the amino acid composition of domains and linkers

2018