0000000000236567

AUTHOR

Marcin Grynberg

0000-0003-1887-7209

showing 4 related works from this author

Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases

2019

AbstractThe widespread occurrence of repetitive stretches of DNA in genomes of organisms across the tree of life imposes fundamental challenges for sequencing, genome assembly, and automated annotation of genes and proteins. This multi-level problem can lead to errors in genome and protein databases that are often not recognized or acknowledged. As a consequence, end users working with sequences with repetitive regions are faced with ‘ready-to-use’ deposited data whose trustworthiness is difficult to determine, let alone to quantify. Here, we provide a review of the problems associated with tandem repeat sequences that originate from different stages during the sequencing-assembly-annotatio…

FOS: Computer and information sciencesBioinformatics[SDV]Life Sciences [q-bio]Sequence assemblyGenomics[SDV.BC]Life Sciences [q-bio]/Cellular BiologyComputational biologyBiologyGenome03 medical and health sciencesAnnotation0302 clinical medicineTandem repeatGeneticsAnimalsSurvey and SummaryDatabases ProteinGeneComputingMilieux_MISCELLANEOUS030304 developmental biology0303 health sciencesEnd user572: BiochemieDNASequence Analysis DNAGenomics[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]WorkflowComputingMethodologies_PATTERNRECOGNITIONGadus morhuaTandem Repeat SequencesScientific Experimental Error[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]Databases Nucleic Acid030217 neurology & neurosurgery
researchProduct

Disentangling the complexity of low complexity proteins

2020

Abstract There are multiple definitions for low complexity regions (LCRs) in protein sequences, with all of them broadly considering LCRs as regions with fewer amino acid types compared to an average composition. Following this view, LCRs can also be defined as regions showing composition bias. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichot…

Protein ConformationComputer scienceReview ArticleComputational biologyMeasure (mathematics)Evolution MolecularLow complexity03 medical and health sciencesProtein DomainsAmino Acid Sequencestructure[SDV.BBM.BC]Life Sciences [q-bio]/Biochemistry Molecular Biology/Biochemistry [q-bio.BM]Databases ProteinMolecular Biology030304 developmental biologyStructure (mathematical logic)0303 health sciencesSequence[SCCO.NEUR]Cognitive science/Neurosciencecomposition bias030302 biochemistry & molecular biologyProteinsdisorderlow complexity regionsStructure and function[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]AlgorithmsInformation SystemsBriefings in Bioinformatics
researchProduct

A model species for agricultural pest genomics: the genome of the Colorado potato beetle, Leptinotarsa decemlineata (Coleoptera: Chrysomelidae)

2018

AbstractThe Colorado potato beetle is one of the most challenging agricultural pests to manage. It has shown a spectacular ability to adapt to a variety of solanaceaeous plants and variable climates during its global invasion, and, notably, to rapidly evolve insecticide resistance. To examine evidence of rapid evolutionary change, and to understand the genetic basis of herbivory and insecticide resistance, we tested for structural and functional genomic changes relative to other arthropod species using genome sequencing, transcriptomics, and community annotation. Two factors that might facilitate rapid evolutionary change include transposable elements, which comprise at least 17% of the gen…

Male0106 biological sciences0301 basic medicineGenome Insectlcsh:Medicine01 natural sciencesGenomeNucleotide diversityInsecticide Resistancepomme de terreTRIBOLIUM-CASTANEUMlcsh:ScienceLeptinotarsaCYSTEINE PROTEINASESPhylogeny2. Zero hungereducation.field_of_studyGenomeMultidisciplinarybiologyinsecte ravageurEcologyGenètica vegetalAgricultureleptinotarsa decemlineataGenomicsS-TRANSFERASE GENESlutte contre les ravageursColeopteraOther Physical Sciencesphénotypeespèce modèleMultigene FamilyInsect ProteinsRNA InterferenceFemaleBiotechnologyAutre (Sciences du Vivant)Genome evolutiondoryphorecoleopteraEvolutionPopulationPopulationRNA-INTERFERENCEGenomicsGEOGRAPHIC POPULATIONSArticleDNA sequencingHost-Parasite InteractionsEvolution Molecular03 medical and health sciences[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry Molecular Biology/Genomics [q-bio.GN]GeneticsAnimalsanalyse génomiquePest Control BiologicaleducationQH426GeneSolanum tuberosumComparative genomicsbusiness.industrychrysomelidaelcsh:RHuman GenomefungiColorado potato beetlePest controlBiology and Life SciencesMolecularGenetic VariationMolecular Sequence AnnotationBiologicalbiology.organism_classification010602 entomologyGenòmicaGenetics Population030104 developmental biologyGene Expression RegulationDROSOPHILA-MELANOGASTERPROTEINASE-INHIBITORSEvolutionary biologyTRANSPOSABLE ELEMENTSDNA Transposable Elementslcsh:QPest ControlBiochemistry and Cell BiologyPEST analysisCAENORHABDITIS-ELEGANSbusinessInsectTranscription Factors
researchProduct

PlaToLoCo: the first web meta-server for visualization and annotation of low complexity regions in proteins

2020

Abstract Low complexity regions (LCRs) in protein sequences are characterized by a less diverse amino acid composition compared to typically observed sequence diversity. Recent studies have shown that LCRs may co-occur with intrinsically disordered regions, are highly conserved in many organisms, and often play important roles in protein functions and in diseases. In previous decades, several methods have been developed to identify regions with LCRs or amino acid bias, but most of them as stand-alone applications and currently there is no web-based tool which allows users to explore LCRs in protein sequences with additional functional annotations. We aim to fill this gap by providing PlaToL…

Sequence analysisAcademicSubjects/SCI00010Protein domainComputational biologyBiologyDomain (software engineering)Computer graphics03 medical and health sciencesAnnotationProtein DomainsSequence Analysis ProteinGeneticsComputer GraphicsHumansAmino Acids030304 developmental biology0303 health sciencesIntersection (set theory)030302 biochemistry & molecular biologyMembrane ProteinsProteinsMolecular Sequence AnnotationVisualizationMolecular Sequence AnnotationWeb Server IssueSoftwareNucleic Acids Research
researchProduct