Search results for "Repetitive sequence"

showing 10 items of 74 documents

Isolation of the DNA minisatellite probe MZ 1.3 and its application to DNA ‘fingerprinting’ analysis

1990

Abstract A minisatellite probe, MZ 1.3, detecting hypervariable fragment patterns was isolated from a human genomic library. A repetitive sequence of 27 bp length was identified which is contained in the probe approx. 40 times. The MZ 1.3 repeat shows variable homology of 53–73% to the repetitive sequence of the protein III gene of the bacteriophage M13 genome. Polymorphic restriction fragment patterns were found with MZ 1.3 using the enzymes Hinf I, BstN I, Hae III, Mbo I, PstI PvuII , and Rsa I. An average of 18 polymorphic fragments was observed using Hinf I as enzyme. The band sharing frequency after Hinf I digestion among unrelated individuals was determined to be 23.8 ± 7.2%. An examp…

Molecular Sequence DataRestriction MappingDNA SatelliteHomology (biology)Pathology and Forensic MedicineRestriction fragmentchemistry.chemical_compoundHumansGenomic libraryGeneRepetitive Sequences Nucleic AcidGeneticsGenomic LibraryBase SequencebiologyNucleotide MappingDNAMolecular biologyBlotting SouthernVariable number tandem repeatMinisatelliteDNA profilingchemistrybiology.proteinDNA ProbesLawPolymorphism Restriction Fragment LengthDNAForensic Science International

researchProduct

Comparative genomic sequencing reveals a strikingly similar architecture of a conserved syntenic region on human chromosome 11p15.3 (including gene S…

2001

Comparative genomics is a superior way to identify phylogenetically conserved features like genes or regions involved in gene regulation. The comparison of extended orthologous chromosomal regions should also reveal other characteristic traits essential for chromosome or gene function. In the present study we have sequenced and compared a region of conserved synteny from human chromosome 11p15.3 and mouse chromosome 7. In human, this region is known to contain several genes involved in the development of various disorders like Beckwith-Wiedemann overgrowth syndrome and other tumor diseases. Furthermore, in the neighboring chromosome region 11p15.5 extensive imprinting of genes has been repo…

Molecular Sequence DataeducationGenomicsBiologyChromosomesContig MappingMiceGene OrderGeneticsAnimalsHumansCloning MolecularMolecular BiologyGeneConserved SequenceGenetics (clinical)Repetitive Sequences Nucleic AcidSyntenyRegulation of gene expressionChromosome 7 (human)Comparative genomicsGeneticsChromosomes Human Pair 11Tumor Suppressor ProteinsGenomic sequencingChromosomeSequence Analysis DNAGC Rich SequenceDNA-Binding ProteinsCytogenetic and Genome Research

researchProduct

Assessing the low complexity of protein sequences via the low complexity triangle.

2020

Background Proteins with low complexity regions (LCRs) have atypical sequence and structural features. Their amino acid composition varies from the expected, determined proteome-wise, and they do not follow the rules of structural folding that prevail in globular regions. One way to characterize these regions is by assessing the repeatability of a sequence, that is, calculating the local propensity of a region to be part of a repeat. Results We combine two local measures of low complexity, repeatability (using the RES algorithm) and fraction of the most frequent amino acid, to evaluate different proteomes, datasets of protein regions with specific features, and individual cases of proteins…

ProteomeProteomesComputer scienceProtein SequencingBiochemistryDatabase and Informatics MethodsSequence Analysis ProteinProtein methodsPeptide sequencechemistry.chemical_classification0303 health sciencesSequenceMultidisciplinary030302 biochemistry & molecular biologyQRGenomicsAmino acidTandem RepeatsProteomeAmino Acid AnalysisMedicineSequence AnalysisResearch ArticleRepetitive Sequences Amino AcidBioinformaticsSequence analysisScienceResearch and Analysis MethodsGenome Complexity03 medical and health sciencesProtein DomainsAmino Acid Sequence AnalysisTandem repeatGeneticsHumansFraction (mathematics)Repeated SequencesAmino Acid SequenceMolecular Biology TechniquesSequencing TechniquesRepresentation (mathematics)Molecular Biology030304 developmental biologyMolecular Biology Assays and Analysis Techniquesbusiness.industryBiology and Life SciencesProteinsComputational BiologyPattern recognitionchemistryGlobular ProteinsArtificial intelligencebusinessPLoS ONE

researchProduct

RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

2020

The RepeatsDB database (URL: https://repeatsdb.org/) provides annotations and classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein tandem repeats are ubiquitous in all branches of the tree of life. The accumulation of solved repeat structures provides new possibilities for classification and detection, but also increasing the need for annotation. Here we present RepeatsDB 3.0, which addresses these challenges and presents an extended classification scheme. The major conceptual change compared to the previous version is the hierarchical classification combining top levels based solely on structural similarity (Class > Topology > Fold) with two new lev…

Repetitive Sequences Amino AcidAcademicSubjects/SCI00010BiologíaStatistics as TopicProtein Data Bank (RCSB PDB)Computational biologyBiologyRepetitive SequencesGene Ontology; HEK293 Cells; HeLa Cells; Humans; Proteins; Reproducibility of Results; Statistics as Topic; User-Computer Interface; Databases Protein; Repetitive Sequences Amino Acid; Tandem Repeat SequencesDatabases03 medical and health sciencesAnnotationUser-Computer InterfaceProtein structureSimilarity (network science)Tandem repeatGeneticsDatabase IssueHumansDatabases ProteinCiencias Exactasdatabase030304 developmental biology0303 health sciencesHierarchy (mathematics)Protein030302 biochemistry & molecular biologyProteinsReproducibility of Resultscomputer.file_formatProtein Data BankClass (biology)proteinsAmino AcidComputingMethodologies_PATTERNRECOGNITIONGene OntologyHEK293 CellsclassificationTandem Repeat Sequencesprotein tandem repeat structures[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]computerHeLa CellsNucleic Acids Research

researchProduct

Mapping and structure of DMXL1, a human homologue of the DmX gene from Drosophila melanogaster coding for a WD repeat protein.

2000

The DmX gene was recently isolated from the X chromosome of Drosophila melanogaster. TBLASTN searches of the dbEST databases revealed sequences with a high level of similarity to DmX in a variety of different species, including insects, nematodes, and mammals showing that DmX is an evolutionarily highly conserved gene. Here we describe the cloning of the cDNA and the chromosomal localization of one of the human homologues of DmX, Dmx-like 1 (DMXL1). The human DMXL1 gene codes for a large mRNA of 11 kb with an open reading frame of 3027 amino acids. The putative protein belongs to the superfamily of WD repeat proteins, which have mostly regulatory functions. The DMXL1 protein contains an exc…

Repetitive Sequences Amino AcidDNA ComplementaryMolecular Sequence DataBiologyConserved sequenceMiceGene mappingComplementary DNAGeneticsAnimalsDrosophila ProteinsHumansRadiation hybrid mappingAmino Acid SequenceDinucleotide RepeatsGeneIn Situ Hybridization FluorescenceGeneticsBase SequenceChromosome MappingProteinsbiology.organism_classificationOpen reading frameDrosophila melanogasterChromosomes Human Pair 5Insect ProteinsDrosophila melanogasterDrosophila ProteinGenomics

researchProduct

ZFWD: a novel subfamily of plant proteins containing a C3H zinc finger and seven WD40 repeats

2000

We describe a new subfamily of WD repeat proteins characterised by the presence of a C3H zinc finger at the N-terminal part of the protein associated with seven WD40 repeats. We have identified four members of this subfamily in Arabidopsis thaliana, one of them with associated expressed sequence tags (ESTs). We have also identified homologous ESTs in rice, cotton, maize, poplar, pine tree and the ice plant. We do not observe animal homologues, suggesting that this subfamily could be specific for plants. Our data suggest an important role for these proteins. Based on the high sequence conservation within the conserved domains, we suggest that these proteins could have a regulatory function.

Repetitive Sequences Amino AcidDNA ComplementarySubfamilyDNA PlantMolecular Sequence DataArabidopsisSequence alignmentBiologyEvolution MolecularWD40 repeatGeneticsProtein IsoformsArabidopsis thalianaAmino Acid SequencePeptide sequencePhylogenyPlant ProteinsExpressed Sequence TagsGeneticsZinc fingerExpressed sequence tagProtein subfamilySequence Homology Amino AcidArabidopsis Proteinsfungifood and beveragesZinc FingersSequence Analysis DNAGeneral Medicinebiology.organism_classificationSequence AlignmentGene

researchProduct

Repeatability in protein sequences

2019

Low complexity regions (LCRs) in protein sequences have special properties that are very different from those of globular proteins. The rules that define secondary structure elements do not apply when the distribution of amino acids becomes biased. While there is a tendency towards structural disorder in LCRs, various examples, and particularly homorepeats of single amino acids, suggest that very short repeats could adopt structures very difficult to predict. These structures are possibly variable and dependant on the context of intra- or inter-molecular interactions. In general, short repeats in LCRs can induce structure. This could explain the observation that very short (non-perfect) rep…

Repetitive Sequences Amino AcidGlobular proteinSaccharomyces cerevisiaeContext (language use)Computational biologyProtein–protein interactionEvolution Molecular03 medical and health sciencesSequence Analysis ProteinStructural BiologyHumansArabidopsis thalianaAmino Acid SequenceDatabases ProteinProtein secondary structure030304 developmental biologychemistry.chemical_classification0303 health sciencesbiology030302 biochemistry & molecular biologyProteinsbiology.organism_classificationAmino acidchemistrySequence AlignmentAlgorithmsFunction (biology)Journal of Structural Biology

researchProduct

Flanking regions determine the structure of the poly-glutamine homo- repeat in huntingtin through mechanisms common among glutamine-rich human protei…

2020

International audience; The causative agent of Huntington's disease, the poly-Q homo-repeat in the N-terminal region of huntingtin (httex1), is flanked by a 17-residue-long fragment (N17) and a proline-rich region (PRR), which promote and inhibit the aggregation propensity of the protein, respectively, by poorly understood mechanisms. Based on experimental data obtained from site-specifically labeled NMR samples, we derived an ensemble model of httex1 that identified both flanking regions as opposing poly-Q secondary structure promoters. While N17 triggers helicity through a promiscuous hydrogen bond network involving the side chains of the first glutamines in the poly-Q tract, the PRR prom…

Repetitive Sequences Amino AcidHuntingtinAmino Acid Motifs[SDV.BBM.BP] Life Sciences [q-bio]/Biochemistry Molecular Biology/Biophysics03 medical and health sciencesHuntington's diseaseStructural BiologyHuman proteome projectmedicineHumans[SDV.BBM.BC]Life Sciences [q-bio]/Biochemistry Molecular Biology/Biochemistry [q-bio.BM]Molecular BiologyHuman proteinsProtein secondary structure[SDV.BBM.BC] Life Sciences [q-bio]/Biochemistry Molecular Biology/Biochemistry [q-bio.BM]030304 developmental biology[INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM]Huntingtin Protein0303 health sciencesChemistry030302 biochemistry & molecular biologyPromotermedicine.diseaseCell biologyIntrinsically Disordered ProteinsGlutamine[SDV.BBM.BP]Life Sciences [q-bio]/Biochemistry Molecular Biology/BiophysicsPolyglutamic Acid[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]Low Complexity Region

researchProduct

Stimulation of protein (collagen) synthesis in sponge cells by a cardiac myotrophin‐related molecule from Suberites domuncula

2000

The body wall of sponges (Porifera), the lowest metazoan phylum, is formed by two epithelial cell layers of exopinacocytes and endopinacocytes, both of which are associated with collagen fibrils. Here we show that a myotrophin-like polypeptide from the sponge Suberites domuncula causes the expression of collagen in cells from the same sponge in vitro. The cDNA of the sponge myotrophin was isolated; the potential open reading frame of 360 nt encodes a 120 aa long protein (Mr of 12,837). The sequence SUBDOMYOL shares high similarity with the known metazoan myotrophin sequences. The expression of SUBDOMYOL is low in single cells but high after formation of primmorph aggregates as well as in in…

Repetitive Sequences Amino AcidMolecular Sequence DataLysinePolymerase Chain ReactionBiochemistryMyotrophinComplementary DNAGeneticsProtein biosynthesisAnimalsAmino Acid SequenceCloning MolecularGrowth SubstanceseducationMolecular BiologyPhylogenyCell Sizeeducation.field_of_studyDose-Response Relationship DrugSequence Homology Amino AcidbiologySequence Analysis DNAbiology.organism_classificationRecombinant ProteinsIn vitroPoriferaUp-RegulationCell biologySuberites domunculaOpen reading frameSpongeIntercellular Signaling Peptides and ProteinsCollagenBiotechnologyThe FASEB Journal

researchProduct

REP2: A Web Server to Detect Common Tandem Repeats in Protein Sequences

2020

Ensembles of tandem repeats (TRs) in protein sequences expand rapidly to form domains well suited for interactions with proteins. For this reason, they are relatively frequent. Some TRs have known structures and therefore it is advantageous to predict their presence in a protein sequence. However, since most TRs diverge quickly, their detection by classical sequence comparison algorithms is not very accurate. Previously, we developed a method and a web server that used curated profiles and thresholds for the detection of 11 common TRs. Here we present a new web server (REP2) that allows the analysis of TRs in both individual and aligned sequences. We provide currently precomputed analyses f…

Repetitive Sequences Amino AcidWeb serverProteomeComputer scienceComputational biologycomputer.software_genreEvolution Molecular03 medical and health sciences0302 clinical medicineTandem repeatStructural BiologySequence comparisonHumansAmino Acid SequenceMolecular BiologyConserved Sequence030304 developmental biologySequence (medicine)Comparative genomicsInternet0303 health sciencesMultiple sequence alignmentBacteriaProteinsTandem Repeat SequencesProteomeUniProtSequence Alignmentcomputer030217 neurology & neurosurgeryJournal of Molecular Biology

researchProduct