Search results for "Repetitive sequence"
showing 10 items of 74 documents
Isolation of the DNA minisatellite probe MZ 1.3 and its application to DNA ‘fingerprinting’ analysis
1990
Abstract A minisatellite probe, MZ 1.3, detecting hypervariable fragment patterns was isolated from a human genomic library. A repetitive sequence of 27 bp length was identified which is contained in the probe approx. 40 times. The MZ 1.3 repeat shows variable homology of 53–73% to the repetitive sequence of the protein III gene of the bacteriophage M13 genome. Polymorphic restriction fragment patterns were found with MZ 1.3 using the enzymes Hinf I, BstN I, Hae III, Mbo I, PstI PvuII , and Rsa I. An average of 18 polymorphic fragments was observed using Hinf I as enzyme. The band sharing frequency after Hinf I digestion among unrelated individuals was determined to be 23.8 ± 7.2%. An examp…
Comparative genomic sequencing reveals a strikingly similar architecture of a conserved syntenic region on human chromosome 11p15.3 (including gene S…
2001
Comparative genomics is a superior way to identify phylogenetically conserved features like genes or regions involved in gene regulation. The comparison of extended orthologous chromosomal regions should also reveal other characteristic traits essential for chromosome or gene function. In the present study we have sequenced and compared a region of conserved synteny from human chromosome 11p15.3 and mouse chromosome 7. In human, this region is known to contain several genes involved in the development of various disorders like Beckwith-Wiedemann overgrowth syndrome and other tumor diseases. Furthermore, in the neighboring chromosome region 11p15.5 extensive imprinting of genes has been repo…
Assessing the low complexity of protein sequences via the low complexity triangle.
2020
Background Proteins with low complexity regions (LCRs) have atypical sequence and structural features. Their amino acid composition varies from the expected, determined proteome-wise, and they do not follow the rules of structural folding that prevail in globular regions. One way to characterize these regions is by assessing the repeatability of a sequence, that is, calculating the local propensity of a region to be part of a repeat. Results We combine two local measures of low complexity, repeatability (using the RES algorithm) and fraction of the most frequent amino acid, to evaluate different proteomes, datasets of protein regions with specific features, and individual cases of proteins…
RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures
2020
The RepeatsDB database (URL: https://repeatsdb.org/) provides annotations and classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein tandem repeats are ubiquitous in all branches of the tree of life. The accumulation of solved repeat structures provides new possibilities for classification and detection, but also increasing the need for annotation. Here we present RepeatsDB 3.0, which addresses these challenges and presents an extended classification scheme. The major conceptual change compared to the previous version is the hierarchical classification combining top levels based solely on structural similarity (Class > Topology > Fold) with two new lev…
Mapping and structure of DMXL1, a human homologue of the DmX gene from Drosophila melanogaster coding for a WD repeat protein.
2000
The DmX gene was recently isolated from the X chromosome of Drosophila melanogaster. TBLASTN searches of the dbEST databases revealed sequences with a high level of similarity to DmX in a variety of different species, including insects, nematodes, and mammals showing that DmX is an evolutionarily highly conserved gene. Here we describe the cloning of the cDNA and the chromosomal localization of one of the human homologues of DmX, Dmx-like 1 (DMXL1). The human DMXL1 gene codes for a large mRNA of 11 kb with an open reading frame of 3027 amino acids. The putative protein belongs to the superfamily of WD repeat proteins, which have mostly regulatory functions. The DMXL1 protein contains an exc…
ZFWD: a novel subfamily of plant proteins containing a C3H zinc finger and seven WD40 repeats
2000
We describe a new subfamily of WD repeat proteins characterised by the presence of a C3H zinc finger at the N-terminal part of the protein associated with seven WD40 repeats. We have identified four members of this subfamily in Arabidopsis thaliana, one of them with associated expressed sequence tags (ESTs). We have also identified homologous ESTs in rice, cotton, maize, poplar, pine tree and the ice plant. We do not observe animal homologues, suggesting that this subfamily could be specific for plants. Our data suggest an important role for these proteins. Based on the high sequence conservation within the conserved domains, we suggest that these proteins could have a regulatory function.
Repeatability in protein sequences
2019
Low complexity regions (LCRs) in protein sequences have special properties that are very different from those of globular proteins. The rules that define secondary structure elements do not apply when the distribution of amino acids becomes biased. While there is a tendency towards structural disorder in LCRs, various examples, and particularly homorepeats of single amino acids, suggest that very short repeats could adopt structures very difficult to predict. These structures are possibly variable and dependant on the context of intra- or inter-molecular interactions. In general, short repeats in LCRs can induce structure. This could explain the observation that very short (non-perfect) rep…
Flanking regions determine the structure of the poly-glutamine homo- repeat in huntingtin through mechanisms common among glutamine-rich human protei…
2020
International audience; The causative agent of Huntington's disease, the poly-Q homo-repeat in the N-terminal region of huntingtin (httex1), is flanked by a 17-residue-long fragment (N17) and a proline-rich region (PRR), which promote and inhibit the aggregation propensity of the protein, respectively, by poorly understood mechanisms. Based on experimental data obtained from site-specifically labeled NMR samples, we derived an ensemble model of httex1 that identified both flanking regions as opposing poly-Q secondary structure promoters. While N17 triggers helicity through a promiscuous hydrogen bond network involving the side chains of the first glutamines in the poly-Q tract, the PRR prom…
Stimulation of protein (collagen) synthesis in sponge cells by a cardiac myotrophin‐related molecule from Suberites domuncula
2000
The body wall of sponges (Porifera), the lowest metazoan phylum, is formed by two epithelial cell layers of exopinacocytes and endopinacocytes, both of which are associated with collagen fibrils. Here we show that a myotrophin-like polypeptide from the sponge Suberites domuncula causes the expression of collagen in cells from the same sponge in vitro. The cDNA of the sponge myotrophin was isolated; the potential open reading frame of 360 nt encodes a 120 aa long protein (Mr of 12,837). The sequence SUBDOMYOL shares high similarity with the known metazoan myotrophin sequences. The expression of SUBDOMYOL is low in single cells but high after formation of primmorph aggregates as well as in in…
REP2: A Web Server to Detect Common Tandem Repeats in Protein Sequences
2020
Ensembles of tandem repeats (TRs) in protein sequences expand rapidly to form domains well suited for interactions with proteins. For this reason, they are relatively frequent. Some TRs have known structures and therefore it is advantageous to predict their presence in a protein sequence. However, since most TRs diverge quickly, their detection by classical sequence comparison algorithms is not very accurate. Previously, we developed a method and a web server that used curated profiles and thresholds for the detection of 11 common TRs. Here we present a new web server (REP2) that allows the analysis of TRs in both individual and aligned sequences. We provide currently precomputed analyses f…