Search results for "alignment"
showing 10 items of 627 documents
Differential annotation of tRNA genes with anticodon CAT in bacterial genomes.
2006
We have developed three strategies to discriminate among the three types of tRNA genes with anticodon CAT (tRNA(Ile), elongator tRNA(Met) and initiator tRNA(fMet)) in bacterial genomes. With these strategies, we have classified the tRNA genes from 234 bacterial and several organellar genomes. These sequences, in an aligned or unaligned format, may be used for the identification and annotation of tRNA (CAT) genes in other genomes. The first strategy is based on the position of the problem sequences in a phenogram (a tree-like network), the second on the minimum average number of differences against the tRNA sequences of the three types and the third on the search for the highest score value …
Sequences homologous to the hobo transposable element in E strains of Drosophila melanogaster.
2001
Hobo is one of the three Drosophila melanogaster transposable elements, together with the P and I elements, that seem to have recently invaded the genome of this species. Surveys of the presence of hobo in strains from different geographical and temporal origins have shown that recently collected strains contain complete and deleted elements with high sequence similarity (H strains), but old strains lack hobo elements (E strains). Besides the canonical hobo sequences, both H and E strains show other poorly known hobo-related sequences. In the present work, we analyze the presence, cytogenetic location, and structure of some of these sequences in E strains of D. melanogaster. By in situ hybr…
Ubiquitins (polyubiquitin and ubiquitin extension protein) in marine sponges: cDNA sequence and phylogenetic analysis
1999
The complete nucleotide sequences of twoSuberites domunculacDNAs and oneSycon raphanuscDNA, all encoding ubiquitin, have been determined. One cDNA fromS. domunculacodes for polyubiquitin with four tandemly repeated monomeric units and the second cDNA encodes ubiquitin fused to a ribosomal protein of 78 amino acids (aa).S. domunculapossesses at least one additional polyubiquitin gene, from which the last two monomers were also sequenced. All analysed genes fromS. domunculaencode identical ubiquitin proteins, with only one aa difference (Ala19) to the human/higher animals ubiquitin (Pro19). Ubiquitin inS. domunculais identical with the ubiquitin found in another Demospongia,Geodia cydonium. T…
A mammalian gene evolved from the integrase domain of an LTR retrotransposon.
2001
FIG. 1.—Summary of the structure and coding sequence of the human Gin-1 gene. Sequences of human cDNAs with accession numbers XMp003947.2 (a putative full-length cDNA), BE502574, AW173201.1, AW950418.1, AI631948.1, and AA766836.1 were used to deduce and confirm these data. The full-length protein is 522 amino acids long. The Gin-1 coding region spans nucleotides 36153–15345 in the genomic clone NTp002663.4. Arrowheads and the numbers above them, respectively, indicate the positions and lengths of introns. Several Alu repeats were detected within the two largest introns. Bold letters indicate the region homologous to the most conserved part of the IN domain, detailed in figure 2 and used to …
Comparing DNA sequence collections by direct comparison of compressed text indexes
2012
Popular sequence alignment tools such as BWA convert a reference genome to an indexing data structure based on the Burrows-Wheeler Transform (BWT), from which matches to individual query sequences can be rapidly determined. However the utility of also indexing the query sequences themselves remains relatively unexplored. Here we show that an all-against-all comparison of two sequence collections can be computed from the BWT of each collection with the BWTs held entirely in external memory, i.e. on disk and not in RAM. As an application of this technique, we show that BWTs of transcriptomic and genomic reads can be compared to obtain reference-free predictions of splice junctions that have h…
Heterogeneity of HVR-1 quasispecies is predictive of early but not sustained virological response in genotype 1b-infected patients undergoing combine…
2003
ISDR mutation pattern and HVR-1 quasispecies were analyzed in HCV genotype 1b-infected patients treated with either PEG- or STD-IFN plus ribavirin, in order to find virological correlates of therapy outcome. ISDR region analysis, performed at baseline (T0) and at 4 weeks of therapy (T1), indicated that ISDR mutation pattern was not predictive of response to treatment. Moreover, no selection of putative resistant strains in the first month of therapy was observed. Viral load was not correlated with any parameter of HVR-1 heterogeneity. Among the HVR-1 heterogeneity parameters considered, complexity was inversely correlated to viral load decline at T1. In univariate analysis, complexity, prop…
Genotypic analysis at multiple loci across Kaposi's sarcoma herpesvirus (KSHV) DNA molecules: clustering patterns, novel variants and chimerism
2001
Abstract Background: the genomes of human Kaposi's sarcoma-associated herpesvirus (KSHV) display several levels of DNA sequence heterogeneity and subgrouping that show distinctive clustering patterns in related human populations. The four major subtype patterns for the hypervariable ORF-K1 protein correlate closely with the principal diasporas resulting from the migration of modern humans out of East Africa and suggest that KSHV is an ancient human virus that is transmitted primarily in a familial fashion with consequent very low recombination rates. However, chimeric genomes have also been detected, especially with regard to the presence of P versus M alleles of the ORF-K15 gene. Objective…
Rat mammary-gland transferrin: nucleotide sequence, phylogenetic analysis and glycan structure
1995
The complete cDNA for rat mammary-gland transferrin (Tf) has been sequenced and also the native protein isolated from milk in order to analyse the structure of the main glycan variants present. A lactating-rat mammary-gland cDNA library in lambda gt10 was screened with a partial cDNA copy of rat liver Tf and subsequently rescreened with 5′ fragments of the longest clones. This produced a 2275 bp insert coding for an open reading frame of 695 amino acid residues. This includes a 19-amino acid signal sequence and the mature protein containing 676 amino acids and one N-glycosylation site in the C-terminal domain at residue 490. Phylogenetic analysis was carried out using 14 translated Tf nucle…
Genetic rearrangements in the pathogenicity locus of Clostridium difficile strain 8864 – implications for transcription, expression and enzymatic act…
1998
The pathogenicity locus (PaLoc) of Clostridium difficile isolate 8864 was investigated to locate genetic rearrangements that would explain the exceptional pathogenicity of this particular isolate. Two major changes were defined: an insertion of 1.1 kb between the two genes tcdA and tcdE, coding for the enterotoxin and an accessory protein of unknown function, respectively, and a deletion of 5.9 kb encompassing the 3' ends of tcdA and tcdC. Transcription of the tcdA-E genes is severely affected by both rearrangements, explaining the demonstrated complete lack of TcdA polypeptide. We present a model of coordinate, growth-related transcription of the tcdA-E genes that confirms our previous fin…
Application of an innovative alignment optimisation method to a cross-cultural mean comparison of teacher self-efficacy: A cross-country study
2021
Teacher self-efficacy is a crucial personal characteristic that is important not only for teachers’ well-being but also for the overall teaching and learning. However, the difficulty to ascertain scalar invariance in the measurement of the construct has beset previous attempts of cross-cultural comparisons. This study implements an alignment optimisation method to compare and rank mean teacher self-efficacy of over 150,000 teachers across 48 countries and economies that participated in the Teaching and Learning International Survey (TALIS) that was conducted 2018. The findings show that Columbia, Portugal, United Arab Emirates, Hungary, and South Africa have teachers with the highest mean s…