Search results for "alignment"

showing 10 items of 627 documents

Differential annotation of tRNA genes with anticodon CAT in bacterial genomes.

2006

We have developed three strategies to discriminate among the three types of tRNA genes with anticodon CAT (tRNA(Ile), elongator tRNA(Met) and initiator tRNA(fMet)) in bacterial genomes. With these strategies, we have classified the tRNA genes from 234 bacterial and several organellar genomes. These sequences, in an aligned or unaligned format, may be used for the identification and annotation of tRNA (CAT) genes in other genomes. The first strategy is based on the position of the problem sequences in a phenogram (a tree-like network), the second on the minimum average number of differences against the tRNA sequences of the three types and the third on the search for the highest score value …

GeneticsRNA Transfer MetPhotobacterium profundumRNAComputational BiologySequence alignmentGenomicsBacterial genome sizeGenomicsBiologybiology.organism_classificationGenomeBacterial ProteinsEnterobacteriaceaeRNA TransferGenes BacterialTransfer RNAGeneticsAnticodonRNA Transfer IleGeneSequence AlignmentGenome BacterialTenericutesNucleic acids research

researchProduct

Sequences homologous to the hobo transposable element in E strains of Drosophila melanogaster.

2001

Hobo is one of the three Drosophila melanogaster transposable elements, together with the P and I elements, that seem to have recently invaded the genome of this species. Surveys of the presence of hobo in strains from different geographical and temporal origins have shown that recently collected strains contain complete and deleted elements with high sequence similarity (H strains), but old strains lack hobo elements (E strains). Besides the canonical hobo sequences, both H and E strains show other poorly known hobo-related sequences. In the present work, we analyze the presence, cytogenetic location, and structure of some of these sequences in E strains of D. melanogaster. By in situ hybr…

GeneticsTransposable elementbiologyEuchromatinBase SequenceChromosome MappingDNASequence Analysis DNAbiology.organism_classificationGenomeChromosomesDrosophila melanogasterSequence Homology Nucleic AcidGeneticsMelanogasterHomologous chromosomeDNA Transposable ElementsAnimalsDrosophila melanogasterMolecular BiologySequence AlignmentEcology Evolution Behavior and SystematicsTransposaseIn Situ HybridizationSequence (medicine)Molecular biology and evolution

researchProduct

Ubiquitins (polyubiquitin and ubiquitin extension protein) in marine sponges: cDNA sequence and phylogenetic analysis

1999

The complete nucleotide sequences of twoSuberites domunculacDNAs and oneSycon raphanuscDNA, all encoding ubiquitin, have been determined. One cDNA fromS. domunculacodes for polyubiquitin with four tandemly repeated monomeric units and the second cDNA encodes ubiquitin fused to a ribosomal protein of 78 amino acids (aa).S. domunculapossesses at least one additional polyubiquitin gene, from which the last two monomers were also sequenced. All analysed genes fromS. domunculaencode identical ubiquitin proteins, with only one aa difference (Ala19) to the human/higher animals ubiquitin (Pro19). Ubiquitin inS. domunculais identical with the ubiquitin found in another Demospongia,Geodia cydonium. T…

GeneticsUbiquitinsMultiple sequence alignmentUbiquitinbiologyRibosomal proteinComplementary DNAbiology.proteinRibosomal RNAFusion proteinGeneEcology Evolution Behavior and SystematicsBiological Journal of the Linnean Society

researchProduct

A mammalian gene evolved from the integrase domain of an LTR retrotransposon.

2001

FIG. 1.—Summary of the structure and coding sequence of the human Gin-1 gene. Sequences of human cDNAs with accession numbers XMp003947.2 (a putative full-length cDNA), BE502574, AW173201.1, AW950418.1, AI631948.1, and AA766836.1 were used to deduce and confirm these data. The full-length protein is 522 amino acids long. The Gin-1 coding region spans nucleotides 36153–15345 in the genomic clone NTp002663.4. Arrowheads and the numbers above them, respectively, indicate the positions and lengths of introns. Several Alu repeats were detected within the two largest introns. Bold letters indicate the region homologous to the most conserved part of the IN domain, detailed in figure 2 and used to …

GeneticsbiologyIntegrasesRetroelementsSequence Homology Amino AcidMolecular Sequence DataTerminal Repeat SequencesAlu elementRetrotransposonGenomeHomology (biology)IntegraseComplementary DNAGeneticsbiology.proteinCoding regionAnimalsHumansAmino Acid SequenceMolecular BiologyGeneSequence AlignmentEcology Evolution Behavior and SystematicsPhylogenyMolecular biology and evolution

researchProduct

Comparing DNA sequence collections by direct comparison of compressed text indexes

2012

Popular sequence alignment tools such as BWA convert a reference genome to an indexing data structure based on the Burrows-Wheeler Transform (BWT), from which matches to individual query sequences can be rapidly determined. However the utility of also indexing the query sequences themselves remains relatively unexplored. Here we show that an all-against-all comparison of two sequence collections can be computed from the BWT of each collection with the BWTs held entirely in external memory, i.e. on disk and not in RAM. As an application of this technique, we show that BWTs of transcriptomic and genomic reads can be compared to obtain reference-free predictions of splice junctions that have h…

Genomics (q-bio.GN)SequenceComputer sciencebusiness.industrySearch engine indexingSequence alignmentPattern recognitionConstruct (python library)Data structureBurrows-Wheeler Transform; Splice junctions; External memoryExternal memoryFOS: Biological sciencesCode (cryptography)Quantitative Biology - GenomicsBurrows-Wheeler TransformArtificial intelligencebusinessSplice junctionsAuxiliary memoryReference genome

researchProduct

Heterogeneity of HVR-1 quasispecies is predictive of early but not sustained virological response in genotype 1b-infected patients undergoing combine…

2003

ISDR mutation pattern and HVR-1 quasispecies were analyzed in HCV genotype 1b-infected patients treated with either PEG- or STD-IFN plus ribavirin, in order to find virological correlates of therapy outcome. ISDR region analysis, performed at baseline (T0) and at 4 weeks of therapy (T1), indicated that ISDR mutation pattern was not predictive of response to treatment. Moreover, no selection of putative resistant strains in the first month of therapy was observed. Viral load was not correlated with any parameter of HVR-1 heterogeneity. Among the HVR-1 heterogeneity parameters considered, complexity was inversely correlated to viral load decline at T1. In univariate analysis, complexity, prop…

GenotypeHepacivirusInterferon alpha-2Viral Nonstructural ProteinsAntiviral AgentsPolyethylene GlycolsViral ProteinsGenetic HeterogeneityRibavirinHumansViral ProteinPhylogenyAntiviral AgentHepaciviruViral Nonstructural ProteinInterferon-alphaSequence Analysis DNAHepatitis C ChronicRecombinant ProteinViral LoadRecombinant ProteinsTreatment OutcomeLinear ModelsLinear ModelDrug Therapy CombinationSequence AlignmentHuman

researchProduct

Genotypic analysis at multiple loci across Kaposi's sarcoma herpesvirus (KSHV) DNA molecules: clustering patterns, novel variants and chimerism

2001

Abstract Background: the genomes of human Kaposi's sarcoma-associated herpesvirus (KSHV) display several levels of DNA sequence heterogeneity and subgrouping that show distinctive clustering patterns in related human populations. The four major subtype patterns for the hypervariable ORF-K1 protein correlate closely with the principal diasporas resulting from the migration of modern humans out of East Africa and suggest that KSHV is an ancient human virus that is transmitted primarily in a familial fashion with consequent very low recombination rates. However, chimeric genomes have also been detected, especially with regard to the presence of P versus M alleles of the ORF-K15 gene. Objective…

GenotypePopulationMolecular Sequence DataGenome ViralBiologyGenomeDNA sequencingMiddle EastOpen Reading FramesAfrica NorthernViral Envelope ProteinsVirologyGenotypemedicineHumansAmino Acid SequenceAlleleeducationCladeKaposi's sarcomaGeneSarcoma KaposiAllelesPhylogenyGeneticsRecombination Geneticeducation.field_of_studyAcquired Immunodeficiency SyndromeKoreaMembrane Proteinsmedicine.diseaseEuropeInfectious DiseasesHerpesvirus 8 HumanNorth AmericaSequence Alignment

researchProduct

Rat mammary-gland transferrin: nucleotide sequence, phylogenetic analysis and glycan structure

1995

The complete cDNA for rat mammary-gland transferrin (Tf) has been sequenced and also the native protein isolated from milk in order to analyse the structure of the main glycan variants present. A lactating-rat mammary-gland cDNA library in lambda gt10 was screened with a partial cDNA copy of rat liver Tf and subsequently rescreened with 5′ fragments of the longest clones. This produced a 2275 bp insert coding for an open reading frame of 695 amino acid residues. This includes a 19-amino acid signal sequence and the mature protein containing 676 amino acids and one N-glycosylation site in the C-terminal domain at residue 490. Phylogenetic analysis was carried out using 14 translated Tf nucle…

GlycanDNA ComplementaryGlycosylationMolecular Sequence DataOligosaccharidesSequence alignmentAnimal Population GroupsBiochemistrychemistry.chemical_compoundMammary Glands AnimalSugar AlcoholsSpecies SpecificityPolysaccharidesComplementary DNANeuraminic acidCarbohydrate ConformationAnimalsRats WistarMolecular BiologyPhylogenyBase SequencebiologycDNA libraryTransferrinNucleic acid sequenceCell BiologyMilk ProteinsMolecular biologyN-Acetylneuraminic AcidRatsSialic acidMilkCarbohydrate SequenceGeneschemistryBiochemistryMultigene FamilySialic Acidsbiology.proteinFemaleNeuraminic AcidsProtein Processing Post-TranslationalSequence AlignmentN-Acetylneuraminic acidResearch ArticleBiochemical Journal

researchProduct

Genetic rearrangements in the pathogenicity locus of Clostridium difficile strain 8864 – implications for transcription, expression and enzymatic act…

1998

The pathogenicity locus (PaLoc) of Clostridium difficile isolate 8864 was investigated to locate genetic rearrangements that would explain the exceptional pathogenicity of this particular isolate. Two major changes were defined: an insertion of 1.1 kb between the two genes tcdA and tcdE, coding for the enterotoxin and an accessory protein of unknown function, respectively, and a deletion of 5.9 kb encompassing the 3' ends of tcdA and tcdC. Transcription of the tcdA-E genes is severely affected by both rearrangements, explaining the demonstrated complete lack of TcdA polypeptide. We present a model of coordinate, growth-related transcription of the tcdA-E genes that confirms our previous fin…

GlycosylationGlycoside HydrolasesTranscription GeneticBacterial ToxinsMolecular Sequence DataLocus (genetics)Chromosomal translocationEnterotoxinBiologyHomology (biology)law.inventionBacterial ProteinsGTP-Binding ProteinslawTranscription (biology)GeneticsAmino Acid SequenceMolecular BiologyGeneGeneticsClostridioides difficileGene Expression Regulation BacterialMolecular biologyRecombinant ProteinsAntisense RNAGenes BacterialGlucosyltransferasesRecombinant DNASequence AlignmentMolecular and General Genetics MGG

researchProduct

Application of an innovative alignment optimisation method to a cross-cultural mean comparison of teacher self-efficacy: A cross-country study

2021

Teacher self-efficacy is a crucial personal characteristic that is important not only for teachers’ well-being but also for the overall teaching and learning. However, the difficulty to ascertain scalar invariance in the measurement of the construct has beset previous attempts of cross-cultural comparisons. This study implements an alignment optimisation method to compare and rank mean teacher self-efficacy of over 150,000 teachers across 48 countries and economies that participated in the Teaching and Learning International Survey (TALIS) that was conducted 2018. The findings show that Columbia, Portugal, United Arab Emirates, Hungary, and South Africa have teachers with the highest mean s…

H1-99Self-efficacyCzechLatent mean comparisonScience (General)MultidisciplinaryCross countryScalar invarianceRank (computer programming)Teacher self-efficacyInternational surveyVDP::Matematikk og Naturvitenskap: 400language.human_languageSocial sciences (General)Q1-390Alignment optimisation methodTALIS 2018Mathematics educationlanguageCross-culturalConstruct (philosophy)PsychologyResearch ArticleHeliyon

researchProduct