Search results for "ALIGNMENT"
showing 10 items of 627 documents
Molecular evolution of antioxidant and hypoxia response in long-lived, cancer-resistant blind mole rats: The Nrf2-Keap1 pathway.
2015
The Nrf2-Keap1 pathway is crucial for the cellular antioxidant and hypoxia response in vertebrates. Deciphering its modifications in hypoxia-adapted animals will help understand its functionality under environmental stress and possibly allow for knowledge transfer into biomedical research. The blind mole rat Spalax, a long-lived cancer-resistant rodent, lives in burrows underground and is adapted to severely hypoxic conditions. Here we have conducted a bioinformatical survey of Spalax core genes from the Nrf2-Keap1 pathway on the coding sequence level in comparison to other hypoxia-tolerant and -sensitive rodents. We find strong sequence conservation across all genes, illustrating the pathw…
dAPE: a web server to detect homorepeats and follow their evolution.
2016
Abstract Summary Homorepeats are low complexity regions consisting of repetitions of a single amino acid residue. There is no current consensus on the minimum number of residues needed to define a functional homorepeat, nor even if mismatches are allowed. Here we present dAPE, a web server that helps following the evolution of homorepeats based on orthology information, using a sensitive but tunable cutoff to help in the identification of emerging homorepeats. Availability and Implementation dAPE can be accessed from http://cbdm-01.zdv.uni-mainz.de/∼munoz/polyx. Supplementary information Supplementary data are available at Bioinformatics online.
MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems
2016
This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of recordJorge González-Domínguez, Yongchao Liu, Juan Touriño, Bertil Schmidt; MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems, Bioinformatics, Volume 32, Issue 24, 15 December 2016, Pages 3826–3828, https://doi.org/10.1093/bioinformatics/btw558is available online at: https://doi.org/10.1093/bioinformatics/btw558 [Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-sca…
An effective extension of the applicability of alignment-free biological sequence comparison algorithms with Hadoop
2016
Alignment-free methods are one of the mainstays of biological sequence comparison, i.e., the assessment of how similar two biological sequences are to each other, a fundamental and routine task in computational biology and bioinformatics. They have gained popularity since, even on standard desktop machines, they are faster than methods based on alignments. However, with the advent of Next-Generation Sequencing Technologies, datasets whose size, i.e., number of sequences and their total length, is a challenge to the execution of alignment-free methods on those standard machines are quite common. Here, we propose the first paradigm for the computation of k-mer-based alignment-free methods for…
Identification of transcribed protein coding sequence remnants within lincRNAs
2018
Abstract Long intergenic non-coding RNAs (lincRNAs) are non-coding transcripts >200 nucleotides long that do not overlap protein-coding sequences. Importantly, such elements are known to be tissue-specifically expressed and to play a widespread role in gene regulation across thousands of genomic loci. However, very little is known of the mechanisms for the evolutionary biogenesis of these RNA elements, especially given their poor conservation across species. It has been proposed that lincRNAs might arise from pseudogenes. To test this systematically, we developed a novel method that searches for remnants of protein-coding sequences within lincRNA transcripts; the hypothesis is that we can t…
Evolutionary conserved mechanisms pervade structure and transcriptional modulation of allograft inflammatory factor-1 from sea anemone Anemonia virid…
2017
Gene family encoding allograft inflammatory factor-1 (AIF-1) is well conserved among organisms; however, there is limited knowledge in lower organisms. In this study, the first AIF-1 homologue from cnidarians was identified and characterised in the sea anemone Anemonia viridis. The full-length cDNA of AvAIF-1 was of 913 bp with a 5' -untranslated region (UTR) of 148 bp, a 3'-UTR of 315 and an open reading frame (ORF) of 450 bp encoding a polypeptide with149 amino acid residues and predicted molecular weight of about 17 kDa. The predicted protein possesses evolutionary conserved EF hand Ca2+ binding motifs, post-transcriptional modification sites and a 3D structure which can be superimposed …
Parallel algorithms for large-scale biological sequence alignment on Xeon-Phi based clusters
2016
Computing alignments between two or more sequences are common operations frequently performed in computational molecular biology. The continuing growth of biological sequence databases establishes the need for their efficient parallel implementation on modern accelerators. This paper presents new approaches to high performance biological sequence database scanning with the Smith-Waterman algorithm and the first stage of progressive multiple sequence alignment based on the ClustalW heuristic on a Xeon Phi-based compute cluster. Our approach uses a three-level parallelization scheme to take full advantage of the compute power available on this type of architecture; i.e. cluster-level data par…
A giant type I polyketide synthase participates in zygospore maturation in Chlamydomonas reinhardtii
2017
Polyketide synthases (PKSs) occur in many bacteria, fungi and plants. They are highly versatile enzymes involved in the biosynthesis of a large variety of compounds including antimicrobial agents, polymers associated with bacterial cell walls and plant pigments. While harmful algae are known to produce polyketide toxins, sequences of the genomes of non-toxic algae, including those of many green algal species, have surprisingly revealed the presence of genes encoding type I PKSs. The genome of the model alga Chlamydomonas reinhardtii (Chlorophyta) contains a single type I PKS gene, designated PKS1 (Cre10.g449750), which encodes a giant PKS with a predicted mass of 2.3 MDa. Here, we show that…
Avoided motifs: short amino acid strings missing from protein datasets.
2020
Abstract According to the amino acid composition of natural proteins, it could be expected that all possible sequences of three or four amino acids will occur at least once in large protein datasets purely by chance. However, in some species or cellular context, specific short amino acid motifs are missing due to unknown reasons. We describe these as Avoided Motifs, short amino acid combinations missing from biological sequences. Here we identify 209 human and 154 bacterial Avoided Motifs of length four amino acids, and discuss their possible functionality according to their presence in other species. Furthermore, we determine two Avoided Motifs of length three amino acids in human proteins…
Unexpected associated microalgal diversity in the lichen Ramalina farinacea is uncovered by pyrosequencing analyses
2017
The current literature reveals that the intrathalline coexistence of multiple microalgal taxa in lichens is more common than previously thought, and additional complexity is supported by the coexistence of bacteria and basidiomycete yeasts in lichen thalli. This replaces the old paradigm that lichen symbiosis occurs between a fungus and a single photobiont. The lichen Ramalina farinacea has proven to be a suitable model to study the multiplicity of microalgae in lichen thalli due to the constant coexistence of Trebouxia sp. TR9 and T. jamesii in long-distance populations. To date, studies involving phycobiont diversity within entire thalli are based on Sanger sequencing, but this method see…