Search results for " Sequence alignment"
showing 10 items of 32 documents
Comprehensive analysis of a Vibrio parahaemolyticus strain extracellular serine protease VpSP37
2015
Proteases play an important role in the field of tissue dissociation combined with regenerative medicine. During the years new sources of proteolytic enzymes have been studied including proteases from different marine organisms both eukaryotic and prokaryotic. Herein we have purified a secreted component of an isolate of Vibrio parahaemolyticus, with electrophoretic mobilities corresponding to 36 kDa, belonging to the serine proteases family. Sequencing of the N-terminus enabled the in silico identification of the whole primary structure consisting of 345 amino acid residues with a calculated molecular mass of 37.4 KDa. The purified enzyme, named VpSP37, contains a Serine protease domain be…
Mutational Characterization of the Bile Acid Receptor TGR5 in Primary Sclerosing Cholangitis
2010
Background: TGR5, the G protein-coupled bile acid receptor 1 (GPBAR1), has been linked to inflammatory pathways as well as bile homeostasis, and could therefore be involved in primary sclerosing cholangitis (PSC) a chronic inflammatory bile duct disease. We aimed to extensively investigate TGR5 sequence variation in PSC, as well as functionally characterize detected variants.Methodology/Principal Findings: Complete resequencing of TGR5 was performed in 267 PSC patients and 274 healthy controls. Six nonsynonymous mutations were identified in addition to 16 other novel single-nucleotide polymorphisms. To investigate the impact from the nonsynonymous variants on TGR5, we created a receptor mod…
Multiple Protein Sequence Alignment with MSAProbs
2013
Multiple sequence alignment (MSA) generally constitutes the foundation of many bioinformatics studies involving functional, structural, and evolutionary relationship analysis between sequences. As a result of the exponential computational complexity of the exact approach to producing optimal multiple alignments, the majority of state-of-the-art MSA algorithms are designed based on the progressive alignment heuristic. In this chapter, we outline MSAProbs, a parallelized MSA algorithm for protein sequences based on progressive alignment. To achieve high alignment accuracy, this algorithm employs a hybrid combination of a pair hidden Markov model and a partition function to calculate posterior…
REP2: A Web Server to Detect Common Tandem Repeats in Protein Sequences
2020
Ensembles of tandem repeats (TRs) in protein sequences expand rapidly to form domains well suited for interactions with proteins. For this reason, they are relatively frequent. Some TRs have known structures and therefore it is advantageous to predict their presence in a protein sequence. However, since most TRs diverge quickly, their detection by classical sequence comparison algorithms is not very accurate. Previously, we developed a method and a web server that used curated profiles and thresholds for the detection of 11 common TRs. Here we present a new web server (REP2) that allows the analysis of TRs in both individual and aligned sequences. We provide currently precomputed analyses f…
Algorithms for Graph and Network Analysis: Graph Alignment
2019
In this article we discuss the problem of graph alignment, which has been longly referred to for the purpose of analyzing and comparing biological networks. In particular, we describe different facets of graph alignment, according to the number of input networks, the fixed output objective, the possible heterogeneity of input data. Accordingly, we will discuss pairwise and multiple alignment, global and local alignment, etc. Moreover, we provide a comprehensive overview of the algorithms and techniques proposed in the literature to solve each of the specific considered types of graph alignment. In order to make the material presented here complete and useful to guide the reader in the use o…
Subunit sequences of the 4 x 6-mer hemocyanin from the golden orb-web spider, Nephila inaurata. Intramolecular evolution of the chelicerate hemocyani…
2003
The transport of oxygen in the hemolymph of many arthropod and mollusc species is mediated by large copper-proteins that are referred to as hemocyanins. Arthropod hemocyanins are composed of hexamers and oligomers of hexamers. Arachnid hemocyanins usually form 4 x 6-mers consisting of seven distinct subunit types (termed a-g), although in some spider taxa deviations from this standard scheme have been observed. Applying immunological and electrophoretic methods, six distinct hemocyanin subunits were identified in the red-legged golden orb-web spider Nephila inaurata madagascariensis (Araneae: Tetragnathidae). The complete cDNA sequences of six subunits were obtained that corresponded to a-,…
CARE: context-aware sequencing read error correction.
2020
Abstract Motivation Error correction is a fundamental pre-processing step in many Next-Generation Sequencing (NGS) pipelines, in particular for de novo genome assembly. However, existing error correction methods either suffer from high false-positive rates since they break reads into independent k-mers or do not scale efficiently to large amounts of sequencing reads and complex genomes. Results We present CARE—an alignment-based scalable error correction algorithm for Illumina data using the concept of minhashing. Minhashing allows for efficient similarity search within large sequencing read collections which enables fast computation of high-quality multiple alignments. Sequencing errors ar…
NMR structure of a non-conjugatable, ADP-ribosylation associated, ubiquitin-like domain from Tetrahymena thermophila polyubiquitin locus.
2019
Abstract Background Ubiquitin-like domains (UbLs), in addition to being post-translationally conjugated to the target through the E1-E2-E3 enzymatic cascade, can be translated as a part of the protein they ought to regulate. As integral UbLs coexist with the rest of the protein, their structural properties can differ from canonical ubiquitin, depending on the protein context and how they interact with it. In this work, we investigate T.th-ubl5, a UbL present in a polyubiquitin locus of Tetrahymena thermophila, which is integral to an ADP-ribosyl transferase protein. Only one other co-occurrence of these two domains within the same protein has been reported. Methods NMR, multiple sequence al…
Vairāku sekvenču izlīdzināšanas metožu salīdzinājums
2021
Šajā maģistra darbā paredzēts izpētīt, aprakstīt un salīdzināt dažādas praksē pieejamas vairāku sekvenču izlīdzināšanas metodes. Darbā tiek aprakstīti vairāku sekvenču izlīdzināšanas metožu galvenie pielietojumi bioinformātikā, biežāk sastopamie algoritmi, kuri tiek izmantoti darbā tālāk apskatītajās programmā. Īsi aprakstītas atvērtā koda programmas, kuras industrijā tiek izmantotas visbiežāk. Maģistra darba ietvaros veikts praktisks pētījums par šo metožu priekšrocībām un trūkumiem. Salīdzinājums veikts gan uz reāliem datu masīviem, gan simulētiem, lai spētu pēc iespējas daudzpusīgāk salīdzināt pieejamo programmatūru. Veikta iegūto rezultātu grafiska atspoguļošana un analīze par novērojam…
Type II keratin cDNAs from the rainbow trout: implications for keratin evolution.
2002
From a teleost fish, the rainbow trout Oncorhynchus mykiss, we have cloned and sequenced cDNAs encoding five different type II keratins. The corresponding protein spots, as separated by 2D-PAGE of trout cytoskeletal preparations, have been identified by peptide mass mapping using MALDI mass spectrometry. Three of the sequenced keratins are expressed in the epidermis (subtype IIe), and two in simple epithelia and mesenchymal cells (subtype IIs). The IIs keratins are both orthologs of human K8. This leaves unsequenced only the trace component S3 of the biochemically established trout keratin catalog. A phylogenetic tree has been constructed from a multiple alignment of the rod domains of the …