Search results for "CLUSTER"
showing 10 items of 3640 documents
Protein denaturation caused by heat inactivation detrimentally affects biomolecular corona formation and cellular uptake
2018
Adsorption of blood proteins to the surface of nanocarriers is known to be the critical factor influencing cellular interactions and eventually determining the successful application of nanocarriers as drug carriers in vivo. There is an increasing number of reports summarizing large data sets of all identified corona proteins. However, to date our knowledge about the multiple mechanisms mediating interactions between proteins and nanocarriers is still limited. In this study, we investigate the influence of protein structure on the adsorption process and focus on the effect of heat inactivation of serum and plasma, which is a common cell culture procedure used to inactivate the complement sy…
FastaHerder2: Four Ways to Research Protein Function and Evolution with Clustering and Clustered Databases.
2016
The accelerated growth of protein databases offers great possibilities for the study of protein function using sequence similarity and conservation. However, the huge number of sequences deposited in these databases requires new ways of analyzing and organizing the data. It is necessary to group the many very similar sequences, creating clusters with automated derived annotations useful to understand their function, evolution, and level of experimental evidence. We developed an algorithm called FastaHerder2, which can cluster any protein database, putting together very similar protein sequences based on near-full-length similarity and/or high threshold of sequence identity. We compressed 50…
Innovative Strategies to Develop Chemical Categories Using a Combination of Structural and Toxicological Properties.
2016
Interest is increasing in the development of non-animal methods for toxicological evaluations. These methods are however, particularly challenging for complex toxicological endpoints such as repeated dose toxicity. European Legislation, e.g., the European Union's Cosmetic Directive and REACH, demands the use of alternative methods. Frameworks, such as the Read-across Assessment Framework or the Adverse Outcome Pathway Knowledge Base, support the development of these methods. The aim of the project presented in this publication was to develop substance categories for a read-across with complex endpoints of toxicity based on existing databases. The basic conceptual approach was to combine str…
Identification of a large, fast-expanding HIV-1 subtype B transmission cluster among MSM in Valencia, Spain
2017
We describe and characterize an exceptionally large HIV-1 subtype B transmission cluster occurring in the Comunidad Valenciana (CV, Spain). A total of 1806 HIV-1 protease-reverse transcriptase (PR/RT) sequences from different patients were obtained in the CV between 2004 and 2014. After subtyping and generating a phylogenetic tree with additional HIV-1 subtype B sequences, a very large transmission cluster which included almost exclusively sequences from the CV was detected (n = 143 patients). This cluster was then validated and characterized with further maximum-likelihood phylogenetic analyses and Bayesian coalescent reconstructions. With these analyses, the CV cluster was delimited to 11…
Snapshots of a shrinking partner: Genome reduction inSerratia symbiotica
2016
AbstractGenome reduction is pervasive among maternally-inherited endosymbiotic organisms, from bacteriocyte- to gut-associated ones. This genome erosion is a step-wise process in which once free-living organisms evolve to become obligate associates, thereby losing non-essential or redundant genes/functions. Serratia symbiotica (Gammaproteobacteria), a secondary endosymbiont present in many aphids (Hemiptera: Aphididae), displays various characteristics that make it a good model organism for studying genome reduction. While some strains are of facultative nature, others have established co-obligate associations with their respective aphid host and its primary endosymbiont (Buchnera). Further…
Clustering of low-correlated spatial gene expression patterns in the mouse brain in the Allen Brain Atlas
2018
In this paper, clustering techniques are applied to spatial gene expression patterns with a low genomic correlation between the sagittal and coronal projections. The data analysed here are hosted on an available public DB named ABA (Allen Brain Atlas). The results are compared to those obtained by Bohland et al. on the complementary dataset (high correlation values). We prove that, by analysing a reduced dataset,hence reducing the computational burden, we get the same accuracy in highlighting different neuroanatomical region.
Regulatory effects of simvastatin and apoJ on APP processing and amyloid-beta clearance in blood-brain barrier endothelial cells
2017
Amyloid-β peptides (Aβ) accumulate in cerebral capillaries indicating a central role of the blood-brain barrier (BBB) in the pathogenesis of Alzheimer’s disease (AD). Although a relationship between apolipoprotein-, cholesterol- and Aβ metabolism is evident, the interconnecting mechanisms operating in brain capillary endothelial cells (BCEC) are poorly understood. ApoJ (clusterin) is present in HDL that regulates cholesterol metabolism which is disturbed in AD. ApoJ levels are increased in AD brains and in plasma of cerebral amyloid angiopathy (CAA) patients. ApoJ may bind, prevent fibrillization, and enhance clearance of Aβ. We here define a connection of apoJ and cellular cholesterol home…
parSRA: A framework for the parallel execution of short read aligners on compute clusters
2018
The growth of next generation sequencing datasets poses as a challenge to the alignment of reads to reference genomes in terms of both accuracy and speed. In this work we present parSRA, a parallel framework to accelerate the execution of existing short read aligners on distributed-memory systems. parSRA can be used to parallelize a variety of short read alignment tools installed in the system without any modification to their source code. We show that our framework provides good scalability on a compute cluster for accelerating the popular BWA-MEM and Bowtie2 aligners. On average, it is able to accelerate sequence alignments on 16 64-core nodes (in total, 1024 cores) with speedup of 10.48 …
CUDA-enabled hierarchical ward clustering of protein structures based on the nearest neighbour chain algorithm
2015
Clustering of molecular systems according to their three-dimensional structure is an important step in many bioinformatics workflows. In applications such as docking or structure prediction, many algorithms initially generate large numbers of candidate poses (or decoys), which are then clustered to allow for subsequent computationally expensive evaluations of reasonable representatives. Since the number of such candidates can easily range from thousands to millions, performing the clustering on standard central processing units (CPUs) is highly time consuming. In this paper, we analyse and evaluate different approaches to parallelize the nearest neighbour chain algorithm to perform hierarc…
Assessing statistical significance in multivariable genome wide association analysis
2016
Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whe…