Search results for "cluster analysis."
showing 10 items of 805 documents
Dissection of DLBCL microenvironment provides a gene expression-based predictor of survival applicable to formalin-fixed paraffin-embedded tissue
2018
Abstract Background Gene expression profiling (GEP) studies recognized a prognostic role for tumor microenvironment (TME) in diffuse large B-cell lymphoma (DLBCL), but the routinely adoption of prognostic stromal signatures remains limited. Patients and methods Here, we applied the computational method CIBERSORT to generate a 1028-gene matrix incorporating signatures of 17 immune and stromal cytotypes. Then, we carried out a deconvolution on publicly available GEP data of 482 untreated DLBCLs to reveal associations between clinical outcomes and proportions of putative tumor-infiltrating cell types. Forty-five genes related to peculiar prognostic cytotypes were selected and their expression …
Pharmacogenomics of Scopoletin in Tumor Cells
2016
Drug resistance and the severe side effects of chemotherapy necessitate the development of novel anticancer drugs. Natural products are a valuable source for drug development. Scopoletin is a coumarin compound, which can be found in several Artemisia species and other plant genera. Microarray-based RNA expression profiling of the NCI cell line panel showed that cellular response of scopoletin did not correlate to the expression of ATP-binding cassette (ABC) transporters as classical drug resistance mechanisms (ABCB1, ABCB5, ABCC1, ABCG2). This was also true for the expression of the oncogene EGFR and the mutational status of the tumor suppressor gene, TP53. However, mutations in the RAS onc…
MicroRNA as crucial regulators of gene expression in estradiol-treated human endothelial cells.
2018
Background/Aims: Estrogen signalling plays an important role in vascular biology as it modulates vasoactive and metabolic pathways in endothelial cells. Growing evidence has also established microRNA (miRNA) as key regulators of endothelial function. Nonetheless, the role of estrogen regulation on miRNA profile in endothelial cells is poorly understood. In this study, we aimed to determine how estrogen modulates miRNA profile in human endothelial cells and to explore the role of the different estrogen receptors (ERα, ERβ and GPER) in the regulation of miRNA expression by estrogen. Methods: We used miRNA microarrays to determine global miRNA expression in human umbilical vein endothelial cel…
FastaHerder2: Four Ways to Research Protein Function and Evolution with Clustering and Clustered Databases.
2016
The accelerated growth of protein databases offers great possibilities for the study of protein function using sequence similarity and conservation. However, the huge number of sequences deposited in these databases requires new ways of analyzing and organizing the data. It is necessary to group the many very similar sequences, creating clusters with automated derived annotations useful to understand their function, evolution, and level of experimental evidence. We developed an algorithm called FastaHerder2, which can cluster any protein database, putting together very similar protein sequences based on near-full-length similarity and/or high threshold of sequence identity. We compressed 50…
Innovative Strategies to Develop Chemical Categories Using a Combination of Structural and Toxicological Properties.
2016
Interest is increasing in the development of non-animal methods for toxicological evaluations. These methods are however, particularly challenging for complex toxicological endpoints such as repeated dose toxicity. European Legislation, e.g., the European Union's Cosmetic Directive and REACH, demands the use of alternative methods. Frameworks, such as the Read-across Assessment Framework or the Adverse Outcome Pathway Knowledge Base, support the development of these methods. The aim of the project presented in this publication was to develop substance categories for a read-across with complex endpoints of toxicity based on existing databases. The basic conceptual approach was to combine str…
Snapshots of a shrinking partner: Genome reduction inSerratia symbiotica
2016
AbstractGenome reduction is pervasive among maternally-inherited endosymbiotic organisms, from bacteriocyte- to gut-associated ones. This genome erosion is a step-wise process in which once free-living organisms evolve to become obligate associates, thereby losing non-essential or redundant genes/functions. Serratia symbiotica (Gammaproteobacteria), a secondary endosymbiont present in many aphids (Hemiptera: Aphididae), displays various characteristics that make it a good model organism for studying genome reduction. While some strains are of facultative nature, others have established co-obligate associations with their respective aphid host and its primary endosymbiont (Buchnera). Further…
Clustering of low-correlated spatial gene expression patterns in the mouse brain in the Allen Brain Atlas
2018
In this paper, clustering techniques are applied to spatial gene expression patterns with a low genomic correlation between the sagittal and coronal projections. The data analysed here are hosted on an available public DB named ABA (Allen Brain Atlas). The results are compared to those obtained by Bohland et al. on the complementary dataset (high correlation values). We prove that, by analysing a reduced dataset,hence reducing the computational burden, we get the same accuracy in highlighting different neuroanatomical region.
CUDA-enabled hierarchical ward clustering of protein structures based on the nearest neighbour chain algorithm
2015
Clustering of molecular systems according to their three-dimensional structure is an important step in many bioinformatics workflows. In applications such as docking or structure prediction, many algorithms initially generate large numbers of candidate poses (or decoys), which are then clustered to allow for subsequent computationally expensive evaluations of reasonable representatives. Since the number of such candidates can easily range from thousands to millions, performing the clustering on standard central processing units (CPUs) is highly time consuming. In this paper, we analyse and evaluate different approaches to parallelize the nearest neighbour chain algorithm to perform hierarc…
Assessing statistical significance in multivariable genome wide association analysis
2016
Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whe…
ParDRe: faster parallel duplicated reads removal tool for sequencing studies
2016
This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of record [insert complete citation information here] is available online at: https://doi.org/10.1093/bioinformatics/btw038 [Abstract] Summary: Current next generation sequencing technologies often generate duplicated or near-duplicated reads that (depending on the application scenario) do not provide any interesting biological information but can increase memory requirements and computational time of downstream analysis. In this work we present ParDRe , a de novo parallel tool to remove duplicated and near-duplicated reads through the clustering of S…