Search results for "Mining"

showing 10 items of 1730 documents

A new parallel pipeline for DNA methylation analysis of long reads datasets

2017

Background DNA methylation is an important mechanism of epigenetic regulation in development and disease. New generation sequencers allow genome-wide measurements of the methylation status by reading short stretches of the DNA sequence (Methyl-seq). Several software tools for methylation analysis have been proposed over recent years. However, the current trend is that the new sequencers and the ones expected for an upcoming future yield sequences of increasing length, making these software tools inefficient and obsolete. Results In this paper, we propose a new software based on a strategy for methylation analysis of Methyl-seq sequencing data that requires much shorter execution times while…

0301 basic medicineComputer scienceParallel pipelineADN02 engineering and technologycomputer.software_genreBiochemistrySensitivity and SpecificityDNA sequencingEpigenesis Genetic03 medical and health scienceschemistry.chemical_compoundStructural BiologyRNA analysisInformàticaDatabases Genetic0202 electrical engineering electronic engineering information engineeringHumansEpigeneticsMolecular Biology020203 distributed computingDNA methylationGenome HumanApplied MathematicsParallel pipelineMethylationSequence Analysis DNASupercomputerComputer Science ApplicationsGenòmica030104 developmental biologychemistryGene Expression RegulationDNA methylationMutationData miningHigh performance computingDNA microarraycomputerSequence AlignmentDNASoftware
researchProduct

Rocker: Open source, easy-to-use tool for AUC and enrichment calculations and ROC visualization

2016

Receiver operating characteristics (ROC) curve with the calculation of area under curve (AUC) is a useful tool to evaluate the performance of biomedical and chemoinformatics data. For example, in virtual drug screening ROC curves are very often used to visualize the efficiency of the used application to separate active ligands from inactive molecules. Unfortunately, most of the available tools for ROC analysis are implemented into commercially available software packages, or are plugins in statistical software, which are not always the easiest to use. Here, we present Rocker, a simple ROC curve visualization tool that can be used for the generation of publication quality images. Rocker also…

0301 basic medicineComputer scienceautomatic calculationLibrary and Information Sciencescomputer.software_genre01 natural sciences03 medical and health sciencesSoftwareArea under curvePlug-inPhysical and Theoretical ChemistryVirtual screeningReceiver operating characteristicbusiness.industryComputer Graphics and Computer-Aided Design0104 chemical sciencesComputer Science ApplicationsVisualizationreceiver operating characteristics010404 medicinal & biomolecular chemistryIdentification (information)ComputingMethodologies_PATTERNRECOGNITION030104 developmental biologyarea under curvesRockerCheminformaticsData miningbusinesscomputerSoftwaresoftwaresJournal of Cheminformatics
researchProduct

DNA Injury and Repair Systems

2018

n/a

0301 basic medicineDNA ReplicationDNA RepairMEDLINEDiseaseComputational biologyGenomeCatalysisInorganic Chemistrylcsh:Chemistry03 medical and health sciencesText miningMedicineAnimalsHumansDiseasePhysical and Theoretical ChemistryPhosphorylationMolecular BiologyDNA injurylcsh:QH301-705.5Spectroscopybusiness.industryGenome HumanOrganic ChemistryGeneral MedicineHuman geneticsComputer Science Applications030104 developmental biologyn/aEditoriallcsh:Biology (General)lcsh:QD1-999businessIntroductory Journal ArticleDNA DamageInternational Journal of Molecular Sciences
researchProduct

Reactome pathway analysis: a high-performance in-memory approach

2016

Reactome aims to provide bioinformatics tools for visualisation, interpretation and analysis of pathway knowledge to support basic research, genome analysis, modelling, systems biology and education. Pathway analysis methods have a broad range of applications in physiological and biomedical research; one of the main problems, from the analysis methods performance point of view, is the constantly increasing size of the data samples. Here, we present a new high-performance in-memory implementation of the well-established over-representation analysis method. To achieve the target, the over-representation analysis method is divided in four different steps and, for each of them, specific data st…

0301 basic medicineData structuresDatabases FactualPathway analysisComputer scienceInterface (Java)Systems biologycomputer.software_genreGenomeBiochemistry03 medical and health sciences0302 clinical medicineStructural BiologyNucleic AcidsHumansMolecular BiologyApplied MathematicsComputational BiologyProteinsPathway analysisComputer Science ApplicationsTree (data structure)030104 developmental biology030220 oncology & carcinogenesisGraph (abstract data type)Data miningOver-representation analysiscomputerAlgorithmsSoftwareBMC Bioinformatics
researchProduct

Applications of Chemoinformatics in Predictive Toxicology for Regulatory Purposes, Especially in the Context of the EU REACH Legislation

2018

Chemoinformatics methodologies such as QSAR/QSPR have been used for decades in drug discovery projects, especially for the finding of new compounds with therapeutic properties and the optimization of ADME properties on chemical series. The application of computational techniques in predictive toxicology is much more recent, and they are experiencing an increasingly interest because of the new legal requirements imposed by national and international regulations. In the pharmaceutical field, the US Food and Drug Administration (FDA) support the use of predictive models for regulatory decision-making when assessing the genotoxic and carcinogenic potential of drug impurities. In Europe, the REA…

0301 basic medicineEngineeringbusiness.industryManagement scienceLegislationContext (language use)Predictive toxicology010501 environmental sciencescomputer.software_genre01 natural sciences03 medical and health sciences030104 developmental biologyCheminformaticsData miningbusinesscomputer0105 earth and related environmental sciencesInternational Journal of Quantitative Structure-Property Relationships
researchProduct

Q-nexus: a comprehensive and efficient analysis pipeline designed for ChIP-nexus

2016

Background: ChIP-nexus, an extension of the ChIP-exo protocol, can be used to map the borders of protein-bound DNA sequences at nucleotide resolution, requires less input DNA and enables selective PCR duplicate removal using random barcodes. However, the use of random barcodes requires additional preprocessing of the mapping data, which complicates the computational analysis. To date, only a very limited number of software packages are available for the analysis of ChIP-exo data, which have not yet been systematically tested and compared on ChIP-nexus data. Results: Here, we present a comprehensive software package for ChIP-nexus data that exploits the random barcodes for selective removal …

0301 basic medicineFOS: Computer and information sciencesDuplication ratesChromatin ImmunoprecipitationBioinformaticsPipeline (computing)610Biologycomputer.software_genre600 Technik Medizin angewandte Wissenschaften::610 Medizin und Gesundheit03 medical and health sciencesSoftwareChIP-nexusGeneticsPreprocessorNucleotide MotifsLibrary complexityChIP-exoGeneticsProtocol (science)Binding Sitesbusiness.industryfungiComputational BiologyHigh-Throughput Nucleotide SequencingReproducibility of ResultsChipChromatin immunoprecipitationData mappingDNA-Binding ProteinsAlgorithm030104 developmental biologyChIP-exoData miningbusinessPeak callingcomputerAlgorithmsSoftwareProtein BindingTranscription FactorsResearch ArticleBiotechnologyBMC Genomics
researchProduct

Feasibility of sample size calculation for RNA-seq studies

2017

Sample size calculation is a crucial step in study design but is not yet fully established for RNA sequencing (RNA-seq) analyses. To evaluate feasibility and provide guidance, we evaluated RNA-seq sample size tools identified from a systematic search. The focus was on whether real pilot data would be needed for reliable results and on identifying tools that would perform well in scenarios with different levels of biological heterogeneity and fold changes (FCs) between conditions. We used simulations based on real data for tool evaluation. In all settings, the six evaluated tools provided widely different answers, which were strongly affected by FC. Although all tools failed for small FCs, s…

0301 basic medicineFold (higher-order function)Sequence Analysis RNAComputer scienceHigh-Throughput Nucleotide SequencingRNA-Seqcomputer.software_genre03 medical and health sciences030104 developmental biology0302 clinical medicineResearch DesignSample size determinationSample SizeFeasibility StudiesHumansData miningMolecular BiologycomputerSoftware030217 neurology & neurosurgeryInformation SystemsSystematic searchBriefings in Bioinformatics
researchProduct

Common Hits Approach: Combining Pharmacophore Modeling and Molecular Dynamics Simulations.

2017

We present a new approach that incorporates flexibility based on extensive MD simulations of protein-ligand complexes into structure-based pharmacophore modeling and virtual screening. The approach uses the multiple coordinate sets saved during the MD simulations and generates for each frame a pharmacophore model. Pharmacophore models with the same pharmacophore features are pooled. In this way the high number of pharmacophore models that results from the MD simulation is reduced to only a few hundred representative pharmacophore models. Virtual screening runs are performed with every representative pharmacophore model; the screening results are combined and rescored to generate a single hi…

0301 basic medicineGeneral Chemical EngineeringDrug Evaluation PreclinicalLibrary and Information SciencesMolecular Dynamics Simulationcomputer.software_genreLigandsLigandScoutCommon Hits Approach (CHA)03 medical and health sciencesMolecular dynamicsUser-Computer InterfaceComputational chemistryPharmacophore ModelingFlexibility (engineering)Virtual screeningChemistryFrame (networking)ProteinsGeneral ChemistryInto-structureSettore CHIM/08 - Chimica FarmaceuticaComputer Science Applications030104 developmental biologyData miningPharmacophorecomputerJournal of chemical information and modeling
researchProduct

Mutations in theERCC2(XPD) gene associated with severe fetal ichthyosis and dysmorphic features

2016

0301 basic medicineGeneticsFetusbusiness.industryIchthyosisObstetrics and Gynecology030105 genetics & hereditymedicine.disease03 medical and health sciencesText miningMedicineERCC2businessGeneGenetics (clinical)Prenatal Diagnosis
researchProduct

CLOVE: classification of genomic fusions into structural variation events

2017

Background A precise understanding of structural variants (SVs) in DNA is important in the study of cancer and population diversity. Many methods have been designed to identify SVs from DNA sequencing data. However, the problem remains challenging because existing approaches suffer from low sensitivity, precision, and positional accuracy. Furthermore, many existing tools only identify breakpoints, and so not collect related breakpoints and classify them as a particular type of SV. Due to the rapidly increasing usage of high throughput sequencing technologies in this area, there is an urgent need for algorithms that can accurately classify complex genomic rearrangements (involving more than …

0301 basic medicineGenomicsBiologycomputer.software_genrelcsh:Computer applications to medicine. Medical informaticsBiochemistryChromosomesDNA sequencingSet (abstract data type)Structural variationUser-Computer Interface03 medical and health sciencesStructural BiologyEscherichia coliHumansCopy-number variationMolecular Biologylcsh:QH301-705.5InternetMethodology ArticleApplied MathematicsBreakpointGenomic rearrangementsDNAGenomicsStructural variationsComputer Science ApplicationsIdentification (information)030104 developmental biologylcsh:Biology (General)Nucleic Acid ConformationGraph (abstract data type)lcsh:R858-859.7Data miningcomputerAlgorithmsBMC Bioinformatics
researchProduct