Search results for "DATA"

showing 10 items of 12992 documents

Model selection for factorial Gaussian graphical models with an application to dynamic regulatory networks.

2016

Abstract Factorial Gaussian graphical Models (fGGMs) have recently been proposed for inferring dynamic gene regulatory networks from genomic high-throughput data. In the search for true regulatory relationships amongst the vast space of possible networks, these models allow the imposition of certain restrictions on the dynamic nature of these relationships, such as Markov dependencies of low order – some entries of the precision matrix are a priori zeros – or equal dependency strengths across time lags – some entries of the precision matrix are assumed to be equal. The precision matrix is then estimated by l 1-penalized maximum likelihood, imposing a further constraint on the absolute value…

0301 basic medicineStatistics and ProbabilityFactorialDependency (UML)Computer scienceGaussianNormal Distributionpenalized inferencesparse networkscomputer.software_genreMachine learning01 natural sciencesNormal distribution010104 statistics & probability03 medical and health sciencessymbols.namesakeSparse networksGeneticsComputer SimulationGene Regulatory NetworksGraphical model0101 mathematicsgene-regulatory systemMolecular BiologyProbabilityMarkov chainModels GeneticPenalized inferencebusiness.industryModel selectiongraphical modelGene-regulatory systemsComputational Mathematics030104 developmental biologysymbolsA priori and a posterioriData miningArtificial intelligenceGraphical modelsSettore SECS-S/01 - StatisticabusinesscomputerNeisseriaAlgorithmsStatistical applications in genetics and molecular biology
researchProduct

Reverse screening on indicaxanthin from Opuntia ficus-indica as natural chemoactive and chemopreventive agent

2018

Indicaxanthin is a bioactive and bioavailable betalain pigment extracted from Opuntia ficus indica fruits. Indicaxanthin has pharmacokinetic proprieties, rarely found in other phytochemicals, and it has been demonstrated that it provides a broad-spectrum of pharmaceutical activity, exerting anti-proliferative, anti-inflammatory, and neuromodulator effects. The discovery of the Indicaxanthin physiological targets plays an important role in understanding the biochemical mechanism. In this study, combined reverse pharmacophore mapping, reverse docking, and text-based database search identified Inositol Trisphosphate 3-Kinase (ITP3K-A), Glutamate carboxypeptidase II (GCPII), Leukotriene-A4 hydr…

0301 basic medicineStatistics and ProbabilityMolecular dynamicPyridinesKainate receptorIndicaxanthinPhytochemical01 natural sciencesGeneral Biochemistry Genetics and Molecular BiologyDocking03 medical and health scienceschemistry.chemical_compoundNeoplasmsGlutamate carboxypeptidase IIData MiningHumansEnzyme InhibitorsMM-GBSAPharmacophore modelingBinding SitesGeneral Immunology and MicrobiologyReverse screening010405 organic chemistryAnti-cancerApplied MathematicsPhosphodiesteraseOpuntiaPhosphoserine phosphataseInositol trisphosphateGeneral MedicineAntineoplastic Agents Phytogenic0104 chemical sciencesBetaxanthinsNeoplasm ProteinsNeuromodulatorMolecular Docking SimulationAnti-inflammatory agent030104 developmental biologychemistryBiochemistryDocking (molecular)Modeling and SimulationPharmacophoreGeneral Agricultural and Biological SciencesIndicaxanthin
researchProduct

LEGO-based generalized set of two linear algebraic 3D bio-macro-molecular descriptors: Theory and validation by QSARs

2019

Abstract Novel 3D protein descriptors based on bilinear, quadratic and linear algebraic maps in R n are proposed. The latter employs the kth 2-tuple (dis) similarity matrix to codify information related to covalent and non-covalent interactions in these biopolymers. The calculation of the inter-amino acid distances is generalized by using several dis-similarity coefficients, where normalization procedures based on the simple stochastic and mutual probability schemes are applied. A new local-fragment approach based on amino acid-types and amino acid-groups is proposed to characterize regions of interest in proteins. Topological and geometric macromolecular cutoffs are defined using local and…

0301 basic medicineStatistics and ProbabilityNormalization (statistics)GeneralizationQuantitative Structure-Activity RelationshipGeneral Biochemistry Genetics and Molecular Biology03 medical and health sciences0302 clinical medicineLinear regressionAmino AcidsMathematicsGeneral Immunology and MicrobiologyApplied MathematicsStatistical parameterProteinsGeneral MedicineCollinearityStructural Classification of Proteins databaseSupport vector machine030104 developmental biologyModeling and SimulationTest setLinear ModelsGeneral Agricultural and Biological SciencesAlgorithmSoftware030217 neurology & neurosurgeryJournal of Theoretical Biology
researchProduct

Reference genome assessment from a population scale perspective: an accurate profile of variability and noise.

2017

Abstract Motivation Current plant and animal genomic studies are often based on newly assembled genomes that have not been properly consolidated. In this scenario, misassembled regions can easily lead to false-positive findings. Despite quality control scores are included within genotyping protocols, they are usually employed to evaluate individual sample quality rather than reference sequence reliability. We propose a statistical model that combines quality control scores across samples in order to detect incongruent patterns at every genomic region. Our model is inherently robust since common artifact signals are expected to be shared between independent samples over misassembled regions …

0301 basic medicineStatistics and ProbabilityQuality ControlGenotypeComputer sciencemedia_common.quotation_subjectPopulationGenomicsBioinformaticscomputer.software_genreBiochemistryGenome03 medical and health sciencesGenetic variationAnimalsHumansQuality (business)AlleleeducationMolecular BiologyGenotypingReliability (statistics)media_commonProtocol (science)education.field_of_studyGenomeModels StatisticalGenetic VariationReproducibility of ResultsGenomicsGenome AnalysisOriginal PapersComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsData miningcomputerSoftwareReference genome
researchProduct

dAPE: a web server to detect homorepeats and follow their evolution.

2016

Abstract Summary Homorepeats are low complexity regions consisting of repetitions of a single amino acid residue. There is no current consensus on the minimum number of residues needed to define a functional homorepeat, nor even if mismatches are allowed. Here we present dAPE, a web server that helps following the evolution of homorepeats based on orthology information, using a sensitive but tunable cutoff to help in the identification of emerging homorepeats. Availability and Implementation dAPE can be accessed from http://cbdm-01.zdv.uni-mainz.de/∼munoz/polyx. Supplementary information Supplementary data are available at Bioinformatics online.

0301 basic medicineStatistics and ProbabilityRepetitive Sequences Amino AcidWeb serverInternetComputer sciencecomputer.software_genreBiochemistryApplications NotesComputer Science ApplicationsWorld Wide WebEvolution Molecular03 medical and health sciencesComputational Mathematics030104 developmental biologyComputational Theory and MathematicsAnimalsHumansData miningMolecular BiologycomputerSequence AlignmentSequence AnalysisSoftwareBioinformatics (Oxford, England)
researchProduct

MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems

2016

This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of recordJorge González-Domínguez, Yongchao Liu, Juan Touriño, Bertil Schmidt; MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems, Bioinformatics, Volume 32, Issue 24, 15 December 2016, Pages 3826–3828, https://doi.org/10.1093/bioinformatics/btw558is available online at: https://doi.org/10.1093/bioinformatics/btw558 [Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-sca…

0301 basic medicineStatistics and ProbabilitySource codeComputer sciencemedia_common.quotation_subject02 engineering and technologyParallel computingcomputer.software_genreBiochemistryExecution time03 medical and health sciences0202 electrical engineering electronic engineering information engineeringCluster (physics)Point (geometry)Amino Acid SequenceMolecular Biologymedia_commonSequenceMultiple sequence alignmentProtein multiple sequenceComputational BiologyProteinsMarkov ChainsComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsDistributed memory systemsMSAProbs020201 artificial intelligence & image processingMPIData miningSequence AlignmentcomputerAlgorithmsSoftware
researchProduct

REGGAE : a novel approach for the identification of key transcriptional regulators

2019

Abstract Motivation Transcriptional regulators play a major role in most biological processes. Alterations in their activities are associated with a variety of diseases and in particular with tumor development and progression. Hence, it is important to assess the effects of deregulated regulators on pathological processes. Results Here, we present REGulator-Gene Association Enrichment (REGGAE), a novel method for the identification of key transcriptional regulators that have a significant effect on the expression of a given set of genes, e.g. genes that are differentially expressed between two sample groups. REGGAE uses a Kolmogorov–Smirnov-like test statistic that implicitly combines assoc…

0301 basic medicineStatistics and ProbabilityTranscription Genetic610Computational biologyBiologyBiochemistry03 medical and health sciencesNeoplasmsHumansTwo sampleMolecular BiologyGeneProbabilitySupplementary dataRegulation of gene expressionSystems Biology500Original PapersComputer Science Applications004Computational Mathematics030104 developmental biologyComputational Theory and MathematicsGene Expression RegulationKey (cryptography)Identification (biology)FemaleSoftware
researchProduct

Screening of potent phytochemical inhibitors against SARS-CoV-2 protease and its two Asian mutants

2021

Abstract Background COVID-19, declared a pandemic in March 2020 by the World Health Organization is caused by Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). The virus has already killed more than 2.3 million people worldwide. Object The principal intent of this work was to investigate lead compounds by screening natural product library (NPASS) for possible treatment of COVID-19. Methods Pharmacophore features were used to screen a large database to get a small dataset for structure-based virtual screening of natural product compounds. In the structure-based screening, molecular docking was performed to find a potent inhibitor molecule against the main protease (Mpro) of SARS-…

0301 basic medicineStereochemistrymedicine.medical_treatmentPhytochemicalsProtein Data Bank (RCSB PDB)Health Informaticsmedicine.disease_causeMolecular Docking SimulationAntiviral AgentsArticleDocking03 medical and health scienceschemistry.chemical_compound0302 clinical medicinemedicineHumansProtease InhibitorsCoronavirusVirtual screeningNatural productsProteaseChemistrySARS-CoV-2COVID-19Computer Science ApplicationsProteaseCoronavirusMolecular Docking Simulation030104 developmental biologyDocking (molecular)PharmacophoreLead compound030217 neurology & neurosurgeryMproPeptide HydrolasesComputers in Biology and Medicine
researchProduct

Parallel and Space-Efficient Construction of Burrows-Wheeler Transform and Suffix Array for Big Genome Data

2016

Next-generation sequencing technologies have led to the sequencing of more and more genomes, propelling related research into the era of big data. In this paper, we present ParaBWT, a parallelized Burrows-Wheeler transform (BWT) and suffix array construction algorithm for big genome data. In ParaBWT, we have investigated a progressive construction approach to constructing the BWT of single genome sequences in linear space complexity, but with a small constant factor. This approach has been further parallelized using multi-threading based on a master-slave coprocessing model. After gaining the BWT, the suffix array is constructed in a memory-efficient manner. The performance of ParaBWT has b…

0301 basic medicineTheoretical computer scienceBurrows–Wheeler transformComputer scienceGenomicsData_CODINGANDINFORMATIONTHEORYParallel computingGenomelaw.invention03 medical and health scienceslawGeneticsHumansEnsemblMulti-core processorApplied MathematicsLinear spaceSuffix arrayChromosome MappingHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNA030104 developmental biologyAlgorithmsBiotechnologyReference genomeIEEE/ACM Transactions on Computational Biology and Bioinformatics
researchProduct

Identification of control targets in Boolean molecular network models via computational algebra

2015

Motivation: Many problems in biomedicine and other areas of the life sciences can be characterized as control problems, with the goal of finding strategies to change a disease or otherwise undesirable state of a biological system into another, more desirable, state through an intervention, such as a drug or other therapeutic treatment. The identification of such strategies is typically based on a mathematical model of the process to be altered through targeted control inputs. This paper focuses on processes at the molecular level that determine the state of an individual cell, involving signaling or gene regulation. The mathematical model type considered is that of Boolean networks. The pot…

0301 basic medicineTheoretical computer scienceComputer scienceProcess (engineering)Molecular Networks (q-bio.MN)Systems biologySystem of polynomial equationsENCODEBoolean networksSet (abstract data type)03 medical and health sciences0302 clinical medicineStructural BiologyModelling and SimulationQuantitative Biology - Molecular NetworksMolecular BiologyEdge deletionsApplied MathematicsComputer Science ApplicationsNetwork controlIdentification (information)030104 developmental biologyBoolean networkBlocking transitionsFOS: Biological sciencesModeling and SimulationAlgebraic controlState (computer science)030217 neurology & neurosurgeryResearch ArticleBMC Systems Biology
researchProduct