Search results for "Mining"

showing 10 items of 1730 documents

ParDRe: faster parallel duplicated reads removal tool for sequencing studies

2016

This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of record [insert complete citation information here] is available online at: https://doi.org/10.1093/bioinformatics/btw038 [Abstract] Summary: Current next generation sequencing technologies often generate duplicated or near-duplicated reads that (depending on the application scenario) do not provide any interesting biological information but can increase memory requirements and computational time of downstream analysis. In this work we present ParDRe , a de novo parallel tool to remove duplicated and near-duplicated reads through the clustering of S…

0301 basic medicineStatistics and ProbabilityFASTQ formatDNA stringsSource codeDownstream (software development)Computer sciencemedia_common.quotation_subjectParallel computingcomputer.software_genreBiochemistryDNA sequencing03 medical and health scienceschemistry.chemical_compound0302 clinical medicineHybrid MPI/multithreadingCluster AnalysisParDReMolecular BiologyGenemedia_commonHigh-Throughput Nucleotide SequencingSequence Analysis DNAParallel toolComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicschemistryData miningcomputerAlgorithms030217 neurology & neurosurgeryDNABioinformatics
researchProduct

Gene-based and semantic structure of the Gene Ontology as a complex network

2012

The last decade has seen the advent and consolidation of ontology based tools for the identification and biological interpretation of classes of genes, such as the Gene Ontology. The information accumulated time-by-time and included in the GO is encoded in the definition of terms and in the setting up of semantic relations amongst terms. This approach might be usefully complemented by a bottom-up approach based on the knowledge of relationships amongst genes. To this end, we investigate the Gene Ontology from a complex network perspective. We consider the semantic network of terms naturally associated with the semantic relationships provided by the Gene Ontology consortium and a gene-based …

0301 basic medicineStatistics and ProbabilityFOS: Computer and information sciencesPhysics - Physics and SocietyComplex systemComputer scienceMolecular Networks (q-bio.MN)Complex systemFOS: Physical sciencesNetworkCondensed Matter PhysicPhysics and Society (physics.soc-ph)computer.software_genreQuantitative Biology - Quantitative MethodsStatistics - ApplicationsGeneSemantic network03 medical and health sciencesSemantic similarityQuantitative Biology - Molecular NetworksApplications (stat.AP)GeneQuantitative Methods (q-bio.QM)Community detectionGene ontologybusiness.industryOntologyOntology-based data integrationComplex networkCondensed Matter PhysicsBipartite system030104 developmental biologyBipartite system; Community detection; Complex systems; Genes; Networks; Ontology; Condensed Matter Physics; Statistics and ProbabilityFOS: Biological sciencesOntologyWeighted networkData miningArtificial intelligenceComputingMethodologies_GENERALbusinesscomputerNatural language processing
researchProduct

Model selection for factorial Gaussian graphical models with an application to dynamic regulatory networks.

2016

Abstract Factorial Gaussian graphical Models (fGGMs) have recently been proposed for inferring dynamic gene regulatory networks from genomic high-throughput data. In the search for true regulatory relationships amongst the vast space of possible networks, these models allow the imposition of certain restrictions on the dynamic nature of these relationships, such as Markov dependencies of low order – some entries of the precision matrix are a priori zeros – or equal dependency strengths across time lags – some entries of the precision matrix are assumed to be equal. The precision matrix is then estimated by l 1-penalized maximum likelihood, imposing a further constraint on the absolute value…

0301 basic medicineStatistics and ProbabilityFactorialDependency (UML)Computer scienceGaussianNormal Distributionpenalized inferencesparse networkscomputer.software_genreMachine learning01 natural sciencesNormal distribution010104 statistics & probability03 medical and health sciencessymbols.namesakeSparse networksGeneticsComputer SimulationGene Regulatory NetworksGraphical model0101 mathematicsgene-regulatory systemMolecular BiologyProbabilityMarkov chainModels GeneticPenalized inferencebusiness.industryModel selectiongraphical modelGene-regulatory systemsComputational Mathematics030104 developmental biologysymbolsA priori and a posterioriData miningArtificial intelligenceGraphical modelsSettore SECS-S/01 - StatisticabusinesscomputerNeisseriaAlgorithmsStatistical applications in genetics and molecular biology
researchProduct

Reverse screening on indicaxanthin from Opuntia ficus-indica as natural chemoactive and chemopreventive agent

2018

Indicaxanthin is a bioactive and bioavailable betalain pigment extracted from Opuntia ficus indica fruits. Indicaxanthin has pharmacokinetic proprieties, rarely found in other phytochemicals, and it has been demonstrated that it provides a broad-spectrum of pharmaceutical activity, exerting anti-proliferative, anti-inflammatory, and neuromodulator effects. The discovery of the Indicaxanthin physiological targets plays an important role in understanding the biochemical mechanism. In this study, combined reverse pharmacophore mapping, reverse docking, and text-based database search identified Inositol Trisphosphate 3-Kinase (ITP3K-A), Glutamate carboxypeptidase II (GCPII), Leukotriene-A4 hydr…

0301 basic medicineStatistics and ProbabilityMolecular dynamicPyridinesKainate receptorIndicaxanthinPhytochemical01 natural sciencesGeneral Biochemistry Genetics and Molecular BiologyDocking03 medical and health scienceschemistry.chemical_compoundNeoplasmsGlutamate carboxypeptidase IIData MiningHumansEnzyme InhibitorsMM-GBSAPharmacophore modelingBinding SitesGeneral Immunology and MicrobiologyReverse screening010405 organic chemistryAnti-cancerApplied MathematicsPhosphodiesteraseOpuntiaPhosphoserine phosphataseInositol trisphosphateGeneral MedicineAntineoplastic Agents Phytogenic0104 chemical sciencesBetaxanthinsNeoplasm ProteinsNeuromodulatorMolecular Docking SimulationAnti-inflammatory agent030104 developmental biologychemistryBiochemistryDocking (molecular)Modeling and SimulationPharmacophoreGeneral Agricultural and Biological SciencesIndicaxanthin
researchProduct

Reference genome assessment from a population scale perspective: an accurate profile of variability and noise.

2017

Abstract Motivation Current plant and animal genomic studies are often based on newly assembled genomes that have not been properly consolidated. In this scenario, misassembled regions can easily lead to false-positive findings. Despite quality control scores are included within genotyping protocols, they are usually employed to evaluate individual sample quality rather than reference sequence reliability. We propose a statistical model that combines quality control scores across samples in order to detect incongruent patterns at every genomic region. Our model is inherently robust since common artifact signals are expected to be shared between independent samples over misassembled regions …

0301 basic medicineStatistics and ProbabilityQuality ControlGenotypeComputer sciencemedia_common.quotation_subjectPopulationGenomicsBioinformaticscomputer.software_genreBiochemistryGenome03 medical and health sciencesGenetic variationAnimalsHumansQuality (business)AlleleeducationMolecular BiologyGenotypingReliability (statistics)media_commonProtocol (science)education.field_of_studyGenomeModels StatisticalGenetic VariationReproducibility of ResultsGenomicsGenome AnalysisOriginal PapersComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsData miningcomputerSoftwareReference genome
researchProduct

dAPE: a web server to detect homorepeats and follow their evolution.

2016

Abstract Summary Homorepeats are low complexity regions consisting of repetitions of a single amino acid residue. There is no current consensus on the minimum number of residues needed to define a functional homorepeat, nor even if mismatches are allowed. Here we present dAPE, a web server that helps following the evolution of homorepeats based on orthology information, using a sensitive but tunable cutoff to help in the identification of emerging homorepeats. Availability and Implementation dAPE can be accessed from http://cbdm-01.zdv.uni-mainz.de/∼munoz/polyx. Supplementary information Supplementary data are available at Bioinformatics online.

0301 basic medicineStatistics and ProbabilityRepetitive Sequences Amino AcidWeb serverInternetComputer sciencecomputer.software_genreBiochemistryApplications NotesComputer Science ApplicationsWorld Wide WebEvolution Molecular03 medical and health sciencesComputational Mathematics030104 developmental biologyComputational Theory and MathematicsAnimalsHumansData miningMolecular BiologycomputerSequence AlignmentSequence AnalysisSoftwareBioinformatics (Oxford, England)
researchProduct

MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems

2016

This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of recordJorge González-Domínguez, Yongchao Liu, Juan Touriño, Bertil Schmidt; MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems, Bioinformatics, Volume 32, Issue 24, 15 December 2016, Pages 3826–3828, https://doi.org/10.1093/bioinformatics/btw558is available online at: https://doi.org/10.1093/bioinformatics/btw558 [Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-sca…

0301 basic medicineStatistics and ProbabilitySource codeComputer sciencemedia_common.quotation_subject02 engineering and technologyParallel computingcomputer.software_genreBiochemistryExecution time03 medical and health sciences0202 electrical engineering electronic engineering information engineeringCluster (physics)Point (geometry)Amino Acid SequenceMolecular Biologymedia_commonSequenceMultiple sequence alignmentProtein multiple sequenceComputational BiologyProteinsMarkov ChainsComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsDistributed memory systemsMSAProbs020201 artificial intelligence & image processingMPIData miningSequence AlignmentcomputerAlgorithmsSoftware
researchProduct

Do next-generation sequencing results drive diagnostic and therapeutic decisions in MDS?

2019

Este artículo se encuentra disponible en la siguiente URL: https://ashpublications.org/bloodadvances/article/3/21/3454/422749/Do-next-generation-sequencing-results-drive

0301 basic medicineSíndromes mielodisplásicos - Aspectos moleculares.Clinical Decision-MakingMEDLINEComputational biologyDNA sequencing03 medical and health sciences0302 clinical medicineText miningHumansMedicineGenetic Predisposition to DiseaseSangre - Células - Aspectos moleculares.Molecular Targeted TherapyGenes.Genetic Association StudiesBlood cells - Molecular aspects.business.industryDecision TreesDisease ManagementHigh-Throughput Nucleotide SequencingGenomicsHematologyPrognosisCombined Modality TherapyMyelodysplastic syndrome - Molecular aspects.030104 developmental biologyMyelodysplastic Syndromes030220 oncology & carcinogenesisMutationPoint-CounterpointMolecular biology.Biología molecular.businessBiomarkersBlood Advances
researchProduct

Ultrasound for Hepatocellular Carcinoma Surveillance: Still Looking for the Fortune Teller.

2018

0301 basic medicineTransplantationmedicine.medical_specialtyCarcinoma HepatocellularHepatologybusiness.industrymedicine.medical_treatmentUltrasoundLiver NeoplasmsLiver transplantationmedicine.diseaseLiver Transplantation03 medical and health sciences030104 developmental biology0302 clinical medicineText miningHepatocellular carcinomamedicineHumans030211 gastroenterology & hepatologySurgeryRadiologyalpha-FetoproteinsbusinessUltrasonographyLiver transplantation : official publication of the American Association for the Study of Liver Diseases and the International Liver Transplantation Society
researchProduct

In response: Neuronal networks in epileptic encephalopathies with CSWS

2017

0301 basic medicinebusiness.industryElectroencephalographyBrain Waves03 medical and health sciences030104 developmental biology0302 clinical medicineText miningNeurologyMedicineEpilepsy GeneralizedNeurology (clinical)businessNeuroscience030217 neurology & neurosurgeryEpilepsia
researchProduct