Search results for "artificial intelligence"

showing 10 items of 6122 documents

Discovering discriminative graph patterns from gene expression data

2016

We consider the problem of mining gene expression data in order to single out interesting features characterizing healthy/unhealthy samples of an input dataset. We present an approach based on a network model of the input gene expression data, where there is a labelled graph for each sample. To the best of our knowledge, this is the first attempt to build a different graph for each sample and, then, to have a database of graphs for representing a sample set. Our main goal is that of singling out interesting differences between healthy and unhealthy samples, through the extraction of "discriminative patterns" among graphs belonging to the two different sample sets. Differently from the other…

0301 basic medicineSettore INF/01 - Informaticabusiness.industryComputer science0206 medical engineeringpattern discovery subgraph extraction biological networksPattern recognition02 engineering and technologyGraph03 medical and health sciencesComputingMethodologies_PATTERNRECOGNITION030104 developmental biologyDiscriminative modelGraph patternsArtificial intelligencebusiness020602 bioinformaticsBiological networkNetwork modelProceedings of the 31st Annual ACM Symposium on Applied Computing

researchProduct

Identification of novel compounds against three targets of SARS CoV-2 coronavirus by combined virtual screening and supervised machine learning.

2021

Coronavirus disease 2019 (COVID-19) is a major threat worldwide due to its fast spreading. As yet, there are no established drugs available. Speeding up drug discovery is urgently required. We applied a workflow of combined in silico methods (virtual drug screening, molecular docking and supervised machine learning algorithms) to identify novel drug candidates against COVID-19. We constructed chemical libraries consisting of FDA-approved drugs for drug repositioning and of natural compound datasets from literature mining and the ZINC database to select compounds interacting with SARS-CoV-2 target proteins (spike protein, nucleocapsid protein, and 2′-o-ribose methyltransferase). Supported by…

0301 basic medicineSimeprevirArtificial intelligencevirusesMERS Middle East Respiratory SyndromeHealth InformaticsBiologyMachine learningcomputer.software_genremedicine.disease_causeAntiviral AgentsArticleWHO World Health OrganizationAUC area under the curve03 medical and health sciences0302 clinical medicinessRNA single-stranded RNA virusmedicineChemotherapyHumansSARS severe acute respiratory syndromeCOVID-19 coronavirus disease 2019CoronavirusNatural productsVirtual screeningACE2 angiotensin converting enzyme 2Drug discoverybusiness.industrySARS-CoV-2COVID-19LBE lowest binding energyFDA Food and Drug AdministrationROC receiver operating characteristicComputer Science ApplicationsHIV human immunodeficiency virusMolecular Docking SimulationDrug repositioning030104 developmental biologyDrug developmentSevere acute respiratory syndrome-related coronavirusParitaprevirInfectious diseasesRespiratory virusArtificial intelligenceSupervised Machine Learningbusinesscomputer030217 neurology & neurosurgeryComputers in biology and medicine

researchProduct

Combining multiple hypothesis testing with machine learning increases the statistical power of genome-wide association studies

2016

Mieth, Bettina et al.

0301 basic medicineStatistical methodsComputer scienceGenome-wide association studyMachine learningcomputer.software_genreGenome-wide association studiesStatistical powerArticle[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]Set (abstract data type)03 medical and health sciences[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG][MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]10007 Department of EconomicsStatistical significanceReplication (statistics)genomeStatistical hypothesis testingGenetic association1000 MultidisciplinaryMultidisciplinarybusiness.industryComputational scienceInstitut für Mathematik330 EconomicsSupport vector machine030104 developmental biologyMultiple comparisons problemwide association studiesstatistical methodsArtificial intelligencebusinesscomputer

researchProduct

Partitioned learning of deep Boltzmann machines for SNP data.

2016

Abstract Motivation Learning the joint distributions of measurements, and in particular identification of an appropriate low-dimensional manifold, has been found to be a powerful ingredient of deep leaning approaches. Yet, such approaches have hardly been applied to single nucleotide polymorphism (SNP) data, probably due to the high number of features typically exceeding the number of studied individuals. Results After a brief overview of how deep Boltzmann machines (DBMs), a deep learning approach, can be adapted to SNP data in principle, we specifically present a way to alleviate the dimensionality problem by partitioned learning. We propose a sparse regression approach to coarsely screen…

0301 basic medicineStatistics and ProbabilityComputer scienceMachine learningcomputer.software_genre01 natural sciencesBiochemistryPolymorphism Single NucleotideMachine Learning010104 statistics & probability03 medical and health sciencessymbols.namesakeJoint probability distributionHumans0101 mathematicsMolecular BiologyStatistical hypothesis testingArtificial neural networkbusiness.industryGene Expression Regulation LeukemicDeep learningUnivariateComputational BiologyManifoldComputer Science ApplicationsData setComputational Mathematics030104 developmental biologyComputingMethodologies_PATTERNRECOGNITIONComputational Theory and MathematicsLeukemia MyeloidBoltzmann constantsymbolsData miningArtificial intelligencebusinesscomputerSoftwareCurse of dimensionalityBioinformatics (Oxford, England)

researchProduct

Gene-based and semantic structure of the Gene Ontology as a complex network

2012

The last decade has seen the advent and consolidation of ontology based tools for the identification and biological interpretation of classes of genes, such as the Gene Ontology. The information accumulated time-by-time and included in the GO is encoded in the definition of terms and in the setting up of semantic relations amongst terms. This approach might be usefully complemented by a bottom-up approach based on the knowledge of relationships amongst genes. To this end, we investigate the Gene Ontology from a complex network perspective. We consider the semantic network of terms naturally associated with the semantic relationships provided by the Gene Ontology consortium and a gene-based …

0301 basic medicineStatistics and ProbabilityFOS: Computer and information sciencesPhysics - Physics and SocietyComplex systemComputer scienceMolecular Networks (q-bio.MN)Complex systemFOS: Physical sciencesNetworkCondensed Matter PhysicPhysics and Society (physics.soc-ph)computer.software_genreQuantitative Biology - Quantitative MethodsStatistics - ApplicationsGeneSemantic network03 medical and health sciencesSemantic similarityQuantitative Biology - Molecular NetworksApplications (stat.AP)GeneQuantitative Methods (q-bio.QM)Community detectionGene ontologybusiness.industryOntologyOntology-based data integrationComplex networkCondensed Matter PhysicsBipartite system030104 developmental biologyBipartite system; Community detection; Complex systems; Genes; Networks; Ontology; Condensed Matter Physics; Statistics and ProbabilityFOS: Biological sciencesOntologyWeighted networkData miningArtificial intelligenceComputingMethodologies_GENERALbusinesscomputerNatural language processing

researchProduct

Model selection for factorial Gaussian graphical models with an application to dynamic regulatory networks.

2016

Abstract Factorial Gaussian graphical Models (fGGMs) have recently been proposed for inferring dynamic gene regulatory networks from genomic high-throughput data. In the search for true regulatory relationships amongst the vast space of possible networks, these models allow the imposition of certain restrictions on the dynamic nature of these relationships, such as Markov dependencies of low order – some entries of the precision matrix are a priori zeros – or equal dependency strengths across time lags – some entries of the precision matrix are assumed to be equal. The precision matrix is then estimated by l 1-penalized maximum likelihood, imposing a further constraint on the absolute value…

0301 basic medicineStatistics and ProbabilityFactorialDependency (UML)Computer scienceGaussianNormal Distributionpenalized inferencesparse networkscomputer.software_genreMachine learning01 natural sciencesNormal distribution010104 statistics & probability03 medical and health sciencessymbols.namesakeSparse networksGeneticsComputer SimulationGene Regulatory NetworksGraphical model0101 mathematicsgene-regulatory systemMolecular BiologyProbabilityMarkov chainModels GeneticPenalized inferencebusiness.industryModel selectiongraphical modelGene-regulatory systemsComputational Mathematics030104 developmental biologysymbolsA priori and a posterioriData miningArtificial intelligenceGraphical modelsSettore SECS-S/01 - StatisticabusinesscomputerNeisseriaAlgorithmsStatistical applications in genetics and molecular biology

researchProduct

MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems

2016

This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of recordJorge González-Domínguez, Yongchao Liu, Juan Touriño, Bertil Schmidt; MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems, Bioinformatics, Volume 32, Issue 24, 15 December 2016, Pages 3826–3828, https://doi.org/10.1093/bioinformatics/btw558is available online at: https://doi.org/10.1093/bioinformatics/btw558 [Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-sca…

0301 basic medicineStatistics and ProbabilitySource codeComputer sciencemedia_common.quotation_subject02 engineering and technologyParallel computingcomputer.software_genreBiochemistryExecution time03 medical and health sciences0202 electrical engineering electronic engineering information engineeringCluster (physics)Point (geometry)Amino Acid SequenceMolecular Biologymedia_commonSequenceMultiple sequence alignmentProtein multiple sequenceComputational BiologyProteinsMarkov ChainsComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsDistributed memory systemsMSAProbs020201 artificial intelligence & image processingMPIData miningSequence AlignmentcomputerAlgorithmsSoftware

researchProduct

Towards Self-explanatory Ontology Visualization with Contextual Verbalization

2016

Ontologies are one of the core foundations of the Semantic Web. To participate in Semantic Web projects, domain experts need to be able to understand the ontologies involved. Visual notations can provide an overview of the ontology and help users to understand the connections among entities. However, the users first need to learn the visual notation before they can interpret it correctly. Controlled natural language representation would be readable right away and might be preferred in case of complex axioms, however, the structure of the ontology would remain less apparent. We propose to combine ontology visualizations with contextual ontology verbalizations of selected ontology (diagram) e…

0301 basic medicineStructure (mathematical logic)Computer sciencebusiness.industry05 social sciences050301 educationRepresentation (arts)Ontology (information science)computer.software_genreNotationlanguage.human_languageDomain (software engineering)03 medical and health sciences030104 developmental biologyControlled natural languagelanguageArtificial intelligencebusiness0503 educationcomputerSemantic WebNatural language processingAxiom

researchProduct

Ultra-Fast Detection of Higher-Order Epistatic Interactions on GPUs

2017

Detecting higher-order epistatic interactions in Genome-Wide Association Studies (GWAS) remains a challenging task in the fields of genetic epidemiology and computer science. A number of algorithms have recently been proposed for epistasis discovery. However, they suffer from a high computational cost since statistical measures have to be evaluated for each possible combination of markers. Hence, many algorithms use additional filtering stages discarding potentially non-interacting markers in order to reduce the overall number of combinations to be examined. Among others, Mutual Information Clustering (MIC) is a common pre-processing filter for grouping markers into partitions using K-Means…

0301 basic medicineTheoretical computer scienceComputer sciencebusiness.industryContrast (statistics)Genome-wide association study02 engineering and technologyMutual informationMachine learningcomputer.software_genreReduction (complexity)03 medical and health sciences030104 developmental biologyGenetic epidemiology0202 electrical engineering electronic engineering information engineeringEpistasis020201 artificial intelligence & image processingArtificial intelligenceCluster analysisbusinesscomputerGenetic association

researchProduct

Deep learning models for bacteria taxonomic classification of metagenomic data.

2018

Background An open challenge in translational bioinformatics is the analysis of sequenced metagenomes from various environmental samples. Of course, several studies demonstrated the 16S ribosomal RNA could be considered as a barcode for bacteria classification at the genus level, but till now it is hard to identify the correct composition of metagenomic data from RNA-seq short-read data. 16S short-read data are generated using two next generation sequencing technologies, i.e. whole genome shotgun (WGS) and amplicon (AMP); typically, the former is filtered to obtain short-reads belonging to a 16S shotgun (SG), whereas the latter take into account only some specific 16S hypervariable regions.…

0301 basic medicineTime FactorsDBNComputer scienceBiochemistryStructural BiologyRNA Ribosomal 16SDatabases Geneticlcsh:QH301-705.5Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazionibiologySettore INF/01 - InformaticaShotgun sequencingApplied MathematicsAmpliconClassificationComputer Science Applicationslcsh:R858-859.7DNA microarrayShotgunAlgorithmsCNN030106 microbiologyk-mer representationlcsh:Computer applications to medicine. Medical informaticsDNA sequencing03 medical and health sciencesMetagenomicDeep LearningMolecular BiologyBacteriaModels GeneticPhylumbusiness.industryDeep learningResearchReproducibility of ResultsPattern recognitionBiological classification16S ribosomal RNAbiology.organism_classificationAmpliconHypervariable region030104 developmental biologyTaxonlcsh:Biology (General)MetagenomicsMetagenomeArtificial intelligenceMetagenomicsNeural Networks ComputerbusinessClassifier (UML)BacteriaBMC bioinformatics

researchProduct