Search results for "Data type"

showing 10 items of 1183 documents

Dry selection and wet evaluation for the rational discovery of new anthelmintics

2017

Helminths infections remain a major problem in medical and public health. In this report, atom-based 2D bilinear indices, a TOMOCOMD-CARDD (QuBiLs-MAS module) molecular descriptor family and linear discriminant analysis (LDA) were used to find models that differentiate among anthelmintic and non-anthelmintic compounds. Two classification models obtained by using non-stochastic and stochastic 2D bilinear indices, classified correctly 86.64% and 84.66%, respectively, in the training set. Equation 1(2) correctly classified 141(135) out of 165 [85.45%(81.82%)] compounds in external validation set. Another LDA models were performed in order to get the most likely mechanism of action of anthelmin…

0301 basic medicineBiophysicsNon-stochastic and stochastic atom-based bilinear indicesBilinear interpolationLDA-based QSAR modelQuBiLs-MAS module01 natural sciencesSet (abstract data type)03 medical and health sciencesMolecular descriptorStatisticsPhysical and Theoretical ChemistryMolecular BiologySelection (genetic algorithm)MathematicsFree and open source softwareTraining setTOMOCOMD-CARDD softwareExternal validationAnthelmintic activityAtom (order theory)Computational creeningCondensed Matter PhysicsLinear discriminant analysis0104 chemical sciencesIndazole010404 medicinal & biomolecular chemistry030104 developmental biologyLead generationMolecular Physics

researchProduct

EFMviz

2020

Elementary Flux Modes (EFMs) are a tool for constraint-based modeling and metabolic network analysis. However, systematic and automated visualization of EFMs, capable of integrating various data types is still a challenge. In this study, we developed an extension for the widely adopted COBRA Toolbox, EFMviz, for analysis and graphical visualization of EFMs as networks of reactions, metabolites and genes. The analysis workflow offers a platform for EFM visualization to improve EFM interpretability by connecting COBRA toolbox with the network analysis and visualization software Cytoscape. The biological applicability of EFMviz is demonstrated in two use cases on medium (Escherichia coli, iAF1…

0301 basic medicineComputer scienceEndocrinology Diabetes and Metabolismgenome-scale metabolic modelslcsh:QR1-502computer.software_genreBiochemistryData typelcsh:MicrobiologySBML03 medical and health sciences0302 clinical medicineData visualizationGraph drawingProtocolACETATEdata visualizationCELLSBMLCYTOSCAPEMolecular BiologyGENE-EXPRESSIONSoftware visualizationbusiness.industryPATHWAY ANALYSISnetwork visualizationelementary flux modesToolboxVisualization030104 developmental biologyWorkflowDEFINITIONESCHERICHIA-COLIGROWTHData miningbusinesscomputerSET030217 neurology & neurosurgeryMetabolites

researchProduct

Reactome pathway analysis: a high-performance in-memory approach

2016

Reactome aims to provide bioinformatics tools for visualisation, interpretation and analysis of pathway knowledge to support basic research, genome analysis, modelling, systems biology and education. Pathway analysis methods have a broad range of applications in physiological and biomedical research; one of the main problems, from the analysis methods performance point of view, is the constantly increasing size of the data samples. Here, we present a new high-performance in-memory implementation of the well-established over-representation analysis method. To achieve the target, the over-representation analysis method is divided in four different steps and, for each of them, specific data st…

0301 basic medicineData structuresDatabases FactualPathway analysisComputer scienceInterface (Java)Systems biologycomputer.software_genreGenomeBiochemistry03 medical and health sciences0302 clinical medicineStructural BiologyNucleic AcidsHumansMolecular BiologyApplied MathematicsComputational BiologyProteinsPathway analysisComputer Science ApplicationsTree (data structure)030104 developmental biology030220 oncology & carcinogenesisGraph (abstract data type)Data miningOver-representation analysiscomputerAlgorithmsSoftwareBMC Bioinformatics

researchProduct

Reactome graph database: Efficient access to complex pathway data

2018

Reactome is a free, open-source, open-data, curated and peer-reviewed knowledgebase of biomolecular pathways. One of its main priorities is to provide easy and efficient access to its high quality curated data. At present, biological pathway databases typically store their contents in relational databases. This limits access efficiency because there are performance issues associated with queries traversing highly interconnected data. The same data in a graph database can be queried more efficiently. Here we present the rationale behind the adoption of a graph database (Neo4j) as well as the new ContentService (REST API) that provides access to these data. The Neo4j graph database and its qu…

0301 basic medicineDatabases FactualComputer scienceData managementKnowledge BasesSocial SciencesInformation Storage and RetrievalNoSQLcomputer.software_genreComputer ApplicationsDatabase and Informatics MethodsUser-Computer Interface0302 clinical medicineKnowledge extractionPsychologyDatabase Searchinglcsh:QH301-705.5Data ManagementLanguageBiological dataEcologySystems BiologyGenomicsGenomic DatabasesComputational Theory and MathematicsModeling and SimulationWeb-Based ApplicationsGraph (abstract data type)Information TechnologyResearch ArticleComputer and Information SciencesRelational databaseQuery languageResearch and Analysis MethodsEcosystems03 medical and health sciencesCellular and Molecular NeuroscienceDatabasesGeneticsComputer GraphicsHumansMolecular BiologyEcology Evolution Behavior and SystematicsInternetInformation retrievalGraph databasebusiness.industryEcology and Environmental SciencesCognitive PsychologyBiology and Life SciencesComputational BiologyGenome AnalysisRelational Databases030104 developmental biologyBiological Databaseslcsh:Biology (General)Cognitive Sciencebusinesscomputer030217 neurology & neurosurgerySoftwareNeurosciencePLoS Computational Biology

researchProduct

A deeper look into natural sciences with physics-based and data-driven measures

2021

Summary With the development of machine learning in recent years, it is possible to glean much more information from an experimental data set to study matter. In this perspective, we discuss some state-of-the-art data-driven tools to analyze latent effects in data and explain their applicability in natural science, focusing on two recently introduced, physics-motivated computationally cheap tools—latent entropy and latent dimension. We exemplify their capabilities by applying them on several examples in the natural sciences and show that they reveal so far unobserved features such as, for example, a gradient in a magnetic measurement and a latent network of glymphatic channels from the mous…

0301 basic medicineDynamical systems theory02 engineering and technologyMachine learningcomputer.software_genreData-drivenSet (abstract data type)03 medical and health sciencesArtificial IntelligenceEntropy (information theory)Dimension (data warehouse)lcsh:ScienceApplied PhysicsMultidisciplinarybusiness.industryPhysicsPerspective (graphical)MagnetismExperimental dataPhysik (inkl. Astronomie)021001 nanoscience & nanotechnology030104 developmental biologyPerspectiveComputer Sciencelcsh:QRelaxation (approximation)Artificial intelligence0210 nano-technologybusinesscomputeriScience

researchProduct

Graph Theoretical Framework of Brain Networks in Multiple Sclerosis: A Review of Concepts.

2019

Abstract Network science provides powerful access to essential organizational principles of the human brain. It has been applied in combination with graph theory to characterize brain connectivity patterns. In multiple sclerosis (MS), analysis of the brain networks derived from either structural or functional imaging provides new insights into pathological processes within the gray and white matter. Beyond focal lesions and diffuse tissue damage, network connectivity patterns could be important for closely tracking and predicting the disease course. In this review, we describe concepts of graph theory, highlight novel issues of tissue reorganization in acute and chronic neuroinflammation an…

0301 basic medicineEncephalomyelitis Autoimmune ExperimentalMultiple SclerosisModels NeurologicalNetwork science03 medical and health sciences0302 clinical medicineNeural PathwaysmedicineAnimalsHumansGeneral NeuroscienceMultiple sclerosisBrainGraph theoryHuman brainmedicine.diseaseFunctional imaging030104 developmental biologymedicine.anatomical_structureGraph (abstract data type)DisconnectionPsychologyNeuroscience030217 neurology & neurosurgeryNetwork analysisNeuroscience

researchProduct

Informational and linguistic analysis of large genomic sequence collections via efficient Hadoop cluster algorithms

2018

Abstract Motivation Information theoretic and compositional/linguistic analysis of genomes have a central role in bioinformatics, even more so since the associated methodologies are becoming very valuable also for epigenomic and meta-genomic studies. The kernel of those methods is based on the collection of k-mer statistics, i.e. how many times each k-mer in {A,C,G,T}k occurs in a DNA sequence. Although this problem is computationally very simple and efficiently solvable on a conventional computer, the sheer amount of data available now in applications demands to resort to parallel and distributed computing. Indeed, those type of algorithms have been developed to collect k-mer statistics in…

0301 basic medicineEpigenomicsgenomic analysis; hadoop; distributed computingStatistics and ProbabilityComputer scienceBig dataSequence assemblyGenomeBiochemistryDomain (software engineering)Set (abstract data type)03 medical and health sciencesdistributed computingSoftwareComputational Theory and MathematicAnimalsCluster AnalysisHumansA-DNAk-mer counting distributed computing hadoop map reduceMolecular BiologyEpigenomicsBacteriabusiness.industryk-mer countingEukaryotaLinguisticsComputer Science Applications1707 Computer Vision and Pattern RecognitionGenomicsSequence Analysis DNAComputer Science ApplicationsComputational Mathematics030104 developmental biologymap reduceComputational Theory and MathematicsDistributed algorithmgenomic analysisKernel (statistics)MetagenomehadoopbusinessAlgorithmAlgorithmsSoftware

researchProduct

Lost Strings in Genomes: What Sense Do They Make?

2017

We studied the sets of avoided strings to be observed over a family of genomes. It was found that the length of the minimal avoided string rarely exceeds 9 nucleotides, with neither respect to a phylogeny of a genome under consideration. The lists of the avoided strings observed over the sets of (related) genomes have been analyzed. Very low correlation between the phylogeny, and the set of those strings has been found.

0301 basic medicineGeneticsanimal structuresgenetic structuresinformation scienceString (physics)GenomeCombinatoricsSet (abstract data type)03 medical and health sciences030104 developmental biology0302 clinical medicinePhylogeneticscardiovascular systemLow correlation030217 neurology & neurosurgerySelection (genetic algorithm)Mathematics

researchProduct

CLOVE: classification of genomic fusions into structural variation events

2017

Background A precise understanding of structural variants (SVs) in DNA is important in the study of cancer and population diversity. Many methods have been designed to identify SVs from DNA sequencing data. However, the problem remains challenging because existing approaches suffer from low sensitivity, precision, and positional accuracy. Furthermore, many existing tools only identify breakpoints, and so not collect related breakpoints and classify them as a particular type of SV. Due to the rapidly increasing usage of high throughput sequencing technologies in this area, there is an urgent need for algorithms that can accurately classify complex genomic rearrangements (involving more than …

0301 basic medicineGenomicsBiologycomputer.software_genrelcsh:Computer applications to medicine. Medical informaticsBiochemistryChromosomesDNA sequencingSet (abstract data type)Structural variationUser-Computer Interface03 medical and health sciencesStructural BiologyEscherichia coliHumansCopy-number variationMolecular Biologylcsh:QH301-705.5InternetMethodology ArticleApplied MathematicsBreakpointGenomic rearrangementsDNAGenomicsStructural variationsComputer Science ApplicationsIdentification (information)030104 developmental biologylcsh:Biology (General)Nucleic Acid ConformationGraph (abstract data type)lcsh:R858-859.7Data miningcomputerAlgorithmsBMC Bioinformatics

researchProduct

The Metabolic Building Blocks of a Minimal Cell

2020

This article belongs to the Section Evolutionary Biology.

0301 basic medicineMinimal gene set machineryMetabolic networkBacterial genome sizeComputational biologyMetabolic networksBiologyGenomeGeneral Biochemistry Genetics and Molecular BiologyArticle03 medical and health sciences0302 clinical medicineminimal gene set machinerylcsh:QH301-705.5Nasuia deltocephalinicolaGeneral Immunology and Microbiologydirected acyclic graphsDirected acyclic graphDirected acyclic graphs030104 developmental biologylcsh:Biology (General)Essential geneminimal cellsMinimal cellsCore (graph theory)metabolic networksGraph (abstract data type)General Agricultural and Biological Sciences030217 neurology & neurosurgeryBiology

researchProduct