Search results for "software"

showing 10 items of 7396 documents

miRToolsGallery: a tag-based and rankable microRNA bioinformatics resources database portal

2017

Abstract Hundreds of bioinformatics tools have been developed for MicroRNA (miRNA) investigations including those used for identification, target prediction, structure and expression profile analysis. However, finding the correct tool for a specific application requires the tedious and laborious process of locating, downloading, testing and validating the appropriate tool from a group of nearly a thousand. In order to facilitate this process, we developed a novel database portal named miRToolsGallery. We constructed the portal by manually curating > 950 miRNA analysis tools and resources. In the portal, a query to locate the appropriate tool is expedited by being searchable, filterable and …

0301 basic medicineComputer scienceProcess (engineering)media_common.quotation_subjectmiRToolsGallerycomputer.software_genreBioinformaticsGeneral Biochemistry Genetics and Molecular Biology03 medical and health sciencesUpload0302 clinical medicinetyövälineetFunction (engineering)Data Curationmedia_commonStructure (mathematical logic)DatabaseData curationSequence Analysis RNAbioinformatiikkabioinformaticsMicroRNAsIdentification (information)Database Tool030104 developmental biologyRankingFeature (computer vision)toolsta1181Databases Nucleic AcidGeneral Agricultural and Biological SciencescomputerAlgorithms030217 neurology & neurosurgeryInformation SystemsDatabase
researchProduct

Revealing community structures by ensemble clustering using group diffusion

2018

We propose an ensemble clustering approach using group diffusion to reveal community structures in data. We represent data points as a directed graph and assume each data point belong to single cluster membership instead of multiple memberships. The method is based on the concept of ensemble group diffusion with a parameter to represent diffusion depth in clustering. The ability to modulate the diffusion-depth parameter by varying it within a certain interval allows for more accurate construction of clusters. Depending on the value of the diffusion-depth parameter, the presented approach can determine very well both local clusters and global structure of data. At the same time, the ability …

0301 basic medicineComputer scienceProperty (programming)Markov chain02 engineering and technologyInterval (mathematics)03 medical and health sciencesdiffuusio (fysikaaliset ilmiöt)0202 electrical engineering electronic engineering information engineeringCluster (physics)SegmentationDiffusion (business)Cluster analysista113ta213diffusionDirected graph030104 developmental biologyData pointHardware and ArchitectureSignal Processingyhdyskuntarakenne020201 artificial intelligence & image processingsocial networkcommunity structureAlgorithmSoftwareInformation Systemsclustering
researchProduct

SpCLUST: Towards a fast and reliable clustering for potentially divergent biological sequences

2019

International audience; This paper presents SpCLUST, a new C++ package that takes a list of sequences as input, aligns them with MUSCLE, computes their similarity matrix in parallel and then performs the clustering. SpCLUST extends a previously released software by integrating additional scoring matrices which enables it to cover the clustering of amino-acid sequences. The similarity matrix is now computed in parallel according to the master/slave distributed architecture, using MPI. Performance analysis, realized on two real datasets of 100 nucleotide sequences and 1049 amino-acids ones, show that the resulting library substantially outperforms the original Python package. The proposed pac…

0301 basic medicineComputer science[INFO.INFO-SE] Computer Science [cs]/Software Engineering [cs.SE]Health Informatics[INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE][INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing03 medical and health sciences[INFO.INFO-CR]Computer Science [cs]/Cryptography and Security [cs.CR]0302 clinical medicineSoftware[INFO.INFO-ET] Computer Science [cs]/Emerging Technologies [cs.ET][INFO.INFO-DC] Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Cluster AnalysisHumansCluster analysis[INFO.INFO-CR] Computer Science [cs]/Cryptography and Security [cs.CR]computer.programming_languagebusiness.industry[INFO.INFO-IU] Computer Science [cs]/Ubiquitous ComputingSimilarity matrixPattern recognitionDNAGenomicsSequence Analysis DNAPython (programming language)Mixture model[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationSpectral clusteringComputer Science Applications030104 developmental biologyComputingMethodologies_PATTERNRECOGNITION[INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA][INFO.INFO-ET]Computer Science [cs]/Emerging Technologies [cs.ET][INFO.INFO-MA] Computer Science [cs]/Multiagent Systems [cs.MA][INFO.INFO-MO] Computer Science [cs]/Modeling and SimulationArtificial intelligence[INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]businesscomputerAlgorithmsSoftware030217 neurology & neurosurgery
researchProduct

Rocker: Open source, easy-to-use tool for AUC and enrichment calculations and ROC visualization

2016

Receiver operating characteristics (ROC) curve with the calculation of area under curve (AUC) is a useful tool to evaluate the performance of biomedical and chemoinformatics data. For example, in virtual drug screening ROC curves are very often used to visualize the efficiency of the used application to separate active ligands from inactive molecules. Unfortunately, most of the available tools for ROC analysis are implemented into commercially available software packages, or are plugins in statistical software, which are not always the easiest to use. Here, we present Rocker, a simple ROC curve visualization tool that can be used for the generation of publication quality images. Rocker also…

0301 basic medicineComputer scienceautomatic calculationLibrary and Information Sciencescomputer.software_genre01 natural sciences03 medical and health sciencesSoftwareArea under curvePlug-inPhysical and Theoretical ChemistryVirtual screeningReceiver operating characteristicbusiness.industryComputer Graphics and Computer-Aided Design0104 chemical sciencesComputer Science ApplicationsVisualizationreceiver operating characteristics010404 medicinal & biomolecular chemistryIdentification (information)ComputingMethodologies_PATTERNRECOGNITION030104 developmental biologyarea under curvesRockerCheminformaticsData miningbusinesscomputerSoftwaresoftwaresJournal of Cheminformatics
researchProduct

Principal components analysis: theory and application to gene expression data analysis

2018

Advances in computational power have enabled research to generate significant amounts of data related to complex biological problems. Consequently, applying appropriate data analysis techniques has become paramount to tackle this complexity. However, theoretical understanding of statistical methods is necessary to ensure that the correct method is used and that sound inferences are made based on the analysis. In this article, we elaborate on the theory behind principal components analysis (PCA), which has become a favoured multivariate statistical tool in the field of omics-data analysis. We discuss the necessary prerequisites and steps to produce statistically valid results and provide gui…

0301 basic medicineComputer sciencebusiness.industryAssociation (object-oriented programming)Big dataGenomicsMachine learningcomputer.software_genreField (computer science)03 medical and health sciences030104 developmental biology0302 clinical medicineSoftwareWorkflowPrincipal component analysisData analysisArtificial intelligencebusinesscomputer030217 neurology & neurosurgeryGenomics and Computational Biology
researchProduct

Deep Learning Architectures for DNA Sequence Classification

2017

DNA sequence classification is a key task in a generic computational framework for biomedical data analysis, and in recent years several machine learning technique have been adopted to successful accomplish with this task. Anyway, the main difficulty behind the problem remains the feature selection process. Sequences do not have explicit features, and the commonly used representations introduce the main drawback of the high dimensionality. For sure, machine learning method devoted to supervised classification tasks are strongly dependent on the feature extraction step, and in order to build a good representation it is necessary to recognize and measure meaningful details of the items to cla…

0301 basic medicineComputer sciencebusiness.industryProcess (engineering)Deep learningFeature extractionFeature selection02 engineering and technologyMachine learningcomputer.software_genreConvolutional neural networkTask (project management)03 medical and health sciences030104 developmental biologyRecurrent neural network0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingArtificial intelligenceRepresentation (mathematics)businesscomputer
researchProduct

On the Use of Binary Trees for DNA Hydroxymethylation Analysis

2017

DNA methylation (mC) and hydroxymethylation (hmC) can have a significant effect on normal human development, health and disease status. Hydroxymethylation studies require specific treatment of DNA, as well as software tools for their analysis. In this paper, we propose a parallel software tool for analyzing the DNA hydroxymethylation data obtained by TAB-seq. The software is based on the use of binary trees for searching the different occurrences of methylation and hydroxymethylation in DNA samples. The binary trees allow to efficiently store and access the information about the methylation of each methylated/hydroxymethylated cytosines in the samples. Evaluation results shows that the perf…

0301 basic medicineDNA Hydroxymethylation020203 distributed computingBinary treebusiness.industryComputer science02 engineering and technologyMethylationComputational biologySupercomputer03 medical and health scienceschemistry.chemical_compound030104 developmental biologySoftwareParallel softwarechemistryDNA methylation0202 electrical engineering electronic engineering information engineeringheterocyclic compoundsbusinessDNA
researchProduct

Reactome pathway analysis: a high-performance in-memory approach

2016

Reactome aims to provide bioinformatics tools for visualisation, interpretation and analysis of pathway knowledge to support basic research, genome analysis, modelling, systems biology and education. Pathway analysis methods have a broad range of applications in physiological and biomedical research; one of the main problems, from the analysis methods performance point of view, is the constantly increasing size of the data samples. Here, we present a new high-performance in-memory implementation of the well-established over-representation analysis method. To achieve the target, the over-representation analysis method is divided in four different steps and, for each of them, specific data st…

0301 basic medicineData structuresDatabases FactualPathway analysisComputer scienceInterface (Java)Systems biologycomputer.software_genreGenomeBiochemistry03 medical and health sciences0302 clinical medicineStructural BiologyNucleic AcidsHumansMolecular BiologyApplied MathematicsComputational BiologyProteinsPathway analysisComputer Science ApplicationsTree (data structure)030104 developmental biology030220 oncology & carcinogenesisGraph (abstract data type)Data miningOver-representation analysiscomputerAlgorithmsSoftwareBMC Bioinformatics
researchProduct

2015

The University of Minnesota Biocatalysis/Biodegradation Database and Pathway Prediction System (UM-BBD/PPS) has been a unique resource covering microbial biotransformation pathways of primarily xenobiotic chemicals for over 15 years. This paper introduces the successor system, enviPath (The Environmental Contaminant Biotransformation Pathway Resource), which is a complete redesign and reimplementation of UM-BBD/PPS. enviPath uses the database from the UM-BBD/PPS as a basis, extends the use of this database, and allows users to include their own data to support multiple use cases. Relative reasoning is supported for the refinement of predictions and to allow its extensions in terms of previo…

0301 basic medicineDatabasecomputer.file_formatBiologyPrediction systemcomputer.software_genreBioinformatics03 medical and health sciences030104 developmental biologyWorkflowResource (project management)BiotransformationGeneticsRDFcomputerNucleic Acids Research
researchProduct

Reactome graph database: Efficient access to complex pathway data

2018

Reactome is a free, open-source, open-data, curated and peer-reviewed knowledgebase of biomolecular pathways. One of its main priorities is to provide easy and efficient access to its high quality curated data. At present, biological pathway databases typically store their contents in relational databases. This limits access efficiency because there are performance issues associated with queries traversing highly interconnected data. The same data in a graph database can be queried more efficiently. Here we present the rationale behind the adoption of a graph database (Neo4j) as well as the new ContentService (REST API) that provides access to these data. The Neo4j graph database and its qu…

0301 basic medicineDatabases FactualComputer scienceData managementKnowledge BasesSocial SciencesInformation Storage and RetrievalNoSQLcomputer.software_genreComputer ApplicationsDatabase and Informatics MethodsUser-Computer Interface0302 clinical medicineKnowledge extractionPsychologyDatabase Searchinglcsh:QH301-705.5Data ManagementLanguageBiological dataEcologySystems BiologyGenomicsGenomic DatabasesComputational Theory and MathematicsModeling and SimulationWeb-Based ApplicationsGraph (abstract data type)Information TechnologyResearch ArticleComputer and Information SciencesRelational databaseQuery languageResearch and Analysis MethodsEcosystems03 medical and health sciencesCellular and Molecular NeuroscienceDatabasesGeneticsComputer GraphicsHumansMolecular BiologyEcology Evolution Behavior and SystematicsInternetInformation retrievalGraph databasebusiness.industryEcology and Environmental SciencesCognitive PsychologyBiology and Life SciencesComputational BiologyGenome AnalysisRelational Databases030104 developmental biologyBiological Databaseslcsh:Biology (General)Cognitive Sciencebusinesscomputer030217 neurology & neurosurgerySoftwareNeurosciencePLoS Computational Biology
researchProduct