Search results for "Data mining"

showing 10 items of 907 documents

Managing Multi-center Flow Cytometry Data for Immune Monitoring.

2014

With the recent results of promising cancer vaccines and immunotherapy 1 – 5 , immune monitoring has become increasingly relevant for measuring treatment-induced effects on T cells, and an essential tool for shedding light on the mechanisms responsible for a successful treatment. Flow cytometry is the canonical multi-parameter assay for the fine characterization of single cells in solution, and is ubiquitously used in pre-clinical tumor immunology and in cancer immunotherapy trials. Current state-of-the-art polychromatic flow cytometry involves multi-step, multi-reagent assays followed by sample acquisition on sophisticated instruments capable of capturing up to 20 parameters per cell at a…

Cancer ResearchComputer scienceData managementREST APIdata provenancecomputer.software_genrelcsh:RC254-282automated analysisData modelinglaboratory informatics03 medical and health sciences0302 clinical medicineLaboratory informaticsreproducible analysisFlow cytometry030304 developmental biologyOriginal Research0303 health sciencesApplication programming interfacebusiness.industrymetadatalcsh:Neoplasms. Tumors. Oncology. Including cancer and carcinogensData scienceAutomationMetadataManagement information systemsOncologyData miningdata managementbusinesscomputer030215 immunologyCommunication channelCancer informatics
researchProduct

Precision sampling fuels precision oncology: an evolutionary perspective.

2021

Intratumor heterogeneity (ITH) develops in malignant tumors. Precision sampling that captures this tumor variability is essential for the implementation of precision oncology. We highlight the necessity to update current sampling protocols and implement a strategy to ensure ITH detection and characterization. A cost-effective strategy for such sampling has been modeled in clear cell renal cell carcinoma (CCRCC).

Cancer ResearchComputer sciencePerspective (graphical)Sampling (statistics)medicine.diseasecomputer.software_genreBiological EvolutionKidney NeoplasmsClear cell renal cell carcinomaOncologyIntratumor heterogeneityPrecision oncologymedicineHumansData miningPrecision MedicinecomputerCarcinoma Renal CellTrends in cancer
researchProduct

Fuzzy subgroup mining for gene associations

2004

When studying the therapeutic efficacy of potential new drugs, it would be much more efficient to use predictors in order to assess their toxicity before going into clinical trials. One promising line of research has focused on the discovery of sets of candidate gene profiles to be used as toxicity indicators in future drug development. In particular genomic microarrays may be used to analyze the causality relationship between the administration of the drugs and the so-called gene expression, a parameter typically used by biologists to measure its influence at gene level. This kind of experiments involves a high throughput analysis of noisy and particularly unreliable data, which makes the …

Candidate geneApriori algorithmMeasure (data warehouse)Fuzzy control systemBiologycomputer.software_genreCausalityFuzzy logicComputingMethodologies_PATTERNRECOGNITIONDrug developmentData miningddc:004Throughput (business)computer
researchProduct

Gene Set to Diseases (GS2D): disease enrichment analysis on human gene sets with literature data

2016

Large sets of candidate genes derived from high-throughput biological experiments can be characterized by functional enrichment analysis. The analysis consists of comparing the functions of one gene set against that of a background gene set. Then, functions related to a significant number of genes in the gene set are expected to be relevant. Web tools offering disease enrichment analysis on gene sets are often based on gene-disease associations from manually curated or experimental data that is accurate but does not cover all diseases discussed in the literature. Using associations automatically derived from literature data could be a cost effective method to improve the coverage of disease…

Candidate genebusiness.industryBig dataExperimental dataGenomicsBiologycomputer.software_genreSet (abstract data type)WorkflowData miningToxicogenomicsbusinesscomputerGeneGenomics and Computational Biology
researchProduct

Searching for repetitions in biological networks: methods, resources and tools

2013

We present here a compact overview of the data, models and methods proposed for the analysis of biological networks based on the search for significant repetitions. In particular, we concentrate on three problems widely studied in the literature: ‘network alignment’, ‘network querying’ and ‘network motif extraction’. We provide (i) details of the experimental techniques used to obtain the main types of interaction data, (ii) descriptions of the models and approaches introduced to solve such problems and (iii) pointers to both the available databases and software tools. The intent is to lay out a useful roadmap for identifying suitable strategies to analyse cellular data, possibly based on t…

Cellular datanetwork global alignmentnetwork local alignmentbiological networks analysiSettore INF/01 - Informaticabusiness.industryComputer sciencenetwork queryingComputational Biologynetwork motif extractionModels Theoreticalcomputer.software_genreData typeNetwork motifSoftwareNetwork alignmentData miningbusinessMolecular Biologycomputerasymmetric alignmentBiological networkSoftwareInformation Systems
researchProduct

Effectiveness of local feature selection in ensemble learning for prediction of antimicrobial resistance

2008

In the real world concepts are often not stable but change over time. A typical example of this in the biomedical context is antibiotic resistance, where pathogen sensitivity may change over time as pathogen strains develop resistance to antibiotics that were previously effective. This problem, known as concept drift (CD), complicates the task of learning a robust model. Different ensemble learning (EL) approaches (that instead of learning a single classifier try to learn and maintain a set of classifiers over time) have been shown to perform reasonably well in the presence of concept drift. In this paper we study how much local feature selection (FS) can improve ensemble performance for da…

Change over timeConcept driftbusiness.industryComputer sciencemedia_common.quotation_subjectSystem testingFeature selectionMachine learningcomputer.software_genreEnsemble learningStatistical classificationVotingArtificial intelligenceData miningbusinesscomputerClassifier (UML)media_common
researchProduct

Anomaly detection in dynamic systems using weak estimators

2011

Accepted version of an article from the journal: ACM transactions on internet technology. Published version available from the ACM: http://dx.doi.org/10.1145/1993083.1993086 Anomaly detection involves identifying observations that deviate from the normal behavior of a system. One of the ways to achieve this is by identifying the phenomena that characterize “normal” observations. Subsequently, based on the characteristics of data learned from the “normal” observations, new observations are classified as being either “normal” or not. Most state-of-the-art approaches, especially those which belong to the family of parameterized statistical schemes, work under the assumption that the underlying…

Change over timeVDP::Mathematics and natural science: 400::Mathematics: 410::Applied mathematics: 413education.field_of_studyComputer Networks and CommunicationsComputer sciencePopulationEstimatorParameterized complexityVDP::Technology: 500::Information and communication technology: 550Network monitoringcomputer.software_genreOutlierAnomaly detectionData miningeducationcomputer
researchProduct

World Influence of Infectious Diseases from Wikipedia Network Analysis

2019

AbstractWe consider the network of 5 416 537 articles of English Wikipedia extracted in 2017. Using the recent reduced Google matrix (REGOMAX) method we construct the reduced network of 230 articles (nodes) of infectious diseases and 195 articles of world countries. This method generates the reduced directed network between all 425 nodes taking into account all direct and indirect links with pathways via the huge global network. PageRank and CheiRank algorithms are used to determine the most influential diseases with the top PageRank diseases being Tuberculosis, HIV/AIDS and Malaria. From the reduced Google matrix we determine the sensitivity of world countries to specific diseases integrat…

CheiRankComputer scienceHuman immunodeficiency virus (HIV)medicine.disease_cause01 natural sciences[INFO.INFO-SI]Computer Science [cs]/Social and Information Networks [cs.SI]law.invention03 medical and health sciencesPageRanklaw0103 physical sciencesGlobal networkmedicine010306 general physics030304 developmental biology0303 health sciencesInformation retrievalGoogle matrixMarkov processes[PHYS.PHYS.PHYS-SOC-PH]Physics [physics]/Physics [physics]/Physics and Society [physics.soc-ph]complex networksdata mining[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]ranking (statistics)3. Good healthInfectious diseaseslcsh:Electrical engineering. Electronics. Nuclear engineeringlcsh:TK1-9971Network analysisWikipedia
researchProduct

Improving the QM/MM Description of Chemical Processes:  A Dual Level Strategy To Explore the Potential Energy Surface in Very Large Systems.

2005

Potential energy surfaces are fundamental tools for the analysis of reaction mechanisms. The accuracy of these surfaces for reactions in very large systems is often limited by the size of the system even if hybrid quantum mechanics/molecular mechanics (QM/MM) strategies are employed. The large number of degrees of freedom of the system requires hundreds or even thousands of optimization steps to reach convergence. Reactions in condensed media (such as enzymes or solutions) are thus usually restricted to be analyzed using low level quantum mechanical methods, thus introducing a source of error in the description of the QM region. In this paper, an alternative method is proposed, coupled to t…

Chemical processComputer scienceDegrees of freedom (physics and chemistry)computer.software_genreTopologyPotential energyComputer Science ApplicationsQM/MMConvergence (routing)Potential energy surfaceData miningPhysical and Theoretical ChemistrycomputerQuantumEnergy (signal processing)Journal of chemical theory and computation
researchProduct

A computer program suitable for analysis of choice of categories in biomedical data recognition problems.

1980

The optimum choice of categories in problems of medical data recognition is governed by the choice of categories, the selection of appropriate features, and by the choice of a loss function. Under these circumstances it is often difficult to find out the suitable classification scheme. The computer program described here serves for the design of the optimum recognition procedure. The Bayes rule is used as decision rule. A criterion for the comparison of different choice of categories is given. The program can be performed after estimation of the underlying prior probabilities and the conditional densities obtained from a training set, and before testing the decision rule with real data.

Choice setComputer programComputer sciencebusiness.industryComputersDecision theoryMedicine (miscellaneous)Decision ruleFunction (mathematics)Machine learningcomputer.software_genreClassificationBayes' theoremDecision TheoryBiomedical dataResearch DesignData miningArtificial intelligencebusinesscomputerSelection (genetic algorithm)Computer programs in biomedicine
researchProduct