Search results for "Computer Science - Machine Learning"

showing 10 items of 155 documents

Sparse and Smooth: improved guarantees for Spectral Clustering in the Dynamic Stochastic Block Model

2020

In this paper, we analyse classical variants of the Spectral Clustering (SC) algorithm in the Dynamic Stochastic Block Model (DSBM). Existing results show that, in the relatively sparse case where the expected degree grows logarithmically with the number of nodes, guarantees in the static case can be extended to the dynamic case and yield improved error bounds when the DSBM is sufficiently smooth in time, that is, the communities do not change too much between two time steps. We improve over these results by drawing a new link between the sparsity and the smoothness of the DSBM: the more regular the DSBM is, the more sparse it can be, while still guaranteeing consistent recovery. In particu…

FOS: Computer and information sciencesStatistics and ProbabilityComputer Science - Machine Learning[STAT.ML]Statistics [stat]/Machine Learning [stat.ML]Statistics - Machine LearningFOS: MathematicsMachine Learning (stat.ML)Mathematics - Statistics TheoryStatistics Theory (math.ST)Statistics Probability and Uncertainty[STAT.ML] Statistics [stat]/Machine Learning [stat.ML]Machine Learning (cs.LG)

researchProduct

Causal Effect Identification from Multiple Incomplete Data Sources: A General Search-Based Approach

2021

Causal effect identification considers whether an interventional probability distribution can be uniquely determined without parametric assumptions from measured source distributions and structural knowledge on the generating system. While complete graphical criteria and procedures exist for many identification problems, there are still challenging but important extensions that have not been considered in the literature. To tackle these new settings, we present a search algorithm directly over the rules of do-calculus. Due to generality of do-calculus, the search is capable of taking more advanced data-generating mechanisms into account along with an arbitrary type of both observational and…

FOS: Computer and information sciencesStatistics and ProbabilityComputer Science - Machine LearningcausalityComputer Science - Artificial IntelligenceHeuristic (computer science)Computer scienceeducationMachine Learning (stat.ML)transportabilitycomputer.software_genre01 natural sciencesMachine Learning (cs.LG)R-kielimissing dataQA76.75-76.765; QA273-280010104 statistics & probabilitydo-calculuscausality; do-calculus; selection bias; transportability; missing data; case-control design; meta-analysisStatistics - Machine LearningSearch algorithmselection bias0101 mathematicsParametric statisticspäättelymeta-analyysicase-control designhakualgoritmit113 Computer and information sciencesMissing datameta-analysisIdentification (information)Artificial Intelligence (cs.AI)Causal inferencekausaliteettiIdentifiabilityProbability distributionData miningStatistics Probability and UncertaintycomputerSoftwareJournal of Statistical Software

researchProduct

Implicit differentiation for fast hyperparameter selection in non-smooth convex learning

2022

International audience; Finding the optimal hyperparameters of a model can be cast as a bilevel optimization problem, typically solved using zero-order techniques. In this work we study first-order methods when the inner optimization problem is convex but non-smooth. We show that the forward-mode differentiation of proximal gradient descent and proximal coordinate descent yield sequences of Jacobians converging toward the exact Jacobian. Using implicit differentiation, we show it is possible to leverage the non-smoothness of the inner problem to speed up the computation. Finally, we provide a bound on the error made on the hypergradient when the inner optimization problem is solved approxim…

FOS: Computer and information sciencesbilevel optimizationComputer Science - Machine Learninghyperparameter selec- tionMachine Learning (stat.ML)[MATH.MATH-OC] Mathematics [math]/Optimization and Control [math.OC]generalized linear modelsMachine Learning (cs.LG)Convex optimizationStatistics - Machine Learning[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]Optimization and Control (math.OC)FOS: Mathematics[MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]hyperparameter optimizationLassoMathematics - Optimization and Control[MATH.MATH-ST] Mathematics [math]/Statistics [math.ST]

researchProduct

Identifying Causal Effects via Context-specific Independence Relations

2019

Causal effect identification considers whether an interventional probability distribution can be uniquely determined from a passively observed distribution in a given causal structure. If the generating system induces context-specific independence (CSI) relations, the existing identification procedures and criteria based on do-calculus are inherently incomplete. We show that deciding causal effect non-identifiability is NP-hard in the presence of CSIs. Motivated by this, we design a calculus and an automated search procedure for identifying causal effects in the presence of CSIs. The approach is provably sound and it includes standard do-calculus as a special case. With the approach we can …

FOS: Computer and information sciencescontext-specific independence relationsComputer Science - Machine LearningArtificial Intelligence (cs.AI)Computer Science - Artificial Intelligenceeducationkausaliteetticausal effect identification113 Computer and information sciencesMachine Learning (cs.LG)

researchProduct

Domain-specific transfer learning in the automated scoring of tumor-stroma ratio from histopathological images of colorectal cancer

2023

Tumor-stroma ratio (TSR) is a prognostic factor for many types of solid tumors. In this study, we propose a method for automated estimation of TSR from histopathological images of colorectal cancer. The method is based on convolutional neural networks which were trained to classify colorectal cancer tissue in hematoxylin-eosin stained samples into three classes: stroma, tumor and other. The models were trained using a data set that consists of 1343 whole slide images. Three different training setups were applied with a transfer learning approach using domain-specific data i.e. an external colorectal cancer histopathological data set. The three most accurate models were chosen as a classifie…

FOS: Computer and information sciencessmooth musclesvisionComputer Science - Machine LearningMultidisciplinaryComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognitioncolorectal cancerforecastingennusteetneuroverkotsuolistosyövätneural networksQuantitative Biology - Quantitative MethodsMachine Learning (cs.LG)machine learningkoneoppiminenFOS: Biological sciencessyöpätauditcancers and neoplasmsmalignant tumorsQuantitative Methods (q-bio.QM)

researchProduct

Balancing Profit, Risk, and Sustainability for Portfolio Management

2022

Author's accepted manuscript © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. Stock portfolio optimization is the process of continuous reallocation of funds to a selection of stocks. This is a particularly well-suited problem for reinforcement learning, as daily rewards are compounding and objective functions may include more than just profit, e.g., risk and su…

FOS: Economics and businessFOS: Computer and information sciencesComputer Science - Machine LearningVDP::Teknologi: 500Artificial Intelligence (cs.AI)Portfolio Management (q-fin.PM)Computer Science - Artificial IntelligenceQuantitative Finance - Portfolio ManagementMachine Learning (cs.LG)

researchProduct

Measuring the Novelty of Natural Language Text Using the Conjunctive Clauses of a Tsetlin Machine Text Classifier

2020

Most supervised text classification approaches assume a closed world, counting on all classes being present in the data at training time. This assumption can lead to unpredictable behaviour during operation, whenever novel, previously unseen, classes appear. Although deep learning-based methods have recently been used for novelty detection, they are challenging to interpret due to their black-box nature. This paper addresses \emph{interpretable} open-world text classification, where the trained classifier must deal with novel classes during operation. To this end, we extend the recently introduced Tsetlin machine (TM) with a novelty scoring mechanism. The mechanism uses the conjunctive clau…

I.2FOS: Computer and information sciencesComputer Science - Machine LearningI.5Computer Science - Artificial IntelligenceComputer scienceI.2; I.5; I.7computer.software_genreI.7Novelty detectionMeasure (mathematics)Machine Learning (cs.LG)Representation (mathematics)Computer Science - Computation and Languagebusiness.industryDeep learningNoveltyPropositional calculusArtificial Intelligence (cs.AI)Artificial intelligencebusinessClassifier (UML)computerComputation and Language (cs.CL)Natural language processingNatural language

researchProduct

Standard Vs Uniform Binary Search and Their Variants in Learned Static Indexing: The Case of the Searching on Sorted Data Benchmarking Software Platf…

2023

Learned Indexes are a novel approach to search in a sorted table. A model is used to predict an interval in which to search into and a Binary Search routine is used to finalize the search. They are quite effective. For the final stage, usually, the lower_bound routine of the Standard C++ library is used, although this is more of a natural choice rather than a requirement. However, recent studies, that do not use Machine Learning predictions, indicate that other implementations of Binary Search or variants, namely k-ary Search, are better suited to take advantage of the features offered by modern computer architectures. With the use of the Searching on Sorted Sets SOSD Learned Indexing bench…

I.2FOS: Computer and information sciencesComputer Science - Machine Learninglearned index structuresH.2Databases (cs.DB)search on sorted data platformComputer Science - Information RetrievalMachine Learning (cs.LG)E.1; I.2; H.2Computer Science - Databasesbinary search variantsComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)E.1algorithms with predictionSoftwareInformation Retrieval (cs.IR)

researchProduct

Convolutional Neural Networks for Multispectral Image Cloud Masking

2020

Convolutional neural networks (CNN) have proven to be state of the art methods for many image classification tasks and their use is rapidly increasing in remote sensing problems. One of their major strengths is that, when enough data is available, CNN perform an end-to-end learning without the need of custom feature extraction methods. In this work, we study the use of different CNN architectures for cloud masking of Proba-V multispectral images. We compare such methods with the more classical machine learning approach based on feature extraction plus supervised classification. Experimental results suggest that CNN are a promising alternative for solving cloud masking problems.

Masking (art)FOS: Computer and information sciencesComputer Science - Machine Learning010504 meteorology & atmospheric sciencesContextual image classificationbusiness.industryComputer scienceComputer Vision and Pattern Recognition (cs.CV)Feature extractionMultispectral image0211 other engineering and technologiesComputer Science - Computer Vision and Pattern RecognitionCloud computingPattern recognition02 engineering and technology01 natural sciencesConvolutional neural networkMachine Learning (cs.LG)Artificial intelligenceState (computer science)business021101 geological & geomatics engineering0105 earth and related environmental sciences

researchProduct

Rapid parameter estimation of discrete decaying signals using autoencoder networks

2021

Machine learning: science and technology 2(4), 045024 (2021). doi:10.1088/2632-2153/ac1eea

Signal Processing (eess.SP)FOS: Computer and information sciencesAccuracy and precisionComputer Science - Machine LearningComputer scienceddc:621.3FOS: Physical sciences01 natural sciencesSignalMachine Learning (cs.LG)010309 opticsExponential growthArtificial Intelligence0103 physical sciencesFOS: Electrical engineering electronic engineering information engineeringLimit (mathematics)Neural and Evolutionary Computing (cs.NE)Electrical Engineering and Systems Science - Signal Processing010306 general physicsSignal processingArtificial neural networkEstimation theoryComputer Science - Neural and Evolutionary ComputingAutoencoder621.3Human-Computer InteractionPhysics - Data Analysis Statistics and ProbabilityAlgorithmSoftwareData Analysis Statistics and Probability (physics.data-an)Machine Learning: Science and Technology

researchProduct