Search results for "Learning automata"

showing 10 items of 76 documents

Explainable Reinforcement Learning with the Tsetlin Machine

2021

The Tsetlin Machine is a recent supervised machine learning algorithm that has obtained competitive results in several benchmarks, both in terms of accuracy and resource usage. It has been used for convolution, classification, and regression, producing interpretable rules. In this paper, we introduce the first framework for reinforcement learning based on the Tsetlin Machine. We combined the value iteration algorithm with the regression Tsetlin Machine, as the value function approximator, to investigate the feasibility of training the Tsetlin Machine through bootstrapping. Moreover, we document robustness and accuracy of learning on several instances of the grid-world problem.

Learning automataComputer sciencebusiness.industryBootstrappingMachine learningcomputer.software_genreRegressionConvolutionRobustness (computer science)Bellman equationReinforcement learningMarkov decision processArtificial intelligenceMathematics::Representation Theorybusinesscomputer

researchProduct

Dynamic Ordering of Firewall Rules Using a Novel Swapping Window-based Paradigm

2016

Designing and implementing efficient firewall strategies in the age of the Internet of Things (IoT) is far from trivial. This is because, as time proceeds, an increasing number of devices will be connected, accessed and controlled on the Internet. Additionally, an ever-increasingly amount of sensitive information will be stored on various networks. A good and effi- cient firewall strategy will attempt to secure this information, and to also manage the large amount of inevitable network traffic that these devices create. The goal of this paper is to propose a framework for designing optimized firewalls for the IoT. This paper deals with two fundamental challenges/problems encountered in such…

Learning automataComputer sciencebusiness.industryDistributed computingSuiteEstimator020206 networking & telecommunicationsLearning Automata02 engineering and technologyFirewall OptimizationNon-Stationary EnvironmentsInformation sensitivityFirewall (construction)Batch UpdateMatching time0202 electrical engineering electronic engineering information engineeringRule matchingThe InternetWeak EstimatorsInternet of Thingsbusiness

researchProduct

A formal proof of the e-optimality of discretized pursuit algorithms

2015

Learning Automata (LA) can be reckoned to be the founding algorithms on which the field of Reinforcement Learning has been built. Among the families of LA, Estimator Algorithms (EAs) are certainly the fastest, and of these, the family of discretized algorithms are proven to converge even faster than their continuous counterparts. However, it has recently been reported that the previous proofs for ??-optimality for all the reported algorithms for the past three decades have been flawed. We applaud the researchers who discovered this flaw, and who further proceeded to rectify the proof for the Continuous Pursuit Algorithm (CPA). The latter proof examines the monotonicity property of the proba…

Learning automataDiscretizationInequalityBasis (linear algebra)Computer sciencemedia_common.quotation_subjectField (mathematics)Monotonic function02 engineering and technologyMathematical proofFormal proof020202 computer hardware & architectureAlgebraArtificial Intelligence0202 electrical engineering electronic engineering information engineeringReinforcement learning020201 artificial intelligence & image processingAlgorithmmedia_common

researchProduct

Stochastic discretized learning-based weak estimation: a novel estimation method for non-stationary environments

2016

The task of designing estimators that are able to track time-varying distributions has found promising applications in many real-life problems.Existing approaches resort to sliding windows that track changes by discarding old observations. In this paper, we report a novel estimator referred to as the Stochastic Discretized Weak Estimator (SDWE), that is based on the principles of discretized Learning Automata (LA). In brief, the estimator is able to estimate the parameters of a time varying binomial distribution using finite memory. The estimator tracks changes in the distribution by operating a controlled random walk in a discretized probability space. The steps of the estimator are discre…

Learning automataEstimator020206 networking & telecommunications02 engineering and technologyBinomial distributionUnivariate distributionEfficient estimatorArtificial IntelligenceSignal Processing0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingMultinomial distributionComputer Vision and Pattern RecognitionMinimax estimatorAlgorithmSoftwareInvariant estimatorMathematicsPattern Recognition

researchProduct

Distributed learning automata for solving a classification task

2016

In this paper, we propose a novel classifier in two-dimensional feature spaces based on the theory of Learning Automata (LA). The essence of our scheme is to search for a separator in the feature space by imposing a LA based random walk in a grid system. To each node in the gird we attach an LA, whose actions are the choice of the edges forming the separator. The walk is self-enclosing, i.e, a new random walk is started whenever the walker returns to starting node forming a closed classification path yielding a many edged polygon. In our approach, the different LA attached at the different nodes search for a polygon that best encircles and separates each class. Based on the obtained polygon…

Learning automataFeature vector020206 networking & telecommunications02 engineering and technologySupport vector machinesymbols.namesakeKernel methodKernel (statistics)PolygonRadial basis function kernel0202 electrical engineering electronic engineering information engineeringGaussian functionsymbols020201 artificial intelligence & image processingAlgorithmMathematics2016 IEEE Congress on Evolutionary Computation (CEC)

researchProduct

Scheduling Domestic Shiftable Loads in Smart Grids: A Learning Automata-Based Scheme

2017

In this paper, we consider the problem of scheduling shiftable loads, over multiple users, in smart grids. We approach the problem, which is becoming increasingly pertinent in our present energy-thirsty society, using a novel distributed game-theoretic framework. From a modeling perspective, the distributed scheduling problem is formulated as a game, and in particular, a so-called “Potential” game. This game has at least one pure strategy Nash Equilibrium (NE), and we demonstrate that the NE point is a global optimal point. The solution that we propose, which is the pioneering solution that incorporates the theory of Learning Automata (LA), permits the total supplied loads to approach the p…

Learning automataJob shop schedulingComputer scienceDistributed computing02 engineering and technology010501 environmental sciences01 natural sciencesSubnetScheduling (computing)symbols.namesakeSmart gridStrategyNash equilibrium0202 electrical engineering electronic engineering information engineeringsymbols020201 artificial intelligence & image processingPotential game0105 earth and related environmental sciences

researchProduct

Generalized Bayesian Pursuit: A Novel Scheme for Multi-Armed Bernoulli Bandit Problems

2011

In the last decades, a myriad of approaches to the multi-armed bandit problem have appeared in several different fields. The current top performing algorithms from the field of Learning Automata reside in the Pursuit family, while UCB-Tuned and the e-greedy class of algorithms can be seen as state-of-the-art regret minimizing algorithms. Recently, however, the Bayesian Learning Automaton (BLA) outperformed all of these, and other schemes, in a wide range of experiments. Although seemingly incompatible, in this paper we integrate the foundational learning principles motivating the design of the BLA, with the principles of the so-called Generalized Pursuit algorithm (GPST), leading to the Gen…

Learning automatabusiness.industryComputer scienceBayesian probabilityMachine learningcomputer.software_genreBayesian inferenceConjugate priorField (computer science)Probability vectorPrinciples of learningArtificial intelligenceSet (psychology)businesscomputer

researchProduct

Using Learning Automata to Enhance Local-Search Based SAT Solvers with Learning Capability

2010

In this work, we have introduced a new approach based on combining Learning Automata with Random Walk and GSAT w/Random Walk. In order to get a comprehensive overview of the new algorithms' performance, we used a set of benchmark problems containing different problems from various domains. In these benchmark problems, both RW and GSATRW suffers from stagnation behaviour which directly affects their performance. This phenomenon is, however, only observed for LA-GSATRW on the largest problem instances. Finally, the

Learning automatabusiness.industryComputer scienceLocal search (optimization)Artificial intelligencebusinessMachine learningcomputer.software_genrecomputer

researchProduct

A Stochastic Search on the Line-Based Solution to Discretized Estimation

2012

Published version of a chapter in the book: Advanced Research in Applied Artificial Intelligence. Also available from the publisher at: http://dx.doi.org/10.1007/978-3-642-31087-4_77 Recently, Oommen and Rueda [11] presented a strategy by which the parameters of a binomial/multinomial distribution can be estimated when the underlying distribution is nonstationary. The method has been referred to as the Stochastic Learning Weak Estimator (SLWE), and is based on the principles of continuous stochastic Learning Automata (LA). In this paper, we consider a new family of stochastic discretized weak estimators pertinent to tracking time-varying binomial distributions. As opposed to the SLWE, our p…

Mathematical optimizationDiscretizationLearning automataComputer scienceStochastic Point Locationlearning automataEstimatorVDP::Technology: 500::Information and communication technology: 550020206 networking & telecommunications02 engineering and technologyOracleVDP::Mathematics and natural science: 400::Information and communication science: 420::Knowledge based systems: 425weak estimatorsnon-stationary environmentsLine (geometry)Convergence (routing)0202 electrical engineering electronic engineering information engineeringApplied mathematics020201 artificial intelligence & image processingMultinomial distributionFinite set

researchProduct

On Using a Hierarchy of Twofold Resource Allocation Automata to Solve Stochastic Nonlinear Resource Allocation Problems

2007

Recent trends in AI attempt to solve difficult NP-hard problems using intelligent techniques so as to obtain approximately-optimal solutions. In this paper, we consider a family of such problems which fall under the general umbrella of "knapsack-like" problems, and demonstrate how we can solve all of them fast and accurately using a hierarchy of Learning Automata (LA). In a multitude of real-world situations, resources must be allocated based on incomplete and noisy information, which often renders traditional resource allocation techniques ineffective. This paper addresses one such class of problems, namely, Stochastic Non-linear Fractional Knapsack Problems. We first present a completely …

Mathematical optimizationHierarchyLearning automataKnapsack problemComponent (UML)Convergence (routing)Resource allocationField (computer science)MathematicsAutomaton

researchProduct