6533b829fe1ef96bd1289728

RESEARCH PRODUCT

A Scheme for Continuous Input to the Tsetlin Machine with Applications to Forecasting Disease Outbreaks

Ole-christopher GranmoKuruge Darshana AbeyrathnaMorten GoodwinXuan Zhang

subject

Learning automataArtificial neural networkComputer scienceDecision tree02 engineering and technologycomputer.software_genreThresholdingField (computer science)020202 computer hardware & architectureAutomatonSupport vector machine0202 electrical engineering electronic engineering information engineeringPreprocessor020201 artificial intelligence & image processingData miningcomputer

description

In this paper, we apply a new promising tool for pattern classification, namely, the Tsetlin Machine (TM), to the field of disease forecasting. The TM is interpretable because it is based on manipulating expressions in propositional logic, leveraging a large team of Tsetlin Automata (TA). Apart from being interpretable, this approach is attractive due to its low computational cost and its capacity to handle noise. To attack the problem of forecasting, we introduce a preprocessing method that extends the TM so that it can handle continuous input. Briefly stated, we convert continuous input into a binary representation based on thresholding. The resulting extended TM is evaluated and analyzed using an artificial dataset. The TM is further applied to forecast dengue outbreaks of all the seventeen regions in Philippines using the spatio-temporal properties of the data. Experimental results show that dengue outbreak forecasts made by the TM are more accurate than those obtained by a Support Vector Machine (SVM), Decision Trees (DTs), and several multi-layered Artificial Neural Networks (ANNs), both in terms of forecasting precision and F1-score.

https://doi.org/10.1007/978-3-030-22999-3_49