6533b7d3fe1ef96bd1261372

RESEARCH PRODUCT

Validation of a Reinforcement Learning Policy for Dosage Optimization of Erythropoietin

Emilio Soria-olivasJosé D. Martín-guerreroMónica Climente-martíN. Víctor Jiménez-torresMarcelino Martínez-soberTeresa De Diego-santos

subject

business.industryManagement scienceComputer scienceMachine learningcomputer.software_genreData setWork (electrical)Robustness (computer science)ErythropoietinmedicineReinforcement learningArtificial intelligencebusinesscomputermedicine.drug

description

This paper deals with the validation of a Reinforcement Learning (RL) policy for dosage optimization of Erythropoietin (EPO). This policy was obtained using data from patients in a haemodialysis program during the year 2005. The goal of this policy was to maintain patients' Haemoglobin (Hb) level between 11.5 g/dl and 12.5 g/dl. An individual management was needed, as each patient usually presents a different response to the treatment. RL provides an attractive and satisfactory solution, showing that a policy based on RL would be much more successful in achieving the goal of maintaining patients within the desired target of Hb than the policy followed by the hospital so far. In this work, this policy is validated using a cohort of patients treated during 2006. Results show the robustness of the policy that is also successful with this new data set.

https://doi.org/10.1007/978-3-540-76928-6_84