About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
INFORMS 2020
Talk
Automated Derivation Of MDP And Reinforcement Learning Models From Historical Data
Abstract
While optimization models can provide immense value, creating such models requires substantial time and expertise. In this presentation, using inventory replenishment optimization as a running example, we will describe how sequential discrete time optimization models can be automatically generated through a combination of Markov Decision Process and Reinforcement Learning models. Our goals are to significantly reduce the time and skills for such model creation, thereby making the benefits of optimization much more widely available. We will also demonstrate the application of our work to the supply chain inventory management problem.