Empirical comparison of various reinforcement learning strategies for sequential targeted marketing

Naoki Abe; Edwin Pednault; Haixun Wang; Bianca Zadrozny; Wei Fan; Chid Apte

ICDM 2002

Conference paper

01 Dec 2002

Empirical comparison of various reinforcement learning strategies for sequential targeted marketing

Abstract

We empirically evaluate the performance of various reinforcement learning methods in applications to sequential targeted marketing. In particular, we propose and evaluate a progression of reinforcement learning methods, ranging from the "direct" or "batch" methods to "indirect" or "simulation based" methods, and those that we call "semi-direct" methods that fall between them. We conduct a number of controlled experiments to evaluate the performance of these competing methods. Our results indicate that while the indirect methods can perform better in a situation in which nearly perfect modeling is possible, under the more realistic situations in which the system's modeling parameters have restricted attention, the indirect methods' performance tend to degrade. We also show that semi-direct methods are effective in reducing the amount of computation necessary to attain a given level of performance, and often result in more profitable policies. © 2002 IEEE.

Conference paper