Department of Mathematics,
University of California San Diego
****************************
Math 288 - Probability
Ernst Presman
Central Economics and Mathematics Institute, Russian Academy of Sciences
A simple proof of an (Gittens) index theorem for graphs
Abstract:
We consider the problem which informally can be formulated as follows. Initially a finite set of independent trials is available. If a Decision Maker (DM) chooses to test a particular trial she receives a reward depending on the trial tested. As a result of testing a random finite set (possibly empty) of new independent trials is added to the set of available trials, and so on, but the total number of potential trials is finite. A DM knows the rewards and transition probabilities of all trials. On each step she can either stop testing or continue. Her goal is to select a testing order and a stopping time to maximize the expected total reward. This problem has a long history and is related to the Multi-armed Bandit Problem with independent arms. We prove that an index can be assigned to each possible trial, and the optimal strategy is to use on each step a trial with maximal index among those available. We give a simple procedure for constructing this index. This is joint work with Isaac Sonin.
Host: Ruth Williams
June 6, 2003
11:00 AM
AP&M 6438
****************************