DocumentCode :
928237
Title :
On finite memory solutions to the two-armed bandit problem (Corresp.)
Author :
Lakshmanan, K.B. ; Chandrasekaran, B.
Volume :
24
Issue :
2
fYear :
1978
fDate :
3/1/1978 12:00:00 AM
Firstpage :
244
Lastpage :
248
Abstract :
The least upper bound on the asymptotic proportion of the choice of the correct coin, achievable by {em expedient} finite-memory algorithms in certain two-armed bandit problems, is derived and schemes which achieve these bounds in a limiting sense are displayed. A deterministic automaton whose performance is close to optimal is also presented.
Keywords :
Automata; Decision procedures; Finite-memory methods; Stochastic automata; Autocorrelation; Automata; Context modeling; Information theory; Integral equations; Random variables; Reactive power; Stochastic processes; Time frequency analysis; Uncertainty;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/TIT.1978.1055854
Filename :
1055854
Link To Document :
بازگشت