مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning in infinite-horizon inventory competition with total demand observations

DocumentCode :

3488006

Title :

Learning in infinite-horizon inventory competition with total demand observations

Author :

Zeinalzadeh, A. ; Alptekinoglu, A. ; Arslan, Gurdal

Author_Institution :

Dept. of Electr. Eng., Univ. of Hawaii at Manoa, Honolulu, HI, USA

fYear :

2012

fDate :

27-29 June 2012

Firstpage :

1382

Lastpage :

1387

Abstract :

We consider single-period and infinite-horizon inventory competition between two firms that replenish their inventories as in the well-known newsvendor model. Normally customers have a preference for shopping in one firm or the other. A fixed percentage of them who encounter a stockout in the firm of their first choice, though, visits the other firm. This substitution behavior makes the firm´s replenishment decisions strategically related. Our main contribution is to introduce a simple learning algorithm to inventory competition. The learning algorithm requires each firm (a) to have the knowledge of its own critical fractile, which the firm can calculate using the values of its own per unit revenue, order cost, and holding cost; and (b) to observe its own total demand realizations. They do not necessarily know their true demand distributions. The firms need not even have any information about each other, beyond the implicit information encoded in their own total demand realizations affected by their competitors´ inventory decisions. In fact, the firms need not even be aware that they are engaged in inventory competition. We prove that the inventory decisions generated by the learning algorithm converge, with probability one, to certain threshold values that constitute an equilibrium in pure Markov strategies for an infinite-horizon discounted-reward inventory competition game.

Keywords :

Markov processes; costing; game theory; inventory management; learning (artificial intelligence); probability; Markov strategies; competitor inventory decisions; firm replenishment decisions; holding cost; infinite-horizon discounted-reward inventory competition game; learning algorithm; newsvendor model; order cost; probability; single-period inventory competition; total demand observations; total demand realizations; Convergence; Educational institutions; Games; Learning systems; Markov processes; USA Councils; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

American Control Conference (ACC), 2012

Conference_Location :

Montreal, QC

ISSN :

0743-1619

Print_ISBN :

978-1-4577-1095-7

Electronic_ISBN :

0743-1619

Type :

conf

DOI :

10.1109/ACC.2012.6315678

Filename :

6315678

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3488006