Title of article :
Exploration versus exploitation in assortment optimization with limited inventory and substitutable demand
Author/Authors :
Arhami, Omid Graduate School of Management and Economics - Sharif University of Technology - Tehran, Iran , Aslani, Shirin Graduate School of Management and Economics - Sharif University of Technology - Tehran, Iran , Talebian, Masoud Graduate School of Management and Economics - Sharif University of Technology - Tehran, Iran
Pages :
8
From page :
55
To page :
62
Abstract :
This study considers an online multi-period assortment optimization problem over multiple replenishment cycles where the seller chooses a subset from N substitutable products and decides the limited amount of each to order and sell at every period. The seller is constrained by a total inventory capacity, a cardinality constraint on the product variety (shelf space), and predetermined replenishment time intervals. The assortment selection is modeled as a Multi-armed bandit problem and the customers' choice is modeled by the MNL choice model. The objective is to optimize the revenue by learning the demand parameters and improve the offering composition at every period. In this novel approach, the offering and consequently the exploration-exploitation decision has two dimensions: the assortment and the inventory allocation. The present research develops a model and policy for learning and optimization that demonstrates good performance in numerical simulations. The results suggest that capacity constraint has a significant impact on the total profit of a seller who tries to learn the demand and the best inventory composition on the fly.
Keywords :
Multi-Armed Bandit (MAB) , Thompson sampling , multinomial logit choice model , computational modeling and simulation
Journal title :
Journal of Industrial and Systems Engineering (JISE)
Serial Year :
2020
Record number :
2630482
Link To Document :
بازگشت