Title :
A note on the relation between weak derivatives and perturbation realization
Author :
Heidergott, Bernd ; Cao, Xi-Ren
Author_Institution :
Hong Kong Univ. of Sci. & Technol., Kowloon, China
fDate :
7/1/2002 12:00:00 AM
Abstract :
Studies the relationship between two important approaches in perturbation analysis (PA)-perturbation realization (PR) and weak derivatives (WDs). Specifically, we study the relation between PR and WDs for estimating the gradient of stationary performance measures of a finite state-space Markov chain. We show that the WDs expression for the gradient of a stationary performance measure can be interpreted as the expected PR factor where the expectation is carried out with respect to a distribution that is given through the weak derivative of the transition kernel of the Markov chain. Moreover, we present unbiased gradient estimators.
Keywords :
Markov processes; gradient methods; matrix algebra; perturbation techniques; probability; finite state-space Markov chain; perturbation analysis; perturbation realization; stationary performance measures; transition kernel; unbiased gradient estimators; weak derivatives; Kernel; Mathematics; Performance analysis; State estimation; Time measurement; Timing;
Journal_Title :
Automatic Control, IEEE Transactions on
DOI :
10.1109/TAC.2002.800648