DocumentCode :
735496
Title :
Effect of within-sample choice distribution and sample size on the estimation accuracy of logit model
Author :
Minhui Zeng ; Ming Zhong ; Hunt, John Douglas
Author_Institution :
Intell. Transp. Syst. Res. Center, Wuhan Univ. of Technol., Wuhan, China
fYear :
2015
fDate :
25-28 June 2015
Firstpage :
304
Lastpage :
310
Abstract :
Within-Sample Choice Distribution and Sample size are important considerations in the estimation of logit model, but their effects on the estimation accuracy have not been systematically studied. Therefore, the objective of this paper is to provide an empirical examination to the above issues through a set of simulated choice datasets. In this paper, the utility function coefficients and alternative specific constants (ASCs) are specified as a prior. Then, assuming alternative attributes and error components follow a normal distribution, both revealed preference (RP) and stated preference (SP) synthetic choice datasets are simulated. Based on these simulated datasets, the utility coefficients and ASCs are re-estimated and compared with the original values specified as the prior. It is found that the utility coefficients can be re-estimated with reasonable accuracy, but the estimates of the ASCs are confronted with much larger errors. The Sum of Square Errors (SSEs) between the “original” and the estimated utility coefficients and ASCs using RP and SP datasets of varying sample size are calculated, plotted and the corresponding diminishing marginal return points are identified. Regarding within-sample choice distribution, study results show that, as the within-sample choice distribution becomes more balanced, the hit-ratio decreases. It appears that, when alternatives are chosen with similar frequency, choosing one alternative vs. another does not make much difference in terms of utility perceived by each decision-maker. Therefore, it is suggested that a population with varying socioeconomic characteristics be created and used in future studies.
Keywords :
estimation theory; normal distribution; ASCs; RP synthetic choice dataset; SP synthetic choice dataset; SSEs; alternative specific constants; error components; estimation accuracy; logit model estimation; normal distribution; revealed preference synthetic choice dataset; sample size; simulated choice datasets; socioeconomic characteristics; stated preference synthetic choice dataset; sum of square errors; utility function coefficients; within-sample choice distribution; Accuracy; Analytical models; Data models; Estimation; Mathematical model; Safety; Transportation; Estimation Accuracy; Logit Model; Sample Size; Within-Sample Choice Distribution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Transportation Information and Safety (ICTIS), 2015 International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4799-8693-4
Type :
conf
DOI :
10.1109/ICTIS.2015.7232160
Filename :
7232160
Link To Document :
بازگشت