• DocumentCode
    735496
  • Title

    Effect of within-sample choice distribution and sample size on the estimation accuracy of logit model

  • Author

    Minhui Zeng ; Ming Zhong ; Hunt, John Douglas

  • Author_Institution
    Intell. Transp. Syst. Res. Center, Wuhan Univ. of Technol., Wuhan, China
  • fYear
    2015
  • fDate
    25-28 June 2015
  • Firstpage
    304
  • Lastpage
    310
  • Abstract
    Within-Sample Choice Distribution and Sample size are important considerations in the estimation of logit model, but their effects on the estimation accuracy have not been systematically studied. Therefore, the objective of this paper is to provide an empirical examination to the above issues through a set of simulated choice datasets. In this paper, the utility function coefficients and alternative specific constants (ASCs) are specified as a prior. Then, assuming alternative attributes and error components follow a normal distribution, both revealed preference (RP) and stated preference (SP) synthetic choice datasets are simulated. Based on these simulated datasets, the utility coefficients and ASCs are re-estimated and compared with the original values specified as the prior. It is found that the utility coefficients can be re-estimated with reasonable accuracy, but the estimates of the ASCs are confronted with much larger errors. The Sum of Square Errors (SSEs) between the “original” and the estimated utility coefficients and ASCs using RP and SP datasets of varying sample size are calculated, plotted and the corresponding diminishing marginal return points are identified. Regarding within-sample choice distribution, study results show that, as the within-sample choice distribution becomes more balanced, the hit-ratio decreases. It appears that, when alternatives are chosen with similar frequency, choosing one alternative vs. another does not make much difference in terms of utility perceived by each decision-maker. Therefore, it is suggested that a population with varying socioeconomic characteristics be created and used in future studies.
  • Keywords
    estimation theory; normal distribution; ASCs; RP synthetic choice dataset; SP synthetic choice dataset; SSEs; alternative specific constants; error components; estimation accuracy; logit model estimation; normal distribution; revealed preference synthetic choice dataset; sample size; simulated choice datasets; socioeconomic characteristics; stated preference synthetic choice dataset; sum of square errors; utility function coefficients; within-sample choice distribution; Accuracy; Analytical models; Data models; Estimation; Mathematical model; Safety; Transportation; Estimation Accuracy; Logit Model; Sample Size; Within-Sample Choice Distribution;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Transportation Information and Safety (ICTIS), 2015 International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4799-8693-4
  • Type

    conf

  • DOI
    10.1109/ICTIS.2015.7232160
  • Filename
    7232160