Mixed logit

Mixed logit

Mixed logit is a fully general statistical model for examining discrete choices. The motivation for the mixed logit model arises from the limitations of the standard logit model. The standard logit model has three primary limitations, which mixed logit solves: "It [Mixed Logit] obviates the three limitations of standard logit by allowing for random taste variation, unrestricted substitution patterns, and correlation in unobserved factors over time."[1] Mixed logit can also utilize any distribution for the random coefficients, unlike probit which is limited to the normal distribution. It has been shown that a mixed logit model can approximate to any degree of accuracy any true random utility model of discrete choice, given an appropriate specification of variables and distribution of coefficients."[2] The following discussion draws from Ch. 6 of Discrete Choice Methods with Simulation, by Kenneth Train (Cambridge University Press), to which the reader is referred for more details and citations. See also the article on discrete choice for information on how the mixed logit relates to discrete choice analysis in general and to other specific types of choice models.

Contents

Random taste variation

The standard logit model's "taste" cofficients, or β's, are fixed, which means the β's are the same for everyone. Mixed logit has different β's for each person (i.e., each decision maker.)

In the standard logit model, the utility of person n for alternative i is:

Uni = βnxn + εni

with

εni ~ iid extreme value

For the mixed logit model, this specification is generalized by allowing βn to be random. The utility of person n for alternative i in the mixed logit model is:

Uni = βnxni + εni

with

εni ~ iid extreme value
 \quad \beta_n \sim f(\beta_n | \theta)

where θ are the parameters of the distribution of βn's over the population, such as the mean and variance of βn.

Conditional on βn, the probability that person n chooses alternative i is the standard logit formula:

 L_{ni} (\beta_{n}) = \frac{e^{\beta_{n}X_{ni}}} {\sum_{j} e^{\beta_{n}X_{nj}}}

However, since βn is random and not known, the (unconditional) choice probability is the integral of this logit formula over the density of βn.

 P_{ni} = \int L_{ni} (\beta) f(\beta | \theta) d\beta

This model is also called the random coefficient logit model since βn is a random variable. It allows the slopes of utility (i.e., the marginal utility) to be random, which is an extension of the random effects model where only the intercept was stochastic.

Any probability density function can be specified for the distribution of the coefficients in the population, i.e., for fn | θ). The most widely used distribution is normal, mainly for its simplicity. For coefficients that take the same sign for all people, such as a price coefficient that is necessarily negative or the coefficient of a desirable attribute, distributions with support on only one side of zero, like the lognormal, are used.[3][4] When coefficients cannot logically be unboundedly large or small, then bounded distributions are often used, such as the Sb or triangular distributions.

Unrestricted substitution patterns

The mixed logit model can represent general substitution pattern because it does not exhibit logit's restrictive independence of irrelevant alternatives (IIA) property. The percentage change in the probability for one alternative given a percentage change in the mth attribute of another alternative is

 E_{nix_{nj}^m} = -\frac{x_{nj}^m} {P_{ni}} \int \beta^m L_{ni}(\beta) L_{nj}(\beta) f(\beta) d \beta = - x_{nj}^m \int \beta^m L_{nj} (\beta) \frac{L_{ni} (\beta)} {P_{ni}} f(\beta) d \beta

where β m is the mth element of β.[1] It can be seen from this formula that "A ten-percent reduction for one alternative need not imply (as with logit) a ten-percent reduction in each other alternative."[1] The relative percentages depend on correlation between the likelihood that respondent n will choose alternative i, L ni , and the likelihood that respondent n will choose alternative j, L nj , over various draws of β.

Correlation in unobserved factors over time

Standard logit does not take into account any unobserved factors that persist over time for a given decision maker. This can be a problem if you are using panel data, which represent repeated choices over time. By applying a standard logit model to panel data you are making the assumption that the unobserved factors that affect a person's choice are new every time the person makes the choice. That is a very unlikely assumption. To take into account both random taste variation and correlation in unobserved factors over time, the utility for respondent n for alternative i at time t is specified as follows:

Unit = βnXnit + εnit

where the subscript t is the time dimension. We still make the logit assumption which is that ε is i.i.d extreme value. That means that ε is independent over time, people, and alternatives. ε is essentially just white noise. However, correlation over time and over alternatives arises from the common effect of the β's, which enter utility in each time period and each alternative.

To examine the correlation explicitly, assume that the β 's are normally distributed with mean \bar{\beta} and variance σ2. Then the utility equation becomes:

 U_{nit} = (\bar{\beta} + \sigma \eta_{n}) X_{nit} + \varepsilon_{nit}

and η is a draw from the standard normal density. Rearranging, the equation becomes:

 U_{nit} = \bar{\beta} X_{nit} + (\sigma \eta_{n} X_{nit} + \varepsilon_{nit})
 U_{nit} = \bar{\beta} X_{nit} + e_{nit}

where the unobserved factors are collected in enit = σηnXnit + εnit. Of the unobserved factors, εnit is independent over time, and σηnXnit is not independent over time or alternatives.

Then the covariance between alternatives i and j is,

Cov(enit,enjt) = σ2(XnitXnjt)

and the covariance between time t and q is

Cov(enit,eniq) = σ2(XnitXniq)

By specifying the X's appropriately, one can obtain any pattern of covariance over time and alternatives.

Conditional on βn, the probability of the sequence of choices by a person is simply the product of the logit probability of each individual choice by that person:

 L_{n} (\beta_{n}) = \prod_{t} \frac{e^{\beta_{n}X_{nit}}} {\sum_{j} e^{\beta_{n}X_{njt}}}

since  \varepsilon_{nit} is independent over time. Then the (unconditional) probability of the sequence of choices is simply the integral of this product of logits over the density of β.

 P_{ni} = \int L_{n} (\beta) f(\beta | \theta) d\beta

Simulation

Unfortunately there is no closed form for the integral that enters the choice probability, and so the researcher must simulate Pn. Fortunately for the researcher, simulating Pn can be very simple. There are four basic steps to follow

1. Take a draw from the probability density function that you specified for the 'taste' coefficients. That is, take a draw from f(β | θ) and label the draw βr, for r = 1 representing the first draw.

2. Calculate Lnr). (The conditional probability.)

3. Repeat many times, for r = 2,...,R.

4. Average the results

Then the formula for the simulation look like the following,

 \tilde{P}_{ni} = \frac {\sum_{r} L_{n}(\beta^r)} {R}

where R is the total number of draws taken from the distribution, and r is one draw.

Once this is done you will have a value for the probability of each alternative i for each respondent n.

References

  1. ^ a b c Train, K. (2003) Discrete Choice Methods with Simulation
  2. ^ McFadden, D. and Train, K. (2000). “Mixed MNL Models for Discrete Response,” Journal of Applied Econometrics, Vol. 15, No. 5, pp. 447-470,
  3. ^ David Revelt and Train, K (1998). "Mixed Logit with Repeated Choices: Households' Choices of Appliance Efficiency Level," Review of Economics and Statistics, Vol. 80, No. 4, pp. 647-657
  4. ^ Train, K (1998)."Recreation Demand Models with Taste Variation," Land Economics, Vol. 74, No. 2, pp. 230-239.

Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • Discrete choice — In economics, discrete choice problems involve choices between two or more discrete alternatives, such as entering or not entering the labor market, or choosing between modes of transport. Such choices contrast with standard consumption models in …   Wikipedia

  • Discrete choice analysis — is a statistical technique. In these models the dependent variable is a binary variable. Instances of discrete choice analysis are probit, logit and multinomial models. They are applied in econometrics, marketing research and transportation mode… …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • Independence of irrelevant alternatives — (IIA) is an axiom of decision theory and various social sciences. The word is used in different meanings in different contexts. Although they all attempt to provide a rational account of individual behavior or aggregation of individual… …   Wikipedia

  • List of mathematics articles (M) — NOTOC M M estimator M group M matrix M separation M set M. C. Escher s legacy M. Riesz extension theorem M/M/1 model Maass wave form Mac Lane s planarity criterion Macaulay brackets Macbeath surface MacCormack method Macdonald polynomial Machin… …   Wikipedia

  • Choice modelling — attempts to model the decision process of an individual or segment in a particular context. Choice modelling may also be used to estimate non market environmental benefits and costs[1]. Well specified choice models are sometimes able to predict… …   Wikipedia

  • Choice model simulation — Although the concept choice models is widely understood and practiced these days, it is often difficult to acquire hands on knowledge in simulating choice models. While many stat packages provide useful tools to simulate, researchers attempting… …   Wikipedia

  • Joachim Prinz (Ökonom) — Joachim Prinz (* 1971) ist ein deutscher Ökonom. Er hatte im Sommersemester 2009 die Vertretungsprofessur für Betriebswirtschaftslehre mit Schwerpunkt Personalmanagement an der Universität Passau inne. Leben Prinz studierte von 1993 bis 1999… …   Deutsch Wikipedia

  • Generalized linear model — In statistics, the generalized linear model (GLM) is a flexible generalization of ordinary least squares regression. It relates the random distribution of the measured variable of the experiment (the distribution function ) to the systematic (non …   Wikipedia

  • Exponential family — Not to be confused with the exponential distribution. Natural parameter links here. For the usage of this term in differential geometry, see differential geometry of curves. In probability and statistics, an exponential family is an important… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”