3 DISCRETE CHOICE MODELLING AND TRANSPORT MODE CHOICE
3.1 Multinomial Logit and Conditional Logit Models
Discrete choice models such as binary logit and multinomial logit are probably the most widely used methods for mapping the freight mode choice process (Greene and Hensher, 2013; Arunotayanun and Polak, 2011; Train and Wilson, 2008; Rich et al., 2009, 2011; Jong and Ben- Akiva, 2007; Danielis et al., 2005; Bergkvist., 2001; Sayed et al., 2000; Nijkamp et al., 1999; Abdelwahab et al., 1998). Typical multinomial logit forms are (Liao, 1994):
ππππ(π¦ = 1) = πβ π½Μ1ππ₯π π π=1 1+πβππ=1π½Μ1ππ₯π+πβππ=1π½Μ2ππ₯π , ππππ(π¦ = 2) = πβππ=1π½Μ2ππ₯π 1+πβππ=1π½Μ1ππ₯π+πβππ=1π½Μ2ππ₯π ππππ(π¦ = 3) = 1 1+πβππ=1π½Μ1ππ₯π+πβππ=1π½Μ2ππ₯π β― ππππ(π¦ = π) = πβ π½ Μπππ₯π π π=1 1+ βπ½β1πβππ=1π½Μπππ₯π π=1 (3.1)
43
These approaches model the choice or market shares for the available transport modes. The functional forms are based on the utility maximizing choice process of the shipper. This approach is of a behavioural nature. The decision taker is assumed to base the choice on the characteristics of the offered transport services, such as delivery time, reliability and frequency of service. When making their decision not all possible attributes are included in the model due to random taste and variation. Thus, a random error is introduced into the model. All this is captured in the random utility theory (McFadden, 1973).
The logit model was first derived by Luce (1959), and it is the most widely used model because of the fact that the choice probabilities take a closed form and are readily interpretable. In the MNL, the probability that the choice outcome π¦π is alternative π from all alternatives available to the individual can be expressed as the logit formula:
π(π¦π = π) = πππ = exp (π₯ππ½π)
βπ½π=0exp (π₯ππ½π)
for π = 0, β¦ , π½ (3.2)
The vector π½π is a vector of coefficients specific to the π th alternative, π₯π is a vector of characteristics specific to the π th individual, π¦π indicates the choice made. To identify the model, we assume without loss of generality that π½0= 0. The model can also be written in terms of the odds for each pair of options π and π:
Ξ©ππ|ππ = exp(π₯π[π½πβ π½π]) (3.3)
This equation shows that the odds (Ξ©ππ) of choosing π versus π do not depend on any other additional alternatives on the choice set; the odds are determined only by the coefficient vectors for π and π, π½π and π½π. Assuming that unobserved utilities for each alternative are independently
44
and identically distributed (IID), and are described by the Gumbel distribution, produces the MNL model (Domencich and McFadden, 1975).
The utility functions are linear in the parameter forms and the parameter π₯ is related to the variance of π (Ben-Akiva and Lerman, 1985). Thus, for the MNL model, a scale parameter π½2= π2/6π2 . The key assumption of the MNL model is that the errors are independent of each other. This independence means that the unobserved portion of utility for one alternative is unrelated to the unobserved portion of utility for another alternative.
If one thinks that the unobserved portion of utility of one alternative is correlated with that of other alternatives, then there are three options (Train, 2003): (1) use a different model that allows for correlated errors, such as the nested logit or mixed logit models, (2) re-specify the representative utility so that the source of the correlation is captured explicitly and thus the remaining errors are independent, or (3) use the logit model under the current specification of representative utility, considering the model to be an approximation.
The conditional logit (CL) model is closely related to the better-known MNL model, but it derives from different behavioural assumptions and is estimated in a different form. The CL model is similarly defined when choice-specific data are available (Maddala, 1983; Train, 2003; Garcia- Menendez et al., 2004; Hansen, 2011). Both multinomial logit and conditional logit are used to analyse the choice of an individual among a set of π alternatives. The main distinction between the two can be put very simply: the MNL model focuses on the individual as the unit of analysis and uses the individual's characteristics as explanatory variables; in contrast, the CL model focuses on the set of alternatives for each individual and the explanatory variables are characteristics of those alternatives. Hence, the CL model is appropriate for a different class of
45
models in which a choice among alternatives is treated as a function of the characteristics of the alternatives, rather than (or in addition to) the characteristics of the individual making the choice (characteristics of the chooser) which the MNL model currently uses. The models do, however, share a common likelihood function.
Using the properties of the Gumbel distribution, the probability that individual π chooses alternative π from among the choices in the choice set ππ is
π(π¦π = π) = πππ = exp (π₯πππ½)
βπ½πβππexp (π₯πππ½) (3.4)
where π₯ππ is a vector of attributes specific to the π th alternative as perceived by the π th individual. It is assumed that there are π choices in each individualβs choice set, ππ.
Within the CL model, the parameters are assumed to be constant across the alternatives. As a result, the CL model can be used to predict the probability that an individual will choose a previously unavailable alternative, given knowledge of π½ and the vector π₯ππ of choice-specific characteristics. Consequently, conditional logit models are often used when the number of possible choices is large. The CL model is explained in detail in the articles by McFadden (1973) Boxall and Adamowicz (2002) and Shen and Saijo (2007).
The MNL model was the most widely used modelling methodology to measure shippersβ mode choice behaviour in the early stage of freight transport modelling (Jong and Ben-Akiva, 2007; Yannis and Golias, 2005; Catalani, 2001; McGinnis et al., 1981; Nam, 1997; Wilson et al., 1986). However, Oum (1990) gave an early warning against the use of MNL models estimated at the aggregate level, because of the many restrictions this model type imposes on the
46
parameters (e.g. IIA). Nevertheless, because of the relative ease of obtaining aggregate data, the MNL model specification is still the one used most in modelling freight mode choice in practice (Jong et al., 2012). A few studies have explored the estimation of a freight transport demand function using a CL model (Train, 2003; Garcia-Menendez et al., 2004).