• No results found

The Entropy Maximising Facility Location Problem

Chapter 4 Metropolitan container terminals

4.2 Methodology

4.2.2 The Entropy Maximising Facility Location Problem

The goal is to determine the most likely 𝑝 IMT locations based on available information on transport budget, cargo distribution patterns, transport costs of cargo distribution, candidate IMT locations with important features such as cargo handling capacities. Here, the entropy maximising principle is used to combine these diverse pieces of information to find the most likely 𝑝 IMT locations and the least biased probability distribution of located IMT usage. The general entropy maximisation framework comprises an entropy objective function and a set of constraints representing the available information. The entropy function comprises the possible ways that a given state of the system can occur:

𝐸 = 𝑍!

βˆπ‘–βˆˆπ’ͺβˆπ‘—βˆˆπ’Ÿπ‘‹π‘–π‘—! (βˆπ‘‘βˆˆπ’― 𝑉𝑖𝑑𝑗!)

(4.6)

where 𝐸 is the number of possible ways that the state (𝑉𝑖𝑑𝑗, 𝑋𝑖𝑗) such that βˆ‘π‘–βˆˆπ’ͺβˆ‘π‘—βˆˆπ’Ÿπ‘‹π‘–π‘— +

βˆ‘π‘–βˆˆπ’ͺβˆ‘π‘‘βˆˆπ’―βˆ‘π‘—βˆˆπ’Ÿπ‘‰π‘–π‘‘π‘— = 𝑍 can occur. The important question is: based on what we know about the system, which of the many states (values of 𝑋𝑖𝑗 and 𝑉𝑖𝑑𝑗 ) is most likely to represent the system? The principle of entropy maximisation is simply asking us to select the states with the maximum number of ways of occurring and consistent with all we know about the system. In general, we seek the values of 𝑋𝑖𝑗 and 𝑉𝑖𝑑𝑗 that maximises 𝐸 and also satisfy all the constraints representing the available information about the system. Statistically, the values of 𝑋𝑖𝑗 and 𝑉𝑖𝑑𝑗 that maximises 𝐸 also maximises ln𝐸. However, it is easier to maximise ln𝐸 so we maximised ln𝐸 instead. Thus, Equation (4.6) reduces to:

ln𝐸 = ln𝑍! βˆ’ βˆ‘ βˆ‘ ln(𝑋𝑖𝑗!)

π‘—βˆˆπ’Ÿ π‘–βˆˆπ’ͺ

βˆ’ βˆ‘ βˆ‘ βˆ‘ ln(𝑉𝑖𝑑𝑗!)

π‘—βˆˆπ’Ÿ π‘‘βˆˆπ’― π‘–βˆˆπ’ͺ

(4.7)

It has been shown in Chapter 3 that the term ln𝐸 has special meaning and desirable properties and Boltzmann (1972) referred to it as entropy. Thus, maximising (4.7) can equivalently be stated as maximising entropy. One of the desirable properties of ln𝐸 (entropy) is that its corresponds to the amount of missing information (or uncertainty or entropy) in the constructed of the probability distribution and that the maximum amount of missing information is attained when there is no known information about the system under investigation. These properties are expressed in propositions (4.1), (4.2) and (4.3) below:

Proposition 4.1: If the set β„³ = {𝑑: 𝑑 ∈ {0, 𝒯}} is set of modal alternatives, where {0} is the index for road alone and the set 𝒯 is the set of indices of IMTs forming the intermodal transport alternatives. Also, let the set of origin-destination movements β„› = {π‘Ÿ = (𝑖, 𝑗): 𝑖 ∈ π’ͺ, 𝑗 ∈ π’Ÿ}

then set the of elemental alternatives 𝒲 = {𝑀 = (π‘Ÿ, 𝑑): π‘Ÿ ∈ β„›, 𝑑 ∈ β„³} with cardinality 𝑛 =

|𝒲|. If the probability of each elemental alternative 𝑀 ∈ 𝒲 is defined by Equation (4.8):

𝑃𝑀 = 𝑍𝑀

𝑍 ; βˆ€π‘€ ∈ 𝒲 (4.8)

Then, equation (4.7) or entropy can be expressed as:

ln𝐸 = βˆ’ 𝑍 βˆ‘ 𝑃𝑀ln𝑃𝑀

π‘€βˆˆπ’²

(4.9)

Proof 4.1: By definition, equation (4.7) can be simplified as:

In𝐸 = ln𝑍! βˆ’ βˆ‘ ln𝑍𝑀!

π‘€βˆˆπ’²

Applying Stirling's approximation, the above equation simplifies to:

ln𝐸 = 𝑍(ln𝑍 βˆ’ 1) βˆ’ βˆ‘ 𝑍𝑀(ln𝑍𝑀 βˆ’ 1)

π‘€βˆˆπ’²

(4.10)

Substituting Equation (4.8) into (4.10) and performing some algebraic manipulation we have:

ln𝐸 = βˆ’ 𝑍 βˆ‘ 𝑃𝑀ln𝑃𝑀

π‘€βˆˆπ’²

(4.11)

Proposition 4.2: In the absence of any other information about the freight system, maximising equation (4.11) produces uniform probability distributions of modal flows:

𝑃𝑀 = 1

𝑛 ; βˆ€π‘€ ∈ 𝒲 (4.12)

with corresponding maximum entropy:

𝐻 = In𝐸̃ = 𝑍ln(𝑛) (4.13)

where 𝑛 is the cardinality of the set 𝒲

Proof 4.2: Now if we assume there is no information available other than obeying the normalisation axiom of probability:

βˆ‘ 𝑃𝑀

π‘€βˆˆπ’²

= 1 (4.14)

then from equation (4.11) the first order condition for maximum ln𝐸 with respect to 𝑃𝑀 and subject to (4.14) satisfy the following equation:

βˆ’π‘ln(𝑃𝑀) βˆ’ 1 βˆ’ πœ‘ = 0; βˆ€π‘€ ∈ 𝒲 (4.15)

where πœ‘ is the Lagrangian multiplier associated with constraint (4.14). Solving for 𝑃𝑀 in (4.15) by enforcing constraint (4.14) we have:

𝑃𝑀 = 1

𝑛 ; βˆ€π‘€ ∈ 𝒲

and substituting the above into Equation (4.11) the maximum entropy can be computed:

𝐻 = 𝑍ln(𝑛)

Proposition 4.3: The entropy (or amount of missing information (𝐻) in the probability distributions) constructed based on any amount of available information about the system cannot be greater than the entropy in equation (4.13). That is 𝐻 ≀ 𝑍ln(𝑛)

Proof 4.3: Define a convex function πœ™(π‘₯) = π‘₯ln(π‘₯); βˆ€π‘₯ β‰₯ 0. Following Jensen’s inequality, the following must hold:

Applying the definition of the convex function πœ™ to the term on the left-hand side of Equation (4.16) and using Equations (4.11) we have:

βˆ’ 1

Also, applying the definition of πœ™ to the term on the right-hand side of (4.16) we have:

βˆ’πœ™ (1

It therefore follows from equation (4.16) that

𝐻 ≀ 𝑍ln(𝑛) (4.17)

The result in equation (4.17) is intuitive since it implies that the more information we have, the less entropy or uncertainty we have about the resulting probability distribution and vice versa.

In other words, on average, the amount of missing information (entropy) about the system under investigation is never increased by learning something about it. Equation (4.12) simply states that applying the principle of maximum entropy to the MCTM with no evidence to suggest why a particular modal alternative should be preferred more than the others will result in a uniform probability distribution. The next section presents the information available in the form of constraints to form the entropy maximising facility location problem (EMFLP).

4.2.3 Available evidence as constraints

For the purpose of this exercise, the evidence available are summarised as follows:

1.Budget constraint: It is assumed that the transport budget 𝑐 is known. This evidence is added as constraint (4.18). The first component captures the weighted cost of using intermodal transport and the second captures the weighted cost of using road alone transport (e.g. truck only), with the sum not exceeding the total allocated transport budget.

βˆ‘ βˆ‘ βˆ‘ 𝑐𝑖𝑑𝑗𝑉𝑖𝑑𝑗

π‘—βˆˆπ’Ÿ π‘‘βˆˆπ’― π‘–βˆˆπ’ͺ

+ βˆ‘ βˆ‘ 𝑐𝑖𝑗𝑋𝑖𝑗

π‘—βˆˆπ’Ÿ π‘–βˆˆπ’ͺ

≀ 𝑐 (4.18)

2. Conservation of cargo flow constraint. Information on the distribution of origin-destination flows of cargo (π‘žπ‘–π‘—) by modes is added as constraint (4.19). It ensures that for each origin-destination pair, the sum of cargo by all available modes equals the total cargo associated with this origin-destination pair.

βˆ‘ 𝑉𝑖𝑑𝑗

π‘‘βˆˆπ’―

+ 𝑋𝑖𝑗 = π‘žπ‘–π‘—; βˆ€ 𝑖 ∈ π’ͺ, 𝑗 ∈ π’Ÿ (4.19)

3. Definitional constraint. The information on the required number of IMTs (𝑝) to locate is presented by constraint (4.20). It ensures that only the required number of IMTs are located.

βˆ‘ π‘Œπ‘‘

π‘‘βˆˆπ’―

= 𝑝 (4.20)

4. Capacity constraint. The cargo handling capacity limit of the each IMT is captured in constraints (4.21) and guarantees that no located IMT exceeds its cargo handling capacity. The constraints also ensure that only open IMTs are used.

βˆ‘ βˆ‘ 𝑉𝑖𝑑𝑗

π‘—βˆˆπ’Ÿ π‘–βˆˆπ’ͺ

≀ π‘Œπ‘‘π‘π‘‘ ; βˆ€π‘‘ ∈ 𝒯 (4.21)

Finally, the entropic objection function in (4.7) can be simplified by applying Stirling's approximation to the factorial terms, ignoring the constant term, ln𝑍! as it does not influence the optimisation process:

ln𝐸~ βˆ‘ βˆ‘ βˆ‘ 𝑉𝑖𝑑𝑗(1 βˆ’ ln𝑉𝑖𝑑𝑗)

π‘—βˆˆπ’Ÿ π‘‘βˆˆπ’― π‘–βˆˆπ’ͺ

+ βˆ‘ βˆ‘ 𝑋𝑖𝑗(1 βˆ’ ln𝑋𝑖𝑗)

π‘—βˆˆπ’Ÿ π‘–βˆˆπ’ͺ

(4.22)

Once the available information are converted to constraints (4.18-4.21), the EMFLP is presented as follows:

EMFLP ∢ Max Ξ› = βˆ‘ βˆ‘ βˆ‘ 𝑉𝑖𝑑𝑗(1 βˆ’ ln𝑉𝑖𝑑𝑗)

π‘—βˆˆπ’Ÿ π‘‘βˆˆπ’― π‘–βˆˆπ’ͺ

+ βˆ‘ βˆ‘ 𝑋𝑖𝑗(1 βˆ’ ln𝑋𝑖𝑗)

π‘—βˆˆπ’Ÿ π‘–βˆˆπ’ͺ

Subject to constraint (4.18) to (4.21) and the following integer and non-negativity constraints:

π‘Œπ‘‘ ∈ {0,1} ; 𝑑 ∈ 𝒯 (4.23)

𝑉𝑖𝑑𝑗 β‰₯ 0; 𝑋𝑖𝑗 β‰₯ 0 ; βˆ€π‘‘ ∈ 𝒯; βˆ€ 𝑖 ∈ π’ͺ; 𝑗 ∈ π’Ÿ (4.24)