Surge Pricing and Dynamic Matching for Hotspot Demand Shock in Ride-Hailing Networks
4.2 Model and Problem Formulation
4.2.1 Hotspot Demand Shock
Hotspot Demand Shock. The hotspot demand shock has a magnitude, Λ(p) ≤ ¯Λ, a decreas-ing function of the rider price charged at the hotspot. We consider the shock duration to be deterministic or exponential with mean ¯T , T ∼ exp(1/ ¯T ).
Matching Decision. The platform offers wage surge signal to drivers within distance ut, the wage surge range, at time t. The offered wage within ut should be high enough to incentivize drivers to reposition to the hotspot. The surge duration is denoted by τ , which can be the same as the demand shock duration T or longer.
Drivers can be matched if they are in the rider patience zone—within distance θ from the hotspot. Therefore the actual matching rate at time t is the sum of instantaneous driver available rate within distance θ and the driver entering rate at distance θ. Let φxt be the matching rate at time t. We have
xt= min{ut, θ} + Z t
0
1(us≥ θ + t − s)ds, t ∈ [0, T ], (4.1) where the first term is the instant matching rate and the second term counts the entering rate through the θ-zone. To get the second term, refer to Figure 4.1 (a). First note that drivers entering the θ-zone at time t may be those that start repositioning at time s and distance t − s from the θ-zone, for s ∈ [0, t], which is depicted by the −45 degree dashed line; then integrating over [0, t] the time ds when the wage surge range at time s is larger than θ + t − s (the orange solid segments on the −45 degree dashed line) yields this entering rate.
!
Figure 4.1: Derivation of xt
Notice from Figure 4.1 (a) that, in the general case, xt involves summing the time segments before t where u is above or on the −45 degree line, which can be mathematically cumbersome. To simplify the exposition without loss of generality, we focus on the control policies where u is continuous and does not decrease too fast so that xthas a straightforward form. This is formalized by the following assumption and proposition.
Assumption 4.1. ut is continuous in t and ˙ut≥ −1, ∀t ≥ 0.
Under Assumption 4.1, we can derive xtgiven us, s ∈ [0, t] in the following proposition.
(All proofs are in the Appendix.)
Proposition 4.1. Given the surge range profile us≥ 0, s ∈ [0, t] and assuming Assumption 4.1, the matching rate at time t is given by
xt=
is the “delay of control” at time t—the state at time t is determined by the control δu(t) time units ago. When {ut−s≥ θ + s} = ∅, we follow the convention that δu(t) = 0.
Figure 4.1 (b) illustrates the derivation of xtunder Assumption 4.1. When ˙ut≥ −1, ∀t ≥ 0, the integral in (4.1) involves at most one time segment of length δu(t) instead of several disconnected time segments. As a result, xt is only determined by the control δu(t) time units ago.
The following corollary of Proposition 4.1 gives the derivative of x as a function of u and its derivative.
Corollary 4.1. Under Assumption 4.1, the derivative of the matching rate at time t is given by
The third case in (4.5) captures the monotonic relationship between ˙u and ˙x (with time delay) and is summarized in Table 4.1.
˙
u
Figure 4.2: Exemplary control and state trajectories
Figure 4.2 shows exemplary control and state trajectories that contain all three cases in Proposition 4.1 and Corollary 4.1. The delayed effect of u on x, xt= ut−δu(t), is depicted by the three red dashed squares as examples. To distinguish the timelines of control u and state x and avoid ambiguity, we use tu and tx respectively, instead of t, when necessary.
Let tx be the timeline of x (so xtx is an explicit form of xt), then let
Using Lemma 4.1, we can prove the following key lemma which relates the integral of functions of utu and xtx.
4.2.2 Rider Price and Driver Wage
Let p0 and w0 be the constant rider price and driver wage at non-hotspot locations, respec-tively. We assume the platform does not change non-hotspot pricing at any time. Next we consider the pricing at the hotspot.
Hotspot rider price pt is a function of the matching rate, pt = p(xt), where p(x) is the highest price to get φx ≤ ¯Λ hotspot demand:
Λ(p) = φx ⇒ p(x) = Λ−1(φx). (4.9)
At time t, the platform’s hotspot revenue rate is therefore φp(xt)xt.
Let wt(y) be the incentive compatible (IC) hotspot wage at time t that makes a (marginal) driver at distance y indifferent between staying local and repositioning to the hotspot. The wage is quoted when the driver starts to reposition but is paid when the driver is matched.
When the IC condition—hence drivers’ repositioning decision—is time independent (which is true under deterministic or exponential surge duration), the IC wage is also time indepen-dent, denoted by w(y). Thereafter we are restricted to w(y) without explicit notification.
We consider two classes of wage quoting policies: broadcast and personalized message.
Under the broadcast policy, a common wage is quoted to all drivers (but paid individually upon matching). Given that the platform chooses wage surge range ut and the IC wage w(y) is increasing in y, all drivers that start repositioning at t will be paid w(ut) if matched eventually. Hence the platform’s wage payment rate at time t is
φWtb= φ
Under time-independent wage w(y) and using Proposition 4.1, we can show (DETAILS can be a lemma) that (4.10) is simplified as
φWtb = φ Z xt
0
w(ut−(y−θ)+)dy. (4.11)
Note that if all repositioning drivers can be matched (which is true under deterministic shock duration), we may adopt the wage-payable rate fWtb = w(ut)ut. In this case the common wage w(ut) is IC to the marginal drivers at ut but overpaying drivers at x < ut.
Under the personalized message policy, each driver at x is quoted his/her own IC wage w(x). Hence the platform’s wage payment rate at time t is
φWtp = φ
Z ut∧θ 0
wt(y)dy + Z t
0
1(us≥ θ + t − s)ws(θ + t − s)ds
. (4.12)
Under time-independent wage w(y) and using Proposition 4.1, we can show (DETAILS can be a lemma) that (4.12) is simplified as
φWtp = φ Z xt
0
w(x)dx. (4.13)
4.2.3 Sequence of Events
In this continuous time model, the sequence of events is as follows.
(1) Platform determines wage surge range ut at time t.
(2) Platform quotes surged wage to drivers at u ∈ [0, ut] at time t using broadcast or personalized message policy.
(3) Drivers within [0, ut] reposition toward the hotspot and are matched immediately (if within [0, θ]) or later (if within (θ, ut] by the end of the surge duration).
(4) Matching rate φxtrealized (xt yielded by us, 0 ≤ s ≤ t through (4.1)) (5) φxt riders are matched and charged total price φp(xt)xt;
φxt drivers are matched and paid total wage φWtb or φWtp.