Structure of the maximum damage attack - Optimal Control of Mobile Malware Epidemics

Whether in practice the worm can indeed inflict the maximum damages developed in this chap- ter depends on implementability of the optimal strategies. Specifically, if the optimal policies that inflict the maximum damage are complex to execute, then the worm may not be able to per- form them since they are limited by the capabilities of their resource constrained hosts as well. Inauspiciously though, we show that optimal attack strategies follow simple structures (theorems 4.3.1,4.3.2) which make them conducive to implementation. Fig. 4.2 provides visualization of the theorems.

Recall that one of the basic trade-offs that the attacker were dynamically faced with was the perfect timing to kill an infective node. Specifically, should an attacker kill a node as soon as it is infected so as to have claimed a casualty and secured a large damage on the network? Theo- rem 4.3.1 states the opposite:

Theorem 4.3.1. Consider an optimal solution pair(ν, u)that jointly maximizes the worm’s damage function in(4.1.3)subject to the constraints in(4.1.4a)and(4.1.4b); thenν(t)has the following characteristics:

∃t1∈[0. . . T)such thatν(t) = 0for0< t < t1andν(t) =νmaxfort1< t < T.

In words, an optimalν(.)is ofbang-bangform, that is, it possesses only two possible values νmaxand0,and switches abruptly between them. It has at most one such jump, which necessarily culminates atνmax.Thus, the theorem says that although killing a node early on would ensure partial damage, the overall damage is more if this decision is deferred until toward the end of the attacking period despite the risk of recovery of the infective node by the system. Specifically, at the start of the outbreak, number of susceptibles is high and infective nodes can be used to further

propagate the infection. As time passes by, the level of susceptibles drops due to both spread of infection and immunization effort by the system. At a certain threshold, the risk of recovery of the infective nodes in the remaining time outweigh the potential benefit by spreading the infection. At this point, whose exact value depends on the parameters of the case, the malware starts killing the nodes with maximum possible rate. This will ensure that an infective nodes are maximally used for spread of the infection and for attacker’s malicious activities.

In order to present the result for optimalu,we need to introduce an extra assumption. Recall that bothb(I)andq(S)satisfyb(0) =q(0) = 0,andb(I), q(S)are increasing functions ofI, Sfor I, S∈[0. . .1].Assume further that there exist positive constantsˆbandqˆsuch that

∀I, S∈[0. . .1], b(I)≥ˆbIandq(S)≥qS.ˆ (4.3.1)

Now, considering the supremum of such constants, we assume to have:

b+ ˆq≥βumax (4.3.2)

βumax is the maximum rate of the spread of the infection, and intuitively, the above condition describes the scenario in which the recovery rate (healing + immunization) is larger than the maximum rate of the spread of the infection. We will refer to this assumption as fast-recovery regime assumption.

Theorem 4.3.2. Assuming fast-recovery regime, any optimal u(t)for the case of strictly convexh(u)

consists of the followingphases:

1. u=umaxon0< t≤t0< Tfor somet0≥0;

2. ustrictly and continually decreases ont0< t≤t1< Tfor somet1≥t0; 3. u=uminont1< t≤T.

For the case of a concaveh(u),any optimaluconsists of only phase 1 followed by phase 2, i.e.,u=umax until timet0and subsequentlyu=umin.

Although theorem 4.3.2 is presented assuming fast-recovery regime, our numerical results show that these structural results hold even when this assumption is relaxed (§4.5).

According to theorem 4.3.2, optimal media scanning rate and malware transmission rates is always a non-increasing function of time. Specifically, the most intense (battery-consuming) spreading effort of malware should take place at the start of the outbreak. Intuitively, this is because initially the number of susceptibles is high, and hence an infective contacts more susceptibles by using the sameuinitially than later. Moreover, if a node is infected earlier, it will have more time to further propagate the infection. Subsequently, owing to the overall energy limita- tions, as the level of susceptibles as potential preys drop, infectives should reduce their media scanning rates so as not to exhaust their spreading battery budget. The structure of an optimal uturns out to subtly depend on whether thehfunction is strictly convex or concave. We can have an intuitive explanation for this phenomenon: a slight reduction in the value ofuresults in a higher decrease in the instantaneous power for a strictly convexh(u)than for a concaveh(u). Therefore, in the case of strictly convexh(u),it is beneficial to reduceucontinuously (instead of keepinguatumaxbefore a threshold time) and prolong the battery lifetime for malicious activities of the malware.

In summary, theorems 4.3.1 and 4.3.2 provide the joint optimal attack as follows: initially, the highest effort of the malware is focused on spreading the malware and amassing infectives without killing any. Subsequently, the reverse course of action is taken: battery is used at lower rates, and at a threshold time, the amassed nodes are slaughtered at the highest rate which lasts till the end of the interval.

Note that the optimal killing policy (ν), as well as the media scanning rate of the infectives (u) for the case of concaveh(u),are completely specified by the (only possible) jump points. Also, the optimal media scanning strategy (u) for a strictly convex hcan be simply divided into at most three phases, characterized by at most two time epochs. Therefore, no continuous global coordination of attack is necessary. In particular, the threshold times can be computed once an

estimate of the parameters of the network (β, q(S),b(I), C) is made, and can be subsequently incorporated into the code of the malware. Given the flexibility provided by software-driven devices, the infective nodes can subsequently execute these strategies without coordinating any further among themselves or with any central entity. The transition times can be determined by solving a system of differential equations, as described in previous sections. Such systems can be solved very fast due to the existence of efficient numerical algorithms for solving differential equations, and the computation time is constant in that it does not depend on the number of nodes N.Note also that our algorithms do not require any local or global information as time progresses and only the initial information is sufficient to determine the decision of infective nodes for the entire interval.

In practice, due to the drifts in local clocks, different infectives may slow down their transmission rates and start the killing at slightly different times. Our simulations presented in§4.5 reveal that the overall damages are robust to clock drifts for our dynamic optimal policies, yet another unfavorable news from the defense point of view. In the next section we provide the proofs for both of the theorems.

In document Optimal Control of Mobile Malware Epidemics (Page 73-76)