Complexity Analysis of Bit Weaving - Equivalent Transformation Techniques

Part I Equivalent Transformation Techniques

5.4 Complexity Analysis of Bit Weaving

There are two computationally expensive stages to bit weaving:¿nding the minimal cross-free partition and bit merging. For analyzing both stages, letwbe the number of bits within a rule predicate, and letnbe the number of rules in the input. We show that bit merging’s worst case time complexity isO(w×n2 lg 3₎_{(where lg 3}₌_log

23), which makes bit weaving the¿rst polynomial-time algorithm with a worst-case time complexity that is independent of the number of¿elds in the input classi¿er.

Finding a minimal cross-free partition is composed of a loop ofnsteps. Each of these steps checks whether or not the adding the next rule to the current partition will introduce a cross pattern; this requires a linear scan comparing the next rule to each rule of the current partition. Each comparison takesΘ(w)time. In the worst case, we get scanning behavior similar to insertion sort which requiresΘ(wn2₎_time andΘ(wn)space.

We now analyze the complexity of bit merging, the most expensive stage of bit weaving. The key to our analysis is determining how many ternary covers are generated from our input ofnrules. This analysis is complicated by the fact that the generation of ternary covers proceeds in rounds. We sidestep this complication by observing that if the total number of ternary covers generated in all rounds is f(n), then the total space required by bit merging isO(w f(n))and the total time required by bit merging isO(w(f(n))2_{). The time complexity follows because in the worst} case, each ternary cover is compared against every other ternary cover to see if a new ternary cover can be created. This is an overestimate since we only compare the ternary covers generated in each round to each other, and we only compare ternary covers within the same pre¿x chunk to each other. We now show that the total number of ternary covers generated isO(nlg 3₎_{which means that bit merging} has a worst case time complexity ofO(wn2 lg 3₎_{and a worst case space complexity} ofO(wnlg 3₎_. 0∗∗∗0⇒ 00000 00010 00100 00110 01000 01010 01100 01110 Table 5.2 The output of bit merging determines the input.

Based on Lemma 5.3, we restrict our attention to individual pre¿x chunks (where all the rules have the same decision) since we never merge rules from different pre¿x chunks. Furthermore, based again on Lemma 5.3, we assume that all input rules end with a 0 or 1 by eliminating all the∗’s at the right end of all rules in this pre¿x chunk. We now perform our counting analysis by starting with the output of bit merging for this pre¿x chunk rather than the input to bit merging.

5.4 Complexity Analysis of Bit Weaving 55

First consider the case where we have a single output rule with b∗’s such as 0∗∗∗0. We observe that the number of input rules in this pre¿x chunk with the same decision must be exactly 2b_{because the initial input is generated by the partial}

list minimization algorithm which only generates classi¿ers with pre¿x rules. Thus, the original input rules could not include non-pre¿x ternary rules like 000∗0. We illustrate this observation in Table 5.2 where we list all 8 input rules that must exist to produce the single output of 0∗∗∗0.

Pass 0 1 2 3 000_∗0 00000 001_∗0 00_∗∗0 00010 010_∗0 01_∗∗0 00100 011_∗0 00110 00_∗00 00∗10 0∗0∗0 0∗∗∗0 01∗00 0∗1∗0 01000 01∗10 01010 0∗000 01100 0∗010 0∗∗00 01110 0∗100 0∗∗10 0∗110 (3 0 ) ×23−0(3 1 ) ×23−1(3 2 ) ×23−2(3 3 ) ×23−3 Ternary Covers

Table 5.3 The set of all ternary covers generated per round

We now count the total number of ternary covers that are generated during each round of bit merging. If we consider the input set of rules as round 0, we observe that the number of ternary covers at roundkis exactly(b_k)×2b−k _{for 0}_≤_k_≤_b_.

Table 5.3 illustrates this property for the bit merging output 0∗∗∗0. If we sum over each round, the total number of ternary covers is exactly∑b

k=0

(_b

)

2b−k₌₃b_{by the}

binomial theorem. Since the input sizen=2b_{, the total number of ternary covers is}

3b_{= (2}b₎lg3₌_nlg 3_.

We now consider the case where the output for a single pre¿x chunk (with the same decision) isqrulesR={ri∣1≤i≤q}whereq>1. Letbifor 1≤i≤qbe

the number of∗’s in output ruleri. LetIibe the set of input rules associated with

ruleri and letCibe the set of ternary covers generated byIi during the course of

bit merging. For any subsetR′_⊆_R_,_I₍_R′_{) =}_∩

ri∈R′IiandC(R′) =∩ri∈R′Ciwhere the intersection operator for a single set returns the same set. Stated another way,I(R′₎ is the set of input rules common to all the output rules inR′_{, and}_C₍_R′₎_{is the set of} ternary covers common to all the output rules inR′_{. Let}_c₍_R′₎_{be the ternary cover} inC(R′₎_{with the most stars. This ternary cover is uniquely de}_¿_{ned for any}_R′_{. In} particular,c(R′)would be the output rule if the input rules were exactlyI(R′), and

C(R′)would be exactly the set of ternary covers generated fromI(R′). LetbR′ be

the number of stars inc(R′). This impliesI(R′)contains exactly 2b_R′ _{input rules and}

56 5 Bit Weaving

LetT I(R)be the total number of unique input rules associated with any output rule inRand letTC(R)be the total number of unique ternary covers associated with any output rule inR. We now show how to computeT I(R)andTC(R)by applying the inclusion-exclusion principle. LetRk _{for 1}_≤_k_≤_q _{be the set of subsets of}_R

containing exactlykrules fromR.

T I(R) =

∑

w k=1 (−1)k−1

∑

R′_∈Rk ∣I(R′)∣=

∑

w k=1 (−1)k−1

∑

R′_∈Rk 2bR′ TC(R) =

∑

w k=1 (−1)k−1

∑

R′_∈Rk ∣C(R′)∣=

∑

w k=1 (−1)k−1

∑

R′_∈Rk 3bR′ It follows that the total number of ternary covers generated isO(nlg 3_).

For example, supposeq=2 andr1=0∗∗∗0 andr2=∗∗∗00. Thenc(R) =0∗∗00,

I(R) ={00000,00100,01000,01100}, and the remaining four elements ofC(R′₎ are{00∗00,0∗000,01∗00,0∗100}, andT I(R) =23₊₂3₋₂2₌₈₊₈₋₄₌_{12 and}

Chapter 6 All-Match Redundancy Removal

We present an all-match based complete redundancy removal algorithm. This is the ¿rst algorithm that attempts to solve¿rst-match problems from an all-match per- spective. We formally prove that the resulting packet classi¿ers have no redundant rules after running our redundancy removal algorithm.

We have improved upon [Liu and Gouda(2005)] in two ways. First, the redundancy theorem becomes simpler. The redundancy theorem in [Liu and Gouda(2005)] distinguishes upward and downward redundant rules, and detects them separately. In contrast, the redundancy theorem presented here gives a single criterion that can detect both upward and downward redundant rules. Second, the new redundancy removal algorithm is more ef¿cient. The algorithm in [Liu and Gouda(2005)] scans a packet classi¿er twice and build FDDs twice in order to remove the two types of redundant rules. In comparison, the new algorithm only scans a packet classi¿er once and builds one all-match FDD with a cost similar cost to building an FDD. The new algorithm is about twice as ef¿cient as the algorithm in [Liu and Gouda(2005)].

In document Hardware Based Packet Classification for High Speed Internet Routers (Page 58-61)