Sufficient Ground Term Sets - Relational Reasoning - Constraint Solving, Deduction, and Program

Definition 18 (Sufficient ground term sets). Given a variable x in a clause C of an SMT formula A (in CNF), a set of ground terms S⊆ ℋ(^A)is sufficient for x w.r.t. a theory𝒯 if A and A[S/x]⁴are equisatisfiable modulo𝒯.

A variable x in a formula A can have more than one sufficient set of ground terms.

ℋ(A), for example, is always a sufficient set of ground terms as a result of the G ¨ odel-Herbrand-Skolem theorem which states that a formula A in Skolem Normal Form (SNF) is satisfiable iff its Herbrand expansion{A[gt1/x1]. . .[gtn/xn]|gt1:n∈ ℋ(A)}

is satisfiable where x1:nare the variables of A [69]. But,ℋ(^A)is usually infinite, and our goal is to determine whether a finite set of sufficient ground terms exists, and to

1(2) and (4) imply f(c1) =c1. y≤z holds for some pair of integers, thus (3) implies f(y) =c2for some y.

But f(y) =f(c₁)by (2) and so f(c₁) =c₂=c₁. This contradicts, however, (1).

2A structure is M^′(c₁) =1, M^′(c₂) =2, M^′(c₃) =0, M^′(c₄) =4, M^′(f)≡1

3A more convenient notation would be essentially interpreted literals. However, for the sake of simplicity we use the shorter notation.

4In practice, one will use A[C[S/x]/C]instate where C is the enclosing clause of x.

7.2 Sufficient Ground Term Sets 77

compute it if one exists. This is done, in our approach, by generating and solving a system of set constraints𝒮Aover sets of ground terms.

Figure 7.2 presents our (syntactic) rules to generate the set constraints system𝒮A

for a formula A in CNF. The notation t ˙∈C denotes that a term t occurs as a subterm of a clause C of A. We use𝒮Ato denote the set constraints system that results from applying these rules exhaustively to all the clauses of A. The constraints range over the sets vGT(x)⊆Gr for all variables x in A. These sets denote the relevant instantiations for the respective variables. Auxiliary sets fGT(f , i)⊆Gr are introduced to denote the set of relevant ground terms for an uninterpreted function f ∈ ℱ at an argument position i∈ N. We assume that the theory of integers is part of the considered𝒯, and that integers are included in the universe of every𝒯-structureℳ, i.e. Z⊆M. The^¯ integer operators<,≤^,+,−^,≥^,>are fixed with their obvious meaning.

Rule R1establishes a relationship between sets of ground terms for variables and function arguments. Rule R2ensures that the ground terms that occur as arguments of a function f are added to the corresponding ground term set of f . Rule R3states that if a term t[x1:n]with variables x1:noccurs as the i-th argument of f , then all the instantiations of t with the respective sets vGT(xi)must be in fGT(f , i). Rules R0, R4

and R14state that our approach does not currently handle the case where a variable x occurs as atom of a literal or as an argument of an unsupported interpreted function (supported operators are{=,<,≤,>,≥}, where “=” refers to integer equality), thus sets vGT(x)to infinity. Moreover, we do not handle the case where a supported interpreted operator has more than one variable argument (rule R5). The remaining rules infer additional constraints for vGT(x)where x occurs as an argument of a supported interpreted function. They constrain vGT(x)to contain at least one ground term that falsifies the corresponding (interpreted) literal.

It should be noted that, in fact, we use infinity as a label to denote that our (current) approach cannot compute a finite sufficient ground term set for a variable. This happens in three main cases: (1) if the ground term set of a variable is explicitly set to infinity (using rules R0, R4, R5and R14), (2) if we can conclude that the ground term set of a variable subsumes an infinite ground term set (using rules R1and R3) and (3) if we can conclude that the ground term set of a variable subsumes itself. A ground term set S subsumes a ground term set R, denoted by R ˙⊆S, if for every ground term gt1∈R there exists a ground term gt2∈S such that gt1is a subterm of gt2.

Before formulating and proving our main theorem, we first motivate the intuition behind our rules in the example of R1, R2and R3. Therfore consider the formula A

A^′∧

78 Chapter 7 Variable Elimination via Sufficient Ground Term Sets

Figure 7.2: The syntactic rules for generating the set constraints system (𝒮A). C denotes a clause of A, gts denote ground terms, f , and o p denote uninterpreted and interpreted function symbols, t[x1:n]denotes a term with variables x1:n.

Considering the sub formula 6.1, A restricts, abstractly seen, the interpretation of f on the value of gt₁with ϕ1and on all values with ϕ2. Now having this view on A, one can easily see that if the variable x is eliminated without considering its instantiation with gt₁, one could miss a possible inconsistency of A —namely between ϕ1and ϕ2— and thus possibly report a spurious counterexample. This case is exactly prevented by the exhaustive application of the rules R1and R2.

Considering the sub formula 6.2, A further restricts the interpretation of f for all values of the image of g with ϕ3. In this case and in order to prevent missing a possible inconsistency between ϕ2and ϕ3one should guarantee that x gets instantiated (at least) with all ground terms of the form g(gt)where gt∈vGT(y)(in our example g(gt2)). This is guaranteed by the exhaustive application of R1, R2and R3. This argumentation also holds even if g is the identity and this is exactly the reason for the (at first sight not intuitive) left to right inclusion of the equality of rule R1.

Let vGT_𝒮_A denote a collection of finite sets of ground terms which satisfies the constraints of𝒮A. We show that, if finite, vGT(x)_𝒮_A is a sufficient ground term set for x in A. The variable x can hence be eliminated by instantiating it with all the ground

7.2 Sufficient Ground Term Sets 79 modified structureℳ^π^x (as defined below) and show in lemma 16 thatℳ^π^x satisfies A.

Intuitively, if the ground term set of x does not subsume the ground term set of the i^thargument of f , or if viis a value that M assigns to a ground term for the i^th argument of f , then M^π^x(f)(.., vi, ..):=M(f)(.., vi, ..). Otherwise, πx(f , i)maps vito a value that M assigns to some ground term for the i^thargument of f . Integers must be mapped to the closest such value (see the proof of Lemma 15).

πx(f , i)(v) =

We also define the value projection with respect to a variable x πxas in Equation 7.4. If vGT(x)_𝒮_A =fGT(f , i)_𝒮_A, for instance because x occurs as the i^thargument of f , then πx=π_x(f , i).

Before showing the proof of lemma 16 used in our main theorem, we introduce and prove some auxiliary corollaries and lemmas.

Corollary 4. If vGT(x)_𝒮_A̸=∞, then πx(v)∈^M(vGT(x)_𝒮_A), for all v∈M.^¯ Proof. The claim follows directly from the definition of πx

Corollary 5. For all ground terms t and variables x occurring in A, val_ℳπx,β(t) =val_M,β(t) for any variable assignment β.

80 Chapter 7 Variable Elimination via Sufficient Ground Term Sets

Proof. We perform the proof by induction over the structure of t.

If t∈Const, the claim follows directly from the definition of M^π^x.

If, without loss of generality, t :=f(s), where f ∈ ℱ and s∈^Gr(A), we get by the induction hypothesis,

val_ℳπx,β(f(s))

val= M^π^x(f)(val_ℳπx,β(s))

I H= M^π^x(f)(val_ℳ,β(s))

val= M^π^x(f)(M(s))

Now, we have to distinguish between interpreted and uninterpreted functions. If f is interpreted, the claim follows directly from the definition of M^π^x. If f is uninterpreted, we first get M^π^x(f)(M(s)) =M(f)(πx(f , 1)(M(s))). Furthermore, we know, because of rule R2and since f(s)occurs in A, that s∈fGT(f , 1)_𝒮_A and consequently M(s)∈ M(fGT(f , 1)_𝒮_A). Now, we can conclude the proof using the definition of πx(f , 1)for values in M(fGT(f , 1)_𝒮_A).

The following lemmas show that if a structureℳtogether with the variable as-signment β^′=λy. if vGT(y)_𝒮_A⊆^˙^vGT(x)_𝒮_A then πy(β(y))else β(y)for some variable assignment β satisfies a literal l in a CNF formula A, then the modified structureℳ^π^x together with β satisfies l. These lemmas are insofar important as they describe how our structure modification M^π^x can be reduced to a modification of variable assign-ments. Lemma 14 gives a stronger variant (with value equality rather than implication) for uninterpreted literals, and Lemma 15 formulates the claim for interpreted literals.

Lemma 14. Let x be a variable with vGT(x)_𝒮_A ̸=∞,ℳa structure, β a variable assign-ment, and β^′=λy. if vGT(y)_𝒮_A⊆^˙^vGT(x)_𝒮_A then πy(β(y))else β(y). Then val_ℳ,β′(l) = val_Mπx,β(l)for all uninterpreted literals l in A.

Proof. To prove the claim, we show the statement val_ℳ,β′(l) =valM^πx,β(l) for all uninterpreted terms l∈^TermΣoccurring as literals of the CNF of A using structural induction.

If l is a ground term in A, then the claim follows directly from corollary 5.

If l=y is a variable, then because of rule R0vGT(y)_𝒮_A=*^˙ vGT(x)_𝒮_A. Consequently, val_ℳ,β′(l) =β^′(y) =β(y) =val_Mπx,β(l).

Let l= f(t1:n)be a function application in A with f an uninterpreted function. The evaluations of l are

val_ℳπx,β(f(t_1:n)) =M^π^x(f)(val_ℳπx,β(t₁), . . . , val_ℳπx,β(tn))

=M(f)(πx(f , 1)(val_ℳπx,β(t₁)), . . . , πx(f , n)(val_ℳπx,β(tn))) val_ℳ,β′(f(t_1:n) =M(f)(val_ℳ,β′(t₁), . . . , val_ℳ,β′(tn))

7.2 Sufficient Ground Term Sets 81

It suffices to show that πx(f , i)(val_ℳπx,β(ti)) =val_M,β′(ti)for 1≤ⁱ≤n. We do this by a case distinction over the structure of the terms ti.

If ti =y is a variable with vGT(y)_𝒮_A*^˙ vGT(x)_𝒮_A, then β^′(y) =β(y). Because of rule R1we additionally get fGT(f , i)_𝒮_A*^˙vGT(x)_𝒮_A, which implies that πx(f , i)is the identity.

If ti=y is a variable with vGT(y)_𝒮_A⊆^˙ vGT(x)_𝒮_A, then β^′(y) =πy(β(y)). Because of rule R1we get fGT(f , i)_𝒮_A =vGT(y)_𝒮_A⊆^˙ ^vGT(x)_𝒮_A, which implies that πx(f , i) =πy.

If tiis a function application, we assume ti=s[x1:m]for some term s. By induction hypothesis,

π_x(f , i)(val_ℳπx,β(s[x1:m]))^{I H}=π_x(f , i)(val_ℳ,β′(s[x1:m])). It suffices now to show that

πx(f , i)(val_ℳ,β′(s[x1:m])) =valM,β^′(s[x1:m]). With respect to fGT(f , i)_𝒮_A, there is two possible cases to consider.

1. fGT(f , i)_𝒮_A*^˙^vGT(x)_𝒮_A: in this case, πx(f , i)is the identity and the claim follows directly.

2. fGT(f , i)_𝒮_A⊆^˙ ^vGT(x)_𝒮_A: in this case, vGT(x_i)_𝒮_A⊆^˙ ^fGT(f , i)_𝒮_A⊆^˙ ^vGT(x)_𝒮_A, for all 1≤ⁱ≤m (because of rule R3). This implies that β^′(xi) =πx_i(β(xi)) for all 1≤ⁱ≤m. Using this fact together with corollary 4, there exists for each xi a ground term gti, with πx_i(β(xi)) =M(gti)and gti ∈^vGT(xi)_𝒮_A. There-fore, we can write, w.l.o.g., πx(f , i)(val_ℳ,β′(s[x_1:m])) =πx(f , i)(M(s[gt_1:m]))_. Because of rule R3we know that s[gt1:m]∈^fGT(f , i)_𝒮_A, and so M(s[gt1:m])∈ M(fGT(f , i)_𝒮_A). Finally, the claim follows from the definition of πx(f , i)for values in M(fGT(f , i)_𝒮_A)and the assumption that fGT(f , i)_𝒮_A⊆^˙vGT(x)_𝒮_A. Let l := f(t_1:n)be a function application with f an uninterpreted function. Since l is uninterpreted, all tis are non-variables and we can use the induction hypotheses on them. Having this together with the definition of M^π^x for interpreted functions, we can now conclude the proof for this last case as follow:

val_ℳ^πx,β(f(t_1:n))

val= M^π^x(f)(val_ℳπx,β(t1), . . . , val_ℳπx,β(tn))

M^πx

= M(f)(val_ℳπx,β(t1), . . . , val_ℳπx,β(tn))

I H= M(f)(val_ℳ,β′(t1), . . . , val_ℳ,β′(tn))

val= val_ℳ,β′(f(t_1:n))

82 Chapter 7 Variable Elimination via Sufficient Ground Term Sets

Lemma 15. Let x be a variable with vGT(x)_𝒮_A̸=^∞,ℳa structure, β a variable assignment, and β^′=λy. if vGT(y)_𝒮_A⊆^˙^vGT(x)_𝒮_A then πy(β(y))else β(y). Then(M, β^′)|=^limplies (M^π^x, β)|=lfor all interpreted literals l in A.

Proof. We prove the claim by induction on the structure of uninterpreted literals l. The structure of uninterpreted literals, however, has only one case more than the structure of uninterpreted literals, namely the case of l=op(y1:n)where o p is an interpreted function symbol and y1:nare variables. Having already proved Lemma 14, we can focus, w.l.o.g., on this case.

Fist, we considre the case in which vGT(yi)_𝒮_A*^˙vGT(x)_𝒮_A for all 1≤i≤n. In this case, β^′(y_i) =β for all 1≤ⁱ≤n. Having this, the claim follows directly using the defintion of M^π^x for interpreted function symbols,

val_ℳπx,β(op(y_1:n))

val= M^π^x(op)(β(y1), . . . , β(yn))

M^π^x

= M(op)(β(y1), . . . , β(yn))

β^′

= M(op)(β^′(y₁), . . . , β^′(yn))

val= val_ℳ,β′(op(y1:n)).

For the remaining case, we can assume vGT(yi)_𝒮_A⊆^˙ vGT(x)_𝒮_A for some 1≤i≤n and consequently β^′(y_i) =πy_i(β(y_i)). In addition to that, we can because of rules R4, R5and R14further restrict, without loss of generality, the form of l to l :=op(yi, gt) where o p∈ {=^,̸=^,<,≤^,>,≥}and α_Σ(y_i) = Z. Let us now assume that(ℳ^{, β}^′)|=^l and(ℳ^π^x, β)̸|=l and show by case distinction on op∈ {=,̸=,<,≤,>,≥}that this assumption is wrong and consequently the claim holds.

1. For o p∈ {<^,>}, we get from rule R11, gt∈^vGT(y_i)_𝒮_A and from the assump-tion the inequality system(π_y_i(β(yi)) <gt)∧ (^β(yi)≥gt), which implies that

|^β(y_i)−^πy_i(β(y_i))|is not minimal, since|^β(y_i)−^gt|is strictly smaller.

2. For o p∈ {≤^,≥}, the proofs go similar to the previous case using rule R6for the

“≤”-case and rule R7for the “≥”-case.

3. For o p :=”=”, we get from rule R13,{^gt−1, gt+1} ⊆^vGT(y_i)_𝒮_A and from the assumption, the inequality system(π_y_i(β(yi)) =gt)∧ (^β(yi)̸=^gt), which is equivalent to(πy_i(β(y_i)) =gt)∧ ((^β(y_i)≤^gt−¹)∨ (^gt+1≤^β(y_i)))_and implies that|^β(yi)−^πy_i(β(yi))|is not minimal, since in the case(β(yi)≤^gt−¹),

|^β(y_i)− (^gt−¹)|is strictly smaller and in the case(gt+1≤^β(y_i)),|^β(y_i)− (gt+1)|is strictly smaller.

4. For o p :=”̸=”, the proof goes similar to the previous case using rule R12.

In document Relational Reasoning - Constraint Solving, Deduction, and Program Verification (Page 96-103)