Multi-parameter STTs - Properties and variants

5.5 Properties and variants

5.5.5 Multi-parameter STTs

In our basic transducer model, the value of each variable can contain at most one hole. Now we generalize this definition to allow a value to contain multiple parameters.

Such a definition can be useful in designing an expressive high-level language for transducers, and it is also used to simplify constructions in later proofs.

We begin by defining nested words with parameters. The setH(Σ,Π)ofparameterized

nested wordsover the alphabetΣusing the parameters inΠ, is defined by the grammar

H:=ε|a|π| ha H bi |H H fora,b∈Σandπ ∈Π.

For example, the nested word haπ1hbiπ2ai represents an incomplete tree with a-

labeled root that has ab-labeled leaf as a child, such that trees can be added to its left as well as right by substituting the parameter symbolsπ1andπ2with nested words. We

can view such a nested word with 2 parameters as a function of arity 2 that takes two well-matched nested words as inputs and returns a well-matched nested word.

In the generalized transducer model, the variables range over parameterized nested words over the output alphabet. Given an alphabetΣ, a setXof variables, and a setΠ

of parameters, the setE(Σ,X,Π)of expressions is defined by the grammar

E:=ε|a|π|x| ha E bi |E E|E[π 7→E] fora,b_∈Σ,x_∈ X, andπ∈ Π. A valuationαfromXto H(Σ,Π)naturally extends to a function from the expressions E(Σ,X,Π)toH(Σ,Π).

To stay within the class of regular transductions, we need to ensure that each variable is used only once in the final output and each parameter appears only once in the right-hand side at each step. To understand how we enforce single-use restriction on parameters, consider the update x := xy associated with a transition from stateq to

state q0_{. To conclude that each parameter can appear at most once in the value of} _x after the update, we must know that the sets of parameters occurring in the values of

x andybefore the update are disjoint. To be able to make such an inference statically, we associate, with each state of the transducer, an occurrence-type that limits, for each variable x, the subset of parameters that are allowed to appear in the valuation forx

in that state. Formally, given parametersΠand variables X, anoccurrence-type ϕis a

function fromXto 2Π_{. A valuation}_α_from_X_to_H₍_Σ_,_Π₎_{is said to be}_consistent_{with the} occurrence-typeϕif for every parameterπ ∈ Πand variablex ∈ X, ifπ ∈ ϕ(x), then the parametrized nested word α(x)contains exactly one occurrence of the parameter π, and ifπ 6∈ ϕ(x), then π does not occur in α(x). An occurrence-type from Xto Π

naturally extends to expressions inE(Σ,X,Π): for example, for the expressione1e2, if

the parameter-sets ϕ(e1)andϕ(e2)are disjoint, thenϕ(e1e2) = ϕ(e1)∪ϕ(e2), else the

from variablesXto 2Π_{is said to be}_{type-consistent}_{with an occurrence-type} _ϕ_Y _from_Y to 2Π_{and an assignment}_ρ_from_Y_to_X_{, if for every variable}_x_in_X_:

• the expressionρ(x)is consistent with the occurrence-typeϕY, and

• the parameters resulting from performing the assignment to x are consistent:

ϕY(ρ(x)) =ϕX(x).

Type-consistency ensures that for every valuationαfromYtoH(Σ,Π)consistent with ϕ, the updated valuationα·ρfromXtoH(Σ,Π)is guaranteed to be consistent withϕ0.

Now we can define the transducer model that uses multiple parameters. A multi-

parameter STT§ from input alphabetΣto output alphabetΓconsists of:

• a finite set of statesQ; • an initial stateq0;

• a set of stack symbolsP;

• state-transition functions δi, δc, and δr, that are defined in the same way as for STTs;

• a finite set of typed variablesXequipped with a reflexive symmetric binary conflict relationη;

• for each stateq, an occurrence-type ϕ(q): X7→ 2Π_{, and for each stack symbol}_p_, an occurrence-typeϕ(p):X7→ 2Π_;

• a partial output function F : Q 7→ E(X,Γ,Π)such that for each state q, the ex-

pressionF(q)is consistent withη, andϕ(q)(F(q))is the empty set;

• for each stateqand input symbola, the update functionρi(q,a)from variablesX toXoverΓis consistent withηand is such that the occurrence-typeϕ(δi(q,a))is type-consistent with the occurrence-typeϕ(q)and the updateρi(q,a);

• for each stateqand input symbol a, the update functionρc(q,a)from variables

X to X over Γ is consistent with η and it is such that, if δc(q,a) = (q0,p) the occurrence-types ϕ(p) and ϕ(q0) are type-consistent with the occurrence-type ϕ(q)and the updateρc(q,a);

• for each state q and input symbol a and stack symbol p, the update function

ρr(q,p,a) from variables X∪Xp to X over Γ is consistent withη and it is such that the occurrence-typeϕ(δr(q,p,a))is type-consistent with the occurrence-type ϕ(q)andϕ(p)and the updateρr(q,p,a).

We can assume thatϕ(q0) =∅, and therefore all variables are initialized toε.

Configurations of a multi-parameter STT are of the form (q,Λ,α), where q ∈ Qis a state,αis a valuation from variablesXtoH(Γ,Π)that is consistent with the occurrence- type ϕ(q), and Λ is a sequence of pairs (p,β)such that p ∈ P is a stack symbol and

β is a valuation from variables X to H(Γ,Π) that is consistent with the occurrence-

type ϕ(p). The clauses defining internal, call, and return transitions are the same as for STTs, and the transduction[[S]]is defined as before. In the same way as before we

define a copyless multi-parameter STT as a multi-parameter STT with a purely reflexive reflexive conflict relation (i.e.η= {(x,x)|x∈ X}) .

Now we establish that multiple parameters do not add to expressiveness. We first prove the property for copyless STTs. Then we add regular look-ahead and show and use it to show that the property holds for general STTs.

Theorem 5.11(Copyless multi-parameter STTs). A nested-word transduction is definable

by a copyless STT iff it is definable by a copyless multi-parameter STT.

Proof. Given a copyless STTSconstructing a multi-parameter copyless STTS0_{is trivial.}

The parameter setΠofS0 _{is the singleton}_{_?_}_{. For every state}_q_of_S_{, there is a corre-} sponding stateqinS0_{. For every type-0 variable}_x _{and state}_q_in _S_, _ϕ₍_q_,_x_{) =} _∅_while for every type-1 variabley,ϕ(q,y) ={?}.

We now prove the other direction of the iff. LetSbe a multi-parameter copyless STT with states Q, initial stateq0, stack symbols P, parametersΠwith |Π| = k, variables

X with |X| = n, occurrence-type ϕ, output function F, state-transition functions δi, δc, and δr, and variable-update functions ρi, ρc, and ρr. We construct an equivalent copyless STTS0 _{= (}_Q0_,_q0

0,P0,X0,F0,δ0,ρ0).

Variable summarization intuition. We need to simulate the multi-parameter variables using only variables with a single hole. We do this by using multiple variables to represent a single multi-parameter variable, and by maintaining in the state extra information on how to combine them.

The construction maintains a compact representation of every multi-parameter vari-

able. To understand the construction, consider a variable x with value

ha hbπ1 bi hci hbπ2 biai. One possible way to represent xusing multiple variables,

each with only one parameter in its value, is the following: x1 = ha?ai,x2 = hb?bihci,

and x3 = hb?bi. Next, we need to maintain in the state some information regarding

how to combine these three values to reconstructx. For this purpose, we use a function of the form f(x1) = (x2,x3),f(x2) =π1,f(x3) = π2, that tells us to replace the ? inx1

with x2x3 and the holes inx2 andx3, withπ1 andπ2 respectively. The state will also

maintain a functiongthat remembers the starting variable (the root of the tree): in this caseg(x) = x1means thatx1 is the root of the symbolic tree representing the variable

We now formalize this idea. The set of variables X0 _contains ₍₂_k₋₁₎_n _{variables of} type-1 andnvariables of type-0. WhenS0 _{is in state}_q_{, for every variable}_x_∈ _X_,

• ifϕ(x)₆= ∅, the value ofxis represented by 2|ϕ(x)_{| −}1 type-1 variables inX0_; • ifϕ(x) =∅, the value ofxis represented by one type-0 variable inX0_.

Since ϕ(x) _≤ k, we can assume that for every variablex ∈ X, there are exactly 2k−1 type-1 variables and one type-0 variable inS0 _{corresponding to it. We denote this set} byV(x) =_{x0,x1, . . . ,x2k−1}, wherex0is the only type-0 variable. Therefore, the STT

S0_{has the set of variables}_X0 ₌S_x

∈XV(x).

State components and invariants. Each state ofQ0 is a triplet(q,g,f), whereq _∈ Q

keeps track of the current state of S, g : X 7→ X0 _{keeps track of the root of the sym-} bolic tree representing each variable, and f : X0 _7→ ₍_X0_×_X0₎_∪_Π_{∪ {}_ε_{} ∪ ⊥}_maintains information on the symbolic tree representing each variable. Given a variablex ∈ X0_,

f(x) =⊥means thatxis not being used in any symbolic tree.

We now define the unfolding f∗ _{of the function} _f _{that, given a variable in} _x _∈ _X0 provides the value inH(Σ,Π)corresponding to the symbolic tree rooted inx:

• if f(x) =ε, then f∗(x) =x;

• if f(x) =πi, then f∗(x) =x[πi];

• if f(x) = (y,z), then f∗₍_x_{) =}_x_[_f∗₍_y₎_f∗₍_z_)]_; • if f(x) =_⊥, then f∗₍_x_{) =}_⊥_.

Our construction maintains the following invariant: at every point in the computation, the value of f∗₍_g₍_x₎₎_in _S0 _{is exactly the same as the value of} _x _in _S_{. We can assume} that at the beginning every variable x _∈ X0 _{has value} _ε_{, and we represent this with}

g(x) =x0and f(x0) =ε. For this base case the invariant holds.

Similarly to what we did for STTs (Corollary 5.5), we observe that every assignment can be expressed as a sequence of elementary updates of the following form:

Constant assignment: x :=wwherewis a constant (wis of the forma,π,x,haπbi);

Concatenation: {x := xy;y:=ε}(and similar cases such as{x :=yx;y:= ε});

Parameter substitution: {x := x[π7→ y];y:=ε}(and similar cases such as{x:= y[π7→

x];y:=ε});

Swap: {x:=y;y := x}.

Update functions. We now describe at the same time the transition relationδ0and the

variable update functionρ0ofS0. Consider a state(q, f,g). We call(q0,f0,g0)the target

state and we only write the parts that are updated skipping the trivial cases. Every time a variablevis unused we set f0₍_v₎_to_⊥_{. We show that the state invariant is inductively} preserved:

{x :=w}: wherewis a constant. Similarly to what we showed earlier in the informal description, the content ofxcan be summarized using 2_|ϕ(x)_{| −}1 variables.

{x := xy;y:=ε}: in order for the assignment to be well-defined ϕ(q0_,_x₎_{must be the} same as ϕ(q,x)∪ϕ(q,y), and|ϕ(q0,x)| = |ϕ(q,x)|+|ϕ(q,y)| ≤ k. Let’s assume wlog that bothϕ(q,x)andϕ(q,y)are not empty. By IHxandyuse 2|ϕ(x)_{| −}1+

2|ϕ(y)| −1=2(|ϕ(q,x)|+|ϕ(q,y)|)−2≤ 2k−2 variables. First we assign each

yi in the tree rooted ing(y)to some unused xi0 inV(x). Let a(y_i) = x_i0 be such a mapping. From the IH we know that at least one type-1 variablexj ∈ V(x)is unused. We can usexjto concatenate the variables summarizingxandy. We can reflect such an update in the tree shape ofxas follows: for everyz∈V(x),

• if there existsy0 such thata(y0) =z, we copy the summary fromy0 tozand

replace each variable in the summary with the corresponding one inV(x):

f(z) = f(y){y0_/_a₍_y0₎_}_,_z₌ _y_{, and}_y0 ₌_ε_;

• if z = xj we concatenate the previous summaries of x andy: if g(x) = x0, andg(y) =y0, anda(y0) = x00, theng0(x) = z, and f0(z) = x0x00. Finally the

variablezneeds to hold the ?: ρ(z) =?.

Finally ify0is the type-0 variable inV(y),g0(y) =y0, f(y0) =ε, andy0 =ε. {x := x[π7→y];y:=ε}: in order for the assignment to be well-definedϕ(q0_,_x₎_{must be}

the same as(ϕ(q,x)\ {π})∪ϕ(q,y), and|ϕ(q0,x)|= (|ϕ(q,x)| −1) +|ϕ(q,y)| ≤

k. Let’s assume wlog that bothϕ(q,x)andϕ(q,y)are not empty. By IHx andy

use 2|ϕ(x)| −1+2|ϕ(y)| −1 = 2(|ϕ(q,x)|+|ϕ(q,y)|)−2 ≤ 2(k+1)−2 = 2k

variables. First we assign eachy_i ₆=g(y)in the tree rooted ing(y)to some unused xi0 inV(x). Let a(y_i) = x_i0 be such a mapping. So far we used 2k−1 variables

x5 x3 x2 x1 x00 x3 x5 x2 x7 x1 x6 x7 x4 x4 x6 x0 x:=ax[π1←π1π5]

FIGURE5.2: Example of paramter tree update.

inV(x). When performing the updates we show how the variable representing the rootg(y)need not be copied allowing us to use at most 2k−1 variables to summarize the value ofx. The root of the tree summarizingxwill be the same as before: ifg0₍_x_{) =}_g₍_x₎_{. Every variable}_z_∈_V₍_x₎_{is updated as follows:}

• if there existsysuch that a(y) = z, we copy the summary fromy to zand

replace each variable in the summary with the corresponding one inV(x): f(z) = f(y)_{y0_/_a₍_y0₎_}_,_z₌ _y_{, and}_y0 ₌_ε_;

• ifz =xπwe append the summary ofyto it: ifg(y) =y0, then

– if f(y0_{) =} _y₁_y₂_, _a₍_y0_{) =} _x0_, _a₍_y₁_{) =} _x₁ _and _a₍_y₂_{) =} _x₂_{, then} _f0₍_z_{) =} (x1,x2), andρ(z) =z[y0];

– if f(y0) =π0, anda(y0) =x0, then f0(z) =π0, andρ(z) =z[y0].

Finally ify0is the type-0 variable inV(y),g0(y) =y0, f(y0) =ε, andy0 =ε. {x :=y;y:= x}: we simply swap the summaries ofxandy. Leta : V(x) 7→ V(y)be

a bijection fromV(x)toV(y), and letb: V(y) _7→ V(x)be the inverse ofa. Then

g0₍_x_{) =}_a₍_g₍_y₎₎_,_g0₍_y_{) =}_b₍_g₍_x₎_, _f0₍_x_{) =} _f₍_y₎_{_y0_/_a₍_y0₎_}_, _f0₍_y_{) =} _f₍_x₎_{_x0_/_b₍_x0₎_}_, for eachx0 _∈_V₍_x₎_,_x0 ₌ _a₍_x0₎_{, and for each}_y0 _∈ _V₍_y₎_,_y0 ₌_b₍_y0₎_.

Figure 5.2 shows an example of update involving a combination of elementary updates. The parameter tree on the left shows represents the content π1π2π3π4 for a

variable x. Therefore, for the tree on the left, g(x) = x5and f(x5) = (x1,x6),f(x6) =

(x2,x7),f(x7) = (x3,x4),f(x1) = π1,f(x2) = π2,f(x3) = π3,f(x4) = π4. Each vari-

able is of type-1. After the update we havex5 := ax5, and we take two fresh variables

x0_,_x00 _{to update the tree to the one on the right where we set}

f(x00_{) = (}_x₁_,_x0₎_,_f₍_x0_{) =} _π₅_{. Since we have 5 parameters and 9 nodes, the counting} argument still holds. Before the update f∗₍_x₅₎_{evaluates to}_π₁_π₂_π₃_π₄_{, while after the} update f∗₍_x₅₎_{evaluates to}_a_π₁_π₅_π₂_π₃_π₄_.

We finally show howδ0 andρ0 are defined at calls and returns. The functions main-

the corresponding return to create the updated tree. Since all variables are reset at calls, this step is quite straightforward and we omit it. By inspection of the variable update function, it is easy to see that the assignments are still copyless.

Output function. Last, for every state(q,f,g)_∈ Q0_{, the output function}_F0₍_q_, _f_,_g_{) =}

f∗₍_F₍_q₎₎_{, where} _f∗ _{is naturally extended to sequences:} _f∗₍_ab_{) =} _f∗₍_a₎_f∗₍_b₎_.

We can then equip Theorem 5.11 with regular look-ahead and get the following result. Corollary 5.12(Copyless multi-parameter STTs RLA). A nested-word transduction is de-

finable by a copyless STT with regular look-ahead iff it is definable by a copyless multi-parameter STT with regular look-ahead.

We then extend the result of Theorem 5.9 to multi-parameter STTs.

Lemma 5.13. A nested-word transduction is definable by a copyless multi-parameter STT with

regular look-ahead iff it is definable by a multi-parameter STT.

Proof. The proof of Theorem 5.9 does not use parameter assignment and can therefore

be used for this theorem as well.

Finally, we can conclude that multi-parameter STTs capture the class of STT definable transformations.

Theorem 5.14(Multi-parameter STTs). A nested-word transduction is definable by an STT

iff it is definable by a multi-parameter STT.

Proof. From Theorems 5.13, 5.12, and 5.14.

In document Programming Using Automata and Transducers (Page 150-157)