Temporal Logics Over Finite Traces with Uncertainty (Full Version)

(1)

Temporal Logics Over Finite Traces with Uncertainty (Full Version)

Fabrizio M. Maggi

University of Tartu

[email protected]

Marco Montali

Free University of Bozen-Bolzano

[email protected]

Rafael Pe ˜naloza

University of Milano-Bicocca

[email protected]

Abstract

Temporal logics over finite traces have recently seen wide ap-plication in a number of areas, from business process mod-elling, monitoring, and mining to planning and decision mak-ing. However, real-life dynamic systems contain a degree of uncertainty which cannot be handled with classical logics. We thus propose a new probabilistic temporal logic over fi-nite traces using superposition semantics, where all possible evolutions are possible, until observed. We study the proper-ties of the logic and provide automata-based mechanisms for deriving probabilistic inferences from its formulas. We then study a fragment of the logic with better computational prop-erties. Notably, formulas in this fragment can be discovered from event log data using off-the-shelf existing declarative process discovery techniques.

Introduction

Linear temporal logic (LTL) is one of the most impor-tant formalisms to declaratively specify and reason about the evolution of systems and processes (Baier and Katoen 2008). Traditionally, LTL adopts a linear, infinite model of time where formulas are interpreted over infinite traces. In recent years, increasing attention has been given to a dif-ferent version of the logic,LTL over finite tracesor LTLf

(De Giacomo and Vardi 2013), which adopts instead a finite-trace semantics. From the modelling point of view, LTLf

matches settings where each execution of the system is even-tually expected to end (even though there is no bound on the number of steps required to reach the termination point). From the reasoning point of view, the automata-theoretic characterisation of LTLf relies on classical finite-state

au-tomata (De Giacomo and Vardi 2013; De Giacomo, De Masellis, and Montali 2014), which are easier to manipu-late and pave the way for the development of robust and ef-ficient reasoning techniques. In fact, LTLf and extensions

thereof have been widely employed in a number of applica-tion domains relevant for AI: from declarative business pro-cess modelling (Pesic, Schonenberg, and van der Aalst 2007; Montali 2010), monitoring (Maggi et al. 2011; De Giacomo et al. 2014), and mining (Maggi, Chandra Bose, and van der Copyright c2020, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Aalst 2012; De Giacomo et al. 2017), to planning (Gerevini et al. 2009; De Giacomo and Rubin 2018) and decision mak-ing (Brafman, De Giacomo, and Patrizi 2018).

When considering real-world processes, uncertainty (which is inexpressible in classical logics) is unavoidable. For example, some pieces may be defective, external events may delay a service, and the loan may lead to an outcome de-pending on various implicit factors. Handling these scenar-ios requires a logical formalism capable of expressing uncer-tainty. Surprisingly, to the best of our knowledge no proba-bilistic extension of LTLf has been considered so far.

Al-though several probabilistic variants of infinite-time tempo-ral logics exist (Ognjanovic 2006; Mor˜ao 2011; Konur 2010; Kovtunova and Pe˜naloza 2018; Woltzenlogel Paleo 2016), the complex interaction of probabilities and time usually requires syntactic or semantic restrictions in the logic, and does not directly carry over the finite-trace setting.

To overcome both challenges at once, we propose a new probabilistic extension of LTLf, called PLTLf, that

essen-tially predicates over the possible evolutions of a trace. The main novelty of PLTLf lies in itssuperpositionsemantics,

where every evolution is possible (with different probabil-ities) until it is observed. This semantics accommodates a seamless interaction of probabilities and time that was not possible in previous formalisms, and elegantly fits over fi-nite traces. PLTLfis a direct generalisation of LTLf: PLTLf

formulas without probabilistic constructors are in fact LTLf

formulas. PLTLf adequately describes probabilistic

tempo-ral or dynamic properties of process executions; e.g., it can express that a shipped package will eventually reach its des-tination with probability 0.95, or that a machine will fail in the next 100 timepoints with probability below 0.001.

Our second main contribution is an investigation of the logical and computational properties of PLTLf,

introduc-ing automata-based algorithms for decidintroduc-ing satisfiability of PLTLf formulas and for computing the most likely

execu-tions of a system described in this logic. These core rea-soning services provide the basis for sophisticated, domain-specific tasks such as a probabilistic version of conformance checking (Carmona et al. 2018) and (prefix) monitoring (Maggi et al. 2011). Unsurprisingly, due to the intertwined connection of temporal and probabilistic constructors,

(2)

dling PLTLf formulas becomes EXPTIME-hard. This leads

us to our third contribution: a study of a fragment of PLTLf,

called PLTL0_f, where the complexity of reasoning falls to PSPACE in the length of the formula, matching the

classi-cal LTLf case. Notably, formulas in this fragment can be

discovered from event log data using off-the-shelf exist-ing declarative process discovery techniques (Maggi, Chan-dra Bose, and van der Aalst 2012).

This manuscript extends the published work (Maggi, Montali, and Pe˜naloza 2020) with full proofs and an ex-tended example.

Preliminaries

We briefly introduce tree and weighted string automata, as-suming basic formal language knowledge. For more details, see (Comon et al. 2007; Droste, Kuich, and Vogler 2009). Tree Automata. Atreeis a set of words of natural num-bersT ⊆N∗, which is closed under prefixes and preceding

siblings; i.e., ifwi ∈ T, thenw ∈ T, andwj ∈ T for all

1 ≤ j ≤ i. A tree isfiniteif its cardinality is finite. Each finite treeT has a maximum numberk ∈_N(itswidth) s.t.

wk ∈ T for somew ∈ N∗. The empty wordεis theroot,

and aleaf is a nodew∈T s.t.w1∈/ T. AlabellingofT on a setΣis a mappingT →Σ. A tree with a labelling is a la-belled tree. Abranchof the treeTis a sequencew1, . . . , wn of nodes ofT such thatw1 =ε,wn is a leaf node, and for everyi,1≤i < n,wi+1=wimform∈N. IfTis labelled,

we call branch also the sequence of labels of a branch. A k-ary tree automaton is a tuple A = (Q,∆, I, F)

whereQis a finite set ofstates,I, F ⊆ Qare theinitial andfinalstates, respectively, and∆ ⊆ Q ×S

i≤kQkis the

transition relation. Arunof Aon the tree T is a labelling

ρ: T → Qs.t.ρ(ε)∈ Iand for everyw∈ T, ifwn ∈T

butw(n+1) ∈/ T, then(ρ(w), ρ(w1), . . . , ρ(wn)) ∈ ∆. It issuccessfulif for every leaf nodew ∈T,ρ(w)∈ F. The language accepted byAis the setL(A)of finite trees for which there is a successful run ofA. Theemptiness problem asks whetherL(A) =∅.

The emptiness problem ofk-ary tree automata is decid-able in timeO(|Q|k+2₎_{(Vardi and Wolper 1986) by} com-putinggood states; i.e., those that appear in a successful run. Statesq ∈F without transitions are good. The set of good states is iteratively extended, adding any state that has a tran-sition leading to only good states. This iteration reaches a fixpoint after checking the transitions of each state at most |Q|times. As there are at most|Q|k+1 _{transitions, the set} of good states is computable in timeO(|Q|k+2₎_._A_{is not} empty iff at least one initial state is good. Thereduced au-tomatonA...ofAis obtained by removing all bad (i.e., non-good) states from A. Aand...A accept the same language, andL(A...)6=∅iff...Acontains at least one initial state.

An alternative emptiness test constructs a successful run top-down. It guesses an initial state to label the root node, and iteratively guesses transitions for every node not labelled with a final state. Through a depth-first construction, the al-gorithm preserves in memory only one branch at a time, to-gether with the information of the chosen transitions. Since

every branch can be restricted to depth|Q|, the process re-quiresO(|Q|·k)space (Baader, Hladik, and Pe˜naloza 2008). Weighted Automata. Consider the probabilistic semiring

A = ([0,1],max,×)with the usualmax and product on

[0,1]. Aweighted automatonis a tupleA= (Q,in,wt, F)

where Q is a finite set of states, F ⊆ Q are the final states, in : Q → [0,1] is the initialization functionand wt:Q×Q →[0,1]is theweight function. ArunofAis a fi-nite sequenceρ=q0, q1, . . . , qnwithqn ∈F;run(A)is the set of all runs ofA. The weight ofρ=q0. . . . , qn ∈run(A) is wt(ρ) := Qn−1

i=0 wt(qi, qi+1). The behaviour of A is kAk := maxρ∈run(A)in(q0)·wt(ρ). To compute the be-haviour of the weighted automatonA, we adapt the empti-ness test for unweighted automata to consider the weights of the transitions computing a function w : Q → [0,1], wherew(q)is the maximum weight of all runs starting inq. Initializew0(q) = 1ifq∈ F andw0(q) = 0ifq /∈F. It-eratively computewi+1(q) := maxq0_∈Qwt(q, q0)w_i(q).

Af-ter polynomially many iAf-terations, we reach a fixpoint where wi ≡wi+1. Ifw:=wi, thenkAk= maxq∈Qin(q)w(q).

The Probabilistic Temporal Logic PLTL

f

PLTLf extends the linear temporal logic on finite traces

LTLf (De Giacomo and Vardi 2013), with a

probabilis-tic constructor expressing uncertainty about the evolution of traces. The only syntactic difference between LTLf and

PLTLf is this new constructor. Formally, PLTLf formulas

are built by the following syntactic rule whereais a propo-sitional variable,p∈[0,1], and./∈ {≤,≥, <, >}:

ϕ::=a| ¬ϕ|ϕ∧ϕ| ϕ|ϕU ϕ|

_}

./pϕ.

Intuitively,

_}

./pϕ means that, at the next point in time, ϕ holds with probability ./ p. To formalise this, we use tree-shaped interpretations providing a class of alternatives branching into the future in a newsuperposition semantics. APLTLfinterpretationis a tripleI= (T,·I, P), whereTis

a finite tree,·I _{is a labelling of}_T _{on the set of propositional}

valuations,1andP : T\ {ε} →[0,1]is s.t. for allw∈T, P

wi∈TP(wi) = 1. Satisfiability of a formula in a tree node

is defined inductively, extending the LTLfsemantics. For an

interpretationI= (T,·I_{, P}₎_and_w_∈_T_:

• I, w|=aiffa∈wI

• I, w|=¬ϕiffI, w6|=ϕ

• I, w|=ϕ∧ψiffI, w|=ϕandI, w|=ψ

• I, w |=ϕiffwis not a leaf node and for alli ∈N, if

wi∈TthenI, wi|=ϕ

• I, w|=ϕU ψiff either (i)I, w|=ψor (ii)I, w |=ϕand for alli∈_N, ifwi∈T thenI, wi|=ϕU ψ

• I, w|=

_}

./pϕiffP

wi∈T;I,wi|=ϕP(wi)./ p.

Iis amodelofφifI, ε|=φ.φissatisfiableif it has a model. Example 1. Figure 1 shows three models of the formula

φ0:=

}

≤0.5a∧

}

≥0.6b. In (a),aandbare observed at the next timepoint with probability 0 and 1, respectively; (c) depicts the other extreme, whereaandbare observed

1

As usual, we describe a propositional valuation by the set of variables it makes true.

(3)

1 1 b (a) 0.5 1 0.5 ₁ a b b (b) 0.4 0.1 1 0.5 1 a a b b (c)

Figure 1: Three models of the formulaφ0from Example 1.

with probability0.5and0.6, respectively. Many intermedi-ate models exist. The formulaφ1:=

}

≥0.5a∧

}

≥0.6¬ais unsatisfiable: no model allowsaand¬ato hold with proba-bility 0.5 and 0.6, respectively.

As usual, the main reasoning task for a PLTLf formulaϕ

is checking satisfiability. This problem becomes harder than in LTLf by the need to verify the compatibility of the

po-tential future steps w.r.t. their probabilities. Thus, we build a tree automaton accepting a class of models ofφ. For sat-isfiability, we can ignore the specific probabilities used, as long as they are compatible. Thus an unweighted automaton suffices. Building this automaton requires some definitions.

For a PLTLf formula φ, csub(φ) is the smallest set

of PLTLf formulas containing all subformulas of φ, is

closed under negation (modulo double negations), and s.t.

x{2}+x{1,2}≥0.6

xQ≥0 Q∈S0

x{1}+x{2}+x{1,2}= 1,

has a solution; e.g.,x{1} = 0.4,x{2} = 0.5,x{1,2} = 0.1.

As shown later, this means that from the atoma0, it is pos-sible to branch in three scenarios to satisfy the probabilities (Figure 1 (c)). ForS1={∅,{1},{1,2}}, the systemI(S1)

x{1}+x{1,2}≤0.5, x{1,2}≥0.6, x∅+x{1}+x{1,2}= 1

has no non-negative solution:x{1,2}needs to be≤0.5and

≥0.6simultaneously. HenceS0 ∈ S(a0)butS1 ∈ S/ (a0). The latter means that to satisfya0, one must find a transition where the formulabis satisfied, butais not.

IfP(a) =∅, then2P(a)₌_{{∅}, and}_I₍_{∅}_{) =}_{₁₌_x

∅},

which has a trivial solution; i.e., if an atoma contains no probabilistic subformulas,S(a)contains only one element, and the construction reduces to classical LTLf. Each

ele-ment inS(a)defines a set of tuples of atoms yielding the transition relation of the automaton. Assume w.l.o.g. that the elements of each S ∈ S(a) are ordered asQ1, . . . , Q|S|.

TS(a)is the set of|S|-tuples of atoms(a1, . . . ,a|S|)s.t. for

allψ,

_}

./pψ∈csub(φ): (i)ψ∈aiffψ∈aifor alli, and (ii) for everyi,1≤i≤|S|,

_}

./pψ∈Qiiffψ∈ai.

Example 3. From Example 2,S0 ∈ S(a)defines 3-tuples of atoms s.t. the first two elements contain one of the abilistic subformulas each, and the last contains both prob-abilistic subformulas. Hence, the tuple(a1,a2,a3)formed by the following atoms belongs toTS0(a):

<0.6b,b, a,¬b}

We define an automaton which can decide satisfiability of PLTLfformulas.

Definition 4. The tree automaton Aφ = (Q,∆, I, F) is

given byQ=At(φ),∆ ={{a}×S

S∈S(a)TS(a)|a∈ Q},

I ={a ∈ Q |ϕ∈a}, andF the set of all atoms not con-taining formulas of the formψ,

_}

>pψ, or

_}

≥p0ψ,p0>0.

Aφnaturally generalises the automata-based approach for

satisfiability of LTLfformulas: ifφhas no probabilistic

con-structor (i.e., it is an LTLf formula), Aφ is the standard

automaton for this setting (De Giacomo, De Masellis, and Montali 2014).Aφaccepts a class of well-structured

quasi-models of the formulaφ, merging redundant branches. The only missing element to have a model are the probabilistic values attached to each successor of a node. These are found solving the system of inequalities built from each transition. Theorem 5. The formulaφis satisfiable iffL(Aφ)6=∅.

Automata emptiness is decidable in time O(|Q|k+2₎_, wherekis the rank of the automaton. In this case, the rank ofAφdepends on the size of the formulaφ; specifically, on

the number of probabilistic subformulas that it contains: if

2

(4)

csub(φ)hasnprobabilistic subformulas, the rank ofAφ is

bounded by2n_{; i.e., emptiness of}_A

φruns in timeO(|Q|2 n

). There is also a non-deterministic algorithm that uses space O(|Q| ·2n₎_{. As the number of states is bounded}

exponen-tially on the length ofφ, Savitch’s theorem (Savitch 1970) yields the following result.

Theorem 6. PLTLf satisfiability is decidable in

exponen-tial space in the number of probabilistic formulas, butonly exponential time in the size of the formula.

If the total number of probabilistic formulas is bounded by some constant, or if we parameterise the problem over the number of probabilistic subformulas (Downey and Fellows 2012), then satisfiability of PLTLfformulas is in EXPTIME.

Conversely, satisfiability is EXPTIME-hard on the length of the input formula φ. The proof of this fact is based on a reduction from theintersection non-emptinessproblem for deterministic tree automata (Comon et al. 2007; Seidl 1994). Theorem 7. PLTLfsatisfiability isEXPTIME-hard.

Probabilistic Entailment

We have focused in a decision problem considering only the existence of a model of the PLTLf formulaφ, disregarding

the probabilistic information included inφ. We now consider reasoning problems dealing with the likelihood of different traces. A basic probabilistic reasoning problem on PLTLfis

computing themost likely trace, along with its probability. To handle the multiplicity of models, we use an optimistic approach, which selects the model maximising the likeli-hood of observing a given trace. One could have chosen a pessimistic approach minimising the likelihood instead. Such a case can be handled analogously by changing all rel-evant maxima for minima in the following.

Definition 8. Atraceis a finite sequence of propositional valuations. The interpretationI = (T,·I_{, P}₎_contains_the

trace t if there is a branchbof T such that bI ₌ _t_{. The}

probabilityoftinIisPI(t) =Q

w∈bP(w). Theprobability

oftw.r.t. the PLTLfformulaφisPφ(t) = maxI|=φPI(t).

Example 9. Using the formulaφ0 from Example 1, letI0 andI1be the models (b) and (c) from Figure 1, respectively.

I1contains the trace(∅,{a}), butI0 does not. Both mod-els contain the tracet = (∅,{a},{b});PI0(t) = 0.5; and

PI1(t) = 0.1.Pφ0(t) = 0.5as witnessed by the modelI0. We want to find the traces with a maximal probability. Formally, t is a most likely trace (mlt) w.r.t. φiff for ev-ery trace t0_, _Pφ₍_t0₎ _≤ _Pφ₍_t₎_{. In our running example, a}

most likely trace is(∅,∅,{b}), which has probability 1 (Fig-ure 1 (a)); however, mlts are not necessarily unique; indeed,

(∅,∅,{a, b})and(∅,∅,{b},∅)are also mlts w.r.t.φ0. To find the mlts w.r.t.φ, we transform the tree automatonAφinto a

weighted string automatonBφwhich keeps track of the most

likely transitions available from a given state of Aφ. For

brevity, we do not distinguish between the valuation form-ing a model, and the atom (containform-ing the valuation) of the run of the automaton. Using the probabilistic semiring, the behaviour ofBφyields the probability of the mlts. We later

show how to use this information to extract the actual traces.

Recall that the reduced automaton A...φ of the

empti-ness test excludes the bad states fromAφ and accepts the

same language as Aφ, but from every state in

... Aφ one

can build a successful run. Deleting bad states also re-moves all transitions (produced by the different combina-tions of the probabilistic subformulas that appear in an atom) which cannot be used due to semantic incompatibilities. Let

(a,a1, . . . ,an)∈ ...

∆; i.e., a transition fromA...φ. There exists

anS ∈ S(a)such that(a1, . . . ,an)∈TS. For eachQ∈S, we solve the optimisation problem

maximize xQ subject to I(S).

Intuitively, we compute the largest probability that a branch satisfying the probabilistic constraints in Q can obtain, given the other branches defined by S. Call the result of this optimisation problem mS,a(Q). As each optimisation problem is solved independently, the maxima may not add 1; e.g., for a0, S0 from Example 2, mS0,a0({2}) = 1, mS0,a0({1,2}) = 0.5andmS0,a0(∅) =mS0,a0({1}) = 0.4. This is intended; we try to identify the largest probability that can be assigned to a path in a model; i.e., a trace.

For an atomaand a setQ⊆ P(a), let now ma(Q) := max

S∈S(a),Q∈S

mS,a(Q).

We obtain the automatonBφ byflattening

...

Aφ into a string

automaton, and weighting every transition according tom. Definition 10. Bφ= (

...

Q,in,wt,F...)is the weighted automa-ton where ...Q andF...are obtained fromA...φ,in(a) = 1iff a∈...I (0 o.w.), andwt(a,a0_{) =}_m

a(Q), whereQ∈a0. Importantly, Bφ is constructed from

...

Aφ and not from

Aφ. This ensures that branches defined by unsatisfiable

con-straints are ignored. For example, if{

_}

≤pa,

}

≤q¬a} ⊆a,

withp+q <1,S(a)6=∅, andma({

}

≤pa}) =p. However,

ifais not a final state, but a bad state:ahas no transition. ConstructingBφ fromAφ,awould have a transition to an

atoma0_containing_a_{(with weight}_p_{), which is incorrect.}

Theorem 11. Let Bφ be the weighted automaton

con-structed fromφby Definition 10. The probability of the mlt w.r.t.φiskBφk.

The behaviour of Bφ is computable in polynomial time

on the number of states, i.e., exponential on the length of the formula, but is not affected by the number of probabilis-tic subformulas appearing in φ. Yet, to buildBφ, we need

first to construct and manipulateAφ, which may be

doubly-exponential on the number of probabilistic subformulas. In-deed, there is a trace with positive probability iffφis sat-isfiable. Thus, deciding whether the probability of the most likely trace is higher than some bound has the same com-plexity as satisfiability.

Corollary 12. The probability of the mlts is computable in exponential space in the number of probabilistic formulas, but exponential time in the size of the formula. Deciding if it is greater than 0 isEXPTIME-hard.

To find the mlts (and not just their probability), we adapt the computation process for the behaviour ofBφ. At each

(5)

iteration of the computation, associate each state with the maximum probability that can be derived from a trace start-ing from it. Together with this number, we also store the successor states yielding that maximum probability, getting an automaton that accepts all the most likely traces. Definition 13. Given a PLTLf formulaφ, its weighted

au-tomatonBφ = (Q,in,wt, F), and a statea ∈ Q, letw(a)

be obtained through the computation of the behaviour ofBφ.

The unweighted automatonBφ= (Q, I,∆, F)is given by I={a∈ Q |in(a)·w(a) =kBφk},

∆ ={(a,a0)∈ Q × Q |wt(a,a0)·w(a0) =w(a)}.

Theorem 14. Bφaccepts the most likely traces.

While it is important to understand the mlts, it is often more useful to compute the likelihood of observing a spe-cific trace or an element from a set of traces; e.g., to verify that an unwanted outcome is unlikely. This problem can be reduced to that of computing the most likely traces, as long as the set of desired traces is a recognisable language. Definition 15. LetL be a recognisable set of finite traces andφa PLTLfformula. TheprobabilityofLw.r.t.φis

Pφ(L) = max t∈L Pφ(t).

A tracet ∈ Lis amost likely trace ofLw.r.t.φif it holds thatPφ(t) =Pφ(L).

Since L is recognisable, there exists an automaton AL

that accepts exactly the traces inL. We can obviously see this automaton as a very simple weighted automaton, whose weights are all in{0,1}. To find the mlts ofLand their cor-responding probability, we intersect AL withBφ andBφ,

respectively, and computekAL∩ Bφkand the language

ac-cepted byAL∩ Bφ, respectively.

Consider now the same probabilistic problems but in re-lation to an observed prefix. Given a sequencesof propo-sitional valuations, we want to analyse only traces t that extends. Formally, ifφis a PLTLf formula, andsis a

fi-nite sequence of propositional valuations, amost likely trace extendings is a trace t = s·usuch that for every trace

t0 = s·u0,Pφ(t0) ≤ Pφ(t). We want to find all the most likely traces extendings, and their probability. Note that the set of all finite words over the alphabet of propositional valu-ations which extendsis recognisable; indeed, a simple con-catenation of the universal automaton to the automaton that accepts onlysrecognises this language. Hence, our previous results answer this question.

The PLTL

0_f

Fragment of PLTL

f

We have seen that even the basic task of satisfiability is, in the PLTLf case, EXPTIME-hard on the length of the

for-mula. To mitigate this complexity, we now focus on the frag-ment of PLTLf where probabilities can only appear as the

top-most temporal constructor of a conjunctive formula. We call this fragment PLTL0_f. Formally, a PLTL0_f formula is a finite set of expressions of the form

_}

./pϕ, whereϕ is a

a I1 0.4 (a) a b I2 0.1 b I3 0.5 a b I1 0.3 (b) a b I2 0.2 a, b b I3 0.5

Figure 2: A probabilistic modelP = ({I1, I2, I3}, P)ofΦ0. The probability of each interpretation appears on the left.

classical LTLfformula,./∈ {≤,≥, <, >}, andp∈[0,1].3

In terms of processes,

_}

./pϕexpresses that the proportion of traces of the process that satisfyϕisp. The set of formulas is interpreted as a conjunction of the probabilistic formulas appearing in it; that is, a PLTL0fformula is a conjunction of

probabilistic constraints. Note that in this restricted setting, the probabilistic constructor

_}

refers only to the probability of observing a specific LTLf formula, without a reference

to the next point in time. We preserve the same notation, to keep consistent with the general logic PLTLf.

This logic is interesting for two reasons. On the one hand, reasoning about PLTL0_f falls down to PSPACE, matching the complexity of the classical LTLf case (without

prob-abilities). On the other hand, PLTL0_f is suited for describ-ing declarative constraints mined from historical log data of business process executions. More specifically, some PLTL0_f patterns can be readily mined from log data using existing declarative process discovery techniques.

Reasoning in PLTL

0_f

The superposition semantics of PLTLf collapse in PLTL0f

to the more standard multiple-world semantics. To simplify the presentation, we define aprobabilistic interpretationas a pairP = (I, PI), whereI is a finite set of LTLf

inter-pretations andPI is a discrete probability distribution over

I. Satisfiability of an LTLfformula by an LTLf

interpreta-tion is defined as usual (De Giacomo and Vardi 2013). The probabilistic interpretation P = (I, PI)is amodelof the

PLTL0f formulaΦ ={

}

./piϕi|1≤i≤n}, analyse the2 n

possi-ble scenarios of a trace, depending which of the constraints are satisfied or violated. More precisely, consider the2n_sets

of constraints in the Cartesian productQn

i=1{φi,¬φi}; i.e., each set chooses for every formula whether it will be satis-fied or violated. If each of these sets, seen as the conjunction of the formulas that it contains, is satisfiable, then the input PLTL0_f formula is satisfiable as well. On the other hand, if any of these sets is unsatisfiable, it means that it is impossi-ble to build a trace that satisfies that combination of formu-las; hence that scenario should be assigned probability 0.

To verify that probabilities for the remaining branches can still be assigned consistently with the values inΦ, we build a system of inequalities whose solution space is pre-cisely the valid probability assignments. We consider one variable for each case. For readability, we name these vari-ablesx0, . . . , x2n₋₁using a binary subindex which indicates

satisfaction or violation of constraints assuming w.l.o.g. that the constraints are linearly ordered. That is, the subindex is a chain of lengthn of 0s and 1s; a 0 or a 1 at position i

means that¬φi or φi is satisfied, respectively. We use the same subindex convention to refer to the sets of constraints

Si; e.g., ifΦcontains three formulas, thenx010is the vari-able corresponding to the setS010={¬φ1, φ2,¬φ3}. Using this information,LΦis the system of inequalities

xi≥0 0≤i <2n 2n₋₁ X i=0 xi= 1 X jth position is 1 xi./ pj 0≤j < n xi= 0 ifSiis unsatisfiable The first two lines guarantee that we assign a non-negative value to each variable, and that their sum is one; we can see these assignments as probabilities. The third line verifies the probability associated to each constraint inΦ: all the vari-ables that correspond to cases makingφitrue should add to be ./ pi. The last line ensures that the unsatisfiable cases are never assigned a positive probability. This system of in-equalities has a solution iff the PLTL0fformula is satisfiable.

Example 16. LetΦ1 := {

}

≤0.8♦a,

}

≤0.7(a → ♦b)}. Since Φ1 has two probabilistic constraints, we build four variables and sets

S00:={¬♦a,¬(a→♦b)}, S01:={¬♦a,(a→♦b)},

S10:={♦a,¬(a→♦b)}, S11:={♦a,(a→♦b)}.

S00is clearly unsatisfiable, but the remaining three sets are satisfiable. The system of inequalities must enforce thatx00 is 0. Specifically, the systemLΦ1is

x00= 0 x01≥0 x10≥0 x11≥0

x00+x01+x10+x11= 1

x10+x11≤0.8 x01+x11≤0.7 A solution ofLΦ₁ isx00 = 0,x01 = 0.2,x10 = 0.3, and

x11= 0.5, which yields a probabilistic model ofΦ1 consist-ing of three interpretations,I1, I2, I3; each interpretationIi satisfies the constraints in the setSiand is assigned the prob-ability xi. An interpretation for S0 is not needed because these constraints are unsatisfiable (and the combination of formulas is assigned probability 0).

Theorem 17. The PLTL0fformulaΦis satisfiable iffLΦhas

a solution.

To construct LΦ, one must solve2n _LTL

f satisfiability

tests, each requiring polynomial space (Sistla and Clarke 1985). Solving this system of inequalities requires polyno-mial time on the number of variables; i.e., exponential onn. Overall, it needs exponential time onn, but only polynomial space on the length ofΦ.

Theorem 18. PLTL0_f satisfiability is decidable in exponen-tial time on the number of probabilistic formulas, but in polynomial space on their total length.

In particular, if the number of formulas inΦis bounded, satisfiability is PSPACE-complete, improving the EXPTIME

lower bound for general PLTLf (Theorem 7). We are more

interested in deducing (probabilistic) guarantees of a process that follows the constraints in a formula; and more impor-tantly of traces being observed over them. Recall that in our semantics any execution is possible as long as there is no ev-idence to the contrary. In PLTL0_f the uncertainty is stated at the beginning; that is, we do not know which of the formulas

φ1, . . . , φnare satisfied, but once a trace has chosen its path, it remains in it without further uncertainty arising later on.

As before, we follow an optimistic approach and try to find the most likely scenarios and traces that fit the con-straints in the formula. Recall that the solution space ofLΦ yields the probability assignments that can be consistently given to the LTLf interpretations appearing in a

probabilis-tic model according to the constraints that it satisfies. Thus, maximising the value of a variablexi yields the maximum probability that the setSimay be assigned in a model.

For eachi,0≤i <2n_{, let}_max

ibe the solution of

max-imising xi subject to LΦ. Note that each variable is max-imised independently of the others, and hence the values

maxi do not form a probability distribution per se; their

sum may be greater than 1. The values maxi express the

optimisticposition of assigning the highest possible proba-bility to the traces satisfyingSi. For the formulaΦ1 from Example 16, the answers to the maximisation problems for the different variables correspond precisely to the values in the solution presented; namely,max00 = 0,max01 = 0.2,

max10= 0.3, andmax11= 0.5.

The question of finding the mlts or simply themost likely scenario can be answered using the valuesmaxi. Take the

(7)

Algorithm 1:Most likely scenario fortoverΦ. Data:Φ={

_}

./piϕi|1≤i≤n}PLTL

0

fformula,tprefix

Result:Index of the most likely scenario fortinΦ mls← −1

for0≤i <2n_do

computemaxi

ifSitandmaxi>maxmlsthen mls←i

Returnmls

indicesjwheremaxjis the largest among all variables; i.e., j∈argmax{maxi|0≤i<2n}. EachSjis a most likely

sce-nario, and every trace satisfyingSjis an mlt, with probabil-itymaxj. In our example,S11is the most likely scenario; i.e, we expect to observe a trace satisfying_♦aand(a→_♦b). IfΦrepresents a process, the mlts are those that we would expect to observe in an execution of the process in the ab-sence of other information. In general, it is more interesting to predict the future behaviour of the process, given some observation of its first steps. Given a (partial) tracet, corre-sponding to the prefix of a full process, we want to find the most likely scenario wheretcan happen, and predict the po-tential future evolution of the trace. Given a setS of LTLf

formulas and a prefixt,Sacceptst(denoted byS t) iff there is a suffixssuch thatS|=t·s. In words,Saccepts a prefix if it can be extended into a trace that satisfies all the conditions inS. To find the most likely scenario accepting a prefix, and a suffix extending it to a successful trace, we generalise the idea described for the case without prefix.

Letacc(t) := {i | Si t} be the set of indicesj s.t.

Sj acceptst, and let j ∈ argmax_i_∈_acc₍_t₎{maxi}be an

in-dex with the maximum value in the set of solutions from the maximisation problems ofLΦ. Then,Sjis a most likely scenario givent, and any trace extendingtaccepted bySjis an mlt. ForΦ1in Example 16, the most likely scenario for any finite prefixtis alwaysS11, andt·abis always a trace accepted by this set. In general, however, the most likely scenario may change as a prefix grows.

Example 19. LetΨ1 := {

}

≤0.5♦a,

}

≤0.6(a → ♦b)}, which is very similar to Φ1 (Example 16), but with dif-ferent probabilities. We get max00 = 0, max01 = 0.5,

max10= 0.4, andmax11= 0.1. For the empty prefixεand the prefix¬a, the most likely scenario isS01, which holds with probability 0.5, and a most likely continuation would append them with a finite number of observations of¬a. If at the second point in time we observea(making the prefix ¬a·a) thenS01 does not accept this prefix anymore, and the most likely scenario becomes S10: we will eventually observe anaafter whichbwill never be observed anymore.

The method for finding the most likely scenario is for-malised in Algorithm 1, wheremax−1 := 0. Note that an expensive part of this algorithm is the computation of the valuesmaxi. However, this computation can be made

of-fline, as a preprocessing step before the algorithm is called, as these values remain invariant for any call. A second point to consider is that the setJ monotonically decreases as the tracet grows. More precisely, for everyt, s, ifS t·s,

thenS t. Hence, if we are monitoring the evolution of a process, trying to find out the most likely continuation of the currently observed trace, then after every newly observed step, we only need to update the setJ to remove thoseSis not accepting the prefix anymore. Finally, to avoid unnec-essary tests, we can exclude from theforloop all indicesi

wheremaxi = 0:maxi = 0means that the system should

not observe any trace satisfyingSi. If the most likely sce-nario is one that has probability 0, then the observed prefix is violating the conditions described byΦ.

Proposition 20. Algorithm 1 returns the index of the most likely scenario, given a prefix.

Interestingly, assuming that all the valuesmaxihave been

computed before, Algorithm 1 can be executed to use only polynomial space. The information to control the forloop requires at most nbits, and the two tests within this loop require polynomial space.

Note that finding the probability of the most likely sce-nario (and trace) is akin to monitoring agreement with a model. Analogously, one can extend the task to monitor-ing a complex PLTLf propertyψ. For the maximum

like-lihood of acceptingψ, we use Algorithm 1, but now con-sidering whetherSi∪ {ψ} t; i.e., finding the scenarios where ψ may still be satisfied given the knowledge of t. This is analogous to the notion of eventual satisfactionin monitoring. Other notions likecurrent satisfaction, or per-manent satisfactioncan be dealt with accordingly, modify-ing the notion of acceptability of a trace w.r.t. a set of LTLf

constraints (De Giacomo et al. 2014).

Discovering PLTL

0_f

Patterns from Event Log Data

PLTL0fformulas can be automatically mined from event log

data using state-of-the-artdeclarative process discovery al-gorithms withinprocess mining. Process mining focuses on the continuous improvement of business processes based on factual data (van der Aalst 2016). Such data are stored in a so-calledevent log, where each event refers to anactivity (a well-defined step in a process) and is related to acase(a process instance). Events in a case areordered and seen as an execution (ortrace) of the process. A core process min-ing task isprocess discovery, which learns a process model that reproduces the traces contained in the log. In declara-tive process discovery, the target model is specified using rules/constraints, like the LTLf patterns adopted by the

De-clare process modelling language (Pesic, Schonenberg, and van der Aalst 2007).

Maggi, Chandra Bose, and van der Aalst (2012) devel-oped a two-phase method to automatically infer Declare constraints from event logs. In the first phase, candidate con-straints to be mined are generated by an algorithm called Apriori. This algorithm returns frequent activity sets indi-cating a high correlation between activities involved in an activity set. Highly correlated sets are used to instantiate, in any possible ways, the Declare patterns. For example, con-sidering the frequent activity set{a, b}and the Declare pat-tern ofresponse, the two LTLf constraints(a → ♦b)

(8)

of so-generated constraints is filtered by retaining only “rele-vant” constraints, where relevance is measured using metrics such as that ofsupport: the proportion of traces satisfying the constraint.

What makes Apriori interesing in our setting is that sup-port can be interpreted as the constraint probability: the dis-covery of LTLfformulaϕwith supportp(that is, appearing

in100p%of the traces) can be interpreted as the discovery of the PLTL0_fformula

_}

=pϕ.

Conclusions

We have introduced a new probabilistic temporal logic PLTLf based on a novel superposition semantics, and its

sublogic PLTL0f where this semantics collapses to the

stan-dard multiple-world approach. These logics are specifically crafted for predicating about uncertainty in dynamic systems whose executions eventually finish. We studied the main properties of the logics, and provided automata-theoretic methods for extracting relevant information from them.

In future work, we plan to implement the algorithms for PLTL0f and apply them to the declarative modelling

and analysis of business processes, considering in particu-lar monitoring and conformance checking.

References

[2008] Baader, F.; Hladik, J.; and Pe˜naloza, R. 2008. Au-tomata can show pspace results for description logics. Inf. Comput.206(9-10):1045–1056.

[2008] Baier, C., and Katoen, J.-P. 2008.Principles of Model Checking (Representation and Mind Series). The MIT Press. [2018] Brafman, R. I.; De Giacomo, G.; and Patrizi, F. 2018. Ltlf/ldlf non-markovian rewards. InProc. of AAAI, 1771– 1778. AAAI Press.

[2018] Carmona, J.; van Dongen, B. F.; Solti, A.; and Wei-dlich, M. 2018.Conformance Checking - Relating Processes and Models. Springer.

[2007] Comon, H.; Dauchet, M.; Gilleron, R.; L¨oding, C.; Jacquemard, F.; Lugiez, D.; Tison, S.; and Tommasi, M. 2007. Tree automata techniques and applications. Avail-able on: http://www.grappa.univ-lille3.fr/tata. release Octo-ber, 12th 2007.

[2018] De Giacomo, G., and Rubin, S. 2018. Automata-theoretic foundations of FOND planning for ltlf and ldlf goals. 4729–4735. ijcai.org.

[2013] De Giacomo, G., and Vardi, M. Y. 2013. Linear tem-poral logic and linear dynamic logic on finite traces. InProc. of IJCAI, 854–860. AAAI Press.

[2014] De Giacomo, G.; De Masellis, R.; Grasso, M.; Maggi, F. M.; and Montali, M. 2014. Monitoring business metacon-straints based on LTL & LDL for finite traces. InProc. of BPM, volume 8659 ofLNCS, 1–17. Springer.

[2017] De Giacomo, G.; Maggi, F. M.; Marrella, A.; and Pa-trizi, F. 2017. On the disruptive effectiveness of automated planning for ltlf-based trace alignment. InProc. of AAAI, 3555–3561. AAAI Press.

[2014] De Giacomo, G.; De Masellis, R.; and Montali, M. 2014. Reasoning on LTL on finite traces: Insensitivity to infiniteness. InProc. of AAAI, 1027–1033. AAAI Press. [2012] Downey, R. G., and Fellows, M. R. 2012.

Parameter-ized Complexity. Springer.

[2009] Droste, M.; Kuich, W.; and Vogler, H. 2009. Hand-book of Weighted Automata. EATCS. Springer, 1st edition. [2009] Gerevini, A.; Haslum, P.; Long, D.; Saetti, A.; and

Dimopoulos, Y. 2009. Deterministic planning in the fifth international planning competition: Pddl3 and experimental evaluation of the planners.Artif. Intell.173(5-6):619–668. [2010] Konur, S. 2010. Real-time and probabilistic temporal

logics: An overview. CoRRabs/1005.3200.

[2018] Kovtunova, A., and Pe˜naloza, R. 2018. Cutting dia-monds: A temporal logic with probabilistic distributions. In Proc. of KR, 561–570. AAAI Press.

[2011] Maggi, F. M.; Montali, M.; Westergaard, M.; and van der Aalst, W. M. P. 2011. Monitoring business con-straints with linear temporal logic: An approach based on colored automata. LNCS, 132–147. Springer.

[2012] Maggi, F. M.; Chandra Bose, R. P. J.; and van der Aalst, W. M. P. 2012. Efficient discovery of understand-able declarative process models from event logs. InProc. of CAiSE, LNCS, 270–285.

[2020] Maggi, F. M.; Montali, M.; and Pe˜naloza, R. 2020. Temporal logics over finite traces with uncertainty. InProc. of AAAI. AAAI Press. To appear.

[2010] Montali, M. 2010. Specification and Verification of Declarative Open Interaction Models: a Logic-Based Ap-proach, volume 56 ofLNBIP. Springer.

[2011] Mor˜ao, L. 2011. Probabilistic distributed temporal logic. Master thesis, Universidade T´ecnica de Lisboa, Por-tugal.

[2006] Ognjanovic, Z. 2006. Discrete linear-time proba-bilistic logics: Completeness, decidability and complexity. J. Log. Comput.16(2):257–285.

[2007] Pesic, M.; Schonenberg, H.; and van der Aalst, W. M. P. 2007. Declare: Full support for loosely-structured processes. In Proc. of EDOC, 287–300. IEEE Computer Society.

[1970] Savitch, W. J. 1970. Relationships between nonde-terministic and denonde-terministic tape complexities. Journal of Computer and System Sciences4(2):177 – 192.

[1994] Seidl, H. 1994. Haskell overloading is dexptime-complete.Information Processing Letters52(2):57 – 60. [1985] Sistla, A. P., and Clarke, E. M. 1985. The complexity

of propositional linear temporal logics. J. ACM32(3):733– 749.

[2016] van der Aalst, W. M. P. 2016.Process Mining - Data Science in Action, Second Edition. Springer.

[1986] Vardi, M. Y., and Wolper, P. 1986. Automata-theoretic techniques for modal logics of programs. J. Com-put. Syst. Sci.32(2):183–221.

[2016] Woltzenlogel Paleo, B. 2016. An expressive proba-bilistic temporal logic. CoRRabs/1603.07453.

(9)

Appendix A: Proofs

Theorem 5. The formulaφis satisfiable iffL(Aφ)6=∅.

Proof. [⇐] Suppose first thatL(A)6=∅, and letT ∈ L(A). This means that there exists a successful run ρof A over

T, which maps every nodew ∈T with an atomρ(w). Let

A be the set of all propositional variables appearing inφ. Define the function ·I _: _T _→ ₂A _by _wI _:= _ρ₍_w₎_∩_A_.

Moreover, for every node w with k successors, as ρ is a successful run, it holds that(ρ(w1), . . . , ρ(wk)) ∈ TS for someS ∈ S(ρ(w)). The latter means that the systemI(S)

has a solution for the (ordered) variablesx1, . . . , xkin[0,1]. Hence, we define the functionP : T \ {ε} → [0,1]where

P(w`)is the solution of the system inρ(w)for the variable

x`. Overall, this defines an interpretation I = (T,·I_{, P}₎_.

We show by induction on the structure of the formulas that for everyψ ∈ csub(φ)and every w ∈ T, I, w |= ψ iff

ψ ∈ρ(w). In particular, sinceρis such thatφ∈ ρ(ε), this implies thatIis a model ofφ.

For a propositional variablea ∈ Athe result holds triv-ially by construction, so we focus on the remaining con-structors. Assume that the result holds for every formula in csub(ψ)∪csub(ψ1)∪csub(ψ2).

[¬]I, w|=¬ψiffI, w6|=ψiff (induction hypothesis)ψ /∈

ρ(w)iff (atom maximality)¬ψ∈ρ(w).

[∧] I, w |= ψ1 ∧ψ2 iffI, w |= ψ1 andI, W |= ψ2 iff (induction hypothesis){ψ1, ψ2} ⊆ρ(w)iff (atom condition (ii))ψ1∧ψ2∈ρ(w).

[]I, w |= ψiffwis not a leaf and for everywi ∈ T I, wi|=ψiff for everywi∈T,ψ∈ρ(wi)iff (definition of the transition relation)ψ∈ρ(w).

[U]I, w|=ψ1U ψ2iff (i)I, w |=ψ2or (ii)I, w|=ψ1and for allwi ∈ T,I, wi |= ψ1U ψ2. We show thatψ1U ψ2 ∈

ρ(w)by induction on the subtree rooted atw. Ifwis a leaf node, only case (i) is possible, and soI, w |= ψ1U ψ2 iff

ψ2∈ρ(w)which means thatψ1U ψ2 ∈ρ(w)by the defini-tion of an atom. Ifwis not a leaft node. Case (i) is treated as for the leaf nodes. Case (ii) holds iffψ1 ∈ρ(w)and, by the second induction, for everywi ∈ T,ψ1U ψ2 ∈ ρ(wi), which implies by the definition of the transition relation that (ψ1U ψ2) ∈ρ(w), and sinceρ(w)is an atom,ψ1U ψ2 ∈

ρ(w).

[

}

]I, w|=

}

./pψiffP

wi∈T ,I,wi|=ψP(wi)./ p. By

con-struction and the induction hypothesis, the latter holds iff P

wi∈T ,ψ∈ρ(wi)xi./ p, where thexis for the solution of the system inρ(w). But this is only possible if

}

./pψ∈Qi ⊆

ρ(w).

Hence,Iis a model ofφandφis satisfiable. This finishes this direction of the proof.

[⇒] Conversely, suppose thatφ is satisfiable, and letI = (T,·I_{, P}₎_{be a model of}_φ_{. We will use this tree to construct a}

successful run ofA, but it needs to be adapted to a simplified form. Given a node w ∈ T, let P(w) ⊆ csub(φ) be the set of all probabilistic formulas

}

./pψ∈csub(φ)such that I, w |=

_}

./pψ, and defineS(w) ⊆ 2P(w) _{to be the set of} subsetsO ⊆ P(w)such that there is a successorwi ∈ T

that satisfiesI, wi|=ψfor all

_}

./pψ∈ OandI, wi6|=ψ

for all

_}

./pψ /∈ O. Given anO ∈ S(w), if there are two

successorswi, wj ∈T that satisfy the previous conditions, it is possible to prunethe treeT by removing all nodes of the form wjv, v ∈ N∗, and setting P0(wi) := P(wi) +

P(wj). It is easy to see thatI0 _{= (}_T0_,_·I0

, P0₎_{, where}_T0 _is

the prunned tree and·I0 _is_·I _{restricted to}_T0_{is also a model}

of φ. Let I0 = (T0,·I0, P0)be the result of applying this prunning procedure to all nodes in the original model (in a top-down manner). Then, it is a simple exercise to verify that the labellingρ:T0→At(φ)where

ρ(w) ={ψ∈csub(φ)|I0, w|=ψ} is in fact a successful run ofA, and henceL(A)6=∅. Theorem 7. PLTLf satisfiability isEXPTIME-hard.

Proof. We prove EXPTIME-hardness by a reduction from the intersection non-emptiness problem for deterministic au-tomata over labelled trees. Adeterministic tree automaton over labelled trees is a tuple A = (Q,Σ,∆, I, F)where Q, I, and F are as in the preliminaries, Σ is a finite al-phabet, and∆ : Q ×Σ → S

i≤kQ

k _{is a total}_transition

function. Given aΣ-labelled tree, a run of this automaton is aQ-labelling that is consistent with the transition function. All the associated notions are defined in the obvious way. The intersection non-emptiness problem for these automata consists in deciding, givennsuch automataAi,1 ≤i≤n

with disjoint sets of states, whetherTn

i=1L(Ai)6= ∅. This

problem is known to be EXPTIME-hard (Seidl 1994). Given a deterministic automaton A = (Q,Σ,∆, I, F), we build the PLTLf formula ϕA as follows. The

proposi-tional variables appearing in the formula will be given by the elements ofQ ∪Σ∪ {1, . . . , k}; that is, the states, the symbols, and the firstknatural numbers. Intuitively, an in-terpretation and node mappingq∈ Qto means that the state

q holds in that element, and analogously for σ ∈ Σ. The variables1, . . . , kare used to distinguish the different suc-cessors of a node in a tree. This intuition will become more clear after the construction of the formula. In addition, we have a new propositional variable xEND that identifies the

leaf nodes.

For every(q, σ)∈ Q ×Σ, let∆(q, σ) = (qq,σ₁ , . . . , q_kq,σ_q,σ)

and define the formulasψq,σ, ψQ,ψΣ, andψN as in Fig-ure 3. Then, we set

ϕA:= _ q∈I q∧  ψQ∧ψΣ∧ ^ (q,σ)∈Q×Σ ψq,σ  . Note that the length ofϕAis polynomially bounded by the

size ofA. We first show that every model ofϕAcan be

trans-formed into a tree accepted byA, and conversely, every tree accepted byA, together with a successful run, is a represen-tation of a model ofϕA.

LetJ= (T,·J_{, P}₎_{be a model of}_ϕ_{. By construction, (see}

formulasψQandψΣ), for every nodew∈T there is exactly oneσ∈Σand oneq∈ Qsuch thatσ, q∈wJ_{. Abusing the}

notation, we call these elements Σ(w) andQ(w), respec-tively. We can assume, w.l.o.g., that for every non-leaf node

(10)

ψq,σ :=    q∧σ→xEND∨V kq,σ i=1

}

≥1/kq,σ(q q,σ i ∧i) q∈F q∧σ→¬xEND∧V kq,σ i=1

}

≥1/kq,σ(q q,σ i ∧i) q /∈F ψQ := _ q∈Q  q∧ ^ q0_∈Q\{_q_} ¬q0   ψΣ:= _ σ∈Σ  σ∧ ^ σ0_∈_Σ_\{_σ_} ¬σ0   ψN := k ^ i=1  i→ ^ j6=i ¬j  

Figure 3: Formulas describing the automaton for the proof of Theorem 7.

wj ∈ T and j ∈ wjJ_.4 _{We construct the labelled tree} TJ : T → ΣwhereTJ(w) = Σ(w)for all w ∈ T. We show thatTJ ∈ L(A), by using the functionQ : T → Q to build a successful run ofAon this tree. Note that for the root nodeε,Q(ε)∈I, becauseJ is a model ofW

q∈Iq. For

every leaf nodew ∈ T,Q(w) ∈ F because otherwise the formulaψq,σguarantees thatwmust have a successor node. Finally, given a non-leaf nodew ∈ T, letq = Q(w)and

σ = Σ(w)and∆(q, σ) = (q₁q,σ, . . . , q_kq,σ_q,σ). Since J is a model ofψq,σ, and by the assumption stated before,wmust havekq,σsuccessors, and is such thatQ(wj) = qjq,σfor all j,1 ≤ j ≤ kq,σ. Thus, the labelling ofT provided by the functionQforms a successful run, andTJ∈ L(A).

For the second claim, letT be aΣ-labelled tree, andQ: T → Qa successful run of Aover T. We build a model

J = (T,·J_{, P}₎_of_ϕ

Aas follows. For everyw∈T,σ∈Σ,

andq∈ Q, we haveσ∈wJ_iff_T₍_w_{) =}_σ_and_q_∈_wJ _iff Q(w) = q. In addition, for every nodewj ∈ T,j ∈ wjJ_,

and for every leaf nodew xEND ∈ wJ. Finally, ifwhas`

successors, then for everyi,1 ≤i≤`,P(wi) = 1/`. It is easy to see that, sinceQis a successful run ofAoverT, this interpretation satisfies all the constraints ofϕA, and hence it

is a model of this formula.

Givenndeterministic tree automataA1, . . . ,An, we

con-struct the PLTLf formulaϕ:=V n

i=1ϕAi, whose length is

polynomially bounded by the total length of thenautomata. We show, making use of the previous arguments, that this formula is satisfiable iff the intersection of these automata is not empty.

If there is a treeT in the intersection of these automata, then there is a successful run forAioverT for alli,1≤i≤ n. We can use these successful runs to build a model ofϕas did in the previous paragraph. Conversely, letJbe a model ofϕ. In particular,J = (T,·J_{, P}₎_{is a model of}_ϕ

Aifor all i,1≤i ≤n. Thus, as we have shown already, the labelled treeTJ (which only depends on the alphabet symbolsΣ) is inL(Ai)for all i; i.e.,TJ ∈ Tn_i₌₁L(Ai), and hence the

4

Otherwise, one can merge different successorsw`such that

j∈w`Jand reorder the successors to satisfy the condition.

intersection is not empty.

Theorem 11. Let Bφ be the weighted automaton

con-structed fromφby Definition 10. The probability of the mlt w.r.t.φiskBφk.

Proof. Notice first that the probability of the mlt is zero iff

φis unsatisfiable. In this case, the automataAΦ... andBφ

be-come empty, and hence the behaviour of the latter is zero as well. So we are only interested in cases where this probabil-ity is greater than 0.

Consider first a run ρ = q0, . . . , qn of Bφ such that

wt(ρ)>0. In particular this means thatwt(qi, qi+1)>0for alli,1 ≤i < n. By construction, this means that for every

i,1≤i < nthere is a transition ofA...φ(i.e., a tupleδ∈

...

∆) of the form(qi, . . . , qi+1, . . . , qk)such thatwt(qi, qi+1)is the maximum value that can be given to a successor of a node satisfyingqiwhich satisfiesqi+1. Moreover, all states appearing in this transition aregoodstates, which means that a successful run can still be constructed from them. Thus, there is a successful run ofA...φ(and hence ofAφ) which has ρas a branch. Ifin(q0) = 1(that is, ifq0is an initial state of Aφ), as in the proof of Theorem 5, we can build a model of φcontaining a brancht=ρ(q0)∩A, . . . , ρ(qn)∩A, where

Ais the set of all propositional variables inφ. Moreover, the probability of this trace in this model is exactlywt(ρ). Thus, to summarise, for every successful runρof Bφ, there is a

modelIand a tracetsuch thatwt(ρ) =PI(t). This implies that the probability of the mlt is greater or equal tokBφk.

Conversely, consider a model I containing the trace t. As in the proof of Theorem 5, we can assume w.l.o.g. that this model translates into a successful runρof A...φ. In this

model, it holds that for every non-root nodewi,P(wi) ≤ mS,ρ(w)(Q), whereSandQare the ones obtained from the transition used inρ. In particular,P(wi)≤mρ(w)(Q), and hence PI(t) ≤ kBφk. Since this is true for all traces and

all models, it follows that the probability of the most likely trace is at mostkBφk.

(11)

Theorem 14. Bφaccepts the most likely traces.

Proof. By construction, the initial states of Bφ are exactly

those that maximise the likelihood of the trace, and likewise transitions are only allowed when they preserve the maxi-mum possible probability. That is, for every successful run

ρofBφ, if seen over the weighted automatonBφwe get that

wt(ρ) =kBφk. Following the arguments from the proof of

Theorem 11, such a run corresponds to a tracetin a model

Isuch thatPI(t) =kBφk, but since the latter is the

proba-bility of the most likely traces,tmust be an mlt as well.

Appendix B: Example

We now provide a fully developed example of the methods for reasoning with PLTLf, over a slightly more complex

for-mula. Consider the formula

many elements on the length ofψ, and enumerating them all is not very informative. So we present only a few rele-vant atoms that will be useful for highlighting the remaining properties. These are shown in Figure 4.

Note thata1anda2differ only on the last element, and in particular contain the same probabilistic formulas, hence P(a1) = P(a2) = {

}

≤0.7(aU b),

}

≤0.6 (¬a∧ ¬b)}. Define the elements of2P(a1)_{to be}

Q00:=∅, Q01:={

}

≤0.7(aU b)},

Q10:={

}

≤0.6(¬a∧ ¬b)}, Q11:=P(a1).

There are in total eight subsets of2P(a1)_{; in particular, we} consider S0 = {Q01, Q10, Q11}, S1 = {Q01, Q10}, and

S2 = {Q01, Q11}. Each of these sets defines a system of inequalities, which we now analyse in detail.

Consider firstI(S0), which is defined by6

x01≥0 x10≥0 x11≥0

x01+x10+x11= 1

x01+x11≤0.7

x10+x11≤0.6

5

We slightly simplify removing three irrelevant conjunctions, and already use the equivalence¬

_}

≤pφ≡

}

>pφ.

6

To improve readability, we abuse the notation and usexito

represent the variablexQi.

This system is satisfiable; for instance, one solution is given byx01= 0.4andx10=x11= 0.3.

ForS1, we obtain the systemI(S1)

x01≥0 x10≥0

x01+x10= 1

x01≤0.7

x10≤0.6

which is also satisfiable; e.g.x01=x10= 0.5. Finally, consider the systemI(S2)

x01≥0 x11≥0

x01+x11= 1

x01+x11≤0.7

x11≤0.6

Clearly, this system is unsatisfiable because the constraints

x01+x11 = 1andx01+x11 ≤ 0.7 are in conflict with each other. Overall, this means thatS(a1)containsS0and

S1 but notS2; and sinceP(a1) = P(a2), the same is true for S(a2). Intuitively, what this means is that from a node satisfying the formulas inP(a1)(namely,

}

≤0.7(aU b)and

}

≤0.6(¬a∧ ¬b)) there are models that have successors satisfying either the first formula alone, or the second for-mula alone, but not both, nor none (S1), but there are no models that have successors satisfying only the first formula, or both formulas, but not only the second nor none (S2). This conclusion is based on the probabilistic constraints alone; the logical interpretation of these formulas is considered within the automata construction as in classical LTLf.

Ordering the setsQiin the standard numerical order over

≤0.6(¬a∧ ¬b), aU b, a, b,¬ (aU b),¬(¬a∧ ¬b),¬ (¬a∧ ¬b)} Figure 4: Some atoms for the formulaφ.

(¬a∧¬b). This means that for everyS∈ S(a5)and every atomaappearing in a tuple fromTS(a5)it must hold that {aU b,¬a∧ ¬b} ⊆a, but this contradicts the definition of an atom. Thus,S

S∈S(a5)TS(a5) =∅. Note that this is not specific of a5, but in fact any atomasuch that (, ,a) ∈

TS0(a1)behaves like this. To see why, note that any such atom must contain the formulasaU band(¬a∧ ¬b) aris-ing from the fact that a is the successor representing the setQ11 and witnesses the satisfiability of both probabilis-tic constraints. Moreover,abeing a successor ofa1 means that¬b ∈ a. Thus, by the properties of atoms,¬b must belong toaas well, and the previous argument developed fora5applies again here.

In a similar note, observe thatS

S∈S(a3)TS(a3) =∅. The reason for this is that a3 contains(aU b), which means that every successor node must satisfyaU b, which implies that we cannot use any tuple containingS00orS10, as those must violate this formula. But as we have seen already no subset ofS2 = {Q01, Q11} belongs toS(a3). Finally, a4 does not have any successors either. Indeed, every successor of a4 must contain¬a∧ ¬b, but in every transition there must be an element which satisfiesb.

From all this information, we can construct the automa-tonAψ. A part of this automaton is depicted in Figure 5. As

it can be seen, the atoma5is in fact a bad state, and hence it, along with all transitions leading to it, is removed from the reduced automatonA...ψ. On the other hand, the language

accepted by this automaton is not empty, and henceψis sat-isfiable. Moreover, we can follow transitions from an initial state to final states to construct an accepted tree, which will serve as a model of the formulaψ. For example, Figure 6 shows an abstract template of some models obtained this way. For any values such thatp1+p2 =q1+q2 = 1, and satisfiesp1, q1 ≤ 0.6;p2, q2 ≤ 0.7, the figure represents a model. This model is obtained starting from the initial state

a1 (which, as a valuation, makesaandbfalse), and is al-lowed to make the transition(a1,a8,a2). The upper branch in the model represents the execution froma2, which makes a transition back toa1, and then reuses the same transition

a1 a3 a2 a8 a7 a4 a5 a6 a9 a10

Figure 5: A part of the automatonAψwith boxes

represent-ing hyperedges. a1 anda2 are initial states. Double lines represent final states. Gray nodes and edges are those that are removed from the reduced automaton.

a a a a ab a a ab p1 p₂ 1 1 1 1 1 q1 q₂ 1 1 1

Figure 6: Some models ofψ.

(a1,a8,a2), but this time,a2makes a transition toa6, which is a final state. The lower branches show some iterations re-maining ina8 before making a transition toa9and finally reachinga10which is a final state. In models following this pattern, the optimist view on the probability of observing ais 0.7, obtained by traversing the lower branch. However, if we know that after one timestepawas not ob-served, then the probability becomes 0.6 ·0.7 = 0.42 as obtained from the middle branch. Notice that this discussion is limited to this class of models only.

We now transform this automaton into the weighted au-tomatonBψin order to handle the most likely traces. A

(13)

por-a1 a2 a8 a7 a6 a9 a10 1 1 1 1 0.7 0.6 1 1 Figure 7:Bψ. a1 a2 a8 a7 a6 a9 a10 Figure 8:Bψ.

tion of this automaton, corresponding to the fragment de-picted in Figure 5 ofAψis shown in Figure 7. Note that, for

example, the transition(a1,a8,a2)fromAψgives rise to the

transitions (a1,a8) and(a1,a2) with weights0.7 and0.6 respectively, which are obtained by maximising the poten-tial values of the systems of inequalities that produce them. However, as mentioned before, these maximisations do not define a probability: there is no model that makes both tran-sitions reach this maxima; indeed, in any model ofAψthe

sum of the probabilities of these transitions needs to always be 1.

Just by observing this fragment ofBψ, we can

immedi-ately see thatkBψk = 1. Indeed, the runsa1,a7anda2,a6 both have weight 1. These runs define traces with a proba-bility 1 of occurring; i.e.,(∅,{a}), and(∅,∅), respectively. Indeed, notice that both probabilistic constraints in the for-mulaψgive only an upper bound on the probability of ob-serving some behaviour. Hence, there are models that as-sign a probability 0 to both of them (i.e., a probability 1 of none occurring) as witnessed by these traces. For a prefix

(∅,{a},{a},{a, b})one must satisfy the probabilistic con-straint(aU b), and so the highest probability possible is0.7, witnessed by the runa1,a8,a9,a10(with the mlt being ex-actly that prefix).

To conclude this example, we depict the flattened automa-ton obtained fromBψin Figure 8.