When are description logic knowledge bases indistinguishable?

(1)

BIROn - Birkbeck Institutional Research Online

Botoeva, E. and Kontchakov, Roman and Ryzhikov, V. and Wolter, F. and

Zakharyaschev, Michael (2015) When are description logic knowledge bases

indistinguishable?

In: UNSPECIFIED (ed.)

Proceedings of the

Twenty-Fourth International Joint Conference on Artificial Intelligence, IJCAI 2015,

Buenos Aires, Argentina, July 25-31, 2015. Palo Alto, U.S.: AAAI Press, pp.

4240-4246. ISBN 9781577357384.

Downloaded from:

Usage Guidelines:

Please refer to usage guidelines at

or alternatively

(2)

When Are Description Logic Knowledge Bases Indistinguishable?

∗

E. Botoeva,

1

R. Kontchakov,

3

V. Ryzhikov,

1

F. Wolter

2

and M. Zakharyaschev

3

1

_{Faculty of Computer Science}

2

_{Dept. of Computer Science}

3

_{Dept. of Computer Science}

Free University of Bozen-Bolzano, Italy

University of Liverpool, UK

Birkbeck, London, UK

{

botoeva,ryzhikov

}

@inf.unibz.it

[email protected]

{

roman,michael

}

@dcs.bbk.ac.uk

Abstract

Deciding inseparability of description logic knowl-edge bases (KBs) with respect to conjunctive queries is fundamental for many KB engineering and maintenance tasks including versioning, mod-ule extraction, knowledge exchange and forget-ting. We study the combined and data complex-ity of this inseparabilcomplex-ity problem for fragments of

Horn-ALCHI, including the description logics un-derpinningOWL 2 QLandOWL 2 EL.

1 Introduction

A description logic (DL) knowledge base (KB) consists of a terminological box (TBox) and an assertion box (ABox). The TBox represents conceptual knowledge by providing a vocab-ulary for a domain of interest together with axioms that de-scribe semantic relationships between the vocabulary items. To illustrate, the following toy TBox,Ta, defines a

vocabu-lary for the automotive industry:

MinivanvAutomobile, HybridvAutomobile,

Automobilev ∃poweredBy.Engine,

Hybridv ∃poweredBy.EEngineu ∃poweredBy.ICEngine,

EEnginevEngine, ICEnginevEngine.

For example, the first two axioms say that minivans and hy-brids are automobiles; the third axiom claims that every auto-mobile is powered by an engine. Thus, the TBox introduces, among others,concept names(sets)Minivan,Automobileand

Engine, states that the concept Minivanis subsumed by the conceptAutomobileand uses therole name(binary relation)

poweredByto say that automobiles are powered by engines. The last two axioms state that electric and internal combus-tion engines are engines. TBoxes are often calledontologies

and presented in applications in terms of the Web Ontology Language OWL 2, which is underpinned by DLs.

The ABox of a KB is a set of facts storing data about the concept and role names introduced in the TBox. An example

∗

This paper was invited for submission to the Best Papers From Sister Conferences Track, based on a paper that appeared in the Proc. of the 14th Int. Conf. on Principles of Knowledge Representa-tion and Reasoning (KR 2014).

ABox,Aa, in the automotive domain is given by

Hybrid(toyota highlander),

Minivan(toyota highlander), Minivan(nissan note).

Typical applications of KBs in modern information systems use the semantics of concepts and roles in the TBox to enable the user to query the data in the ABox. This is particularly useful if the data is incomplete or comes from heterogenous data sources which is the case, for example, in linked data applications [Pollereset al., 2013] and large scale data inte-gration projects [Poggiet al., 2008; Gieseet al., 2013], or if the data comprises web content gathered by search engines using semantic markup [Hitzleret al., 2009].

As the data may be incomplete, the open world assumption is made when querying a KBK: a tupleaof individuals from

Kis a (certain)answerto a queryqoverKiffq(a)is true in every modelIofK. As general first-order queries are unde-cidable under the open-world semantics, the basic and most important querying instrument is conjunctive queries (CQs), which are ubiquitous in relational database systems and form the core of the Semantic Web query language SPARQL. A CQq(x)is a first-order formula∃yϕ(x,y), whereϕ(x,y)

is a conjunction of atoms of the formA(z1)orP(z1, z2)for

a concept nameA, a role nameP, and variablesz1, z2from

x,y.1 _{For instance, to find minivans powered by electric} en-gines, one can use the following CQ:

q(x) =∃y(Minivan(x)∧poweredBy(x, y)∧EEngine(y)). Thentoyota highlanderis its only certain answer in(Ta,Aa).

The problem of answering CQs over KBs has been the focus of significant research in the DL community with deep complexity results for a large variety of DLs (see be-low), the introduction of new DLs for which query answer-ing is tractable for data complexity [Hustadt et al., 2005; Calvaneseet al., 2007], the invention of various query an-swering techniques [Calvaneseet al., 2007; Lutzet al., 2009] and the development of powerful implemented systems; see, e.g., [Kontchakov and Zakharyaschev, 2014] and references therein.

Apart from developing query answering techniques, a ma-jor research problem is KB engineering and maintenance. In

1

Since we consider Horn DLs, the results of this paper actually apply to disjunctions (or unions) of CQs (UCQs). For simplicity, however, we concentrate on CQs only.

(3)

fact, with typically large data and often complex and tangled ontologies, tool support for transforming and comparing KBs is becoming indispensable for applications. To begin with, KBs are never static entities. Like most software artefacts, they are updated to incorporate new information, and dis-tinct versions are introduced for different applications. Thus, developing support for KBversioning has become a major research problem [Jim´enez-Ruizet al., 2011; Konev et al., 2012]. As dealing with a large and semantically tangled KB can be costly, one may want toextractfrom it a smaller mod-ulethat is indistinguishable from the whole KB as far as the given application is concerned [Stuckenschmidtet al., 2009]. Another technique for extracting relevant information is for-getting, where the task is to replace a given KB by a new KB using only those concept and role names that are needed by the application but still providing the same information about those names as the original KB [Konevet al., 2009; Koopmann and Schmidt, 2014b]. Finally, the vocabulary of a given KB may not be convenient for a new application. In this case, similarly to data exchange in databases [Arenas

et al., 2014]—where data structured under a source schema is converted to data under a target schema—one may want to transform a KB in a source signature to a KB given in a more useful target signature and representing the original KB in an accurate way. This task is known asknowledge ex-change[Arenaset al., 2012; 2013].

In this paper, we investigate a relationship between KBs that is fundamental for all such tasks if querying the data via CQs is the main application. Let Σbe a signature consist-ing of a set of concept and role names. We say that KBsK1

andK2areΣ-query inseparableand writeK1 ≡ΣK2if any

CQ formulated inΣhas the same answers overK1andK2.

Note that even forΣcontaining all concept and role names in the KBs,Σ-query inseparability does not necessarily imply logical equivalence: e.g.,(∅,{A(a)})is{A, B}-query insep-arable from({BvA},{A(a)})but the two KBs are clearly not logically equivalent. Thus, if KBs are used for purposes other than querying data via CQs, then different notions of inseparability are required. We now discuss the applications ofΣ-query inseparability for the tasks above in more detail.

Versioning.Version control systems for KBs provide a range of operations including, for example, computing the relevant differences between KBs, merging KBs and recovering KBs. All these operations rely on checking whether two versions,

K1andK2, of a KB are indistinguishable from the application

point of view. If that application is querying the data using CQs over a given signature Σ, then K1 and K2 should be

regarded as indistinguishable just in case they give the same answers to CQs inΣ. Thus, the basic task for a query-centric approach to KB versioning is to check whetherK1≡ΣK2. Modularisation. Modularisation and module extraction are major research topics in ontology engineering and mainte-nance. In module extraction, the problem is to find a (small) subset of the axioms of a given large KB that is indistinguish-able from it with respect to the intended application. If that application is querying a KBKusing CQs over a signatureΣ, then the problem is to find a smallΣ-query module ofK, that is, a KBK0 _{⊆ K}_with_K0 _≡

ΣK. Note that one can extract a

minimalΣ-query module from a KB using a polynomial-time algorithm with theΣ-query inseparability check as an oracle. To illustrate the notion ofΣ-query module, consider the au-tomotive ontologyKa = (Ta,Aa)described above and the

signatureΣm = {Automobile,Engine,poweredBy}. Then

Km= (Tm,Aa)is aΣm-query module ofKa, whereTmis

MinivanvAutomobile, Automobilev ∃poweredBy.Engine.

Knowledge Exchange. In knowledge exchange, we want to transform a KB K1 in a signature Σ1 to a KB K2

in a new signature Σ2 connected to Σ1 via a declarative

mapping specification given by a TBox T12. Such

map-ping specifications between KBs are also known as ontol-ogy alignments or ontolontol-ogy matchings and have been stud-ied extensively [Shvaiko and Euzenat, 2013]. If, as above, we are interested in querying data via CQs, then the tar-get KB K2 should be a sound and complete

representa-tion of K1 w.r.t. querying data, and so satisfy the

condi-tionK1∪ T12 ≡Σ2 K2, in which case it is called a univer-sal CQ-solution. To illustrate, consider again the ontology

Ka = (Ta,Aa), and letTaerelate the signatureΣa ofKato Σe={Car,HybridCar,ElectricMotor,Motor,hasMotor}:

AutomobilevCar, HybridvHybridCar,

EnginevMotor, EEnginevElectricMotor,

poweredByvhasMotor.

ThenKe= (Te,Ae)is a universal CQ-solution, where

Te={ElectricMotorvMotor, Carv ∃hasMotor.Motor,

HybridCarvCaru ∃hasMotor.ElectricMotor},

Ae={HybridCar(toyota highlander),Car(nissan note)}.

Forgetting. A KBK0 _{results from} _forgetting_{a signature}_Σ

in a KBK ifK0 _≡

sig(K)\Σ K andsig(K0) ⊆ sig(K)\Σ,

wheresig(K)is the signature ofK. Thus, the result of for-gettingΣdoes not useΣand gives the same answers to CQs without symbols inΣasK. The result of forgetting is also called auniform interpolantforKw.r.t.sig(K)\Σ. Forget-ting is of interest for a number of applications. Typically, when reusing an existing KB in a new application, only a small number of its symbols is relevant, and so instead of reusing the whole KB, one can take the potentially smaller KB resulting from forgetting the extraneous symbols. For-getting can also be used forpredicate hiding: if a KB is to be published, but some part of it has to be concealed from the public, then this part can be removed by forgetting its sym-bols [Cuenca Grau and Motik, 2012]. Finally, forgetting can be used forKB summary: the result of forgetting often pro-vides a smaller and more focused ontology that summarises what the original ontology says about the retained symbols, potentially facilitating KB comprehension. To illustrate, for

Σf ={Automobile,Engine,poweredBy}, the KB(Tf,Af),

Tf ={Automobilev ∃poweredBy.Engine},

Af ={Automobile(toyota highlander),

Automobile(nissan note)},

is a result of forgettingsig(Ka)\Σf inKa.

(4)

in-cludeDL-LiteH_core[Calvaneseet al., 2007; Artaleet al., 2009],

ELandELH [Baader et al., 2005] underlying the OWL 2 profiles OWL 2 QL andOWL 2 EL. For all of these DLs,

Σ-query inseparability turns out to be P-complete for data complexity, which matches the data complexity of CQ evalu-ation in all of our DLs lying outside theDL-Litefamily. The obtained tight combined complexity results are summarised in the diagram below:

Horn-ALCHI

Horn-ALCI

Horn-ALCH

Horn-ALC ELH

EL

DL-LiteHhorn

DL-Litehorn

DL-LiteHcore

DL-Litecore

P EXPTIME

EXPTIME

P 2EXPTIME

forward strategy

general strategy

backward+forward strategy

Most interesting are EXPTIME- and 2EXPTIME-complete-ness of DL-LiteH_core and Horn-ALCI, respectively, which contrast with NP- and EXPTIME-completeness of CQ eval-uation in these logics. ForDL-Lite without role inclusions andELH, Σ-query inseparability is P-complete, while CQ evaluation is NP-complete. In general, it is the combined presence of inverse roles and qualified existential restric-tions (or role inclusions) that makesΣ-query inseparability hard. To establish the upper complexity bounds, we develop a uniform game-theoretic framework for checking finite Σ -homomorphic embeddability between (possibly infinite) ma-terialisations of KBs. All omitted proofs can be found in the full version at http://tinyurl.com/poa49vf.

2 Horn-

ALCHI

and its Fragments

All the DLs considered in this paper are Horn fragments of

ALCHI. To define them, we fix lists ofindividual namesai,

concept namesAi, androle namesPi, fori < ω. Aroleis a

role namePior aninverse rolePi−; we assume(P

−

i )−=Pi.

ALCI-concepts,C, are defined by the grammar C ::= Ai | > | ¬C | C1uC2 | ∃R.C,

whereRis a role. We use⊥,C1tC2and∀R.Cas

abbrevi-ations for¬>, ¬(¬C1 u ¬C2)and ¬∃R.¬C, respectively. ALC-concepts are ALCI-concepts without inverse roles;

EL-concepts are ALC-concepts without ¬. DL-Litehorn

-concepts are ALCI-concepts without ¬, in which C = >

in every occurrence of∃R.C. Finally,DL-Litecore-concepts areDL-Litehorn-concepts withoutu; in other words, they are

basic conceptsof the form>,Aior∃R(a shortcut for∃R.>).

For a DLL, anL-concept inclusion(CI) takes the form C v D, whereC andD are L-concepts. AnL-TBox, T, contains a finite set ofL-CIs. AnALCHI,DL-LiteH_hornand

DL-LiteH_core TBox can also contain a finite set ofrole inclu-sions(RIs)R1vR2, where theRiare roles. InALCHand

ELH, TBoxes have RIs but without inverse roles. DL-Lite

TBoxes also containdisjointness constraintsB1uB2 v ⊥

andR1uR2v ⊥, for basic conceptsBiand rolesRi.2

To introduce the Horn fragments of these DLs, we require the following (standard) recursive definition [Hustadtet al.,

2

Although role disjointness constraints are not in the syntax of

ALCHI, they play no essential part in our constructions, and the techniques we develop forALCHIare also applicable toDL-Lite.

2005; Kazakov, 2009]: a concept Coccurs positively inC; ifCoccurs positively (respectively, negatively) inC0 thenC occurs positively (negatively) inC0_u_D,_∃_R.C0_,_D_v_C0_{, and}

it occurs negatively (positively) in¬C0andC0vD. A TBox

T isHornif no concept of the form¬Coccurs negatively in

T and no∃R.¬Coccurs positively inT. In the DLHorn-L, whereLis one of our DLs, onlyHorn-L-TBoxes are allowed. Clearly,EL- andDL-Lite-TBoxes are Horn by definition.

AnABox,A, is a finite set ofassertionsof the formAk(ai)

orPk(ai, aj). AnL-TBoxT and an ABoxAform anL-KB

K= (T,A);ind(K)is the set of individual names inK. The semantics for the DLs is defined in the usual way based on interpretationsI = (∆I,·I₎_{that comply with the}_unique

name assumption: aI_i 6=a_jI fori6=j [Baaderet al., 2003]. We writeI |=αin case an inclusion or assertionαis true in

I. IfI |=α, for allα∈ T ∪ A, thenIis amodelof a KB

K = (T,A); in symbols: I |=K. Kisconsistentif it has a model.K |=αmeans thatI |=αfor allI |=K.

Aconjunctive query(CQ)q(x)is a formula∃yϕ(x,y), where ϕ is a conjunction of atoms of the form Ak(z1) or

Pk(z1, z2)withziinx,y. A tupleainind(K)(of the same

length asx) is acertain answertoq(x)overK= (T,A)if

I |=q(a)for allI |=K; in this case we writeK |=q(a). If

x=∅, the answer toqis ‘yes’ ifK |=qand ‘no’ otherwise. For combined complexity, the problem ‘K |= q(a)?’ is NP-complete for theDL-Litelogics [Calvaneseet al., 2007],

ELandELH[Rosati, 2007], and EXPTIME-complete for the remaining Horn DLs above [Eiteret al., 2008]. For data com-plexity (with fixedT andq), this problem is in AC0 _{for the} DL-Lite logics [Calvaneseet al., 2007] and P-complete for the remaining DLs [Rosati, 2007; Eiteret al., 2008].

Asignature,Σ, is a set of concept and role names. By a

Σ-concept, Σ-role, Σ-CQ, etc. we understand any concept, role, CQ, etc. constructed using the names from Σ. Given an interpretationI and a signatureΣ, we define theΣ-types

tI_Σ(u)andr_ΣI(u, v)ofu, v ∈∆Iby taking:

tI_Σ(u) ={Σ-concept nameA|u∈AI},

rI_Σ(u, v) ={Σ-roleR|(u, v)∈RI}.

3 Σ

-Query Entailment and Inseparability

Now we define the central notions of the paper,Σ-query en-tailment and inseparability, and establish their semantic char-acterisation based on the notion of materialisation. We then show how to construct materialisations by developing a the-ory of finitely generated materialisations.

Definition 1. Let K1 and K2 be KBs and Σ a signature.

We say thatK1 Σ-query entailsK2 ifK2 |= q(a)implies K1|=q(a), forallΣ-CQsq(x)and all tuplesainind(K2).

Knowledge basesK1andK2areΣ-query inseparableif they

Σ-query entail each other; in this case we writeK1≡ΣK2.

Checking Σ-query inseparability can be trivially reduced to twoΣ-query entailment checks. Conversely, for most lan-guages we have a semantically transparent reduction ofΣ -query entailment toΣ-query inseparability:

Theorem 1. LetLbe any of our DLs containingELor hav-ing role inclusions. ThenΣ-query entailment of L-KBs is

(5)

a A

u

P

R

S T S T

Q

Q Q Q

aA σ

T , Q

R

S, Q

R

T , Q

R

S, Q

M2

M1

a)

a

A

u

P

R−

S−

T−

S−

Q−

a σ43

σ42

σ14

σ04

T , Q

R

S, Q

R

T , Q

R

S, Q

0 ₁ ₂

2 3

3

4 4

GΣ 2

MΣ 1

b)

Figure 1: a) materialisationsM1andM2ofK1andK2and b) a 4-winning (backward) strategy inGΣ(G2,M1).

To prove the upper complexity bounds for both problems, we give a semantic characterisation ofΣ-query entailment in terms of materialisations and finiteΣ-homomorphisms. An interpretationMis amaterialisationof a KBKifK |=q(a)

iffM |=q(a), for all CQsq(x)and all tuplesainind(K). We say thatKismaterialisableif it has a materialisation.

Suppose Mi is a materialisation of Ki, for i = 1,2.

A function h: ∆M2 _→ _∆M1 _{is a} _Σ_{-homomorphism} _if

tM2

Σ (u) ⊆ tMΣ1(h(u))andrΣM2(u, v) ⊆ rMΣ1(h(u), h(v)),

for allu, v∈∆M2_{, and}_h₍_aM2_{) =}_aM1_{for all}_a_∈_ind₍_K 2)

that belong to aΣ-concept or aΣ-role. As answers toΣ-CQs are preserved underΣ-homomorphisms,K1Σ-query entails K2 if there is aΣ-homomorphism fromM2 toM1.

How-ever, the converse does not necessarily hold:

Example 1. ConsiderKi= (Ti,{A(a)}), fori= 1,2, where

T1={Av ∃S, ∃S−v ∃T, ∃T−v ∃S,

SvQ, T vQ, ∃Q− v ∃R},

T2={Av ∃P, ∃P− v ∃R−, ∃Rv ∃S−u ∃Q−, ∃Qv ∃Q−, ∃Sv ∃T−, ∃T v ∃S−}.

MaterialisationsMiofKi, fori= 1,2, are shown in Fig. 1a.

LetΣ ={Q, R, S, T}. Then there is noΣ-homomorphism fromM2 toM1(asrΣM2(a, u) =∅, we can maputo, say,

σ but then only the shaded part ofM2 can be mapped Σ

-homomorphically toM1). However, for anyΣ-queryq(x), M2|=q(a)impliesM1|=q(a)as any finite

subinterpreta-tion ofM2can beΣ-homomorphically mapped toM1.

We say that M2 is finitely Σ-homomorphically embed-dable into M1 if, for every finite subinterpretation M02 of M2, there exists aΣ-homomorphism fromM02toM1.

Intu-itively, finiteM0

2can be regarded as a CQ whose variables are

elements of∆M2_{and the answer variables are the}_ind₍_K 2). Theorem 2. SupposeKiis a consistent KB with a

material-isationMi,i= 1,2. ThenK1Σ-query entailsK2iffM2is finitelyΣ-homomorphically embeddable intoM1.

One problem with applying Theorem 2 is that material-isations are in general infinite. We address this problem by introducing finite representations of materialisations, and show thatHorn-ALCHI and all of its fragments enjoy hav-ing such representations. LetK = (T,A)be a KB and let

G= (∆G_,_·G_,_;₎_{be a finite structure such that}

– ∆G =ind(K)∪Ω, for someΩwithind(K)∩Ω =∅,

– (∆G,·G₎_{is an interpretation with}_PG

i ⊆ind(K)×ind(K),

for all role namesPi,

– (∆G,;) is a directed graph (possibly containing loops) with nodes∆Gand arcs;⊆∆G×Ω, where eachu;v is labelled with a set (u, v)G 6= ∅ of roles satisfying the condition: ifui ;vandu2;vthen(u1, v)G = (u2, v)G.

ApathσinGis a sequenceu0. . . unwithu0∈ind(K)and

ui;ui+1fori < n. Denote the last element ofσbytail(σ).

The unravelling M of G is an interpretation (∆M_,_·M₎_,

where∆Mis the set of paths inGand·M_{is given by:}

aM=a, for eacha∈ind(K), AM={σ|tail(σ)∈AG},

PM=PG∪ {(σ, σu)|tail(σ);u, P ∈(tail(σ), u)G} ∪ {(σu, σ)|tail(σ);u, P− ∈(tail(σ), u)G},

for concept and role namesAandP. We callGagenerating structure forK if its unravelling is a materialisation of K. We say that a DLLhasfinitely generated materialisationsif everyL-KB has a generating structure. In Example 1,M2is

generated by the structureG2in Fig. 1b.

Theorem 3. Horn-ALCHI and all its fragments defined above have finitely generated materialisations. Moreover,

– for any L ∈ {ALCHI,ALCI,ALCH,ALC} and any Horn-L KB (T,A), a generating structure can be con-structed in time|A| ·2p(|T |)_,_p_{a polynomial}_;

– for anyLin theELandDL-Lite families introduced above and anyL-KB(T,A), a generating structure can be con-structed in time|A| ·p(|T |),pa polynomial.

4 Σ

-Query Entailment by Games

Suppose a DLLhas finitely generated materialisations. We now show that the problem of checking finite Σ -homo-morphic embeddability between materialisations ofL-KBs can be reduced to the problem of finding a winning strategy in a game played on the generating structures for these KBs.

For a generating structureGfor Kand a signatureΣ, the

Σ-typestG_Σ(u)andrG

Σ(u, v)ofu, v∈∆G are defined by:

tG_Σ(u) ={Σ-concept nameA|u∈AG},

rG_Σ(u, v) =  



{Σ-roleR|(u, v)∈RG}, ifu, v∈ind(K),

{Σ-roleR|R∈(u, v)G}, ifu;v,

∅, otherwise,

where(P−)G is the converse ofPG. We writeu ;Σ _v _if

u;vandrG_Σ(u, v)6=∅.

SupposeKi is a consistentL-KB, fori = 1,2, andΣa

[image:5.612.47.554.58.140.2]

(6)

forKiand letMibe its unravelling;GiΣandM

Σ

i denote the

restrictions ofGiandMitoΣ. We begin with a very simple

game on the finite generating structureGΣ

2 and the possibly

infinite materialisationMΣ 1.

Infinite Game. This game is played by two players. Intu-itively, player 1 tries to construct a homomorphism, while player 2 wants to impede him by choosing a path in M2

to which player 1 cannot find a homomorphic image given his previous choices. Thestatesof the game are of the form si= (ui 7→σi), fori≥0, whereui ∈∆G2andσi ∈∆M1

satisfy the following condition:

(s1) tGΣ2(ui)⊆tMΣ1(σi).

The game starts in a states0 = (u0 7→ σ0)withσ0 = u0

in caseu0 ∈ind(K2)belongs to aΣ-concept or aΣ-role. In

each roundi > 0, player 2 challenges player 1 with some ui ∈ ∆G2 such thatui−1 ;Σ2 ui. Player 1 has to respond

with aσi∈∆M1satisfying(s1)and (s2) rGΣ2(ui−1, ui)⊆rMΣ1(σi−1, σi).

This gives the next statesi= (ui7→σi). Note that of all the

uionlyu0may be an ABox individual; however, there is no

such a restriction on theσi. Aplay of lengthn≥0starting

froms0is any sequences0, . . . ,sn of states obtained as

de-scribed above. For an ordinalλ≤ω, we say that player 1 has aλ-winning strategy in the gameGΣ(G2,M1)starting from a states0if, for any play of lengthi < λ, which starts froms0

and conforms with this strategy, and any challenge of player 2 in roundi+1, player 1 has a response. The next theorem gives a game-theoretic flavour to the criterion of Theorem 2.

Theorem 4. M2is finitelyΣ-homomorphically embeddable intoM1iff the following conditions hold:

(abox) tM2 Σ (a)⊆t

M1

Σ (a),rMΣ2(a, b)⊆rMΣ 1(a, b), for any

a, b∈ind(K2)that belong to aΣ-concept or aΣ-role; (win) for anyu0∈∆G2andn < ω, there existsσ0∈∆M1

such that player 1 has ann-winning strategy in the game

GΣ(G2,M1)starting from(u07→σ0). Example 2. ConsiderGΣ

2 forK2,MΣ1 forK1andΣfrom

Ex-ample 1. For eachn < ω, player 1 has ann-winning strategy from(u7→σn

0): forn= 4, it is shown in Fig. 1b by dotted

lines (in round 2, player 2 has two possible challenges). The criterion of Theorem 4 does not seem to be a big im-provement on Theorem 2 as we still have to deal with an in-finite materialisation. Our aim now is to replace(win)by a more complex game on thefinitegenerating structuresG2and G1. We consider four types,τ, of strategies inGΣ(G2,M1)

and for each of them we define a gameGτ_Σ(G2,G1)such that,

for anyu0∈∆G2, the following conditions are equivalent: (<ωτ) for every n < ω, player 1 has an n-winning τ

-strategy inGΣ(G2,M1)starting from some(u07→σn0); (ωτ₎ _{player 1 has an}_{ω-winning strategy in}_Gτ

Σ(G2,G1)

start-ing from some state dependstart-ing onu0andτ.

We begin with simplest ‘forward’ winning strategies.

Forward Strategies. We say that aλ-strategy (λ ≤ ω) for player 1 in the game GΣ(G2,M1) is forward if, for any

play of lengthi−1 < λ, which conforms with this strat-egy, and any challenge ui−1 ;Σ2 ui by player 2, the

re-sponseσiof player 1 is such that eitherσi−1, σi∈ind(K1)or

σi=σi−1w, for somew∈∆G1. If theGi,i= 1,2, are such

that theΣ-labels on;i-edges contain no inverse roles, then

everystrategy inGΣ(G2,M1)is forward. This is the case for

DLs without inverse roles:Horn-ALCH,EL, etc.

The existence of a forwardλ-winning strategy for player 1 inGΣ(G2,M1)is equivalent to the existence of such a

strat-egy in the game Gf_Σ(G2,G1), which is defined similarly to

GΣ(G2,M1)except that it is played onG2andG1, and the

re-sponsewi∈∆G1of player 1 to a challengeui−1;Σ2 uimust

be such that eitherwi−1, wi∈ind(K1)orwi−1;1wi. This

game is a standard reachability game on finite graphs, where the existence of ω-winning strategies for player 1 can be checked in polynomial time in the size ofG1andG2[Mazala,

2001]. By Theorem 3, we obtain the P and EXPTIMEupper complexity bounds forELHandHorn-ALCH, respectively. In contrast to forward strategies, the winning strategies of Ex-ample 2 can be described as ‘backward.’

Backward Strategies. A λ-strategy for player 1 in GΣ(G2,M1)isbackwardif, for any play of lengthi−1< λ,

which conforms with this strategy, and any challenge ui−1 ;Σ2 ui by player 2, the response σi of player 1 is

the immediate predecessorofσi−1 inM1in the sense that

σi−1 = σiw, for somew ∈ ∆G1; player 1 loses if σi−1 ∈ ind(K1). Note that, sinceM1is tree-shaped, the response of

player 1 to any other challengeui−1;Σ2 u0imust be the same

σi. That is why the states of the gameGbΣ(G2,G1)are of the

form (Ξi 7→ wi), whereΞi is thesetof all ;Σ2-successors

of elements of Ξi−1(forward strategies need only one

suc-cessor). The more complex structure of the states leads to an increase in the complexity: checking whether player 1 has an ω-winning strategy inGb

Σ(G2,G1)isCONP-hard.

Observe that in the case of DL-Litecore and DL-Litehorn (which have inverse roles but no RIs), generating structures

G = (∆G,·G_,_;₎_{are so that, for any}_u _∈ _∆G _and_{R, there}

isat most onevwithu;vandR ∈rG(u, v)[Kontchakov

et al., 2010a]. As a result, anyn-winning strategy consists of a (possibly empty) backward part followed by a (possibly empty) forward part. Moreover, in the backward games for these DLs, the setsΞi are singletons. Thus, the number of

states in the combined backward/forward games is polyno-mial, and the existence of winning strategies is in P.

In general, however, the forward strategy in the combina-tion is not enough, and we require start-bounded strategies.

Start-Bounded Strategies. A strategy for player 1 in the gameGΣ(G2,M1)starting from(u07→σ0)isstart-bounded

if it never leads to (ui 7→ σi) with σ0 = σiw, for some

w∈∆G1_and_{i >}₀_{. In other words, player 1 cannot use}

ele-ments ofM1that are located closer to the ABox thanσ0; the

ABox individuals inM1can only be used ifσ0 ∈ ind(K1).

ConsiderGΣ

2 andMΣ1 in Fig. 2a. Player 1 has a winning

start-bounded strategy from (u2 7→ σ1)as shown in Fig. 2a by

dashed lines (indices indicate rounds). Observe that player 1 moves forwards and backwards along;1inM1:σ4=σ3w

is visited in round 2 between visits toσ3in rounds 1 and 3.

Start-bounded finite game Gs

Σ(G2,G1) ensures that

player 1 moves only forwards along ;1 in G1 and so, he

has to guessallelements ofG2that are mapped to the same

(7)

u2 _T u6 _P u7 P₁− u8 T₁− u9

σ1 σ3

T , T1

σ4

P, P1 a

0

4 1

3 2

GΣ 2

MΣ

1 a)

u1 _u

2

R−

u3

u6

S−

T P u7 P₁− u8 T₁− u9

u10

S− 1

σ2

σ1

R

σ3 T , T1

σ4

P, P1

a

S, S₁ 0

1 1

2 2

GΣ 2

MΣ

1 b)

Figure 2: a) start-bounded strategy and b) general strategy as a composition of a backward and start-bounded strategies.

si= (Γi,Ξi 7→wi), whereΞiis the guess andΓiis required

to ensure that only forward moves are possible. ConsiderGΣ

2 andMΣ1 in Fig. 2a. Suppose thatG1is

iso-morphic to M1 and denote by wi the element of G1 that

corresponds toσi inM1. Then player 1 has anω-winning

strategy inGs

Σ(G2,G1)from(∅,{u2, u9} 7→ w1). Player 2

challenges with u2 ;Σ2 u6, and player 1 responds with

({u2, u9},{u6, u8} 7→w3). Then player 2 picksu6 ;Σ2 u7

and player 1 responds with({u6, u8},{u7} 7→ w4), where

the game ends. Note the crucial guesses{u2, u9} 7→w1and {u6, u8} 7→w3made by player 1. If player 1 responded with

({u2, u9},{u6} 7→w3)(and failed to guess thatu8must also

be mapped tow3), then after the challengeu6;Σ2 u7and the

only possible response({u6},{u7} 7→w4)), player 2 would

picku7;Σ2 u8, to which player 1 could not respond. General Strategies. A general winning strategy in the game GΣ(G2,M1)is composed of one backward and a number of

start-bounded strategies. Consider, for example,GΣ

2 andMΣ1

in Fig. 2b. Starting from(u1 7→ σ2), player 1 can respond

to the challengesu1 ;Σ2 u2 ;Σ2 u3 according to the

back-ward strategy; to u2 ;Σ2 u6 ;Σ2 u7 ;Σ2 u8 ;Σ2 u9

ac-cording to the start-bounded strategy as above, see Fig. 2a; whileu9;Σ2 u10is again responded according to the

back-ward strategy. We can combine the two backback-ward strategies into a single one, but keep the start-bounded one separate.

In general, the finite gameGg_Σ(G2,G1)begins as the

back-ward game but with states of the form(Ξi7→wi,Ψi), where Ξi andwi are as above and Ψi indicates initial challenges

in start-bounded games. In each roundi > 0, player 2 can choose from two options. First, ifwi−1 ∈/ ind(K1), he can

challenge player 1 with the setΨi−1 (that is, similar to the

backward game but with a possibly smallerΨi−1instead of

the set of all successors). Second, player 2 can launch the start-bounded game from(∅,Ξi−1 7→wi−1), where his first

challenge cannot be picked fromΨi−1.

The full game graph is exponential in the size of the gener-ating structures, and so checking whether player 1 has an ω-winning strategy can also be done in exponential time. This matches the EXPTIME-hardness of checking whether player 1 has anω-winning strategy in thestart-boundedgame.

Theorem 5. For combined complexity, both KBΣ-query in-separability and KBΣ-query entailment are

(f) P-complete in ELand ELHand EXPTIME-complete in Horn-ALCandHorn-ALCH;

(b+f) P-complete inDL-LitecoreandDL-Litehorn;

(b+s) EXPTIME-complete in DL-LiteH_horn and DL-LiteH_core;

2EXPTIME-complete inHorn-ALCHI,Horn-ALCI. For data complexity, all these problems areP-complete.

5 Related Work

Σ-query inseparability of KBs has not been investigated sys-tematically before. However, the polynomial upper bound for

ELwas established as a preliminary step to study TBox query inseparability [Lutz and Wolter, 2010]. This notion was also used to study forgetting inDL-LiteNbool [Wanget al., 2010].

Σ-query inseparability of KBs is closely related to knowl-edge exchange between KBs and inseparability between TBoxes. SupposeK1andK2are KBs given in disjoint

sig-naturesΣ1andΣ2, andT12consists of inclusions of the form

S1 v S2, where theSi are concept (or role) names inΣi.

Then deciding whetherK1∪ T12≡Σ2 K2is called the mem-bership problem for universal CQ-solutions. For DLsLwith role inclusions, this problem is in fact aΣ2-query

insepara-bility problem inL, and so the complexity upper bounds for

Σ-query inseparability can be applied directly to obtain upper bounds for the membership problem. Conversely, Σ-query entailment for any of our DLsLis LOGSPACE-reducible to the membership problem for universal CQ-solutions inL, and so we can also apply complexity lower bounds for query in-separability of KBs.

As for TBox inseparability, recall that T1 andT2 are Σ

-query inseparable if, for allΣ-ABoxesA, the KBs(T1,A)

and(T2,A)are Σ-query inseparable. This notion has been

extensively studied [Kontchakov et al., 2010b; Lutz and Wolter, 2010; Konevet al., 2011; 2012]. TBox and KB insep-arabilities have different applications. The former supports ontology engineering when data is unknown or changes fre-quently: one can equivalently replace one TBox with another only if they return the same answers to queries overeveryΣ -ABox. In contrast, KB inseparability is useful in applications where data is stable—such as knowledge exchange or vari-ants of module extraction and forgetting with fixed data—in order to use the KB in a new application or as a compilation step to make CQ answering more efficient. For many DLs, TBoxΣ-query inseparability is harder than KB query insep-arability: in DL-Litehorn, the space of relevant ABox coun-terexamples is exponential and, in fact, TBox inseparability is NP-hard, while KB inseparability is in P. Similarly,Σ-query inseparability ofELKBs is tractable, whileΣ-query insep-arability of TBoxes is EXPTIME-complete. The complexity of TBox inseparability for Horn-DLs extending Horn-ALC

[image:7.612.62.552.59.123.2]

(8)

References

[Artaleet al., 2009] A. Artale, D. Calvanese, R. Kontchakov, and M. Zakharyaschev. The DL-Lite family and relations. JAIR, 39:1–69, 2009.

[Arenaset al., 2012] M. Arenas, E. Botoeva, D. Calvanese, V. Ryzhikov, and E. Sherkhonov. Exchanging description logic knowledge bases. InKR, 2012.

[Arenaset al., 2013] M. Arenas, J. P´erez, and J. L. Reutter. Data exchange beyond complete data.JACM, 60:28, 2013.

[Arenaset al., 2014] M. Arenas, P. Barcel´o, L. Libkin, and F. Murlak.Foundations of Data Exchange. CUP, 2014. [Baaderet al., 2003] F. Baader, D. Calvanese, D. McGuinness,

D. Nardi, and P. F. Patel-Schneider, editors. The Description Logic Handbook: Theory, Implementation and Applications. CUP, 2003 (2nd edition, 2007).

[Baaderet al., 2005] F. Baader, S. Brandt, and C. Lutz. Pushing the

ELenvelope. InIJCAI, 2005.

[Calvaneseet al., 2007] D. Calvanese, G. De Giacomo, D. Lembo, M. Lenzerini, and R. Rosati. Tractable reasoning and effi-cient query answering in description logics: The DL-Lite family. J. Aut. Reasoning, 39:385–429, 2007.

[Cuenca Grau and Motik, 2012] B. Cuenca Grau and B. Motik. Reasoning over ontologies with hidden content: The import-by-query approach.JAIR, 45:197–255, 2012.

[Cuenca Grauet al., 2008] B. Cuenca Grau, I. Horrocks, Y. Kaza-kov, and U. Sattler. Modular reuse of ontologies: Theory and practice.JAIR, 31:273–318, 2008.

[Del Vescovoet al., 2011] C. Del Vescovo, B. Parsia, U. Sattler, and T. Schneider. The modular structure of an ontology: Atomic decomposition. InIJCAI, 2011.

[Eiteret al., 2008] T. Eiter, G. Gottlob, M. Ortiz, and M. Simkus. Query answering in the description logic Horn-SHIQ. In JELIA, 2008.

[Gieseet al., 2013] M. Giese, D. Calvanese, P. Haase, I. Hor-rocks, Y. Ioannidis, H. Kllapi, M. Koubarakis, M. Lenzerini, R. Möller, M. Rodriguez-Muro, Ö. Özcep, R. Rosati, R. Schlatte, M. Schmidt, A. Soylu, A. Waaler. Scalable end-user access to big data. InBig Data Computing, 2013.

[Hitzleret al., 2009] P. Hitzler, M. Kr¨otzsch, S. Rudolph. Founda-tions of Semantic Web Technologies. CRC, 2009.

[Hustadtet al., 2005] U. Hustadt, B. Motik, and U. Sattler. Data complexity of reasoning in very expressive description logics. In IJCAI, 2005.

[Jim´enez-Ruizet al., 2011] E. Jim´enez-Ruiz, B. Cuenca Grau, I. Horrocks, and R. Berlanga Llavori. Supporting concurrent ontology development: Framework, algorithms and tool. Data Knowl. Eng., 70(1):146–164, 2011.

[Kazakov, 2009] Y. Kazakov. Consequence-driven reasoning for Horn-SHIQontologies. InIJCAI, 2009.

[Konevet al., 2009] B. Konev, D. Walther, and F. Wolter. Forget-ting and uniform interpolation in large-scale description logic ter-minologies. InIJCAI, 2009.

[Konevet al., 2011] B. Konev, R. Kontchakov, M. Ludwig, T. Schneider, F. Wolter, M. Zakharyaschev. Conjunctive query inseparability of OWL 2 QL TBoxes. InAAAI, 2011.

[Konevet al., 2012] B. Konev, M. Ludwig, D. Walther, and F. Wolter. The logical difference for the lightweight description logicEL.JAIR, 44:633–708, 2012.

[Kontchakov and Zakharyaschev, 2014] R. Kontchakov and M. Za-kharyaschev. An introduction to description logics and query rewriting. InRW, 2014.

[Kontchakovet al., 2010a] R. Kontchakov, C. Lutz, D. Toman, F. Wolter, and M. Zakharyaschev. The combined approach to query answering in DL-Lite. InKR, 2010.

[Kontchakovet al., 2010b] R. Kontchakov, F. Wolter, and M. Za-kharyaschev. Logic-based ontology comparison and module ex-traction, with an application to DL-Lite.Artif. Intell., 174:1093– 1141, 2010.

[Koopmann and Schmidt, 2014a] P. Koopmann and R. A. Schmidt. Count and forget: Uniform interpolation ofSHQ-ontologies. In IJCAR, 2014.

[Koopmann and Schmidt, 2014b] P. Koopmann and R. A. Schmidt. Forgetting and uniform interpolation forALC-ontologies with ABoxes. InDL, 2014.

[Kr¨otzschet al., 2013] M. Kr¨otzsch, S. Rudolph, and P. Hitzler. Complexities of Horn description logics. ACM Trans. Comput. Log., 14(1):2, 2013.

[Lutz and Wolter, 2010] C. Lutz and F. Wolter. Deciding insepa-rability and conservative extensions in the description logicEL. J. Symbolic Computation, 45(2):194–228, 2010.

[Lutzet al., 2009] C. Lutz, D. Toman, and F. Wolter. Conjunctive query answering in the description logicELusing a relational database system. InIJCAI, 2009.

[Lutzet al., 2012] C. Lutz, I. Seylan, and F. Wolter. An automata-theoretic approach to uniform interpolation and approximation in the description logicEL. InKR, 2012.

[Mazala, 2001] R. Mazala. Infinite games. InAutomata, Logics, and Infinite Games, pages 23–42, 2001.

[Nikitina and Glimm, 2012] N. Nikitina and B. Glimm. Hitting the sweetspot: Economic rewriting of knowledge bases. InISWC, 2012.

[Nikitina and Rudolph, 2014] N. Nikitina and S. Rudolph. (Non-)Succinctness of uniform interpolants of general termi-nologies in the description logicEL. Artif. Intell., 215:120–140, 2014.

[Nortjeet al., 2013] R. Nortje, K. Britz, and T. Meyer. Reachability modules for the description logicSRIQ. InLPAR, 2013. [Poggiet al., 2008] A. Poggi, D. Lembo, D. Calvanese, G. De

Gia-como, M. Lenzerini, and R. Rosati. Linking data to ontologies. J. Data Semantics, X:133–173, 2008.

[Pollereset al., 2013] A. Polleres, A. Hogan, R. Delbru, and J. Um-brich. RDFS and OWL reasoning for Linked Data. InRW, 2013. [Rosati, 2007] R. Rosati. On conjunctive query answering in EL.

InDL, 2007.

[Shvaiko and Euzenat, 2013] P. Shvaiko and J. Euzenat. Ontology matching: State of the art and future challenges. IEEE Trans. Knowl. Data Eng., 25(1):158–176, 2013.

[Stuckenschmidtet al., 2009] H. Stuckenschmidt, C. Parent, and S. Spaccapietra, editors. Modular Ontologies: Concepts, The-ories and Techniques for Knowledge Modularization. Springer, 2009.