k Delegating oracle - Modeling Learning Tasks

CHAPTER 6. RELATIONSHIP BETWEEN LEARNING CLASSIFIERS

6.2 Modeling Learning Tasks

6.2.2 k Delegating oracle

A kDelegating oracle is a procedure which when invoked by a learner in turn invokes subordinate oracles EX1hf₁, X_s1, D₁i . . . EXkhf_k, X_sk, D_ki with probabilities p₁. . . p_krespectively. The

kDelegating oracle has access to a mapping set M = {m1, m2. . . mk} where mi = {mx_i, mc_i};

mx_i : Xsi −→ X is an attribute mapping function; and mc_i : C_i −→ C is a class mapping func-

tion where Ci = Range(fi) and C = Range(f ). It uses the mapping mi to convert an instance

hx_si, f_i(x_si)i received form the ithsubordinate oracle to hm_ix(x_si), mc_i(f_i(x_si))i before passing it

to the learner. A mapping aware k-delegating oracle is represented by EXhφ, X+, D+, M, X i, where X+ = {X1, X2. . . Xk} is the set of instance spaces, D+ = {D1, D2. . . Dk} is the set

of distributions, M is a mapping set and φ is the labeling function with which instances are returned to the learner. A schematic representation of learning using a kDelegating oracle is shown in Figure 6.1.

∀x_si ∈ X_si, mx_i(x_si) ∈ X ; ∀l ∈ Range(f_i), mc_i(l) ∈ Range(f ) and whenever x ∈ X_si, X_sj,

mx_i(x) = mx_j(x). A mapping set M is said to be admissible if ∀mi∈ M, mi is admissible. An

admissible mapping set M ensures that the examples returned by the k -delegating oracle are of the form hx, l(x)i where x ∈ X and l(x) ∈ C. We denote the set of all possible admissible mappings by M∗ and assume in the rest of the chapter that a given mapping is admissible. Consider the case of learning a function f using a k Delegating oracle kEXhφ, X+, D+, M, X i. A straight-forward analysis shows that the total number of admissible mappings is |M ∗ | = Qk

i |X ||Xsi|(we assume two mappings are different if they differ on mapping atleast one instance

from the instance space of any one of the subordinate oracles).

Let the ideal mapping set Mtrue= {m1,true, m2,true. . . Mk,true,} where mi,true=

{mx

i,true, mci,true}. The ideal mapping set is assumed to satisfy the following conditions: (a)

Mtrue ∈ M∗ and (b) hx, l(x)i the labeled instance returned to L (obtained after applying

relevant mappings in Mtrue to the labeled instance sampled from the subordinate oracle) is

the same as hx, f (x)i. The condition (b) requires that instances returned to the learner are labeled according to the target function f .

Definition 2 Semantics Preserving Class Mapping : A class mapping mc_i is said to be semantics preserving if ∀l ∈ Ci mci(l) = mci,true(l).

Definition 3 Semantics Preserving Attribute Mapping: An attribute mapping mx_i is said to be semantics preserving whenever ∀x ∈ X_si, f_i(x) = l and mc_i,true(l) = l₁ implies

f (mx

i(x)) = l1

Definition 4 Semantics Preserving Mapping Set : A mapping set M = {m1, m2. . . mk}

is said to be correct if ∀i ∈ {1, 2, . . . , k} mx_i and mc_i are semantics preserving.

Definition 5 Identity Mapping A mapping set M is said to be an identity mapping if ∀mi = {mxi, mci} ∈ M, mxi(x) = x and mci(l) = l.

A semantics preserving mapping guarantees that the instances returned to the learner are labeled with the target function f . Hence, given a k Delegating oracle kEXhφ, X+, D+, M, X i

Instances with Label 0 Instances with Label 1 Instances with Label 0 Instances with Label 1 Instances with Label 0 Instances with Label 1 Instances with Label 0 Instances with Label 1

Figure 6.2 An example of a mapping with errors and a semantics preserving mapping.

with semantics preserving mapping M , it is the case that φ = f where f is the target function. It is the case that Mtrue is semantics preserving. A mapping set M is said to have errors if it

is not semantics preserving. Errors in a mapping set can either be due to errors in attribute mappings or errors in class mappings. Since the class mappings are fairly straightforward (being between the binary class labels 0 and 1), we will assume throughout the chapter that only possible errors are in attribute mappings. Figure 6.2 shows an example mapping with errors and a semantics preserving mapping.

Remark A straight-forward analysis shows that the total number of semantics preserving mappings isQk i |X+| |X_si+_| × |X−_||Xsi −_| .

The remark above shows that, in general, there are multiple available semantics preserving mappings each of which returns examples to the learner that are labeled with the target function. We now show that difference between using different semantics preserving mappings man- ifests itself in terms of difference in distribution with which examples are returned to the learner. Given a mapping mx_i, let [mx_i(y 7→ z)] be a indicator function that returns 1 when mx_i(y) = z (for y ∈ X_si) and returns 0 otherwise. Given a k Delegating oracle kEXhφ, X+, D+, M, X i the

distribution over X with which instances are returned to the learner L is P r[z] = k X i=1 pi X y∈X_si [mx_i(y 7→ z)]pry∈Di[y]

stances are returned to the learner L depends on the set P = {p1. . . pk} (the probabilities of

selecting the subordinate oracles), the distribution set D+ and the mapping set M (through dependence on indicator function [mx_i(y 7→ z]) and is denoted by MP(D+). For easing clutter

of notation we drop the parameter P in the notation and denote it by M (D+).

While a learner L with access to a kDelegating oracle can easily model learning from semantically disparate data (including modeling errors in mappings), it is the case that a learner with access to a kDelegating oracle cannot model domain adaptation. We now introduce, in the next section, an approach to model domain adaptation.

In document Learning predictive models from massive, semantically disparate data (Page 97-100)