Maintenance Algorithms - Maintaining the Semantic Information Network

5.4 Maintaining the Semantic Information Network

5.4.3 Maintenance Algorithms

The add function inserts a vertex vaddand its properties in a SIN. Further, it determines

which relationships exist between vadd and the already existing vertices. As mentioned,

properties, being both mandatory and static, are the minimum input for the function. Function add(SIN, vadd)

Input: SIN = (V, E, L, W, fl, fw) a SIN,

vadd: vertex to be added and its properties P (vadd);

Output: SIN is updated;

1 begin

2 V := V ∪ {v_add}; 3 foreach v ∈ V do

4 if uri(v) 6= uri(vadd) then

5 E := E ∪ {new edge/s between v and v_add};

The delete function removes a vertex vdel and its properties from a SIN including its

relationships with neighbored vertices. Note that all quadrants of the property classifi- cation are relevant for the delete function (cf. Figure 5.18B).

Function delete(SIN, vdel)

Input: SIN = (V, E, L, W, fl, fw) a SIN,

vdel: vertex to be deleted and its properties P (vdel);

Output: SIN is updated;

1 begin

2 v := get v ∈ V so that uri(v) = uri(vdel); 3 E := E \ {(v, γ), (γ, v) | γ ∈ Γ(v)};

4 V := V \ {v};

The update function takes a vertex vupd as input, which is used to update the vertex

v in the SIN that is identified by the same uri as vupd. The function also adds, deletes

and updates relationships between the updated vertex v and existing vertices.

Function update(SIN, vupd)

Input: SIN = (V, E, L, W, fl, fw) a SIN,

vupd: vertex used to update the SIN and its properties P (vupd);

Output: SIN is updated;

1 begin

2 P (v_upd) := {p ∈ P (v_upd) | p is not mandatory + static}; 3 v := get v ∈ V so that uri(v) = uri(v_upd);

4 foreach (key, val) ∈ P (v) do 5 if (key, val_upd) ∈ P (v_upd) then 6 val := val_upd;

7 P (vupd) := P (vupd) \ {(key, valupd)};

8 else

9 P (v) := P (v) \ {(key, val)}; 10 P (v) := P (v) ∪ P (v_upd);

11 foreach v0 ∈ V do 12 if v0 ∈ Γ(v) then

13 E := update edge/s between v0 and v;

14 E := E \ {obsolete edge/s between v0 and v}; 15 if uri(v0) 6= uri(v) then

16 E := E ∪ {new edge/s between v0 and v};

Based on these functions, we propose three algorithms for maintaining SINs. The maintenance is based on two main principles: (1) push principle and (2) pull principle.

Regarding the push principle, the data source pushes process information and business processes automatically to the SIN when they are added, updated or deleted in the data source. Regarding exogenous changes, however, as a prerequisite the data source must

be able to send notifications if process information or business processes are changed. Regarding endogenous changes, the prerequisite is that the SIN detects changes automatically and triggers respective actions (cf. Definition 18).

Regarding the pull principle, the SIN gathers process information and business processes from a data source. Such a maintenance process is triggered by time-based schedulers; i.e., the SIN is maintained at certain points in time. The pull principle is used for data sources not capable of sending change notifications to the SIN.

For each of these two principles, we introduce a corresponding algorithm. As a prerequisite for both algorithms, the SIN must have access to underlying data sources. In case of exogenous changes, the SIN transforms process information and business processes into a uniform format. In case of endogenous changes no transformation is necessary.

Push Algorithm. The push algorithm deals with changes of a SIN according to the push principle, e.g., a policy may no longer be valid in 2015 and the corresponding SIN object has to be maintained accordingly. Thus, SIN maintenance is triggered by an action applied to the SIN by the push algorithm.

Algorithm 1: Push Algorithm.

Input: SIN = (V, E, L, W, fl, fw) a SIN, a an action;

Output: SIN is updated;

1 begin

2 switch func(a) do

3 case add

4 v := create a vertex and its properties from

the data source affected by the uri of a;

5 add(SIN, v);

6 break;

7 case update

8 v := create a vertex and its properties from

the data source affected by the uri of a;

9 update(SIN, v);

10 break;

11 case delete

12 v := get v ∈ V so that uri(v) = uri(a);

13 delete(SIN, v);

The push algorithm works as follows: In the add and update case, we create a vertex v and its properties from the data source affected by the action a (i.e., based on the “uri” of the action). After that, we call the corresponding add or update function. In the delete case, we identify the corresponding vertex v ∈ V based on the “uri” of the action and call the according delete function.

Pull Algorithm. The pull algorithm deals with changes of a SIN based on the pull principle; i.e., data has changed in the data source and needs to be gathered by the SIN. For example, documents on a shared drive are updated and, therefore, respective changes must be made in the SIN. The maintenance of the SIN is triggered by a scheduler.

Algorithm 2: Pull Algorithm.

Input: SIN = (V, E, L, W, fl, fw) a SIN, ds the data source;

Output: SIN is updated;

1 begin

2 V_ds := create a set of vertices and their properties from the data source ds; 3 foreach v ∈ V so that source(v) = ds do

4 v_ds := get v_ds ∈ V_ds so that uri(v_ds) = uri(v); 5 if v_ds6= null then

6 if vds is newer than v then

7 update(SIN, v_ds); 8 Vds := Vds \ {vds}; 9 else 10 delete(SIN, v); 11 foreach vds∈ Vds do 12 add(SIN, v_ds);

The pull algorithm works as follows: First, we create a set of vertices Vds from a data

source ds. Then, for each vertex v ∈ V that was created from ds (property “source”), we check whether a corresponding vertex vds ∈ Vds exists. If this is the case, we further

check whether vdsis newer than v (e.g., by comparing the creation or modification dates).

If v is outdated, it is updated with the properties of vds by calling the update function.

Then, vds is removed from Vds. If no corresponding vertex exists in the data source, we

delete vertex v in the SIN using the delete function. Finally, we add each remaining vertex vds ∈ Vds from the data source to the SIN by calling the add function. Hence,

the pull algorithm allows maintaining the SIN at a certain point in time. Note that this procedure has to be repeated for each data source of the SIN.

Partial-Pull Algorithm. In practice, a SIN may comprise a large number of objects and relationships. Maintaining the SIN based on the pull principle, therefore, can be a time- consuming task. In a specific work context, however, a user might only be interested in a selected part of the SIN. During a review, for example, review templates, existing reviews, or results of a real-time evaluation (e.g., prioritization of projects in a workshop) are of great importance, while checklists and best practices for performing effective project management are less interesting. Thus, it is sufficient to maintain only those objects and relationships relevant for the user when querying the SIN. To reflect this, we introduce another principle, called the partial-pull principle, where the SIN gathers only process information and business processes from data sources as requested by a user.

Regarding the partial-pull principle, we introduce a third algorithm (i.e., the partial- pull algorithm) as a lightweight version of the pull algorithm. It does not maintain the entire SIN, but only those parts relevant for a given request.

Algorithm 3: Partial-Pull Algorithm.

Input: SIN = (V, E, L, W, fl, fw) a SIN, req the request to a SIN;

Output: SIN is partially updated, Vreq contains the requested vertices; 1 begin

2 Vds := create a set of vertices from the data sources affected by req; 3 foreach v ∈ V affected by req do

4 v_ds:= get v_ds ∈ V_ds so that uri(v_ds) = uri(v); 5 if vds 6= null then

6 if v_ds is newer than v then

7 update(SIN, v_ds);

8 vupd := get v0 ∈ V so that uri(v0) = uri(vds);

9 Vreq:= Vreq∪ {vupd};

10 else

11 Vreq:= Vreq∪ {v};

12 else

13 delete(SIN, v);

The partial-pull algorithm works as follows: First, we create a set of vertices Vds from

the data sources affected by the user request req. Then, for each vertex v ∈ V affected by req we retrieve the corresponding vertex vds from the affected data sources. Thus, if

a corresponding vertex vds is in the data source, we check whether vds is newer than v

(e.g., by comparing the creation or modification dates). If v is outdated, it is updated with vds by calling the update function. If there is no corresponding vertex in the data

source, we delete vertex v in the SIN by calling the delete function. The partial-pull algorithm allows maintaining parts of a SIN based on a request and ensures that all requested objects are synchronized with affected data sources.

As opposed to the other principles, the partial-pull principle is completely user-driven as it is solely triggered by a user request (e.g., a search). In turn, the push and pull principle are machine-driven, e.g., triggered through notifications from schedulers.

Validation (Maintenance Algorithms). In order to demonstrate and validate the feasibility and applicability of the maintenance algorithms, we conducted a case study in the automotive domain. This section only provides a short summary of the case study results. The entire results can be found in Section 9.3. In the case study, we have shown that automatic maintenance of SINs with regard to both exogenous and endogenous changes is feasible with acceptable costs. Furthermore, we investigated the effect of depth and breadth on the runtime of the proposed algorithms. The algorithms

performed satisfactorily in terms of adding, updating and deleting objects. The cost of detecting relationships, however, varies widely when using expensive linguistic or statis- tical algorithms. Moreover, the case study results have shown that it is crucial to provide up-to-date, integrated and homogeneous views on process information during business process execution. The empirical validation confirms that there is a high demand for a single point of access to process information in knowledge-intensive business processes.

In document Process-Oriented Information Logistics: Aligning Process Information with Business Processes (Page 87-92)