SQL-to-OCL transformation - SQL, PL/SQL and OCL basic concepts

6.2 SQL, PL/SQL and OCL basic concepts

6.7.2 SQL-to-OCL transformation

This section describes the mapping between SQL SELECT statements and the equivalent OCL expressions. In particular we describe the mappings for SQL pro- jections, joins, functions, group by and having clauses. These mappings are needed to create the derivation rules for views as described above and to be able to ex- tract constraints implemented as part of trigger definitions as explained in the next subsection.

In the following, we present the list of mappings.

Projection.

A SQL projection is composed by a SELECT, a FROM and, optionally, a WHERE clause. The mapping of generic projection is shown in Fig. 6.13.

SELECT DISTINCT;col_select classtable_name.allInstances()

->select(iteratortable_name;|;iteratortable_name.attrcol_where;<Boolean;expr>);

->collect(iteratortable_name;|

Tuple{col_nameselect;=;iteratortable_name.attrcol_select})

->asSet() FROM;table WHERE;col_where;<Boolean;expr>; 4 3 2 1

Figure 6.13: Projection mapping

1. The mapping starts by translating the FROM clause. This is done by se- lecting all instances (i.e., method allInstances()) of the class in the CS that corresponds to table in the FROM.

2. The WHERE clause, if defined, is translated into an OCL select iterator. The condition in the select iterator is created by translating the conditions in the

WHERE clause. The mapping is basically a direct mapping once the refer- ences to the column names in the WHERE are replaced by the corresponding attribute or association ends names. SQL functions are translated into their OCL counterparts (if existing, otherwise new OCL operations must be previ- ously defined, e.g. see [54]).

3. The SELECT clause is translated into an OCL collect iterator that creates a collection of objects according to the structure defined in the Tuple definition. Each field in the Tuple corresponds to a column in the SELECT clause. Fields are initialized with the value of the corresponding attributes.

4. Finally, the DISTINCT clause might be used in conjunction with a SELECT statement to return only the different (i.e., distinct) values in a given table. This clause is mapped adding the operation asSet() after the OCL mapping of the SELECT clause.

SELECT first_name, last_name

Employee.allInstances() ->select(employee | employee.salary > 5000) ->collect(employee | Tuple{first_name = employee.first_name, last_name = employee.last_name}) FROM Employee WHERE salary > 5000;

Figure 6.14: Example of a projection mapping

In Fig. 6.14, an example of projection mapping is shown. The FROM clause containing the table Employee is translated into an OCL operation Employee.allInstances() that retrieves all the instances of the UML class Employee. Later, a select operation that represents the WHERE clause is applied on those instances, such that the instances of Employee with a salary greater than 5000 are selected. Finally, the SELECT clause is mapped into a collect, such that for each remaining instance of Employee, its attributes first_name and last_name are collected into a tuple.

Join.

In SQL, the JOIN operation combines the values of two or more tables. Our transformation covers both inner and outer joins

– Inner joins.

The inner join is by far the most common case of joining tables. Giving two tables a and b and according to the join conditions, it returns the intersection of the two tables. The mapping of a generic inner join is shown in Fig.6.15.

1. Firstly, we perform the Cartesian product of the population of all tables by retrieving all the instances of the corresponding classes in the CS and applying on them the cartesian product (named product in OCL).

ONfaliasA.colAJoinf=faliasB.colBJoin SELECTfcolAselect,fcolBselect FROMftableAfaliasA JOINftableBfaliasB

WHEREfcond_wheref

classtableA_name.allInstances()->product(classtableB_name.allInstances())

->select(iteratorproductf|fiteratorproduct.first.colAJoinf=fiteratorproduct.second)

->select(iteratorproductf|fiteratorproduct.cond_where)ff

->collect(iteratorproductf|fTuple{colA_nameselectf=fiteratorproduct.first.colAselect,

colB_nameselectf=fiteratorproduct.second.colBselect})

Figure 6.15: Inner Join mapping pattern

2. The join conditions are mapped to the body of a select operation, that iterates over the tuples of the Cartesian product (i.e., iteratorproduct), se-

lecting those that satisfy the conditions. First and second are used to identify the classes in the Cartesian product (i.e., classtableA_name and

classtableB_name). Note that colA is compared with second (and not sec-

ond.colB) since in the pattern we assume that colA is the name of the role of the association linking the two classes (i.e. the type of colA in the CS is classtableB_name).

3. In case the WHERE clause is defined (i.e., Implicit Join or other conditions), a new select operation is created following the procedure described in the previous pattern.

4. If not all columns are selected, the collect operation is used as described in the previous pattern.

ONke.department_idk=kd.department_id SELECT employee_id,kdepartment_namek FROMkEmployeeke JOINkDepartmentkd WHERE e.salaryk>k5000k Employee.allInstances()->product(Department.allInstances()) ->select(itk|kit.first.emp_dept_fkk=kit.second) ->select(itk|kit.first.salaryk>k5000)kk ->collect(itk|kTuple{employee_idk=kit.first.employee_id, kdepartment_namek=kit.second.department_name})

Figure 6.16: Example of explicit inner join mapping

In Fig. 6.16, an example of join mapping is shown. As first step, the Carte- sian product is calculated between the instances of Employee and Depart- ment. From this set we select tuples where the value of department column of the employee (first.emp_dept_fk) is equals to the value of the second attribute (i.e., an instance of the class Department). Here, first refers to an employee instance since Employee is the first class referenced in the creation of the Cartesian product. Later, the WHERE clause is mapped into a select operation, that selects the employees that earn more than 5000. Finally, selected elements are projected using the collect operation to return a set of tuple elements with only two fields, the employee_id and the department_name.

– Outer joins.

The other type of join is the Outer Join that returns the same result of the Inner Join plus the rows from one/both tables that do not match any row from the other table. This Join is divided into Left, Right and Full that respectively return all the rows from the left, right or both tables, even if no matches in the right, left or in one of the tables exists. The OCL mapping of Outer Joins consists in a union operation between the OCL expression derived from the Inner Join plus the instances of the class that represents the table where the outer clause is applied. These instances are collected using the operation allInstances() and only those that have an undefined value on the attributes corresponding to the join columns are selected.

||||||||||||||||||||||||||||||||||||||||||||||||||||||

FROM|tableA|aliasA SELECT|colAselect,|colBselect

let|left_outer_join|:|Sequence(Tuple(colA_nameselect|:|colA_typeselect,

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||colB_nameselect|:|colB_typeselect))|=

classtableA.allInstances()->select(iteratortableA_name|||

|||||||||||||||||||||||||||||||||||||||||||||||||||iteratortableA_name.colAJoin.oclIsUndefined())

|||||||||||||||||||||||||||||||||->collect(it|||Tuple{colA_nameselect|=|iteratortableA_name,

|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||colB_nameselect|=|null})|in

join->union(left_outer_join)

let|join|:|Sequence(Tuple(colA_nameselect|:|colA_typeselect,

|||||||||||||||||||||||||||||||||||||||||||colB_nameselect|:|colB_typeselect))|=|...|in

Figure 6.17: Left outer join mapping

In Fig.6.17, a generic example of Left Outer Join is presented. The result of the Inner Join is stored in the variable join initialized in the first Let expression. The Left Outer Join is defined by the variable left_outer_join. Its type is the same of the variable join; while its initialization is composed by a sub-set of instances of the class that represent the left table (i.e., table A). The instances in this sub-set have a null value on the attribute colAJoin, that

represents the column whereon the join is performed. Finally, the result of the Left Outer Join is depicted by the union of the two variables join and left_outer_join.

Group By, Having and Aggregate Functions.

The GROUP BY clause is used to group the result-set of a given SELECT statement. Groups can be filtered by means of the HAVING clause. In Fig. 6.18, a generic mapping of GROUP BY and HAVING clauses is shown.

1. We first translate the SQL FROM, JOIN and WHERE clauses according to the previous mapping patterns. The result is used to initialize the sel variable. 2. The tuples in sel are then processed to create the grouped result set group mapping the GROUP BY clause. In short, group is created by iterating on the sel tuples and collecting together those tuples with an identical value on the attributes corresponding to the columns in the GROUP BY clause.

letqsel:qSequenceFTupleFcolA_nameq:qcolA_type4qcolB_nameq:qcolB_type11q=

qqqqqqqqqqqclasstable_name.allInstancesF1->collectFiteratortable_nameq|q

Tuple{colA_nameq=qiteratortable_name.attrcolA_name4

qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqcolB_nameq=qiteratortable_name.attrcolB_name}1->asSequenceF1qin

letqgroupq:qSetFTupleFcolA_nameq:qcolA_type1q=q

qqqqqqqqqqqqqqqqqqqqqsel->collectFiteratorselq|qTuple{colA_nameq=qiteratorsel.colA_name}1->asSetF1qin

group->collectFgq|qTuple{colA_nameq=qg.colA_name4

qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqcolB_nameq=qsel->selectFiteratorselq|qiteratorsel.colA_nameq=qg.colA_name1

qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq->collectFiteratorselq|qiteratorsel.colB_name1.functionF1q}1q

->selectFgq|qg.colB_nameq<Booleanqexpr>1;q SELECTqcolA4qfunctionFcolB1 FROMqtable GROUP BYqcolA HAVINGqFunctionFcolB1q<Booleanqexpr> 1 2 3 4

Figure 6.18: Mapping of group by and having clauses

3. If an aggregate function is applied on one of the columns in the SELECT clause, an equivalent OCL operation (see [54] for more on aggregate functions in OCL) is added at the end of the GROUP BY clause mapping.

4. Finally, if the HAVING clause has been defined, it is translated into a final se- lectoperation, where the body of the select expression is created by mapping the HAVING conditions.

SELECTfdepartment_id,fSUM(salary) FROMfEmployeefe,fDepartmentfd WHEREfe.department_idf=fd.department_id HAVING,SUM(salary)f>f5000 GROUP,BYfe.department_id letfself:fSequence(Tuple(department_idf:fInteger,,salaryf:fReal))f= Employee.allInstances()->product(Department.allInstances()) ->select(itf|fit.first.emp_dept_fkf=fit.second) f->collect(itf|fTuple{department_idf=fit.second.department_id, ffffffffffffffffffffffffffffffsalaryf=fit.first.salary})->asSequence()fin letfgroupf:fSet(Tuple(department_idf:fInteger))f=f ffffffffffffffffffffsel->collect(itf|fTuple{department_idf=fit.department_id})->asSet()fin group->collect(gf|fTuple{department_idf=fg.department_id, ffffffffffffffffffffffffffffffffffffsalaryf=fsel->select(itf|fit.department_idf=fg.department_id) fffffffffffffffffffffffffffffffffffffffffffffffffffffffffff->collect(itf|fit.salary)->sum()f})ffffffffffffffffffffffff f ->select(gf|fg.salaryf>f5000)

Figure 6.19: Example of a group by and having clauses mapping

In Fig.6.19, the mapping of a GROUP BY, HAVING and SUM query is shown. To begin with, the projection of department’s id and employee’s salaries derived from the Cartesian product of all the Employee and Department instances is stored in the sequence sel. Then, for each department_id the sequence of salaries in sel such that the department_id is equal is collected. The OCL operation sum is applied on the result to get the sum of salaries per department. Finally, the HAVING clause is mapped into a select operation that selects the departments with a sum of salaries greater than 5000.

In document Thèse de Doctorat. Valerio COSENTINO. A Model-Based Approach for Extracting Business Rules out of Legacy Information Systems (Page 108-112)