• No results found

Two Phase Ontology Mapping Approach

N/A
N/A
Protected

Academic year: 2020

Share "Two Phase Ontology Mapping Approach"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

1

www.ssijmar.in

Two Phase Ontology Mapping Approach

Rajiv Mishra

1

Rajiv Munjal

2

1,2: CBS Group of Institutions, Jhajjar

SHIV SHAKTI

International Journal in Multidisciplinary and Academic Research (SSIJMAR)

Vol. 2, No. 3, May 2103 (ISSN 2278 – 5973)

ABSTRACT:

(2)

2

www.ssijmar.in

INTRODUCTION:

As the quantity and complication of available information are getting increased day by day,there comes an extreme need for tools and techniques to handle data semantically. As the traditional way of information retrieval is based on keywords and therefore provide limited capabilities to capture the conceptualizations associated with user needs and contents,

ontology-based architecture becomes a great requirements these days.

To deal with this issue, ontologies are proposed for knowledge representation, which are nowadays the backbone of semantic web applications. Both the information extraction and retrieval processes can benefit from such metadata, which gives semantics to plain text.

Ontology-based approaches are characterized by the use of exhaustive conceptualizations in the form of ontologies. They offer formal descriptions of the meanings involved in user needs and contents. Therefore, ontology based approach has better chances to achieve the so-called semantic search model. Ontologies are used in order to specify the knowledge that is exchanged and shared between the different systems, and within the systems by the various components. Ontologies define the formal semantics of the terms used for describing data, and the relations between these terms. Ontologies enable all participants „to speak a common language'.

Having obtained the semantic knowledge and represented them via ontologies, the next step is mapping these ontologies and finally querying the semantic data, also known as semantic search. From the query interface form all the fields are taken and represented through a graph data structure which is utilized to in for mapping purposes. Here we have taken an example of book-domain ontologies.

In the following figures two query interface form are represented and subsequently they are represented through graph data structure.

Author

Title

Price

Subject

ISBN

(3)

3

www.ssijmar.in

Fig.1 Query Interface Form 1

Fig.2 Query Interface Form 2

In a graph representation technique there are two lists: node-list & edge-list. Node-list includes all the nodes (vertices) in the graph whereas in the edge –list each node holds addresses of their adjacent nodes in the graph via a linked structure.

Fig.3: Hierarchical representation of Ontology 1

Fig.4: Hierarchical representation of Ontology2

This work focuses on devising a new and effective approach to map different ontologies belonging to same domain in an efficient manner.

Writer

Name

Cost

Field

ISBN

Price Book

ISBN Subject

Title Author

LastName FirstName

Currency Thing

Cost Field ISBN

(4)

4

www.ssijmar.in

LITERATURE SURVEY

Different approaches to the matching problem have been proposed in the literature, see for example [1][6] and [9] for a taxonomy of the past approaches.

The main task of this work is to develop an efficient mapping technique. In recent years we have seen a range of research work on methods proposing such mappings.

Ying Wang[9] has done a very appreciable work in his paper on ontology matching, describing the concepts of Linguistic-based matching & Structure-based matching.

The effort of Marc Ehrig and Steffen Staab[1](QOM - Quick Ontology Mapping)on mapping is considered very valuable for proposed mapping approach .

Apart from this, the work in [6] (Survey of Ontology Mapping Techniques) were extremely essential for this work.

PROBLEM STATEMENT

Based on the literature survey, existing ontology mapping system has the improper identification of matching concept pair as well as the lack of an effective mapping approach when matching between two ontology. As a result, we propose an effective framework of ontology matching techniques to address the above problems.

PROPOSED WORK:

The proposed work comprises two major phases to map different ontology‟s. In first phase linguistic comparisons are performed and during the second phase structural level things are exploited.

The proposed system takes as input an ontology of query interface page termed as ontology1. A domain specific ontology database referred as ontology 2 is also utilized to make pair of concept (nodes) from both ontology. All nodes of ontology 1 are selected for pairing with only a few concepts from ontology 2 depending on the levels of ontology1. Once a set of pairs are selected than concepts name (label) from ontology1 is compared with the predefined synonyms of corresponding concept in the pair from ontology2. If a match is found than the pair under examination is said to be matched and their sibling & child concepts (nodes) are also paired. If match is not found for the pair than next phase of structural matching is starts. In structural matching if pair is found to be equivalent than again their sibling & child concepts are paired otherwise next pair from the already created set is evaluated for equivalency.

(5)

5

www.ssijmar.in

Fig.5: Architecture of proposed model

This works consists of 4 components n-length concept pairing, synonym matcher, child & sibling pair, structure matcher and two data bases ontology 2, mapping vector and one input ontology 1. These components are described here in detail:

Ontology 1:

It is the ontology corresponding to query interface form whose entries has to filled at the end of the mapping process. Ontology is represented using graph data structure.

Ontology 2:

It is a predefined ontology from the same domain and also referred as ontology database. In addition to the structural things it also store domain, range, cardinality and synonyms for each concept( node) from the ontology. This data base get updated every time a new matching pair is found during the process.

Mapping vector:

A small database only consists of those pair of nodes which are found to be equivalent with a sample value. It comprises three fields: one for concept from ontology1, second for concept from ontology 2 and the last one for a sample data of concept from ontology 2.

n-Length Concept Pairing:

At the very beginning it receives two sets S1 & S2 from ontology 1 & Ontology 2 respectively. From these two sets this component makes a resulting set S which is nothing but the Cartesian product of the set S1 & S2. Thus set S comprises pair of concepts from the both ontologies which are further investigated for equivalency. Once some concept pair is found to be equivalent than their sibling & child concepts are also added in this set for investigation.

Synonym Matcher:

It receives a pair of node from previous module and checks whether there is a corresponding entry for Synonym in the ontology 2 or not. If entry is found in the data base than the pair is said to be

Ontology 1

Ontology 2

n-length Concept

pairing Synonym matcher

Mapping Vector

Child & Sibling pair

(6)

6

www.ssijmar.in

equivalent and added into both databases. Subsequently, their child & sibling nodes are also paired and added to the set S. If entry is not found in the corresponding field than next phase of matching is invoked.

Structure Matcher:

This component is called when first phase of matching is fails. On receiving a pair of nodes this component do a number of things. It checks whether these two nodes are associated with their parent with the same relation or not. After performing this step, the no of relations as well as their types with which these two nodes are connected are analyzed.

Eventually the weight associated with each pair is compared with a threshold value. If this weight is qualified for the threshold than that pair of node is considered as equivalent and this pair is added to the Mapping-Vecor (database), if the weight is not qualified for threshold then this pair is discarded. Once the pair is qualified with the specified threshold than its sibling & child nodes are also added to the set S for their evaluation.

Above mentioned processes are executed for each pair of nodes in the set S. Finally to make entries into the query interface form mapping table is exploited. Value of concept from ontology 2 is filled into the field of corresponding concept from ontology 1. After filling all fields form page is submitted and in return a reponse page is obtained.

Conclusion:

An attempt has been made to provide solutions for mapping between concepts belonging to two ontologies of the same domain. The problem of mapping two ontologies effectively and efficiently arises in many application Scenarios. The mapping between concepts exploits various characteristics of concepts, and if they are properly used they can significantly improves both accuracy & efficiency of mapping. Another potentially valuable information source is the set of properties one class may have, because similar concepts not only shares similar texts but similar properties. How to incorporate these and additional information sources into our mapping process is very essential. In this work additional information sources pertaining to ontology are exploited which makes the mapping process very effective and efficient.

REFERENCES:

[1] Marc Ehrig and Steffen Staab:QOM - Quick Ontology Mapping

[2] Graciela Brusa,Laura,Omar:A Process for Building a Domain Ontology: an Experience in Developing a Government Budgetary Ontology

(7)

7

www.ssijmar.in

[4] A. K. Sharma, Komal Kumar Bhatia: “Automated Discovery of Task Oriented Search Interfaces through Augmented Hypertext Documents” Proc. First International Conference on Web Engineering & Application (ICWA2006).

[5] A. K. Sharma, Komal Kumar Bhatia “A Framework for Domain-Specific Interface Mapper (DSIM)”, International Journal of Computer Science and Network Security, VOL.8 No.12

[6] Savithri Godugula:Survey of Ontology Mapping Techniques

[7] A. K. Sharma, Komal Kumar Bhatia.Merging Query Interfaces In Domain -Specific Hidden Web Databases ,2008.

[8] F.Weisman, N.Roos and P.Vogt:Automatic Ontology Mapping for Agent Communication.

[9] Ying, Liu and Bell:A Concept hierarchy based Ontology Mtaching Approach

[10] P.C.Weinstein and WP Birmingham:Agent Communication with Differentiated Ontologies.

[11] http://www.cs.cmu.edu/~mccallum/bow/rainbow/

[12] Ian Horrocks.: Ontologies and the Semantic Web

[13] http://www.semanticweb.org/

[14] http://www.crisp-dm.org

[15] B. T. Le, R. Dieng-Kuntz, F. Gandon. On Ontology Matching Problems for Building a Corporate Semantic Web in a Multi-communities Organization. ICEIS (4) (2004) 236-243

[16] A. Maedche, S. Staab. Measuring Similarity between Ontologies. EKAW (2002) 251-263

[17] S. Melnik, H. Garcia-Molina, E. Rahm. Similarity Flooding: A Versatile Graph Matching Algorithm and its Application to Schema Matching. In Proc.of ICDE (2002) 117-128

[18] A. G. Miller. Wordnet: A Lexical Database for English. Communications of the ACM 38(11) (1995) 39-41

References

Related documents

author concludes that “the actual waste level of the fi rm does not directly depend upon the size of the fi ne or the probability of discovery of the violation.” That is, increases

Some of the antibiotics are metabolized or excreted by other channels at a rate so rapid that the contribution of the renal clear- ance may be almost negligible (chloramphenicol

In this paper, we mean by a complex topological algebra a complex Hausdor ff topological vector space which is an associative algebra with separately continuous multiplication..

Comparison of the gain normalized leakage current in our InAs e-APDs with other recently reported small area high gain-bandwidth APDs.. Also shown is the

The performance of a novel industrial membrane bioreactor (MBR) comprising denitrification, nitrification and ultrafiltration for the secondary treatment of primary treated

To find a bit of steganographic data in the image file, we select a range block from the range region in the same order as in the hiding process and