• No results found

Module I: Overview of Semantic Technologies and the Semantic Web

N/A
N/A
Protected

Academic year: 2021

Share "Module I: Overview of Semantic Technologies and the Semantic Web"

Copied!
22
0
0

Loading.... (view fulltext now)

Full text

(1)

!"#$%&'()(!"#*+,-.,/0(1(234(56/-76.(

!"#$%&'(#)*+',-.'/0#'1#2)%34'5#67'

!"#$%&'(!)*+,"-"./(!012,2,.3(4(5"063+"#37

Module I:

Overview of Semantic Technologies and the

Semantic Web

© Copyright 2007-2008 TopQuadrant Inc. Slide 2

Module I - Executive Briefing on Semantic Technologies

and the Semantic Web

[Course Day 1]

!

!

!"#$%&'()%"*+"&*,$-."#+)%"*#%*#/.*0%'$1.*

!

!

2/+#*-1*3.4+")(*5.(/"%6%789*2/+#*-1*-#*:%%&*;%$9*

!

!

5/.*<==$.(-+)"7*>+6'.*%?*@A.+"-"7B*

!

!

C.4%*!*

!

!

A+==-"7*#/.*3.4+")(*5.$$+-"*

DDDDD**E'"(/)4.*DDDDD*

!

!

0%4=+$-"7*3.4+")(*F-#/*0%"G.")%"+6*5.(/"%6%7-.1*

!

!

C.4%*!!*

!

!

3#%(H-"7*I%'$*3.4+")(*5%%6J%K*

!

!

L"%F6.&7.*A+"+7.4."#*+"&*#/.*3.4+")(*2.J*

(2)

© Copyright 2007-2008 TopQuadrant Inc. Slide 3

The Semantic Wave is NOT one thing … there are

differing

major streams

within it

!!

The Semantic Web

"

!

Information sharing on a global scale

"

!

Intranets vs. Internet

!!

Semantic Technology

"

!

Enhanced knowledge access and search

"

!

Semantic Interoperability

"

!

Information syndication

"

!

and so forth

Semantic Web: Make web content machine-readable!

The Semantic Web is a vision: the idea of having data on the Web defined

and linked in a way that it can be

used by machines

not just for display

purposes, but for automation, integration and reuse of data across various

applications.

[W3C 2001] ”

The Semantic Web is an extension of the current Web in which information is

given well-defined

meaning

, better enabling computers and people to work in

cooperation.

” [Tim Berners-Lee

et al

2001]

(3)

© Copyright 2007-2008 TopQuadrant Inc. Slide 5

What could the Web do?

Can this sort of

interaction

become part of

the Web itself?

© Copyright 2007-2008 TopQuadrant Inc. Slide 6

How could the Web do it?

Built-in by the Webmaster

(4)

© Copyright 2007-2008 TopQuadrant Inc. Slide 7

What about XML? Doesn’t it support semantics?

!

!

M5AE*7+G.*'1*?%$4+N"7*#+71*

!

!

OAE*7+G.*'1*('1#%4*#+71*

"

!

I%'*7.#*#%*=-(H*8%'$*#+71P+Q$-J'#.1*

"

!

5+71*(+"*/+G.*B4.+"-"7R*1=.(-S(*#%*8%'$*+==6-(+)%"*

!

!

A+"8*&-+6.(#1*/+G.*J6%11%4.&*

!

!

OAE*+"&*OAE*3(/.4+*J.(+4.*2T0*1#+"&+$&1*

!

!

*3#+"&+$&*&-+6.(#1*+$.*J.-"7*&.G.6%=.&*J8*4+"8*

-"&'1#$8*7$%'=1*U*OVWEX%$7Y*;=AEX%$7Y*5+KAEX%$7Y*D**

!

!

ZG.$8*6+$7.*%$7+"-[+)%"*/+1*#/.-$*%F"*OAE*3(/.4+1*

(5)

© Copyright 2007-2008 TopQuadrant Inc. Slide 9

Gartner: All Tied Up with XML: 2001

From 2001 through 2004 enterprises spent $3 billion on modeling activities with no

return on investment from $ 2 billion of it.

© Copyright 2007-2008 TopQuadrant Inc. Slide 10

A new Web of terminology

Use the same technology for mapping web pages

to terminology

to map terminology to one another

(6)

© Copyright 2007-2008 TopQuadrant Inc. Slide 11

AAA Slogan -

Anyone

can say

Anything

about Any topic

Non-unique naming

“Java”

public String getContextPath() { try {

Method getContextPathMethod =

delegate.getClass().getMethod("getContextPath", null); //$NON-NLS-1$

return (String) getContextPathMethod.invoke(delegate, null); } catch (Exception e) { // ignore } return null; }

“Java”?

Programming language?

Hot Beverage?

(7)

© Copyright 2007-2008 TopQuadrant Inc. Slide 13

What is Semantic Technology?

“Semantic technology (software) allows the meaning

of and associations between information to be known

and processed

at execution time

.

For a semantic technology to be truly at work within a

system, there must be a knowledge model of some

part of the world (an active

ontology

) that is used by

one or more applications at execution time.”

(((

--

TopQuadrant

© Copyright 2007-2008 TopQuadrant Inc. Slide 14

Semantic Technology and the

Internet

1995

2000

2005

+ RDF,

OWL

?

+ XML

J2EE, .NET, …

Encoding

Paradigm

Creation

+ RDBMS

JSP, ASP, Java, …

Generated applying

specific templates,

used by people

Killer

Apps

!

Search

!

Content Mgmt

!

Web Application

Servers

!

Portals

!

Process

Integration

!

Web Services

Generated by

applications based on

fixed schemas, used

by applications and

people

!

Advisors

!

Personal Agents

!

IP Apps

!

Cognitive Engines

Generated by

applications based

on models, used by

applications,

devices and people

Static

Dynamic

Transactional

Semantic

HTML

CGI, Perl, ...

Hand crafted by

people for people

!

Browser

(8)

© Copyright 2007-2008 TopQuadrant Inc. Slide 15

Contrasting the Semantic Web and Semantic

Technology Solutions

!

!

5/.8*

-386.

*-"*#/.-$*

"

!

:%+61*U*

9:,0

*#/.8*F+"#*#%*+((%4=6-1/*

"

!

>+6'.*=$%=%1-)%"1*U*

9:;

*J%#/.$*F-#/*#/.4*

"

!

W.+&-".11*?%$*(%44.$(-+6*+==6-(+)%"*U*

9:6/

*#%*7.#*%"*#/.*#$+-"**

!

!

5/.8*/+G.*

<"46("=6.7,#

*-"*#/.-$*'1.*%?*

"

!

5%%61*\.X7XY*,"#%6%78*.&-#%$1]*

"

!

>."&%$1*+"&*^$%&'(#1*

"

!

3%6')%"*+$(/-#.(#'$.1*

!

!

5/.8*

<:,.6(3/(>"44"/

**

"

!

3.4+")(*6+"7'+7.1*+"&*1#+"&+$&1**

*\&.G.6%=.&*J8*#/.*2T0__2%$6&*2-&.*2.J*0%"1%$)'4]*

"

!

;%$*A%$.*!"?%$4+)%"*1..`*

!

+88#9::;;;<;=<"0.:(7

!

+88#9::;;;<;=<"0.:>??@:3;7

W3C standards for semantic models

!

!

2T0*3.4+")(*1#+(H*-1*J'-6#*%"*OAE*

!

!

OAE_J+1.&*,"#%6%78*6+"7'+7.1*+$.*J.-"7*&.G.6%=.&*#%*

1'==%$#*1.4+")(*-"#.$%=.$+J-6-#8X*

“Semantic Web is stimulating a

whole new class of applications

at individual, enterprise and web

scales”

– Eric Miller, W3C, Semantic

Technologies for eGOV’2003

www.w3.org/

www.w3.org/2001/sw

(9)

© Copyright 2007-2008 TopQuadrant Inc. Slide 17

Application integration today

I+/%%a*;-"+"(.*

MySimon

© Copyright 2007-2008 TopQuadrant Inc. Slide 18

How do they do it?

!

!

I+/%%a*+"&*A83-4%"*+$.*(%66.()"7*1-4=6.Y*F.66_

'"&.$1#%%&*&+#+*\=.$1%"+6*S"+"(-+6*$.(%$&1Y*$.#+-6*

=$-(.1]X**

!

!

^$%7$+44.$1*J'-6#*+*1-"76.*=$%7$+4*?%$*+66*&+#+*1%'$(.1*

!

!

C._?+(#%*1#+"&+$&1*\6-H.*b'-(H."]*+$.*+6$.+&8*-"*=6+(.*

Simplest kind of Application Integration – everyone

agrees on a simple representation. Just use it!

(10)

© Copyright 2007-2008 TopQuadrant Inc. Slide 19

It’s supposed to be a web, after all!

Mash-ups are not the

responsibility of some

service on the web …

They are the

responsibility of the

web infrastructure!!

How does it work?

RDF

RDF – the Ultimate Mash-up

Language !!

(11)

© Copyright 2007-2008 TopQuadrant Inc. Slide 21

Capability Case: Model-enabled Application Integrator

Solution Story: Geospatial Mashup in TopBraid Composer

A view of TopBraid Composer being used to connect a real estate

ontology and other RDF resources with geospatial ontologies.

© Copyright 2007-2008 TopQuadrant Inc. Slide 22

CapabilityCase: Semantic Multi-Faceted Search

(12)

© Copyright 2007-2008 TopQuadrant Inc. Slide 23

CapabilityCase: Semantic Multi-Faceted Search

SolutionStory: Executive Search Company

Semantic Model-driven Applications

3.4+")(*5.(/"%6%78*1'==%$#1*".F*#8=.1*%?*

';/,43>(

?+<3/6<<(@##73>,A"/<(

(13)

© Copyright 2007-2008 TopQuadrant Inc. Slide 25

Capability Case: Semantic Portal

CapabilityCase:

Semantic Portal

http://del.icio.us/CapabilityCases/SemanticPortal

Intellidimension’s Semantic

Portal

Nokia’s Mobile Phones

Forum

Ontference

© Copyright 2007-2008 TopQuadrant Inc. Slide 26

CapabilityCase: Semantic Portal

SolutionStory: Ontference

Integrating multiple sources of information – talk submissions,

attendee registrations, user profiles:

(14)

© Copyright 2007-2008 TopQuadrant Inc. Slide 27

CapabilityCase: Semantic Data Integrator

SolutionStory: FAA Passenger Threat Analysis

Data for passenger threat analysis comes from

a wide range of heterogeneous, structured and

unstructured sources, including the FBI most

wanted list, flight details, news, public records,

and biometrics.

A solution built using Semagix Freedom allows

security personnel to assess passenger threats

while maintaining a high rate of passenger flow.

Semagix Freedom interfaces with diverse

information sources, extracts relevant

information in near real-time, and then organizes

and normalizes them based upon the ontology. It

co-relates the information from different sources

to determine possible threats. by discovering

hidden relationships between seemingly

unrelated pieces

of information.

Systems developed in different work practice settings have different

semantic structures for their data. Time-critical access to data is made

difficult by these different semantics. Semantic Data Integration allows

data to be shared and understood across these settings.

Aviation Security – Passenger Threat Analysis

Passenger Threat Analysis Console (Ontology-based Analysis)

Customer Story: Major retailer deploys customer site in

12 weeks

A web portal for consumers to maintain information about

their homes and belongings. Many different product – all

have different types of information.

JENA SDB

RDF Store

(User Data)

MySQL

JENA in-memory RDF Store

(domain model)

JENA in-memory RDF Store

(form models)

JENA in-memory RDF Store

(USER 1 data)

.

.

Generated Web UI

(15)

© Copyright 2007-2008 TopQuadrant Inc. Slide 29

TopBraid Live – has an open Architecture

© Copyright 2007-2008 TopQuadrant Inc. Slide 30

Graph stores

!

!

ZK+4=6.*=$%&'(#1`*3."#."(.1Y*3.4+7-K*;$..&%4cY*

d<W:Y*0%7-#%*

!

!

W%6.*-"*3.4+")(*2.J`**

"

!

^$._&+#.*3.4+")(*2.J*3#+"&+$&1*

"

!

3#$-(#68*1=.+H-"7Y*"%#*1.4+")(*F.Je**

"

!

#8=-(+668*"%#*2T0*(%"?%$4+"#*

!

!

M%F*#%*.G+6'+#.*%".`*

"

!

3(+6+J-6-#8*\"'4J.$*%?*"%&.1Y*#/$%'7/='#]*

"

!

b'.$8*6+"7'+7.*

(16)

© Copyright 2007-2008 TopQuadrant Inc. Slide 31

Comparing Semantic and Relational

RDB

Semantic Model (Ontology)

!"#$#%&'%('

!)*+,-'

./,*0()*'

#

!

Database must be designed

to answer the questions

#

!

Specific, typically complex,

queries must be developed

#

!

Ontology must be designed to

answer the questions

#

!

Queries can be generic and

very simple

!"#$#%&'%('

!11(22(34%,'

564)7,'

#

!

Inflexible:

"

!

Database structure must be

modified so it can continue

to answer the questions

"

!

Queries must be re-written

"

!

Data must be ported

#

!

Flexible:

"

!

Ontology can be easily extended

so it can continue to answer new

questions

"

!

No data porting required

8-(1,**#)7'

9:,,3'

#

!

Can be very fast with proper

tuning – mature technology:

"

!

Known optimization

approaches

#

!

Certain queries, such as multi

table joins and self joins are

known to cause problems

#

!

Not as fast, but improving,

tuning does not affect

flexibility:

"

!

Adding more processing

power and distributed

computation helps

#

!

Performs better than RDBMS

for certain query types

Key differences in the representation of relationships

RDB

Semantic Model (Ontology)

8).*9%):9/+'-,'

.#:)3-%;09<;'

#

!

Relationships are either 1:1,

many:1 or many:many

#

!

Many:many relationships must be

broken into many:1 relationships

by creating join tables

#

!

By default all relationships are

many:many

#

!

Functional properties and

cardinality restrictions are used to

specify 1:1, many:1 as well as other

cardinalities

#

!

It is possible to specify, for example, 1:4

or min 2, etc.

=%,-.2)3-%'

6#).9%&'

.#:)3-%;09<;'

#

!

Additional information about the

relationship is represented by the

extra columns in the join table

#

!

Relationship is reified (made into a

class)

#

!

Additional information is

represented as properties of the

class

>0#'%)/?.#'-,'

#

!

Implicit

#

!

Embedded in the name of the

join table or in the name of the

column

#

!

Explicit

#

!

Care is taken to name a relationship

in a way that its nature and

intentions are well understood

(17)

© Copyright 2007-2008 TopQuadrant Inc. Slide 33

Properties (ST) vs. Attributes and Relations (OO)

!

!

,2E*^$%=.$).1*$.=$.1."#*$.6+)%"1*J.#F.."*#F%*

-"&-G-&'+61*

G&"0(>7,<<6<H(

!

!

,2E*^$%=.$#8*#8=.1`*

"

!

IJK6>0(L."#6.A6<(

6-"H*+"*-"&-G-&'+6*#%*+"*-"&-G-&'+6*

"

!

',0,0;#6(L."#6.A6<

*6-"H*+"*-"&-G-&'+6*#%*1-4=6.*G+6'.1*

!

2,8).)03A(B-"183A(3802,.3A((C""-)1,3A(1,D(3"(E"08+7

!

1,(F$G(H*+)I1('1818/#)(#02I282J)(J1-K)("0(1,(L'M(-28)01-7

!

!

5/%1.*F-#/*,,*.K=.$-."(.P.K=.$)1.*4'1#*%G.$(%4.*

#/.*#8=-(+6*=$._(%"(.=)%"*#/+#*=$%=.$).1*J.6%"7*#%*#/.*

(6+11a*

© Copyright 2007-2008 TopQuadrant Inc. Slide 34

Properties are first-class constructs

5/-1*+66%F1*$.6+)%"1/-=1*J.#F.."*^$%=.$).1*

/+1^+$."#*

/+1A%#/.$

*

/+1;+#/.$

*

This is not a class diagram!

!!

In contrast to most OO paradigms, where properties are

“owned” or “contained in” Classes

… and for other modelers to

reuse properties

&(`($.+#%$

*

48`+'#/%$

*

BF/.$.G.$*!*'1.*#/.*=$%=.$#8*

f+'#/%$gY*#.66*#/.*F%$6&*#/+#*#/.8*

(+"*$.+&*f&(`($.+#%$gR*

5/-1*+66%F1*$.6+)%"1/-=1*

J.#F.."*^$%=.$).1*

(18)

© Copyright 2007-2008 TopQuadrant Inc. Slide 35

In OWL, Properties may have Sub Properties

!

!

!#*-1*=%11-J6.*#%*?%$4*/-.$+$(/-.1*%?*=$%=.$).1*\#/.1.*+$.*

"%#*06+11*/-.$+$(/-.1]*

!

!

5/.*

.-D<M<+JL."#6.0;ID

*(%"1#$'(#*+66%F1*$.6+)%"1/-=1*

#%*J.*+J1#$+(#.&*'=*#/.*1'J_=$%=.$#8*#$..X*

(

S

T) In OWL, Classes are

inferred

or

computed

!

!

,2E*(6+11.1*+$.*-"#.$=$.#.&*+1*

<60<

*#/+#*(%"#+-"*

-"&-G-&'+61*

"

!

<*(6+11*-1*

/"0

*+*H-"&*%?*#.4=6+#.*+1*-"*,,*#.(/"%6%78*

"

!

!"*,2EY*(6+11.1*+$.*J'-6#*'=*%?*&.1($-=)%"1*#/+#*1=.(-?8*#/.*(%"&-)%"1*#/+#*4'1#*J.*

1+)1S.&*J8*+"*-"&-G-&'+6*#%*J.*+*4.4J.$*%?*#/.*(6+11**

!

!

3'J(6+11.1*+$.*1'J1.#1*%?*#/.-$*=+$."#*(6+11.1X***

!

!

3'=.$(6+11_1'J(6+11*$.6+)%"1/-=1*(+"*J.*(%4='#.&*

+'#%4+)(+668*J8*+*

.6,<"/6.(

(19)

© Copyright 2007-2008 TopQuadrant Inc. Slide 37

Properties (ST) vs. Attributes and Relations (OO)

!

!

,2E*^$%=.$).1*$.=$.1."#*$.6+)%"1*J.#F.."*#F%*

-"&-G-&'+61*

G&"0(>7,<<6<H(

!

!

,2E*^$%=.$#8*#8=.1`*

"

!

IJK6>0(L."#6.A6<(

6-"H*+"*-"&-G-&'+6*#%*+"*-"&-G-&'+6*

"

!

',0,0;#6(L."#6.A6<

*6-"H*+"*-"&-G-&'+6*#%*1-4=6.*G+6'.1*

!

2,8).)03A(B-"183A(3802,.3A((C""-)1,3A(1,D(3"(E"08+7

!

1,(F$G(H*+)I1('1818/#)(#02I282J)(J1-K)("0(1,(L'M(-28)01-7

!

!

5/%1.*F-#/*,,*.K=.$-."(.P.K=.$)1.*4'1#*%G.$(%4.*

#/.*#8=-(+6*=$._(%"(.=)%"*#/+#*=$%=.$).1*J.6%"7*#%*#/.*

(6+11a*

© Copyright 2007-2008 TopQuadrant Inc. Slide 38

Semantic Web – OO Gotchas!

!"*#/.*3.4+")(*2.JY*8%'*-"?.$*#/.*(6+11*%?*+"*%Jh.(#X*

5/.*(6+11*%?*+"*%Jh.(#*(+"*(/+"7.`*

*%G.$*)4.*

*F-#/*F/+#*8%'*H"%FPJ.6-.G.*

*F-#/*F/%4*8%'*#$'1#*

^$%=.$).1*+$.*S$1#_(6+11*%Jh.(#1*\-"&.=."&."#*%?*(6+11.1a]*

*^$%=.$).1*?%$4*/-.$+$(/-.1*+1*F.66*+1*(6+11.1*

i%*J./+G-%$*-1*&.1($-J.&*+"8F/.$.*U*%"68*

3/D6.6/>3/C(

A'6)=6.*1.#*4.4J.$1/-=*-1*(%44%"=6+(.*

*i%*,,*-"/.$-#+"(.*

(20)

© Copyright 2007-2008 TopQuadrant Inc. Slide 39

How Semantic Languages Work

V$-"7*-"?%$4+)%"*#%7.#/.$*

C$+F*

3/D6.6/>6<

*?%$*?'$#/.$*=$%(.11-"7*

RDF

RDFS

OWL

What is RDF?

!

!

WC;*\

(

.1%'$(.*

@

.1($-=)%"*

A

$+4.F%$H]*-1*+"*

-"?$+1#$'(#'$.*?%$`*

"

!

Z"(%&-"7Y*

"

!

ZK(/+"7.*+"&**

"

!

C-1#$-J')"7*4.#+&+#+*

RDF Triple:

Subject

Object

Massage

Predicate

offers

Safety Harbor

(21)

© Copyright 2007-2008 TopQuadrant Inc. Slide 41

RDF: A distributed network of data!

offeredBy

Safety Harbor

offers

Massage

Safety Harbor offers Massage Facial offeredBy Safety Harbor

RDF Files: “bags of

triples”

Facial

© Copyright 2007-2008 TopQuadrant Inc. Slide 42

RDFS is a schema language for RDF

WC;3*+66%F1*'1*#%*($.+#.*G%(+J'6+$-.1*

offers

Spa

rdf:type

Resort

rdfs:subClassOf

Safety Harbor

Massage

Treatment

Activity

rdfs:subClassOf

rdf:type

offers

rdfs:domain

rdfs:range

(22)

© Copyright 2007-2008 TopQuadrant Inc. Slide 43 SafetyHarbor offers Massage offers rdfs:domain Safety Harbor rdfs:subClassOf Resort Spa Spa rdf:type

If the bags contain RDFS

key symbols, then RDFS

can infer certain

conclusions

RDFS is RDF, too!

offers

Spa

rdf:type

Resort

rdfs:subClassOf

Safety Harbor

Massage

Treatment

Activity

rdfs:subClassOf

rdf:type

offers

rdfs:domai

n

rdfs:range

References

Related documents

I used an optimization process to attain the total population weighted distance for the actual Kentucky Senate district plan that would be comparable to the measures from the

Figures 6 and 7 (see Appen- dix) show the secure communication time span dura- tions when the experimental client program runs on a Nokia 6600 mobile phone, the server program runs

Data related to corporate governance attributes (board size, board independence, board diversity and board meetings) and control variables (firm performance, firm

This forum is dedicated to the discussion on the social values of Second Language Acquisition (SLA) research and is motivated by talks organized in the Fall of 2018 by Columbia

(ii) A second set of variables is made up of indicators describing macroeconomic situation, namely, inflation and production growth rate observed over the previous

Hence, according to this discussion and having that the outcome and substitute models have opposite predictions about the relation between governance and dividend

COPD and other conditions requiring controlled or low dose oxygen therapy Administer the initial dose until a reliable SpO 2 measurement is available then adjust oxygen flow to