• No results found

Semantic Web Tool Landscape

N/A
N/A
Protected

Academic year: 2021

Share "Semantic Web Tool Landscape"

Copied!
29
0
0

Loading.... (view fulltext now)

Full text

(1)

UNCLASSIFIED

Copyright © 2009 The MITRE Corporation 1

Dr. Leo Obrst MITRE

Information Semantics Group

Information Discovery & Understanding Command and Control Center

Semantic Web

Tool Landscape

CENDI-NFAIS-FLICC Conference

National Archives Building

(2)

2

Overview

• Semantic Web

• Prefatory Comments and Caveats • Here is What We Did 5 Years Ago • Semantic Web Context

• Recent Semantic Web (and Linked Data) Tool Surveys • Semantic Web Tool Landscape

• A Vertical View • Some Tools

• Rest for Another Day • Some Examples

• Trends and Indications • Linked Data

(3)

3

Machines partially understand what humans mean

The Semantic Web

• Current Web

is a

collection of links and

resources

– Is syntactic & structural only – Excludes semantic

interoperability at high levels. – Google of today is string

based (keyword) & has no notion of the semantics (meaning) of your query

• Semantic Web

extends

the Current Web so

information is given

well-defined meaning

– Enables semantic

interoperability at high levels – Google of tomorrow will be

concept based

– Able to evaluate knowledge in context

Humans have to do the understanding Web today Semantic Web tomorrow ? ? ? ? ? ? ? ? ? Force Structure As Is Deployed Force Home base In Transit Capabilitiies Locations Logistics Units Theater Terrain Marsh

(4)

Prefatory Comments & Caveats

• Some tools span multiple functions

– Example: some ontology development environments assist with full ontology lifecycle management

– Example: some data integration, search tools provide ontology development environments

• Some tools blur the distinctions between vocabularies and ontologies (e.g., SKOS vs. OWL)

– Vocabularies: focus on terms (natural language words & phrases, their lexical semantic relations)

– Ontologies: focus on concepts (real world referents and categories of those), represent what the terms mean

• Many commercial tools are based on semantic technologies, may import/export SW content: SmartLogic, OntologyWorks, etc.

• Some tools are based on more expressive standards, such as ISO Common Logic, Horn Logic (logic programming), etc.

• For every tool I mention, there are 100 more that go unmentioned!

(5)

5

State of

Research

Practice

A Maturity Continuum

Draft Standards Standards Interest Groups & Consortia Professional Services & Consultants Early

Adoption ApplicationsFielded MainstreamUsage

Proven Commercialization Proven Commercial Tools Applied

Research CommercializationEarly

Abstract Research State of Practice State of Research

Here is what

we did 5 years

ago at MITRE!

We gauged the

state of the

Semantic Web

(6)

6

State of

Research

Practice

Languages, Ontologies, Methodologies

Draft

Standards Standards Early

Adoption ApplicationsFielded MainstreamUsage

Proven Commercialization Proven Commercial Tools Applied

Research CommercializationEarly

Abstract Research State of Practice State of Research RDF as triple, i.e., predicate, relation RDF as graph, i.e., connected predicates RDF as inference closure over related predicates

OWL-S (Semantic Web Services), SWSL RDF/S OWL OWL-Lite OWL-DL OWL-FULL Upper Ontologies:

SUMO, Upper Cyc, DOLCE, IFF Description Logic Programming MethOntology+OntoClean OWL-FOL

Here is what

we did 5 years

ago at MITRE!

(7)

7

State of

Research

Practice

Rules / Query Languages

Draft Standards Standards Proven Commercialization Proven Commercial Tools Applied

Research CommercializationEarly

Abstract Research State of Practice State of Research RuleML Semantic Web Rule Language (SWRL) RDQL RQL DAML Query Language (DQL) RDFQL SeRQL SPARQL Triple

Here is what

we did 5 years

ago at MITRE!

(8)

8

Semantic Web Context

Anyone, anywhere can add to an evolving, decentralized “global

database”

Explicit semantics enable looser coupling, flexible composition of

services and data

S

e

ma

nt

ic

W

e

b

“Digital Dial Tone”, Global Addressing HTTP, Unicode, URIs

Syntax, Transmission XML

Structure XML Schema

Expose Data & Service Semantics RDF/RDF Schema

Enable Reasoning: Proof, Logic SWRL, RIF, FOL, Inference

Curre

nt

W

e

b

S ec u rit y , T ru st OWL

Add Full Ontology Language so Machines can Interpret the Semantics

(9)

Recent Semantic Web (and

Linked Data) Tool Surveys 1

1. Semantic Web Development Tools: Introduction

2. General Development Environments, Editors, Content Management Systems, ...

3. RDF Triple Store Systems

4. Programming Environments

5. Reasoners (OWL or rule based) and related tools

6. Modularization and diffing tools

7. RDF Generators

8. On-line Validators

9. SPARQL "Endpoints"

10. Special Browsers

1. RDF and/or OWL Browsers

2. Semantic Web Browsers

11. Search Engines

12. Tools for Converting RDF Data between different Serializations

13. Tools for Converting Classifications and Other KOS into RDF-S or OWL Ontologies

14. Tagging and Semantic Web Vocabularies

9

(10)

Recent Semantic Web (and

Linked Data) Tool Surveys 2

• Lee Feigenbaum’s 2009 Semantic Web Landscape: Technologies Tools, and Projects:

http://www.slideshare.net/LeeFeigenbaum/semantic-web-landscape-2009

• Semantic Web Tools: http://semanticweb.org/wiki/Tools. About 120 tools.

• W3C Semantic Web Frequently Asked Questions: http://www.w3.org/RDF/FAQ

• These provide extensive references!

(11)
(12)

Semantic Web Tool Landscape

(13)

13

A Vertical View

Ontologies RDF Triple Stores Access, Storage, Persistence Services Services S e rv ic e s

Knowledge Consumer Applications

Knowledge Producer Applications

Au to ma ted R easo n er s O n to lo g y D ev E n vs Social Networking Instances, Facts Classes, Relations,

Properties, Axioms, Rules RDF, OWL, SWRL. RIF, LP Relational Databases Files XML, HTML Docs

Information Extraction Data Mining

Information Extraction Taggers, Annotators

Decision Support Electronic Commerce Converters Validators Generators Query Engines Semantic Search Taggers, Annotators

(14)

Tool Categories

• Ontology Development Environments • Ontology Reasoners

– Description Logic Reasoners – Rule Reasoners

– Probabilistic Reasoners – Hypothetical Reasoners

• Triple Stores (generally RDF instances in a graph), Ontology Repositories • Query Languages, Engines, Semantic Search Tools

• Semantic Web Programmatic Interfaces • Ontology Modularity Tools

• Ontology Versioning Tools • Ontology Mapping Tools

• Controlled Vocabularies to Ontologies Linking Tools • Ontology/Semantics/Metadata Tagging Tools

• RDF Generation Tools

• Ontologies to Relational Databases Linking Tools • Ontology Lifecycle Management Tools

• Validators • Converters

• Indexing, Taggers, Annotators

(15)

Ontology Development

Environments

• Protégé 4.0, Collaborative Protégé • TopQuadrant’s TopBraid Suite

• NeOn: http://www.neon-toolkit.org/

– Reference implementation of the NeOn architecture – Supports ontology engineering and management – Supports complete ontology lifecycle

– Supports different ontology languages (OWL, F-Logic) – Supports networked ontologies (modules, mappings) – Free academic; commercial; eval license

• Altova’s SemanticWorks

• KAON2 (KArlsruhe ONtology)

(16)

Ontology Reasoners

• Description Logic Reasoners

– FaCT – FaCT++ – Racer – RacerPro

– Description Logic Programming (DLP) – Pellet

• Rule-based, Horn Logic, and First-Order Logic Reasoners

– Jena

– JESS (Java Expert System Shell)

– SILK (Semantic Inferencing on Large Knowledge): http://silk.semwebcentral.org/

(17)

Storage: Triple Stores, Ontology

Repositories

• Triple Stores – AllegroGraph – Oracle Spatial 11g – OWLIM – SESAME – Mulgara – Drupal – Garlic 4Store • Ontology Repositories – Swoogle – BioPortal – XMDR

– Open Ontology Repository (OOR)

• Federation of registries/repositories: BioPortal, XMDR, Semantic MediaWiki, etc.

• Other (vocabulary, etc.)

(18)

Query Tools, Semantic Search

• SPARQL

• Twinkle

• TopBraid’s SPIN (SPARQL Inference Notation): constraint checking, user-defined SPARQL functions, rules

– SPARQLMotion: a scripting language for importing, exporting, processing data in RDF

• Sindice

• Many tools listed under “Trends and Indications”

(19)

Semantic Web Programmatic

Interfaces

• See Semantic Web Tools:

http://esw.w3.org/topic/SemanticWebTools

1. Multi-language environments 2. Java Developers 3. Python Developers 4. C Developers 5. C++ Developers

6. C# and .Net Developers

7. Flex/ActionScript Developers 8. JavaScript Developers 9. Tcl/Tk Developers 10.PHP Developers 11.Lisp Developers 12.Obj-C Developers 13.Prolog Developers 14.Perl Developers 15.Ruby Developers 16.Haskell Developers 19

(20)

The Rest: For Another Day

• Ontology Modularity Tools

• Ontology Versioning Tools • Ontology Mapping Tools

• Vocabularies to Ontologies Linking Tools • RDF Generation Tools

• Ontologies to Relational Database Linking Tools • Validators

• Converters

– UML-OWL Generator

• Semantic Wikis, Social Networking Tools

– Revelytix’s Knoodl – Semantic MediaWiki – OntoWiki

• Other

– Large Knowledge Collider (LarKc) – Good Relations Annotator

(21)

Example: TopQuadrant’s

TopBraid Suite

(22)

Example: Open Calais Architecture

(23)

Example: OpenLink’s Virtuoso

(24)

Trends and Indications

• Linked Data: strong movement to expose structured and

unstructured data to Web using RDF and URIs

• RDF Generators

• Semantic Web Scripting & Mash-up Languages (or

scripting languages with Semantic Web APIs, Libraries)

• Wider Semantic Technology applications

– E.g., Infrastructure, Semantic Search, Computed Answers, etc. – UIMA, Wolfram Alpha, Google Fusion Tables, Open Calais, – OpenLink’s Virtuoso, Zepheira, Freebase, Talis Group

– Radar Network’s Twine, Siri,

– Metatomix, Hakia, True Knowledge, SmartLogic’s Semaphore, – ThoughtWeb, Cambridge Semantics, Collibra, Inbenta

– Semantic Discovery Systems, Structured Dynamics

(25)

Linked Data

25

In May, 2009, 4.7 billion RDF triples, interlinked by around 142 million RDF links, reported by W3C’s Linking Open Data Project

(26)

Linked Data

• Exposing data structurally using dereferenceable

URIs

• One might call the core of “Linked Data” the

Structural Web, being constituted by RDF graphs

• Semantic Web ontologies describe those RDF

graphs

• Need to expose the data + its semantics!

• Large organizations exposing data: NY Times,

BBC, etc.

– NY TIMES: http://open.blogs.nytimes.com/2009/10/29/first-5000-tags-released-to-the-linked-data-cloud/

(27)

27

Are We There Yet?

• RDF/s & OWL became W3C Recommendations, Feb.

2004

• Commercial vendors investing heavily

• Huge growth in number of conferences: ISWCs,

SemTechs, etc.

• More ontologies being posted

• Research trend: SOA, Web services and Semantic Web

convergence

• Tools & methodologies are maturing

– Proof/trust/pragmatic web: still early

• Are we there yet?

• According to Tim Berners-Lee, not yet, but getting there!

(28)

The Real Semantic Web Landscape

28 Copyright: http://crafttutorials.files. wordpress.com/2008/04/ earth.jpg

(29)

Thanks!

• Questions?

[email protected]

References

Related documents

This Network is led by the Institut National Polytechnique de Lorraine (INPL) and reports to the Interuniversity Consortium for Agricultural and Related Sciences in Europe and

Figure 25—United States Alaska yellow-cedar lumber exports to East Asia (Japan Customs Bureau 2009, Warren 2008); e = estimate.. Figure 27—Japanese Alaska yellow-cedar and other

Fixed Spot Thermometers MaIN PRODUCTs MWIR-B-640 Process Imaging Process Imaging additioNAL PRODUCT NIR-B NIR-B-2K Combustion & Emission Monitoring Combustion &

methodology was developed for creating MultiSpeak equivalents of IEC 61968-9 profiles 3) a proof of concept mapping to satisfy the On Demand Meter Read was accomplished

Overall, considering all types of interventions in response to the financial crisis, and both monetary policy and recovery measures for the banking industry, we can observe a

Validation of the estimated global horizontal and direct normal irradiance values has been performed by comparison with high-quality ground station measurements.. Due to the low

But with such an unrestricted form for the time variation, if the system of equations is large or the lag length is long, the number of free parameters in the model

The Lead Partner – Municipality of Vigo, (Spain) with the contribution of all the Partners of SUM Project: AREAL - Regional Agency for Energy and Environment in the