• No results found

Creating the Thomson Reuters knowledge graph and open permID - ODI Summit 2015

N/A
N/A
Protected

Academic year: 2021

Share "Creating the Thomson Reuters knowledge graph and open permID - ODI Summit 2015"

Copied!
27
0
0

Loading.... (view fulltext now)

Full text

(1)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be

Thomson Reuters, The Answer Company

ODI SUMMIT

3 NOVEMBER 2015

@DrDanielASmith

DR. DANIEL A. SMITH, SENIOR DEVELOPER, CORPORATE TECHNOLOGY

CREATING THE THOMSON REUTERS

(2)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

ABOUT THOMSON REUTERS

2

FINANCIAL & RISK

INTELLECTUAL PROPERTY & SCIENCE

LEGAL

Comprehensive IP & scientific information,

decision support tools & services to enable

governments, academia, publishers,

corporations & law firms.

Critical information, decision support tools,

software & services to legal, investigation,

business and government professionals.

Critical news, information & analytics,

enables transactions, and connects

trading, investing, financial and corporate

professionals.

TAX & ACCOUNTING

Integrated tax compliance and accounting

information, software & services for

professionals in accounting firms,

corporations, law firms and government.

REUTERS NEWS

Powered by more than 2,800 journalists reporting in 20

languages from bureaux around the world, Reuters is

the world’s largest international news organisation.

(3)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

3

(4)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

4

(5)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

5

ABOUT THOMSON REUTERS

Due to growth by acqusition, we

are working with siloed data

Segregation of content by

(6)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

6

ABOUT THOMSON REUTERS

Benefits: Designed, content

controlled, edited and published

by each business

(7)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

7

(8)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

8

(9)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

9

(10)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

10

KNOWLEDGE GRAPH

Company

name

Thomson Reuters

TRI.TO

primaryQuote

Quote

RIC

1977-12-28

incorporated

http://tr.com

website

TRI

ticker

TOR

exchange

(11)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

11

KNOWLEDGE GRAPH

Thomson Reuters

name

granted

name

Eikon

US20140173400A1

application

2013-10-23

filed

2014-06-19

published

uses

Patent

makes

Product

Company

(12)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

12

KNOWLEDGE GRAPH

Thomson Reuters

name

granted

name

Eikon

US20140173400A1

application

2013-10-23

filed

2014-06-19

published

uses

Patent

makes

Product

Company

Company

name

Thomson Reuters

TRI.TO

primaryQuote

Quote

RIC

1977-12-28

incorporated

http://tr.com

website

TRI

ticker

TOR

exchange

(13)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

13

(14)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

IDENTITY CHALLENGES

Entities can have multiple identifiers

e.g. Organisations have IDs all areas:

Finance and Risk

Tax and Accounting

Legal

IP and Science

News

(15)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

ORGANISATION IDENTIFIERS IN FINANCE AND RISK

MXID

NDGSymbol

DBSTicker

SDCCusip

SDCID

SEDARIssuer

EdCoID

VEFirmID

VentureEconomicsID

TMTCompanyID

CIK

DisclosureID

EedbID

GemAlphaNumericID

15

RegistrationNumber

DunsNumber

SinotrustNumber

DatastreamFiId

RegulatoryId

Cusip6

TaxId

RcpId

EfxId

EjvExchangeCode

Lei

DataStreamId

AllCode

InvestextId

(16)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

CONTENT MARKETPLACE

Thomson Reuters addressed the problem by introducing an information

governance framework, which:

-

Created authorities to maintain common reference points of identity for major objects

(organisations, people, instruments, quotes etc...)

-

Used PermIDs as a common method for identity

Our customers have the same problems, as does the open data community

-

Introducing rules is not practical. Instead, offer a compelling solution that each group or

organisation can make use of independently: Open PermID

-

Grow a network effect – an ecosystem – around this de-facto standard

(17)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

PERMID AS A USEFUL COMMON REFERENCE POINT

Specifically maintained for identity reference & not as a side-effect

-

Use / context independent – focus is on getting community support & network mass

-

Unambiguous, consistent interface, doesn’t need interpretation

-

Well-described & maintained relative to the real world

-

Stable meaning, persistent, temporal

-

Coverage & granularity reflect community needs

-

Dependable support over time

Everyone knows that everyone else can freely access and use it

-

Open licensed

-

Known quantity to plan against

-

Creates a network effect

(18)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

TECHNOLOGY STACK

Content Marketplace

Data Item Registry (i.e., ISO/IEC 11179)

XML

Knowledge Graph

Semantic Web

RDF, OWL, SPARQL, SPIN, Jena, Sesame

Big Data

Apache Big Data Ecosystem - Hadoop, Spark, Kafka, Oozie, Cassandra, Elastic Search

(19)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

BUILDING THE KNOWLEDGE GRAPH

The Content Marketplace work gave us the linkage through PermIDs

Semantic Web and Big Data technologies give a strong starting point to building a

knowledge graph

Take those technologies and scale them to:

Query or manipulate at scale

Provide lots of data and lots of perspectives on data

(20)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

BUILDING THE KNOWLEDGE GRAPH

Build a minimal set of tools to put and get data into the graph

Determine the minimum viable set of data to bootstrap the graph

Retain federation of data internally

Data authorities keep editorial and publishing control as before

If we can prove out a knowledge graph of federated data internally, we can use the

same approach to link to customers data and open data

(21)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

THOMSON REUTERS KNOWLEDGE GRAPH STATS

Knowledge Graph: 2.35B triples

-

Metadata, Organisations, People: 2.27B triples

-

Inferred Data, generated with SPIN rules [reverse predicates etc.]: 78.3M triples

Compared to other large open data sets:

-

Wikidata: 367M triples

-

DBPedia: 474M triples

-

Freebase: 2B triples

-

UniProt: 17B triples

(22)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

KNOWLEDGE GRAPHS PROVIDE

MANY VIEWS TO ANSWER MANY QUESTIONS

Gives us the ability to provide many lenses over the graph

Query for absolute facts

Patents issued by a company, litigation history, market capitalisation history

Also make inferred and abstract connections

Sort by litigation history within an industry sector weighted by market capitalisation

Combine absolute facts with inferred/abstract connections

(23)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

KNOWLEDGE GRAPHS PROVIDE

MANY VIEWS TO ANSWER MANY QUESTIONS

Iterate and build layers of queries of increasing sophisticated/complexity to infer

new facts

Handle relative truth of facts and data - according to their source

Ability to utilise the facts relevant to your product or question

Adding additional perspectives as relevant

(24)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

KNOWLEDGE GRAPH USE CASE

(25)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

KNOWLEDGE GRAPH USE CASE

(26)

A closer look: Alternate logo lock-ups

19

Primary logo lock-up

Alternate vertical logo lock-up (limited use)

For aesthetic reasons, the read-through is modified to be Thomson Reuters, The Answer Company

KNOWLEDGE GRAPH USE CASE

(27)

27

[email protected]

@DrDanielASmith

http://tr.com/site/data-identifiers

References

Related documents

The study showed that administration of 16 ml/kg Bwt of raw camel milk twice daily caused insignificant decrease in the levels of plasma total cholesterol, LDL-c, and

 Room nights in Hotels prior and post voyage  Airport to hotel transfer. 

The first aim of the present study was to examine the associations of Five-Factor personality traits with effort-reward imbalance and job strain. The Five-Factor Model of

It was our belief that by participating in the process designing an online course with their team, that the faculty members in this course would produce courses that were far

the pressre of the air throgh the gas trbine 'ore. The bla$es a''elerate the air in'reasing its $yna#i' pressre, an$ then the vanes $e'elerate the air transferring kineti'

Level 2 infractions levied by a Residence Life Coordinator or any other faculty or staff member may be appealed to the Dean of Students (or the Senior Vice President for

that the effect of floral resource availability will vary depending on the sociality group (ie. Solitary, native social, or managed social). The rainy season takes place from May

Transfer students who are interested in the Ath- letic Training major should inform the Admission Office upon application for admission to the University and contact the