• No results found

Semantic Data Modeling: The Key to Re-usable Data

N/A
N/A
Protected

Academic year: 2021

Share "Semantic Data Modeling: The Key to Re-usable Data"

Copied!
16
0
0

Loading.... (view fulltext now)

Full text

(1)

Semantic Data Modeling:

The Key to Re-usable Data

Stephen Brobst

Chief Technology Officer

Teradata Corporation

[email protected]

617-422-0800

(2)

2

Not just a collection

of subjects...

Activity

Party

Account

Product

Single, Integrated System

...but also their

relationships

Party

Product

Account

Activity

Don’t model subjects

individually!

Model your entire

business!

Enterprise Information Management

(3)

3

Functional Views

Sales

Marketing

Finance

Rates/

Regulatory

Customer

Service

Risk

Demographics

Pricing

General Ledger

Promotions

Products

Safety

Engineering

Production

HR

Contracts

Works OK for OLTP, but causes

data chaos for BI applications.

(4)

4

Business Intelligence Requires Data Integration

Product Data Customer Data Account Data Transaction Data G/L Data Market Data External Data

(5)

5 Copyright © 2005, Stephen A. Brobst. All rights reserved.

Data Modeling Techniques

Key observation: Practitioners in the data warehousing

industry frequently confuse construction of the semantic data

model, logical data model, and physical data model.

A semantic data model (SDM) captures the business

view of information for a specific knowledge worker

community or analytic application.

A logical data model (LDM) captures the business

relationships in the enterprise information independent

of a specific analytic application or departmental view.

A physical data model (PDM) captures the

(6)

6

Data Model Deployment

Conceptual Data Model

Project – A ‘ Project – B Project – C

Enterprise Data Standards

Subject Area ‘A’

Enterprise Logical Data Model(3NF)

xxxxxxx xxxxxxxxxx xxxxx xxxxxxx xxxxxxx xxxxxxxxxx xxxxx xxxxxxx xxxxxxx xxxxxxxxxx xxxxx xxxxxxx xxxxxxx xxxxxxxxxx xxxxx xxxxxxx xxxxxxx xxxxxxxxxx xxxxx xxxxxxx xxxxxxx xxxxx xx

Subject Area

‘A’

Physical Model Realization

Design Meta

Data

Semantic Model Views

Subject Area

‘B’

Subject Area

‘C’

Single Physical Data Model

(7)

7 Copyright © 2005, Stephen A. Brobst. All rights reserved.

Semantic Data Modeling

Semantic data modeling is a logical data

modeling technique; the semantic view of

information does not necessarily need to be

physicalized in the database.

There may be a different semantic data model for

each department/applications that uses the data

warehouse.

Dimensional modeling is a common technique for

constructing the semantic data model for an

analytic application, but is not the only viable

approach.

(8)

8

Dimensional

Physical Data Extensions

Different Semantic Model Designs are Appropriate

for Different Types of Knowledge Workers

Normalized Generic Structures

Index choices &

selective table

denormalizations

(9)

9 Copyright © 2005, Stephen A. Brobst. All rights reserved.

Physical Data Model

Physical data model represents the tables constructed in

the database.

Recommendations:

Use the (3NF) LDM as the starting point for the PDM

with selective denormalization when appropriate for

(primarily) performance reasons.

Overlay (dimensional) SDM on top of PDM using

views and/or semantic metadata in your BI tool.

Design LDM first, then use application-specific

business requirements to derive the SDMs and

performance considerations to map into the PDM.

(10)

10

Semantic Models Should be BI Tool Agnostic

MicroStrategy

Teradata OLAP Connector

Tableau

Tier 3

Access

Tier 2

Integrated

Tier 1

Acquisition

(11)

11

A collection of data modeling assets that help make database

design and development faster and easier for the access

layer:

>

Access layer provides path for data from the integrated data model

to end user consumption.

>

When this layer not well-designed, it can impact speed, security,

and simplicity in developing and delivering reports, BI applications.

Re-usable building blocks provide flexibility and consistency

to the development process:

>

SMBBs include pre-built semantic models.

Focuses on a specific analytic need

in a specific industry:

>

For example, Communications Mobile

Revenue Analytics.

SMBBs are to the semantic layer as

iLDMs are to the integrated layer of

a data warehouse implementation.

What is a Semantic Modeling

(12)

12

Dimensional Model

Dimension Building Blocks

Dimension Building Blocks Support a

Range of Analytical Needs

Fixed, Normalized Hierarchy

Fixed, Flattened Hierarchy

Variable Depth Hierarchy

(13)

13

What are SMBBs?

How are they related to an LDM?

Building from the Foundation for your

Data Warehouse:

An LDM is like a blueprint for a house

that you are building. It serves as the

foundation for your integrated data

warehouse.

The SMBBs are like room designs that

meet specific homeowner needs.

Different rooms need different designs

based on their purpose. Similarly, for

each new business application, new

semantic models are needed.

SMBBs provide different designs

(building blocks) for the modeler to

choose from in building the semantic

models.

These flexible, reusable building blocks

(14)

14

Q: Where does it all start?

A: Business requirements drive the process!

Relationships between the

Three Types of Data Models

The Logical Model is

used to drive

generalization and

support source data

leverage and reuse.

Logical Data Model

Physical Data Model

Semantic Data Models

Data

access

patterns

Support

data

re-use

The Semantic Model

captures data

access patterns that

must be supported

by the core physical

model.

The Physical Model

provides core

support for data

integration within

the information

architecture.

(15)

15

Semantic Layer Benefits

Efficient table joins can be encouraged

inside the SDM views.

Views are low maintenance objects.

Views do not consume database space.

Join indexes (JIs) and aggregate join

indexes (AJIs) can be created based on the

access paths embedded in the SDMs.

PDM is not compromised with new

application requirements.

(16)

16 Copyright © 2005, Stephen A. Brobst. All rights reserved.

Conclusions

Critical to distinguish between logical data modeling,

semantic data modeling, and physical data modeling.

Separate the implementation of the semantic model

from the physical data model (PDM) deployment for

maximum flexibility.

Selective use of PDM extensions to optimize

performance.

Either ANSI standard views of the semantic metadata

within your BI tool of choice can be used for creating

a semantic data layer without sacrificing flexibility of

the PDM.

References

Related documents

T h e second approximation is the narrowest; this is because for the present data the sample variance is substantially smaller than would be expected, given the mean

The phonetic properties of six Malay vowels are investigated using magnetic resonance imaging (MRI) to visualize the vocal tract in order to obtain dynamic articulatory

Whether grown as freestanding trees or wall- trained fans, established figs should be lightly pruned twice a year: once in spring to thin out old or damaged wood and to maintain

The main wall of the living room has been designated as a "Model Wall" of Delta Gamma girls -- ELLE smiles at us from a Hawaiian Tropic ad and a Miss June USC

Simulating clinical concentrations and delivery rates of a typical intravenous infusion, a variety of routinely used pharmaceutical drugs were tested for potential binding to

Saturday (hard day, 6-8 hours): dojo class conditioning hard stretching sparring weight training  bag work. running

Potential explanations for the large and seemingly random price variation are: (i) different cost pricing methods used by hospitals, (ii) uncertainty due to frequent changes in

Players can create characters and participate in any adventure allowed as a part of the D&D Adventurers League.. As they adventure, players track their characters’