• No results found

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

N/A
N/A
Protected

Academic year: 2021

Share "BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:"

Copied!
10
0
0

Loading.... (view fulltext now)

Full text

(1)

B

IG

D

ATA

C

OURSE

1

D

ATA

Q

UALITY

S

TRATEGIES

-

C

USTOMIZED

T

RAINING

O

UTLINE

Cerulium Corporation has provided quality education and consulting expertise for over six years. We offer customized solutions to maximize your warehouse.

Prepared by:

Steve Wilmes, CEO Cerulium Corporation Phone: 803.719.6782

(2)

PURPOSE

This class, together with the Data Profiling Techniques class, provides a good beginning working knowledge of Data Quality. It equips students with a thorough understanding of all the issues surrounding data quality, and what can be done to improve it. The Data Quality Dimensions Methodology is presented as a

repeatable way to uncover and classify common data quality problems, and the classification scheme leads to the root causes and how they can be corrected. Metrics are presented which enable IT to partner and deliver to the business progress reports on data quality. Lastly, techniques on how and where to cleanse data are presented.

COURSE CURRICULUM:

This one-day class, together with the Data Profiling Techniques class, provides a good beginning working knowledge of Data Quality. The Data Architecture Boot Camp for Big Data (see below) is designed to address the critical need within the data warehousing industry for seasoned Data Architects.

BIG DATA – DATA QUALITY COURSE OUTLINE

Intended Audience: Business Analysts, Data Analysts, Data Administrators, Data Modelers, Data/System Architects, Database Administrators (DBA’s), Data Quality Managers, Application Development Managers, Data Warehouse

Managers, Project Managers

Course Agenda:

 What is Data Quality?

 The Role of Business Process in DQ  The DQ Dimensions

 How to Measure DQ & Metrics  How to Use Metrics

 How & Where to Cleanse Data  Data Cleansing Techniques

(3)

B

IG

D

ATA

C

OURSE

2

E

NTERPRISE

BI

S

TRATEGIES

U

SING

H

ADOOP

-C

USTOMIZED

T

RAINING

O

UTLINE

Cerulium Corporation has provided quality education and consulting expertise for over six years. We offer customized solutions to maximize your warehouse.

Prepared by:

Steve Wilmes, CEO Cerulium Corporation Phone: 803.719.6782

(4)

PURPOSE

Enterprise BI™ is committed to providing participants with two levels of a BI perspective. The first level is focused on the data and technical architecture requirements needed to implement a best-of-class infrastructure for BI. The second level concentrates on exposing participants to leading BI tools, their use, and their application.

The course begins with a best-of-class, detailed examination of data and technical architectures specific to BI and for applications such as Query and Reporting, OLAP, Data Mining, Spatial Analysis, Real-time Analytics, Metadata, and Portal applications. Mobility will be introduced as a quick easy effect mechanism for agile methodology

Participants are then led through discussions on the proper application of leading BI tools, with regard to the architectures defined. Through lecture and live

product demonstrations, features and functionality are compared between products such as Hyperion Essbase, Microsoft Analysis Services, Cognos, and others. We will look at Hive, which runs on Hadoop, as a BI tool that can easily be implemented in the Enterprise for low cost solution.

COURSE CURRICULUM:

This three-day Enterprise BI™ course is designed to provide participants with a non-biased view of leading BI tools and how those tools fit into an overall BI architecture.

(5)

ENTERPRISE BI STRATEGIES USING HADOOP COURSE OUTLINE

I. The BI Organization

o The Basis of Information o Components of a BI Strategy o Enterprise Deployment o Data Structure Dependency o A Conceptual Architecture o Applying the Right Tools o BI is an Iterative Process

o These Are Not Business Requirements II. Query & Reporting

o Definitions and Terms o Architecture

o Analysis, Design & Build Checklists o Tool Functionality

o Project Considerations III. Building an OLAP Solution

o OLAP & The BI Organization o OLAP Terms

o Dimensional Spectrum

o Business Question Components o Dimensional Diagrams

o A Tuple Cell o Hierarchies

o The Inflation Factor

o The Challenge With Sparsity IV. Meta Data

o Meta Data Terms o Benefits of Meta Data o Meta Data Impact

o The Meta Data Challenge o Common Model

o Point-to-point Bridges o Leading Approach o An Integrated View

o The Current State of Meta Data V. The BI Portal

o Portal Terms

o Decision Portal Definitions o Decision Portal Characteristics o Cross-Discipline Knowledge o Decision Based Design o Decision Factor Analysis o A Portal Implementation Method o The Architecture

(6)

VI. Data Mining

o Data Mining & The BI Organization o Data Mining Terminology

o Data Mining Opportunities

o Mining Drives Analytic Applications o Data Mining Scenarios

o Application of Mining Techniques o Effort Distribution

VII. Spatial Data & Analysis

o Spatial Analysis & The BI Organization o Spatial Terminology

o What is GIS?

o Spatial Relationships o Displaying Spatial Data

o What Spatial Data Means to You VIII. Real-time Warehousing

o Terms o Zero-latency

o Real-time Data Warehousing o Business Activity Monitoring o Network & Systems Management IX. BI Trends

(7)

B

IG

D

ATA

C

OURSE

3

B

IG

D

ATA

B

OOT

C

AMP

-

C

USTOMIZED

T

RAINING

O

UTLINE

Cerulium Corporation has provided quality education and consulting expertise for over six years. We offer customized solutions to maximize your warehouse.

Prepared by:

Steve Wilmes, CEO Cerulium Corporation Phone: 803.719.6782

(8)

Big Data Customized Education Outline

04/18/13

© 2013 Cerulium Corporation – All rights reserved. Confidential. 8/10

PURPOSE

This class is designed to expand on the skills of people that already have business analysis, systems analysis, and database design skills, and couple them with group facilitation skills, process modeling and data warehouse design skills. Upon completion of this course, the student will be able to facilitate the gathering of business information needs requirements and turn them into reliable and forward-thinking data warehouse designs complete with data models and specifications for ETL and Business Intelligence development.

COURSE CURRICULUM:

This five-day class is a hands-on, intense course that includes lecture, hands on labs, and nightly homework. The Big Data Boot Camp is designed to address the critical need within the data warehousing industry for seasoned Data Architects.

(9)

Big Data Customized Education Outline

04/18/13

© 2013 Cerulium Corporation – All rights reserved. Confidential. 9/10

BIG DATA BOOT CAMP COURSE OUTLINE

Day One

 Data Warehouse Basics

 Design Characteristics for Hadoop and MPP

 Architectural Considerations – Why go this way at all  Critical Success Factors

 Requirements Definition  Scope

 Process Modeling – The business way  Data Modeling

 Information Needs Modeling  Facilitation Management

 Purpose  Techniques

 Understanding Groups  Facilitator’s Tool Kit  LAB 1

Day Two

 Business Process Modeling  Context Diagramming

 Process and Flow Characteristics  Process Relationships

 Methodology  Validation Steps  LAB 2

 Enterprise Data Modeling  Definition and Use  Architecture  Mechanics  LAB 3

 Business Information Needs  Purpose and Use

 Focus on Business Objectives  LAB 4

Day Three

 Data Modeling Overview

 Logical vs Physical Models

 Logical Model Purpose and Components  Physical Model Purpose and Components  LAB 5

(10)

Big Data Customized Education Outline

04/18/13

© 2013 Cerulium Corporation – All rights reserved. Confidential. 10/10 Day Three - Continued

 Data Modeling Entity-Attribute-Relationship Characterization  Understanding Entities and Attributes

 Representing Relationships and Hierarchies  Unique Identifiers - Best Practices

 Naming Standards  LAB 6

 The Data Modeling Process  Techniques

 Understanding the Normal Forms  Data Profiling

 LAB 7 Day Four

 Data Architecture Data Acquisition

 Data Warehouse Architectural Layers  Source Layer Considerations

 Audit Layer Considerations  LAB 8

 Data Architecture ETL Considerations  Stage Layer Design

 Audit/Stage ETL Design  LAB 9

 Data Architecture Base-History-Dimensional Design  Base Layer Design Standards

 History Layer Design Standards  Dimensional Design Standards  LAB 10

Day Five

 Modeling for BI Tool Capabilities  Business Tool Considerations  Leveraging RDBMS Capabilities  Leveraging BI Tool Capabilities  LAB 11

 Conclusions

 Data Warehouse Layers  Presentation Components  Audit Components

 Business Objective Focus  FINAL EXAM

References

Related documents

We conclude that community smells can be used in the scope of: (1) agile retrospectives — to diagnose potential organisational and socio-technical issues encountered by agile teams;

The main wall of the living room has been designated as a "Model Wall" of Delta Gamma girls -- ELLE smiles at us from a Hawaiian Tropic ad and a Miss June USC

Players can create characters and participate in any adventure allowed as a part of the D&D Adventurers League.. As they adventure, players track their characters’

Simulating clinical concentrations and delivery rates of a typical intravenous infusion, a variety of routinely used pharmaceutical drugs were tested for potential binding to

Democritus distingnished between primary qualities based on the shape and characteristics of the atomon, and secondary or derived qualities, like sweet, bitter, warm,

This paper examines the lithology and raw material provenance of knapped stone artifacts recovered from prehistoric sites on Meganisi in the course of the

T h e second approximation is the narrowest; this is because for the present data the sample variance is substantially smaller than would be expected, given the mean

The graphs presented in Figure 6.8 and Figure 6.9 show the effects of shape factor and Poisson’s ratio on the stress distributions in axial direction in IS-shaped bonded elastic