• No results found

The Data Warehouse - Strategy

N/A
N/A
Protected

Academic year: 2021

Share "The Data Warehouse - Strategy"

Copied!
50
0
0

Loading.... (view fulltext now)

Full text

(1)

The Data Warehouse - Strategy

Service profile for

dispositive informationprocessing Diplom-Informatiker Peter K. Albrecht

Head of the division Systems-Consult (Version: 01.05.1999)

(2)

Content

• Mummert + Partner Consultancy AG

• Why Data Warehouse Projects with

Mummert + Partner

• The Mummert + Partner - Service profile

(3)

1960 Dr. Olaf Mummert founded the Mummert consultancy in Karlsruhe on April 1st. 1996 Transformation of the consultancy Ltd. into a PLC.

We accompany our customers up to the target.

Entwicklung und Umsetzung von Branchenlösungen 42% Strategie-, Management und Organisationsberatung 39%

IV-Beratung 15% Sonstiges 4% Öffentlicher Sektor Sonstige Industrie 42% 39% 15% 4% 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% Entwicklung und Umsetzung von

Branchenlösungen Strategie-, Management und

Organisationsberatung IV-Beratung Sonstiges revenue revenue 0 50 100 150 200 250 300 88 89 90 91 92 93 94 95 96 97 98 employees employees 0 100 200 300 400 500 600 700 800 900 1000 88 89 90 91 92 93 94 95 96 97 98

(4)

The target leading combination of business- and specialist

know-how.

Banks - Investment Banking - Retail Banking - Asset Management - Banking Services

Insurances - Customer Relation - Market/Product/Sales - Controlling - Insurance architecture

- Financial architecture - Asset Management - Business Solutions Federal and state

authorities

- Management and organizational consulting - Financial systems - Facility- and Assetmanagement

Public sector - Financial systems - Logistic - Facility- and

Assetmanagement Energy - Customer management - Logistic

- Finanzsysteme - Facility- und Assetmanagement Telecommunication - Customer Relation - Debitor management

Trade - Customer management – Logistic - Financial systems Industry - Customer management - Construction (CAX-Technologien)

- Logistic - Financial systems

- Data Warehousing - Document Management - CRM-Technology - Quality Management - Software Factory - Technology Management (Infrastructur) - Human Ressources Management

(5)

We are where our customers need us.

Berlin Frankfurt Hamburg Cologne Leipzig London Milan Munich Münster New York Vienna Zurich

(6)

Content

• Mummert + Partner Consultancy AG

• Why Data Warehouse Projects with

Mummert + Partner

• The Mummert + Partner - Service profile

(7)

Extensive initial solutions ensure the project success

1. Specialized procedural model

2. Proven methodology

3. Certified product know how

4. Data Warehouse laboratory

5. Skilled and experienced employees

6. Successful customer relationship

On the basis of these 6 components, Mummert + Partner is

the competent partner for Data Warehouse projects.

(8)

1. The M+P procedural model for Data Warehouse projects

Operation/

maintenance

Helpdesk Automation Verification

Scalability

Capacity Tuning Backup

Change-

Manage-ment

Modeling

Metadata Data Marts WarehouseData Dataflow Frontend Architecture Prototyping

Advance study

definitionTarget Is - analysis External Know-how

Data Focus Economic viability

Project definition

ROLAP vs. MOLAP 3-TIER vs. 2-TIER vs. 1-TIER Planing Team staffing Product choice Server FAT-Client THIN-Client Intranet

Implementation

Network Development Security /

Usergroups Testing Education Propagation

Customizing

Data sources

(9)

2.The proven methodology is the key to success.

Frontend

Data Marts

Data Warehouse

Data sources

Meta-Data

(10)

Relational

Multidimensional

OLAP-CLIENT

OLAP-SERVER

RDBMS

ETL

(11)

4. The Data Warehouse Laboratory - for developing and prototyping

Backupserver für alle NT-Server und Clients IBM RS6000 F40 1 GB RAM 60 GB 40 x Windows NT WS 64 MB RAM IBM SP2 SilverNode AIX 4 Prozessoren 3 GB RAM 360 GB Fünf NT-Server Pentium II 512/256 MB RAM 18/9 GB

(12)

5. Professional technical and methodical know -how.

Data Warehouse - Team

Data Warehouse - Experts

System analysts

Graduates

Technology management

Producer

24 MA

6 MA

Business consultants

6 MA

Composition of the team

Goal: 60 employees up to

the middle of 1999

More than 140 personal years

of

Data Warehouse experience

Mummert + Partner

1 1 2

4

9 10

17

2121

2527

293131

33

36

0 5 10 15 20 25 30 35 40 Mar-98 Apr-98 M ay-98 Jun-98 Jul-98 Sep-98 Oct-98 Nov-98 Dec-98 Jan-99 Feb-99 Mar-99 Apr-99 M ay-99 Jun-99 Jul-99

(13)
(14)

Content

• Mummert + Partner Consultancy AG

• Why Data Warehouse projects with

Mummert + Partner

• The Mummert + Partner - Service profile

(15)

Project handling or alternativly consulting single services

Operation/

maintenance

Modeling

Advance studies

Project definition

Implementation

Helpdesk Automation Verification

Scalability

Capacity Tuning Backup

Change- Manage-ment Data Warehouse Dataflow Data Marts

Metadata Frontend Architecture

Prototyping Target

definition Is - analysis Know-how

External Data Focus Economic viability ROLAP vs. MOLAP 3-TIER vs. 2-TIER vs. 1-TIER Planing Team staffing Product choice Server FAT-Client THIN-Client Intranet

Network Development Security /

Usergroups Testing Education Propagation

Customizing

Data sources

(16)

Targetdefinition

• Activities

• Evaluation of the strategic areas and their dimensions

• Definition of the expected targets from the economic point of view

• Definition of the necessary controlling instruments to reach the target

• Holding „strategy-workshops“ with the concerning departments

• Results

• Documentation of demand concerning the subject

• Model of a satisfactory information demand (high level)

• Prioritization of set targets

• Prioritization of controlling instruments that have to be created

• Hierarchical dimension of the business

(17)

IS- analysis

Activities

· Analyzing existing operational systems

· Analyzing existing dispositive systems

· Analyzing the current user architecture • Results

· Grade of covering the information demand by data of existing systems

· Expanding the targets by the integration of existing dispositive systems and reporting

· Consolidation of necessary controlling instruments to reach the target • Productcriteria

• Accessibility to: Flat Files, VSAM, IMS, DB2, Oracle, Informix, Sybase, Adabas, Ingres, SAP-R/2, SAP-R/3, Tandem-NonStop-SQL, NCR Teradata, MS-SQL-Server, MS-Excel, MS-Access, OBDC

Covering - preceding Systems

(18)

External Data

Covering

- prec.Systeme - external sources

(19)

Know-how

Covering: - prec. Systems - external Sources - Know-how (System-Input)

Advance study:

(20)

Advance study:

Focussing

(21)

Prototyping

Advance study:

(22)

Economic viability

Advance study:

(23)

ROLAP vs. MOLAP

Project definition:

Activities

• Working out the necessary dimensions and their hirarchical order (drill down, slicing and dicing, aggregation)

• Evaluating the expected data volumina by regarding the sparsity • Discussing the MOLAP – attempt (multidimensional OLAP)

• Discussing the ROLAP – attempt (relational OLAP) • Discussing the HOLAP – attempt (mixed use)

• Results

• Basic decision about the usuable data architecture for the data marts and for the data warehouse which will be build up in later steps

• Productcriteria

• Support of MOLAP, ROLAP and/ or HOLAP • Support of personal OLAP

• Export possibility to MS-Office • Add-in functionality for MS-Office

MOLAP ROLAP HOLAP

Operative Data Data Warehouse Data Marts

(24)

3-TIER vs. 2-TIER vs. 1-TIER

DB-Server OLAP-Server Client

(25)

FAT-Client vs. THIN-Client / Intranet

Project definition:

(26)

Server

Project definition:

(27)

Product choice

Project definition:

• Activities

· Assessment of the existing technologies on the market regarding the necessary data-and C/S-arcitecture to reach the target

· Consideration of the company’s demands regarding the product choice

· Holding user-workshops to find out about the acceptance regarding possible front-end design

• Results

· Confirmation of the data- and C/S-arcitecture in case it had to be revised

· Making a principle decision about the products that will be used

· Making a principle decision about the sort of user-surface • Productcriteria

· Possibilities of combining the products between each other

· Integration of the products/modules over common metadata

· Maximum amount of transformed dimensions and hierarchical levels

(28)

Team staffing

Project definition:

Data supply Front-end

Design Technique Team of experts Sponsor customer Quality safeguarding producer Support Projectmanagement

(29)

Planning

Project definition:

(30)

Metadata

Modeling:

Activities

• Determination/ fixing the repository to deposit metadata over datasources, data warehouse, data marts as well as the data flow between all levels (staging areas) • Determination of metadata with accessibility for the user

• Results

• A central active repository for every level of the system

• Productcriteria

• Interface for data modeling

• Storing of the transformation- aggregation and extraction rules • Storing of all the necessary operations

• Storing of information about the user and his authorization on the dimension level

(31)

Data Marts

Modeling:

Activities

• Modeling of the Data Mart that will be developed in consideration to the agreed data arcitecture

• Consideration of aggregation and denormalization

• Agreeing precisely on the data model with the help of the specialists team by holding a navigation workshop

• Documentation of the data model by using metadata from the arranged repository • Working out the cardinality and rates of compressions of the dimensions

• Results

• Data model of the data mart that will be created in a denormalized form • Productcriteria

• Modeling of data in the way of ER with support of the denormalization and hierarchical dimensions (star scheme, snowflake)

• Generating the physical data storage • Automatic partition and intensification

(32)

Data sources

Modeling:

(33)

Data Warehouse

Modeling:

(34)

Data-flow

Modeling:

Activities

• Modeling the dataflows all the way from the data-sources up into the data marts on field level • Documentation of aggregations, transformations, extractions and operations which are

necessary to fill the data marts via the data warehouse • Documentation of the conditions of integrity on field level

• Joint and detailed agreement upon the data flow modeling with the specialists team • Differentiation of additive, semi-additive and non-additive business figures

• Results

• Active repository to regulate the data supply

• Productcriteria

• Use of the modern structured analysis or similar methods • Possibility to deposit conditions of integrity on field level

(35)

Frontend

Modeling:

Activities

• Modeling the frontend design with the necessary details

• Agreeing upon the navigation within the system on the basis of the modeled data mart • Documentation of the data flow from the data mart to the Frontend

• Agreeing upon the frontend design between the specialists team and chosen users • Considerating the possibilities of the chosen product

• Considerating the possibilities of intelligent software-agents • Results

• Confirmed model of the dialog that will be developed and used • Productcriteria

• Possibilities of navigation (drill down, drill up, slicing and dicing) • Maximum amount of on- and off spreadsheet dimensions

• Possibilities of exception-reporting (traffic lights function?, balanced scorecard, sw-agents) • Graphical opportunities (2D, 3D, geographic analysis)

• Flexibility in the layout of reports (standard-reporting, free navigation, EIS-functionality) • Flexibility by analyzing (ranking, comparison of different periods, cumulation)

• User friendliness (Power-user functionality) • Export- and impressor functionality

(36)

Architecture

Modeling:

(37)

Network

Implementation:

(38)

Customizing

Implementation:

• Activities

• Implementation of the chosen product in every level of the data warehouse which is supported by the product

• Using the specific procedures and technics which are given by the chosen product • Agreeing upon further creations of system components due to the development

• Considerating the adopted standards which have been worked out by the internal and external quality assurance

• Considerating the referential integrity which is visible by modeling • Results

• Implementation of the chosen product

• Identification of further components of the system that will have to be developed • Productcriteria

• effort for the development of surface and data supply • effort for the integration of changes

• effort for refresh, append and update of the data deposit • seperating surface and data supply

(39)

Development

Implementation:

Activities

• Creating the data supply routine by using the information deposed on the repository dynamically • Programming the frontend that has to be developed

• Integration of data supply and surface by using standardized interfaces

• Considerating the adopted standards which have been worked out by the internal and external quality safeguarding

• Considerating the referential integrity which is visible by modeling • Results

• Completed system allowing nonrestricted access to all data • Productcriteria

• Effort For the development of surface and data supply • Effort for the integration of changes

• Effort for refresh, append and update of the data deposit • Seperating frontend and data supply

• Interfaces to higher programming languages over class librarys • Availability of an integrated software development location

Surface-Design

Data-supply

(40)

Security/ Usergroups

Implementation:

• Activities

• Defining all the dimensions which are relevant for security

• Defining the usergroups in collaboration with the specialists team • Implementation of security informaion in the repository

• Expanding the routines of data access by the consideration of security information in the repository

• Results

• Security administration system

• Completed system by considerating the aspects of security • Concept for different usergroups

• Productcriteria

• Considerating power-user as well as end-user

• Variable classification of users to different user groups

(41)

Testing

Implementation:

• Activities

• Definition of error categories (e.g. data supply, surface, technic)

• Definition of error classes /grades (e.g. 1= serious error...4= nice to have) • Defining equivalence classes to reach a high functional covering by the test • Carrying out the test to eliminate errors steadily

• Carrying out stress- and masstests above the equivalence classes

• Results

• Approving the system

• Productcriteria

• Debugging- functionality

(42)

Education

Implementation:

(43)

Propagation

Implementation:

(44)

Helpdesk

Operation/ maintenance:

(45)

Automation

Operation/ maintenance:

• Activities

• Creating automatical job-processes for the data supply of the data marts

• Creating automatical job-processes for the supply of data of the client-system (pull-principle) • Considerating available time frames

• Handing over the data supply routines to the productive system of the customer • Results

• Productive system

• Productcriteria

• Possibility of automation of the data supply- and data spreading routines • Considerating recommencement in case of an interruption of the data supply • Query automation of integrity conditions as well as free eligible filters

• Exception-handling by proactive presentation of critical raw data • Independent data supply of the clients

(46)

Change-Management

Operation/ maintenance:

(47)

Capacity

Operation/maintenance:

Activities

• Finding out about the expected need of store capacity • Considerating given key figures due to the modeling like

-amount of dimensions as well as their cardinality

-amount of hierarchies per dimension as well as their rates of compression -amount of different features per hierarchy

-in case of MOLAP solutions, the amount of non usable spaces (sparsity) -in case of ROLAP solutions, length of a line of the central entity-type • Consideration of the additional requirement due to the indication

• Consideration of the additional need due to aggregation • Results

• Qualified projection of the required capacity of the system • Productcriteria

(48)

Tuning

Operation/maintenance:

• Activities

• Recommendation of aggregations for standard analysis • Recommendation of indices for frequent used analysis

• In case of higher data volumina (>100 GB) implementation of parallel architectures • Moving data from the data mart to the client ( subsetting, personal OLAP)

• Dismanteling and parallelising the raw data supply • Partitioning of the data deposit

• Results

• Improvement of the running system • Productcriteria

• Automated logging on statement level

• Automated recommendation of aggregations and index • Log-management-system to evaluate the use of the system • Possibility of using bitmap-index

(49)

Backup

Operation/maintenance:

(50)

Verification/Scalability

Prediction Deviation Analysis Classification Segmentation Association

Operation/maintenance:

References

Related documents

Under the direction of Andrew Stagg, Bendigo Bank’s Manager of Learning and Development since 1998, Whole Brain ® Thinking has become an integral part of the bank’s culture

Elisabeth Gerarda de Waal, daughter of Nicolaas de Waal and Christina Kaljee, was born op 12 september 1891 te Amersfoort and was baptized.. She is waschmeisje,

Liu J., Gao Y., Cao D., Zhang L. and Guo Z. 2011 Nanoparticle dispersion and aggregation in 

This publication was commissioned by the NSW Mine Safety Advisory Council as a result of the NSW Mining Industry Health and Safety Action Plan to 2008.. The NSW Mine Safety Advisory

Phonetic analysis of unsupervised state-based interpolation between Regional Standard Austrian German (RSAG) and Bad Goisern dialect (GOI)..

Teaching at Escuela de Ingeniería, Pontificia Universidad Católica de Chile Courses: Air Transportation (second semester 2012-2013), Desafíos de la Ingeniería

A research model is developed that proposes both a direct effect of IT infrastructure on organizational performance and an indirect effect mediated by the effects of IT