• No results found

Data Mart/Warehouse: Progress and Vision

N/A
N/A
Protected

Academic year: 2021

Share "Data Mart/Warehouse: Progress and Vision"

Copied!
31
0
0

Loading.... (view fulltext now)

Full text

(1)

Data Mart/Warehouse: Progress and Vision

Institutional Research and Planning

University Information Systems

(2)

1/2/2007

What is data warehousing?

A data warehouse:

ƒ

is a single place that contains

complete,

accurate

data from multiple sources, making

data analysis easier.

ƒ

can streamline complex administrative

functions and support the decision-making

process.

ƒ

is specifically designed for querying and

reporting.

ƒ

is separate from operational/legacy systems

(e.g. SIS).

(3)

How data warehousing works

ƒ

Information is collected from a variety of

different sources and formats.

ƒ

The data is “cleaned” – transformed into a

common format.

ƒ

The cleaned data is added to the

warehouse, and made available through

data marts.

ƒ

Data is frozen in time for comparative and

forecasting purposes.

ƒ

Users can then write queries or make

(4)

1/2/2007

IRP Data Mart/Warehouse

SIS (completed) HRS data FRS data IRP data selection and design Data Warehouse UIS Software/hardware

support and tools

Queries Reports

(5)

Use of data warehousing

ƒ

Reporting

Consistency and standardization of data. Users only have to learn how to use one database to write queries and reports and display data in graphical form.

ƒ

Internal –

Enrollment, retention & graduation

rates, etc.

ƒ

External –

IPEDS, US News & World Report,

Princeton Review

(6)

1/2/2007

Data Driven Decision Making

ƒ

Monitor data

ƒ

Data warehousing helps monitor important

university processes.

ƒ

Rapid queries

ƒ

Users can generate ad-hoc reports “on the fly”

without needing to spend time writing and

re-writing the code needed to extract data from

multiple sources

ƒ

Trend Analysis

ƒ

Administrators can easily analyze admissions and

enrollment trends using historical data.

ƒ

Daily Decision Making

ƒ

Increase the efficiency of the daily functioning of

(7)

Benefits of Data Warehouse

ƒ

Competitive advantage

ƒ

Administrators can respond to changing

conditions based on data.

ƒ

Consistent public image

ƒ

Reputation for quality and excellence

ƒ

Enhance public image

ƒ

Facilitates accreditation

ƒ

By using the data warehouse to “drill down” into

the college level and department level, more

detailed reports can be generated for

(8)

1/2/2007

IRP’s Data Mart

Current data ƒ SIS ƒ Enrollment ƒ Degrees ƒ Admissions ƒ Graduation ƒ Retention

Databases to be added

ƒ Research expenditures (2 years)

ƒ Course distribution

(2 semesters w/ ability to go back further)

ƒ Faculty file to 1995 (in progress)

ƒ Staff file

(in process of being cleaned)

ƒ Budget file

(in conceptual design stage)

ƒ Athletics (in progress)

ƒ Faculty Load (in development)

ƒ Student services (in the future)

(9)

Data source analysis diagram

Student Information System (SIS)

Enrollment &

Degree Data Other Data

Commission of Higher Education

(CHE)

Student Information Data Warehouse

(SIDW) Official Data

(10)

1/2/2007

Architecture model

Source Systems

NJIT-SIS

Data Staging Area

T3 c on n ec ti on

Data Staging Services

- Load

- Transformation - Cleansing - Export

- Metadata exchange

CHE (Government) E-M ai l Staging Files

- Dimension sets - Facts Data Staging Administrator Client Back Room Base Level Presentation Repository Metadata Catalog Source Models Target Models

Source Target Mappings & Cleansing

Source Models

Target Models Code Definitions Official Data Definitions

Report Library Data Warehouse Database Administrator Client Backup / Replicate Application (Distributed) Front Room Security Service

-Log on Administration

- Authentication - Authorization

ROLAP Service

- Query Management - Data Analyze - Aggregate Navigate

Cleaned Data

Bulk Loader

Focus Program

- Enrollment Files - Degree Files

SURE File Connx Program

- Other Files

End User Application Client

Advanced Analysis Client

E-M

ai

(11)

Appendix

Current IRP Data Mart

Screens

(12)

1/2/2007

Screenshot: Predefined query

(13)
(14)

1/2/2007

Screenshot: Enrollment cross-tab

(15)
(16)

1/2/2007

Screenshot: Admissions cross-tab

(17)
(18)

1/2/2007

Screenshot: Graduation rate cross-tab

(19)
(20)

1/2/2007

Screenshot: Retention rate cross-tab

(21)
(22)

1/2/2007

Screenshot: Miscellaneous queries

(23)
(24)

1/2/2007

Extract, Transform and Load (ETL)

(25)

DW Summary: Characteristics

ƒ

Subject-orientation

ƒ

integrated

ƒ

non-volatile (i.e. not updated)

ƒ

time variant (kept for long periods, for

forecasting and trend analysis)

ƒ

summarized

ƒ

large volume

ƒ

not normalized

ƒ

metadata

(26)

1/2/2007

Data Warehouse Process

(27)

Data Warehouse Tools

ƒ

Intelligent Agents and Agencies - tools work

and think for user.

ƒ

Query Facilities and Managed Query

environments.

ƒ

Statistical Analysis - One of the biggest

surprises in the data warehousing

marketplace is the resurgence of interest in

traditional statistical analysis, and the

concomitant resurrection of the popularity to

products like SAS and SPSS.

(28)

1/2/2007

Data Warehouse Tools

ƒ

Data Discovery

A large class of tools formerly classified as

decision support, artificial intelligence and expert systems. They now make use of neural networks, fuzzy logic, decision trees, and other tools from advanced mathematics to allow a user to “sift” through massive amounts of raw data to

“discover” new, interesting, insightful, and in many cases useful things about the organization, its

operations, and its markets.

ƒ There are many different data discovery

(29)

Data Warehouse Tools

ƒ

OLAP -

On-line Analytical Processing:

ƒ

often uses multi-dimensional spreadsheet tools

allowing users to look at information from many different angles.

ƒ Users are able to “slice and dice” reports and to

look at the same kinds of information at different levels at the same time.

ƒ Typical OLAP application might allow a product

manager to view sales figures for a given product at the national level, see them broken down by

division, drill down to see territories within a

division, check sales numbers for each store within a territory, and then compare them against sales of stores from another territory.

(30)

1/2/2007

Data Warehouse Tools

Data Mining

Provides for:

ƒ

Knowledge discovery in databases

ƒ

Knowledge extraction

ƒ

Data archeology

ƒ

Data exploration

ƒ

Data pattern processing

ƒ

Data dredging

(31)

Data Mart Timeline:

Month1 Month2 Month3 Month4 Month5 Month6 Project Planning/Management/Policy

Development Environment Software Installation

Data Inventory & Quality Assessment Database Design

Application Design

Data Scrubbing & Loading Application Development

Testing Documentation

References

Related documents

Strategy #4: Review and update comprehensive park plan Action Step #1: Identify financial costs of plan components Action Step #2: Research funding options. Action Step #3:

Given an equilateral triangle ABC and a point M in the plane (ABC). Let D be the midpoint of the side AB. Let ABCD be a square, and C the circle whose diameter is AB. Let Q be

Und, wenn man den App-Begriff weiter fasst und sagt, dass muss jetzt nicht unbedingt eine native App sein, sondern es kann auch eine Web-App sein, dann kann das durch aus sein,

ODREĐIVANJE HIDRAULIČKE VODLJIVOSTI METODOM PERMEAMETRA SA STALNOM RAZINOM NA PODRUČJU VODOCRPILIŠTA BIKANA U VIROVITICI..

Advanced Papers Foundation Certificate Trust Creation: Law and Practice Company Law and Practice Trust Administration and Accounts Trustee Investment and Financial

In regard to Fact 3, when production based prices such as the GO deflator for traded goods or the PPI are used to measure P i T , we find that correlations, standard deviations,

Often, Spain sought to close these loopholes after each new treaty with Britain, However, constant shifting in Spanish governance and diplomacy often left an unclear foreign

Players can create characters and participate in any adventure allowed as a part of the D&D Adventurers League.. As they adventure, players track their characters’