• No results found

Enterprise Data Management

N/A
N/A
Protected

Academic year: 2021

Share "Enterprise Data Management"

Copied!
10
0
0

Loading.... (view fulltext now)

Full text

(1)

Enterprise Data Management

A Comprehensive Data Approach for CSPs

Sivaprakasam S.R.

Enterprise Data Management (EDM) helps Communication Service Providers (CSPs) address the challenges caused by convergence of technologies

and frequent mergers and acquisitions. It provides a single view of the truth, unique reference data and a unified data quality framework to integrate, validate and migrate data. In addition, it enables continuous monitoring of the quality of data and establishes standards across the enterprise data lifecycle. Our white paper discusses the need for EDM in the telecommunications sector, the benefits and challenges in implementing EDM, and the components of an effective enterprise data management solution.

(2)

Executive Summary

Mergers and Acquisitions (M&As), convergence of technologies and rapid changes in global regulations are revolutionizing the telecommunications industry. To address the dynamic market conditions as well as the competition from established companies and new entrants, Communications Service Providers (CSPs) must constantly provide new features and

opportunities to customers. In addition, they must foresee market changes and act decisively by capitalizing on their primary asset – operational and analytical data – for precise metrics and predictions. However, CSPs must implement and govern enterprise data effectively to meet their business intelligence needs.

Enterprise Data Management

Enterprise Data Management (EDM) helps CSPs manage heterogeneous data sources, validate the quality of data, devise a common data model by integrating information, build analytical and presentation layers, and manage end-to-end metadata in the analytical and presentation layers. EDM also guides and governs the data architecture, while managing data assets. A typical EDM architecture diagram is depicted in Figure 1.

Error Logging Oper at ion al Data Stor e (ODS ) Business Rules Metamodel Mapping Metamodel

ETL / EAI TOOL

DQ Rules Metamodel Excepon Audit Repository Data Profiling Data Cleansing Data Validation Data De-duplication Exception Investigation Data Profiling Data Cleansing Data Validation Data De-duplication Exception Investigation DQ / Exception Rule Engine

BPM Workflow Engine

Enterpri

se

Data Mode

l

Business Analytics, Score card and Dashboard Reporting Excepon OLAP DB Enterprise Data Wa re hous e Downst re am DB Downst re am DB Enterprise Service Bu s Shared Data Se rv ice s Shared Security Se rv ic es Shared Business Se rvic es

Source Data Data Integration Target Data Data Services End User Layer

Helpdesk End Users Enterprise Information Po rt al Intranet Customers Decision Makers Raonalized Systems

Enterprise Metadata Repository

Enterprise Data Governance

S1

S2

S3

Sn

Figure 1: Typical enterprise data management architecture

Benefits of EDM

EDM ensures consistency of information with a ‘single version of truth’ by providing reference data requirements on an integrated data platform. It supports operations and enhances decision-making capabilities by helping CSPs migrate from disparate data silos to an integrated, enterprise-wide data environment.

(3)

EDM delivers several benefits -

• Provides a single, accurate view of end-to-end enterprise data

• Consolidates, profiles and integrates data from disparate systems, thereby enhancing Data Quality (DQ) and eliminating exceptions. It increases the trust factor of data.

• ’Insulates’ systems and processes from change, enabling rationalization of legacy systems and facilitating mergers and acquisitions

• Improves the accuracy of decisions

• Enables on-demand extraction of ad hoc operational reports

• Facilitates rapid analysis of customer retention, customer satisfaction, customer lifetime value, and cross-sell and up-sell data

• Enables risk-benefit analysis of business opportunities

• Enables reuse of the EDM framework in new regions, markets and product categories

Dimensions of Enterprise Data Management

Data Architecture

Data Architecture

Data Stewardship

Data Stewardship

Enterprise Data Governance

Enterprise Data Governance Enterprise Metadata Management Enterprise Metadata Management

Enterprise Data Management Enterprise Data Management

En te rpr is e Da ta Mo de l En te rp ri se D at a Mo de l Da ta S ta ndar d Da ta St an da rd Da ta Q ua lit y Ma na ge me nt Da ta Qu al it y Ma nage me nt Da ta Secu ri ty D at a Se cu ri ty

Figure 2: Enterprise data management framework

The components of an enterprise data management framework are shown in Figure 2, and are discussed in subsequent sections. Enterprise data can be managed along various angles/ layers to meet the goals of EDM:

• Data stewardship

• Data governance

• Data standards

• Data definitions and taxonomies

• Technology standards

• Data retention

• Data quality management

• Data architecture

• Data integration

• Data migration

• Master/ reference data management

• Metadata management

• Data warehousing

• Data portal

(4)

Data Stewardship

EDM requires the owners of source data to manage data assets effectively. A data steward’s responsibilities include managing data standards, formats and trust factor, and establishing and enforcing data standards. Data quality management provides quality analysis reports that enable stewards to improve the quality of data, reduce data redundancy and improve data management capabilities across the enterprise.

Source data owners provide the overall governance and oversight to the data management activities within an enterprise, while data stewards

-• Define business terms and establish rules for quality and exception validation Identify business-critical attributes and ensure data validity

• Prioritize and ensure data quality of exceptions raised by the quality management team

• Coordinate between business, application and IT teams to enhance the trustworthiness of data

• Address target and downstream data requirements

• Create awareness about data management principles, data security and retention policy, and best practices across the enterprise

Data Governance

A data governance committee must be instituted to develop the principles of managing data-related processes and enforcing them across the enterprise. It must be responsible for nominating teams for programs such as data stewardship, data quality management, data modeling, data integration, data migration, data warehousing, master data management, data architecture, data security, and metadata management. A data governance framework that focuses on people, process and technology ensures accessibility, availability, quality, consistency, security, and audit-readiness of data.

Typically, senior managers form the steering committee, while the operational team of the data governance committee includes business users, data architects and data stewards. Data governance plans are framed by a working group and approved by the steering committee. The committee must ensure that data assets are managed effectively across the enterprise.

Senior

Senior

Management

Management

Steering Committee

Steering Committee

Working Group

Working Group

St

an

da

rd

s a

nd

Ru

le

s

St

an

da

rd

s a

nd

Ru

le

s

-

-

Ap

pr

ov

al

s

Ap

pr

ov

al

s

Data Architect

Data ArchitectData Architect Data StewardData StewardData Steward Business UserBusiness UserBusiness User

Da

ta

Go

ve

rn

an

ce

Ma

na

ge

m

en

t

Da

ta

Go

ve

rn

an

ce

Ma

na

ge

m

en

t

Figure 3: Data governance structure Key roles and responsibilities of the data governance committee:

• Steering committee – Articulate the vision and arrange funds for the data governance initiative

• Source data owners – Prioritize and execute data management

• Data stewards – Address issues in data quality and standards such as merger or deletion of data, data enrichment, etc. Data stewards must ideally be from the business side.

(5)

Data Standards

Data standards are framed by the data governance steering committee to ensure that all data elements of an enterprise comply with standard terms, definitions and values. The working group, in turn, ensures that all associated parties agree with the data values and adhere to the standards. It helps the data stewards frame validation rules to filter business exceptional data and improve its trustworthiness. DQ issues identified by validation against standards must be closely monitored and tracked by data stewards until resolution.

By ensuring that each and every data element of the enterprise data model adheres to the rules and definitions of the SID model1, CSPs can meet the data requirements of emerging technologies such as wireline, wireless, cable, and IPTV. In

addition, they can avoid redundant data and ensure consistency of data.

Data elements can be classified based on their logical and physical properties. Logical property refers to the definition, origin and data type. Physical properties of the data element include data length, validation rules, and how data is stored, presented to end users and labeled. For example, elements such as the date and currency fields have specific display formats, standards, regional settings, and validations.

Data Quality Management

Data Quality Management (DQM) is a key enterprise data management process that addresses issues in the quality of data, and identifies exceptions in data elements that can be classified into industry standard quality dimensions as shown in Figure 4. It requires strategic data profiling and data quality software to be a part of the data management process.

DB ConnecvityDB

Connecvity AnalysisAnalysisAnalysisDQ DQ DQ Configuring Validation Rules Configuring Validation Rules Configuring Validation Rules Configuring Validation Rules Monitor DQ MetricsMonitor DQ Metrics Data Steward Analysis Data Steward Analysis Data Steward Analysis Data Steward Analysis Completeness Does it provide all the information

required?

Completeness Does it provide all the information

required?

Does it provide all the information

required? Validity

Is it within acceptable parameters for the business?

Validity

Is it within acceptable parameters for the business?

Is it within acceptable parameters for the business?

Uniqueness

Are the values repeated?

Uniqueness

Are the values repeated? Are the values repeated?

Relevance

Is it relevant for its intended purpose?

Relevance

Is it relevant for its intended purpose? Is it relevant for its intended purpose?

Timeliness

Is it up-to-date and available whenever required?

Timeliness

Is it up-to-date and available whenever required?

Is it up-to-date and available whenever required?

Consistency

Is it consistent and easily understood?

Consistency

Is it consistent and easily understood? Is it consistent and easily understood?

Accuracy

Is it correct, objective and can it be validated?

Accuracy

Is it correct, objective and can it be validated?

Is it correct, objective and can it be validated?

Figure 4: Classification of exceptional data elements

(6)

The quality of data may be affected at various stages of the data element lifecycle such as data entry, data transformation, conversion from operational to analytical data, master data and reference data transformation, and migration of data from legacy systems. Some practical examples of anomalies in data quality:

• Accuracy

Mary Pierce, 1408 North Any Street, Germantown, MD, 418-734-1576

- Is this a valid postal address?

• Consistency

Alan Smith, 4200, Weston Park, Cary, NC

- Is there a suffix to the street? - Is there a postal code?

• Completeness Street => St, Str

Account => Acct, Accnt, A/c

- Are they business terms with variations in value?

• Correctness

Mary Brown, 1408 North Any street, Germantown, MD, 9AB-786

- Is this a valid zip code format?

• Uniqueness

’First Merit Bank‘ and ’1st Merit Bank’

- Are they the same or different

• Relevance Robert Smith,

123 Peach Tree lane, Pleasanton CA Julia Smith, 123 Peach lane, CA

- Are the two from the same household?

• Timeliness

Activation date and time: 12 May,

2009, 12:00 AM, posted into the system during day-end batch processing. However, the service was activated during business hours of the previous day.

• Interpretability

First name attribute contains: Tai-Tai Cheung Mei Lee Wang

- Is this a personal name or company name? - Is the gender male or female?

Data management tools can be used for profiling and standardizing data, matching and merging data, monitoring quality, and tracking and addressing issues in data quality.

Challenges in Data Quality Management

• Ownership of quality issues

Continuous monitoring of quality by the DQM process enables real-time reporting of issues. However, it is seen that units/ departments rarely take ownership to rectify them or approve automatic cleansing. A data governance committee is needed to enforce cross-functional collaboration and sensitize the organization about the importance of resolving DQ issues. It requires huge manpower, and the Return on Investment on data quality fixes is relatively low.

• Identifying master/ reference data

The lack of attribute-level standards for data quality, M&As without a data integration strategy and federated source data stores result in multiple versions of data, the lack of a ‘golden’ record and incomplete master attributes.

• Metadata-driven DQM

A DQM process driven by metadata management is recommended for CSPs with frequent Merger and Acquisitions, multiple source systems or strategic plans to integrate data from multiple technologies. The cleansing maps in the integrated process and its in-built mechanism to alert and track issues help refine DQ, exception and notification rules and resolution techniques. The DQ system notifies exceptions to quality standards and violation of data quality rules to data stewards, who can rectify the issues with the help of data architects. New DQ issues must be updated in the rule repository. Quality analysis reports contain the results of DQ measurements in a pre-determined format. The senior management normally requires reports to be presented with color codes akin to the traffic signal – usable data (green), partly usable (yellow) and not usable (red).

(7)

Customer

Billing

Source Systems

Data

Consolidation Data Profiling, Quality Analysis Data Cleansing

Service

Faults

Excepon Repository

Continuous DQ Monitoring by SME

ODS Network Contract Data Profiler

EDM

EDM

DQ Rules ProductUsage ETL Processing Corrective Acon

DQ Engine

Enterprise Data Model

Figure 5: Data quality management

• Data quality-driven cleansing

The DQ process meets business requirements by cleansing and standardizing the database through continuous monitoring. The data governance committee and data stewards frame the rules for cleansing, exception management and standardization of data. Data quality scorecards help data stewards monitor and track DQ issues, and refine data cleansing rules. Cleansing occurs at various instances of the data management lifecycle:

• Front-office (real-time) cleansing

• Back-office (batch cleansing)

• Cross-office (cleansing during data transfer between businesses)

• Data collaboration between cross-functions

The DQM process is burdened with ambiguity and risk when an attribute has multiple owners or a change in an attribute influences other attributes and downstream applications. It can be avoided by storing details of the creation of data in the taxonomy of data items. The information to be stored include author, date and time of creation, application details, domain values, derivation details, derivation logic (if required), dependant fields, data item hierarchy, and propagation of changes.

As a DQM best practice, CSPs must start with limited data profiling and data cleansing activities, and incrementally develop a robust and scalable data quality management platform for a cross-organizational, 360-degree view of business.

(8)

Data Architecture

Data architecture is a fundamental aspect of enterprise data management. It is a multi-layered set of models that defines the enterprise data strategy and management policy on data collection, and identifies the need for improvement in business decisions.

Billing and Payments

Service Provisioning

Contracts

STATISTICS FORECASTING &BUILDINGBANKSOCIETY

PENSION INSURANCE GPS CUSTOMER INVOICING SALES_LEDGER GL(SE & SAL)

CCPS PROCUREMENT EDI-FAX-EMAIL GATEWAY MDS STATS_FORECASTING EIS GPS_PSION PSION_UPAY GPS_EIS GPS_BANK Share Save Eligibility

Personnel Detail GPS_INS

D&CPS_CCPS Cost plans TELEWARE PAYROLL ACS EDI-FAX-EMAILGATEWAY TACT-FCST GRD BACS BUDGET MODELLING SYSTEM KEL

ACS_STATS GWAY_ACSAll media

ACS_GWAYFORECASTING_TACT-FCST ACS_GRD TELE_GRD GRD_TELE PA-SE-SAL AICC CORPORATION TAX AICC_CTAX PA_AICC Final Accounts PA_CCPS CCPS_PROC PROC_CCPS PROC_EDIEDI_PROC

PROC_FAXPROC_EMAILPROC_MDS AP(SE)_PROC PROC_AP(SAL) AP(SAL)_PROC INV_SL MEN_INV PAY_AP AP_BACS INV_CUST PAY_BACS FCS_GRD GRD_FCS INV_GRD FCS_GL GL_FCS GRD_GL(SE) GL(SE)_GRD GRD_GL(SAL) GL(SAL)_GRD BANK_UPAY UPAY_PSION ACS_INV PAS_PROC PA_PR OC PROC_PAS PROC_PA GL_PROC PROC-GL * * * * * * * * * * * * * * * * * * * * * * * BMS AUTOCAD D&CPS MMS D&CPS_AUTOCAD Drawings PROPERTY (attack) AP-SE-SAL PM 3DMOD&VIS DES OPT FOCUS GIS MENTOR PM_CCPS Cost Phasing PM_PROC CCPS_PM Cost Plan MDS_PROC BMS_MMS DES-OPT_3DMOD&VIS 3DMOD&VIS D&CCPS Models D&CPS_GIS Drawings and object data MMS_GIS Maintenance Costs PROP_MMS MMS_PROP Maintenance Costs GIS_PROP PROP_GIS FOCUS_PROP Surveys GIS_MEN MEN_GIS PROP_EIS MMS_EIS PROC_EIS MEN_KELKEL_MEN AP_MMS PROC_AP(SE) PROP_SL * * * * * * * * * * * * * *

STATISTICS FORECASTING &BUILDINGBANKSOCIETY

PENSION INSURANCE GPS CUSTOMER INVOICING SALES_LEDGER GL(SE & SAL)

CCPS PROCUREMENT EDI-FAX-EMAIL GATEWAY MDS STATS_FORECASTING EIS GPS_PSION PSION_UPAY GPS_EIS GPS_BANK Share Save Eligibility

Personnel Detail GPS_INS

D&CPS_CCPS Cost plans TELEWARE PAYROLL ACS EDI-FAX-EMAILGATEWAY TACT-FCST GRD BACS BUDGET MODELLING SYSTEM KEL

ACS_STATS GWAY_ACSAll media

ACS_GWAYFORECASTING_TACT-FCST ACS_GRD TELE_GRD GRD_TELE PA-SE-SAL AICC CORPORATION TAX AICC_CTAX PA_AICC Final Accounts PA_CCPS CCPS_PROC PROC_CCPS PROC_EDIEDI_PROC

PROC_FAXPROC_EMAILPROC_MDS AP(SE)_PROC PROC_AP(SAL) AP(SAL)_PROC INV_SL MEN_INV PAY_AP AP_BACS INV_CUST PAY_BACS FCS_GRD GRD_FCS INV_GRD FCS_GL GL_FCS GRD_GL(SE) GL(SE)_GRD GRD_GL(SAL) GL(SAL)_GRD BANK_UPAY UPAY_PSION ACS_INV PAS_PROC PA_PR OC PROC_PAS PROC_PA GL_PROC PROC-GL * * * * * * * * * * * * * * * * * * * * * * * BMS AUTOCAD D&CPS MMS D&CPS_AUTOCAD Drawings PROPERTY (attack) AP-SE-SAL PM 3DMOD&VIS DES OPT FOCUS GIS MENTOR PM_CCPS Cost Phasing PM_PROC CCPS_PM Cost Plan MDS_PROC BMS_MMS DES-OPT_3DMOD&VIS 3DMOD&VIS D&CCPS Models D&CPS_GIS Drawings and object data MMS_GIS Maintenance Costs PROP_MMS MMS_PROP Maintenance Costs GIS_PROP PROP_GIS FOCUS_PROP Surveys GIS_MEN MEN_GIS PROP_EIS MMS_EIS PROC_EIS MEN_KELKEL_MEN AP_MMS PROC_AP(SE) PROP_SL * * * * * * * * * * * * * BMS AUTOCAD D&CPS MMS D&CPS_AUTOCAD Drawings PROPERTY (attack) AP-SE-SAL PM 3DMOD&VIS DES OPT FOCUS GIS MENTOR PM_CCPS Cost Phasing PM_PROC CCPS_PM Cost Plan MDS_PROC BMS_MMS DES-OPT_3DMOD&VIS 3DMOD&VIS D&CCPS Models D&CPS_GIS Drawings and object data MMS_GIS Maintenance Costs PROP_MMS MMS_PROP Maintenance Costs GIS_PROP PROP_GIS FOCUS_PROP Surveys GIS_MEN MEN_GIS PROP_EIS MMS_EIS PROC_EIS MEN_KELKEL_MEN AP_MMS PROC_AP(SE) PROP_SL * * * * * * * * * * * * * *

Customers and Sales Channels

Products and Services

Service Assurance Corporate

Marketing

Figure 6: Multi-layered data architecture

The enterprise data strategy comprises strategic initiatives such as data integration, legacy data migration, legacy system rationalization, master data management, metadata management, and business intelligence and reporting.

CSPs can integrate independent applications into an ‘enterprise data model’ to address challenges such as dynamic business processes, convergence of technology, changes in regulations, and increased competition. In addition, they can view and monitor enterprise performance by integrating and migrating data into the enterprise model, enabling rationalization of legacy systems.

Data architects design the enterprise data model, automate data capture and validation, and implement an audit trail mechanism for business-critical data. They are also responsible for:

• Entity relationship diagrams that depict the relationship of business entities across subject areas

• Data flow diagrams that display data flow between enterprise applications and databases

• Identifying data stewards across subject areas, associated business units, business processes of each business function, and enterprise applications

• Setting standards and best practices for naming conventions to define data elements in the enterprise data model

• Designing and managing a metadata repository across the enterprise. The repository stores attribute name, type, length, owner of the data element, business unit, valid values, last updated user, last updated date and time, etc. Data profiling; data modeling; Extract, Transform and Load (ETL); and reporting tools exchange metadata with the enterprise metadata repository.

(9)

Network Elements

Sales Channels Marketing Campaigns

Service Provider Third Party Service

Provider

Content Store Party Role Map

Sales / Opportunity

Credit Risk Scoring SCM Organizaon Individual Address Customer Site Customer Accounts Users Internal Transaction Finance Accounting Inter-Carrier Accounts Interaction Location Party Customer Order Trouble Ticket Physical Resource

Class of Service Partner and Profile

Product Product Catalog Agreement Service Orchestration / Authorizaon / Metadata Service Logical Resource Presence Info Contact Channel Customer

Usage and Billing Customer Profile

Billing Account Network Equipment

Sample Data Model

Figure 7: Entity relationship diagram

The aim of the enterprise data strategy must be to provide cleansed, consistent, integrated, and well-managed reference or master data. Since master data is a core component, any issue with it will affect the entire enterprise, exposing risks due to data inconsistency.

Data Security

CSPs need a robust strategy to ensure security of sensitive, business-critical data such as customer profile, payment details, contact, contracts, subscription, and product details. The data governance team must frame rules and regulations to –

• Manage the change control board that authorizes changes to the data structure for sensitive data. Frequent changes lead to an unstable business and multiple versions of the business entities.

• Enhance the confidentiality and availability of data in hard and soft copy

• Protect data from unauthorized access, modification and destruction

• Prevent improper disclosure of data

• Avoid security breach of information and related loss to business, legal implications, etc.

The data security strategy assigns stewards for all data sources, and authorizes them to grant access rights, maintain the list of authorized users and ensure accuracy of data. Stewards must also ensure that data is not duplicated in any format unless there is a business process requirement and copies are controlled across the enterprise.

(10)

Challenges in Enterprise Data Management

To manage terabytes of enterprise data in a complex landscape of legacy applications, CSPs require an enterprise data model that is compliant with the SID framework. They also need well-defined strategies for data integration, modernization of applications, migration of data from legacy applications, rationalization of legacy systems, and presenting data in the enterprise portal. It involves the following challenges:

• Duplication of data due to heterogeneous applications with independent data systems

• Data impurity due to the lack of data quality principles across the enterprise

• Non-standard data and exceptions in data range, type and length due to the absence of a metadata platform

• Multiple owners maintain data pertaining to different technologies in various data structures

• Lack of a data governance committee to enforce data quality principles and quality standards, and create awareness among source system owners about the importance of data as corporate assets

• Multiple versions of operational reports with different sub-functions and duplication of data in different formats across upstream/ downstream data systems due to uncontrolled data distribution, resulting in legal and regulatory non-compliance

Conclusion

Enterprise data management helps CSPs improve the quality of data, prevent revenue leakage and roll out methodologies for data governance, metadata management, master data management, data architecture, and data security. It also enables informed decision making through customer behavior analysis, market behavior analysis, competitor analysis, single view of enterprise data, and converged billing. However, the success of an EDM implementation depends on effective collaboration between business sub-functional heads, data architects, the CIO and CEO.

About the Author

Sivaprakasam S.R. is a Principal Technology Architect and mentors the database and business intelligence track at Infosys’ Communication, Media and Entertainment business unit. His interests include enterprise data modeling, enterprise data integration, enterprise data warehousing, enterprise data quality management, and semantic data integration. He can be reached at [email protected]

References

Related documents

Whether grown as freestanding trees or wall- trained fans, established figs should be lightly pruned twice a year: once in spring to thin out old or damaged wood and to maintain

Understand data as expected vendor their product a spreadsheet and any enterprise ample time saving can help give the amazon.. Comprehensive and their product spreadsheet

5 CDDC functions transferred into DCC, clearly branded as a business focused team, but arms length company retained to maximise any future trading opportunities 

Our approach helps your organisation deliver real performance improvement and sustainable change by helping you develop and execute a specific business performance

Potential explanations for the large and seemingly random price variation are: (i) different cost pricing methods used by hospitals, (ii) uncertainty due to frequent changes in

(char *buf, int start, int size, char *dest); Copies a data string from a source buffer to a destination buffer starting at an offset specified.. by start up to a fixed

T h e second approximation is the narrowest; this is because for the present data the sample variance is substantially smaller than would be expected, given the mean

We assume that the court maximizes ex-ante expected gains from contracting, and characterize the optimal policy, which is to void contracts in events which are deemed ex-post to