• No results found

SAS : The Complete Solution For Medical Data Analysis

N/A
N/A
Protected

Academic year: 2021

Share "SAS : The Complete Solution For Medical Data Analysis"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

SAS®: The Complete Solution For Medical Data Analysis

Diane E. Brown

Athena of North America, Indianapolis, IN

ABSTRACT

With health care costs out of control, the importance of health care information is crucial to making apropriate revisions in the health care system. A wealth of information is held by large insurance companies, but it is often stored in large databases in a format that is hard to use for analysis and reporting. This paper discusses an application that was built for employer groups, physicianlhospital organizations, and insurance companies to analyze health care data for purposes of fraud and abuse, benefit reengineering, identification of preventive health care programs, identification of case management areas, competitive pricing, and provider profiling. Technical features of this system include a relational SAS data base, SAS reporting and analysis, and a SAS/AF front end. The SAS data base contains claims tables which are indexed and several look up files to support medical coding systems and provider information. Standard SAS reports are generated by the user through parameter input including sub setting claims data prior to report generation.· SAS/ASSIST and SASIINSIGHT are available through the system for more sophisticated users. The

SASI AF front end, developed with the help of SAS Consulting Services, is entirely data driven enabling new users and new reports to be added by adding a new observation in a SAS data set. The system is Windows based and can run on a standalone PC, PC LAN, or client/server, depending on data base sizing.

INTRODUCTION

Athena of North America is a subsidiary of Associated Insurance Companies, a large health insurer which holds the Blue Cross Blue Shield license in the states ofIndiana and Kentucky and has over 150 subsidiary companies. This division of Athena of North America provides health data analysis through system and consulting services. These services are available for Medicare A, Medicare B and the private insurance market. The product being presented in this paper is OUf private product, AMDS (Athena Medical Data Systems). The market for AMDS includes employer groups, physicianlhospital groups, insurance companies, and provider networks. Employer groups can utilize the system for benefit reengineering, fraud and abuse, claims payment auditing, and the identification of preventive health care programs. Physicianlhospital groups use the system for competitive pricing and market research. Insurance companies and provider networks use the system for provider profiling. In addition, insurance companies can use the system for fraud and abuse, benefit design, case management identification, and to improve data coding and collection.

SAS INFORMATION DATA BASE

The AMDS system is supported by a SAS information data base. This data base provides a common format for all health claims regardless of the claims processing system. It

(2)

is designed to protect the user from misinterpreting or misusing the data through data reformatting, data cleansing, and the creation of analytical and categorical variables.

The SAS information data base is built from flat files and DB2 tables within Associated Insurance Companies for clients for which we currently process their health insurance claims. We also have a foreign flat file design to accept data from outside sources. Descriptive data such as medical procedure descriptions and diagnosis descriptions have been purchased or data entered. Categorical data such as procedure groupings and diagnosis groupings have been created by our medical staff. Medical intelligence such as John Hopkins case-mix relative values and diagnosis related groups (DRGS) have been purchased.

Operational data was converted to informational SAS data sets. The informational data base required complex logic to reformat the data, including selecting appropriate data, resolving data inconsistencies, transforming data values. Preprocessing logic included building a fina! claim from adjusted claims, combining claims which had been split, and the creation of analytical variables not found in the operational data. The SAS data sets are relational and indexed, but not fully normalized.

In

cases where the data required little storage and was used often, it saved response time and simplified the system to put this information on the claims files rather than normalizing.

SAS data sets were selected as the data base of choice because:

• Features of a conventional DBMS such as rollback and recovery, audit trails, transaction processing, and maintenance of referential integrity are not required. • Inserts into tables in order to maintain

referential integrity are not desired because this creates rows with missing values for non-keyed variables. It is our desire to resolve these mismatch situations.

• Complex data manipulation is required to build the SAS files from the operational data and SAS is a very good choice for this function.

• File conversion is not neccessary every time a report needs to be generated. • System performance is increased with a

SAS data base.

• It was not neccessary to hire a data base administrator.

The SAS information data base contains the following files:

1. Master Data Files

These files are used by all clients and include the following:

• Procedure Codes: This file translates CPTIHCPC procedure codes into English descriptions for reporting. • ICD9 Procedure Codes: This file

translates ICD9 pr.ocedure codes into English descriptions for reporting.

(3)

• Diagnosis Codes: This file translates ICD9 diagnosis codes into English descriptions for reporting.

• Comprehensive Procedure Groups: These are medically assigned groups of CPTIHCPC procedures for physician claims and collapse approximately 12,000 codes into 100+ groups. For institutional claims, these same groups are formed using revenue codes.

• Comprehensive Diagnosis Groups: These are medically assigned groups of ICD9 diagnosis codes which collapse approximately 15,000 codes into 100

+

groups.

• Format Library: This is a SAS format library contammg descriptions for variables such as provider specialty, place of service, type of service, relationship code, rate structure, and submission type. • Non-institutional Normative

Summaries: These files are built annually from Associated Insurance Company internal claims files and can accommidate external customer data as foreign system data is made available with sales to customers for which we do not process their health insurance claims. These files have been adjusted for John Hopkins ACGS(Arnbulatory Care Groups). These files are used for Provider Profiling to compare providers to their peers and can also be used for independent analysis.

2. Client Specific Data Files

These files contain client specific data and are built for each client separately.

Claims files typically contain a rolling twelve months of data refreshed quarterly, but can be adjusted based on client requirements. Claims files contain 40 to 50 variables, including the John Hopkins ACG relative value for case-mix adjustment. The three client specific claims files are indexed by Comprehensive Procedure Group, Comprehensive Diagnosis Group, Provider Specialty, and Provider Id. These variables are the most often used for sub setting the claims files prior to reporting. Run time is much faster when subsetting by an indexed variable.

• Institutional Inpatient Claims: This file contains hospital inpatient claims.

• Insitutional Outpatient Claims: This file contains hospital outpatient claims.

• Non-Institutional Claims: This file contains physician and supplier claims.

• Provider Master: This file contains provider information such as specialty, name, and address for those providers which had at least one claim during the period covered by the claims files.

• Group: This file contains the groups within a client's business, such as active/retiree or separate locations,

(4)

along with a description of each group.

• Membership: This file contains a record of each member and their dependents, along with demographic information for each.

REPORTS

Approximately 150 standard reports have been developed to support user data analysis.Different levels of reporting are available as shown in the following diagram:

Medical

Institutional Non-institutional

Inpatient Outpatient

Medical reporting includes all medical claims; Institutional Inpatient, Institutional Outpatient, and Non-Institutional. Institutional reporting includes both Inpatient and Outpatient. Institutional Inpatient, Institutional Outpatient, and Non-Institutional levels include only those specific type of claims.

Reports fall into the following categories: • Comprehensive Procedure Groups • ICD9 Procedure Groups

• CPTIHCPC Procedures • Revenue Codes • Provider Ranking • Provider Profiling • Diagnosis • Employer • Operational

• Detail Claim Listings

Only those report categories which are appropriate for each report level are available. This is another way in which the system directs the user in the appropriate use of the data. Many of the reports exist at multiple report levels.

The standard reports can be browsed by clicking on the BROWSE button. These are stored reports that are ran on the mainframe when the data base was created. The standard reports can be ran based on user subsetting. For each report, the user can click on SUBSET in the SAS/AF system and

a list of appropriate sub setting variables will be displayed. Note that only those variable which are appropriate to subset for a specific report will be displayed to the user. This is another way in which the system protects the user from misuse of the data. This opens up the report possibilities available to the user in a standard reporting mode to millions of combinations.

SAS/AF FRONT END

SAS/AF was used to build a user front end

to allow the user to:

(5)

• Generate standard reports through sub setting,

• Access SASfASSIST and SASfINSIGHT for custom reporting, graphics, and analysis.

SAS Consulting Services was used to develop the SASf AF system due to limited resources and availability of local SCL programmers. Their data driven approach is a unique and superior approach for a front end system. It has made system maintenance very straightforward.

The system is Windows based and incorporates features which are typical of Windows applications. It was designed to take advantage of GUI point and click features including pull downs, icons, scroll bars.

PCfLAN SYSTEM REQUIREMENTS The standalone PC or LAN approach can easily support 500,000 claim line SAS files with a report response time of 2 minutes on the average when subsetted by indexed variables. Larger data bases can be built depending on disk availability and user satisfaction with response time ..

PC recommendations for the PC or LAN server include a 486 machine, the faster the better, 12+ meg memory, VGA monitor, mouse,40-70 meg hard disk for the SAS System under Windows (depending on the SAS products selected), and 80 meg hard disk for the client with 300,000 claim lines. Software requirements include Windows 3.1, DOS 5.0, Base SAS, SASfGRAPH, and

SASfFSP. SASfASSIST is recommended and all other SAS products are optional.

CLIENTfSERVER

SYSTEM REQUIREMENTS

For the client/server environment, PC recommendations include everything shown for the PCfLAN version with the exception of the hard disk space required for the client data. In addition a modem may be required, depending on the user environment. Software for the PC include all shown for the PCILAN version, but in addition, SASfCONNECT is required and Extra for Wmdows is required for access through a modem. 'On the server side, SAS software requirements include Base SAS and SASfCONNECT. Our server is currently an IBM 3090 MVSfESA machine.

(6)

ACKNOWLEDGEMENTS

This system would not have been possible without the contributions of the following people:

• Lisa Aberson's expertise in the raw data files and knowledge of health insurance claims processing was crucial in the data base design and identification of data issues.

• Trina Houston wrote the complex code required to build the SAS files from the raw data including the data reformatting • Debbie Tinsley wrote the majority of the

reports and designed the technical infrastructure required to implement customers.

• Jim Schleusner wrote the Provider Profiling subsystem, working with very complex mathematical routines.

• Jim Owens implemented several new customers, taking the system from the mainframe enviroment to the LAN.

• Diane Zaccardelli spent many hours testing system and wrote reports.

• JeffKubea prepared the methodology to install the system in customer sites • Monica Pulos manages customer

implementation for our Medicare system.

• AI Toney and others from SAS Consulting Services, who built and taught us how to build a data driven SAS/ AF application. The system

development was on a tight schedule and AI was my body guard!

You can contact the author at: Diane E. Brown

Athena of North America 120 Monument Circle Indianapolis, IN 46204-4903 (317) 488-6105

SAS, SAS/AF, SASIFSP, SAS/CONNECT, SAS/GRAPH, SAS/ASSIST, and SASIINSIGHT are registered trademarks or trademarks of SAS Institute, Inc, Cary, NC, USA in the USA and other countries.

References

Related documents

22 The basic concept was to develop a model for increasing private investment in local companies to deploy pay-as-you-go (PAYG) solar models that would provide basic energy

In recent years, a subset of early-stage Internet companies (companies whose primary product is a website or Internet application) have been following different principles –

combination of appropriate policy instruments and show that an optimal pollution tax should be used for the negative externality and output restrictions in final production, and

However, in automatic speech recogni- tion (ASR), RNNLMs cannot be used to directly decode a speech signal since they rely on continuous representations of word histories while

Ifyou need assistance securing a contractor, please contact me for information regarding our Contractor Approved Repair Estimating Service( CARES).. You may see your current

For after school "at risk" programs to participate in the CACFP the program site must be located in a geographical area served by a school (i.e., elementary, middle, or

:3 Algernon asks Jack about the name 44444 on the inside of the

Ergo, this report discusses the major components that comprise the Android operating environment, elaborating on the Android design and architecture (the building blocks), as well