• No results found

The continuous challenge for test data management.

N/A
N/A
Protected

Academic year: 2021

Share "The continuous challenge for test data management."

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

The continuous challenge

for test data management

.

Accelerating delivery of high-quality testing data

for CI/CD pipelines

The rise of agile development and CI/CD

Testing needs to continuously keep up

The challenges of test data management

Steps to resolving testing data challenges

Summary

About K2View test data management

(2)

The shift to agile development, with Continuous Integration and Continuous Development (CI/CD) pipelines, is

accelerating the pace of innovation with increasing efficiency. For testing teams, this shift, means that test data design and provisioning need to keep up with the faster pace.

Shift-left testing works in tandem with agile software delivery in raising the standards of quality testing. With this

method, testing begins at the earliest stages of the development process rather than saving it for last. To implement this approach, test data must be prepared even earlier, and faster.

With production data fragmented across multiple enterprise systems, ensuring complete and harmonized data for testing is a constant struggle. The need to anonymize data, as required by privacy regulations, and create synthetic data to augment the existing data set, add an additional layer of complexity.

For Development Operations (DevOps) leaders and testing teams, delivering high-quality test data environments, at a rapid pace, is critical. This paper reviews the challenges faced on this journey, and the steps required to get there.

Catalyzed by the rapid growth in applications, software development has shifted gears, delivering segments in fast sprints. This agile approach – managed by DevOps teams – is characterized by much smaller-scope development project segments, which go live in weeks, as opposed to months.

Introduction

The rise of agile development and CI/CD

Agile development methodology divides long projects into smaller “sprints”. The logic is to get software increments into production as soon as possible – instead of waiting long periods for full deployment. Not only does agile development give companies a competitive edge, by getting innovative apps out to the user, sooner, but it also mitigates risk, because

(3)

Preparing quality test data has always been a challenge, especially with agile development and CI/CD. And with the growing popularity of microservices, and the integration of multiple applications, the provisioning of valid test data has become even more complex.

For a test cycle to be effective, whether it is manual or automated, availability of stable test data, that is as close to production as possible, is critical. Further, DevOps requires automation to be successful, and that automation needs data to run smoothly, and be able to access qualitative, consistent, and predictable sets of data. Adequate and on-demand test data should be available for running fully automated test suites. This test data should not constrain automated testing, to ensure seamless test data management processes.

Shift-left testing requires shift-left test data

Shift-left testing is an approach to software testing in which testing is performed earlier in the software development process. To test earlier, however, requires the earlier availability of realistic test data.

Testing needs to continuously keep up

Shift-left testing is common in agile development, where software development is divided into short-duration “sprints”. Each sprint requires its own testing cycles, and so, creating realistic test data often becomes a bottleneck – cancelling the gains of agile productivity.

(4)

Testing teams must contend with many data constraints, which typically slow down software delivery, while hindering quality, and agility.

Data accessibility

Testing teams sometimes lack access to the necessary data, or the tools to extract it. For example, data

is typically split between different data sources. A “customer” may be broken up between customer care,

billing, ordering, ticketing, and collection systems. To run functional tests on a customer in an integrative testing environment, their data must be extracted from all relevant source systems.

Data quality

In many cases, the data may be available, but it fails to meet the required quality standards for the following reasons:

Corrupted data:

Reused data often suffers from corruption problems. Relying on corrupted data could lead to severe problems that are only detected deep into the process, affecting every result.

Unmasked data:

With data privacy headlining today’s news, there’s no doubt that any production data used in testing environments must be completely masked (made unidentifiable). Any

Irrelevant or incomplete data

Applications must be tested against specific data that simulates all required scenarios. Complete data with referential integrity is paramount in preventing test scenarios from breaking down due to bad data.

Data availability

Gathering enough production data to cover the required testing scenarios is often extremely challenging. For example, testers may have a population requirement of 300, but only 50 production samples are actually available. The right TDM tools need to be able to synthesize (generate) 250 data samples, based on the real-life samples, while maintaining data integrity across all systems.

The challenges of test data management

Data synthesis is an invaluable capability, when artificially seeding a designated test population.

(5)

Steps to resolving testing data challenges

There are a number of steps companies should follow to achieve the right test data management strategy, and yield top test data sources, to create better products.

Define

Start by determining clear criteria upon which the test data collection procedure will be based. These define the data subsets that should be used in testing the use cases, including specific business entities, the volume of data required for testing, its creation date, and more. Teams can make use of an automated data catalog to

inventory and classify test data assets, and visually map information supply chains. While these criteria must be granular, they should also leave room for modification and updating moving forward.

Extract

Having established which test data is needed, it’s time to extract it from the organization’s production systems. When the required data is dispersed across many different systems and data sources, a test data management tool – that can integrate with the production systems, and extract test data according to predefined rules – comes in handy.

Refresh and sync

Testing is an iterative process. When bugs are discovered and fixed, testing should be repeated to ensure quality. A test data management strategy should provide for the means to quickly roll back the test data that was previously used – by the specific tester, for the specific use case – without impacting the test data currently being used for other tests. Companies need a test data

Mask

Any test data management strategy discussion is incomplete without ensuring adequate privacy and security measures. When dealing with production data, the challenge is to ensure data privacy, while maintaining the data’s integrity and keeping it secure. It’s critical to meet privacy compliance regulations and protect the data from breaches. Centralizing the test data from multiple sources into a test data warehouse, masking it, and securing it along the way, creates a simple and efficient process for meeting data compliance and security requirements.

Synthesize

When test teams can’t extract a sufficient volume of test data from production, they need a data synthesizing solution for artificial data creation. A test data management strategy should include the means to generate synthetic data based on real production data, while maintaining referential integrity of the data across all the systems.

Provision

After acquiring the necessary test data, generating missing data, and masking it as required, it’s time to move it to the target test environments. Test data management solutions should offer a fast and seamless path from multiple systems to multiple environments. Companies should be able to upload, adjust, and remove data scenarios and business entities at any stage throughout the process.

Generate synthetic data based on real production data, while maintaining referential integrity across all systems.

(6)

About K2View test data management

K2View Test Data Management extracts and organizes data from production systems into a test data warehouse using Digital Entity™ data schemas. The digital entity unifies all the data elements for the business entity, from all production systems, regardless of the number of underlying source systems, their technologies, and data formats. It can be a customer, location, party, device, or anyone, or anything else that’s business-critical.

The K2View test data management solution automates the process of data extraction, cleansing, and masking, as well as data encryption, and movement to target test environments – and all this is performed in minutes, instead of days or weeks.

Global 2000 companies – including AT&T, Vodafone, Sky, and Hertz – deploy K2View in weeks to deliver massive-scale data-intensive use cases.

Copyright 2021 K2View Ltd. All rights reserved. The terms digital entity, micro-database, and micro-DB are trademarks of K2View Ltd. The contents of this document are subject to change without prior warning.

Summary

At the end of the day, the move to agile development, requiring high-performance test data environments, saves enterprises millions of dollars. Test data management can improve compliance, optimize storage, reduce spending, and enhance the end-user experience.

A robust test data management strategy allows enterprises to efficiently satisfy its test data needs by supporting data profiling, analysis, governance, provisioning, generation, and privacy. Such a solution will empower development, QA and DevOps teams to provision test data themselves, in a simple automated process, on a single platform – regardless of the number systems to be tested, their respective technologies, and the number of testing environments.

DevOps methodology and moving to agile is great in theory, but you hit a roadblock if you don’t have realistic data to test against. K2View test data management provides a self-service approach for our teams to access realistic data when and where it’s needed Ward Chewning

References

Related documents

The productivity practical management model evaluation of automotive mechanical skill program in state vocational school Semarang consist of: (1) productivity practical

While former empirical work with German data has extensively analyzed the wage effect of training, none of them has accounted for the likely possibility that worker selection into

Таблиця 7 Матриця «категорія дилера – напрям збутових зусиль» ЗАТ Машзавод [розроблено авторами] Напрям збутових зусиль Пріори- тет Категорія

WHO and World Bank (2010), in their co-produced World Report on Disability, reviewed research on the use of telehealth to provide services for mental health, cardiac

In this randomized, blinded, sham-controlled study, use of an external abdominal compression device did not lower overall cecal intubation times in all patients compared with

10.6 Switch the mode selector to the desired position and set the room thermostat to the desired temperature. 10.7 Your boiler is now ready

→ Takeaway: The effect of credit crunch on exports was due to decline in general credit to finance working capital for

Assessment of parasitic mites ( Varroa spp ) infesting the colonies of African honeybees ( Apis mellifera scutellata ) was conducted in 25 districts of Tanzania