• No results found

Is the Cost of Reliability, Maintainability, and Availability Affordable for Software Intensive Systems?

N/A
N/A
Protected

Academic year: 2021

Share "Is the Cost of Reliability, Maintainability, and Availability Affordable for Software Intensive Systems?"

Copied!
18
0
0

Loading.... (view fulltext now)

Full text

(1)

Is the Cost of Reliability,

Maintainability, and Availability

Affordable for Software Intensive

Systems?

Presented to

IEEE Software Technology Conference Long Beach, CA

Myron Hecht April, 2014

(2)

Outline

Consideration of RMA in Systems Acquisition is Required by Law

New Policies Have Been Created

Investment in RMA Pays Off

Reliability Best Practices Have Been Established

Methods for Calculating Operation and Sustainment Costs Have Been Created

Tools for Reliability Analysis, Evaluation, and Tracking Exist

– And advanced tools are being developed

So What’s the Problem?

Conclusions

(3)

Consideration of RMA in Systems Acquisition is

Required by Law

Public Law 111-123 “Weapons System Acquisition Reform Act of 2009,” Section 102 (codified as 10 USC 4 Section 139d)

– (b) Director of Systems

Engineering-– (1) APPOINTMENT- There is a Director of Systems Engineering, who shall be

appointed by the Secretary of Defense from among individuals with an expertise in systems engineering and development planning.

– . . .

– (5) DUTIES- The Director shall—

``(A) develop policies and guidance for—

``(i) the use of systems engineering principles and best practices, generally;

``(ii) the use of systems engineering approaches to enhance reliability,

availability, and maintainability on major defense acquisition programs;

``(iii) the development of systems engineering master plans for major defense acquisition programs including systems engineering considerations in support of lifecycle management and sustainability; and

``(iv) the inclusion of provisions relating to systems engineering and

(4)

New Policies Have Been Created

CJCSI 3170.01, “Operation Of The Joint Capabilities Integration And Development System,” May, 2007

– Materiel Availability and Operational Availability as Key Performance

Parameters (KPPs)

– Reliability and Ownership Costs as Key System Attributes

DoDI 5000.02 interim*, “Operation of the Defense Acquisition System,” November, 2013

– PMs for all programs shall formulate a viable Reliability, Availability, and

Maintainability (RAM) strategy….. RAM

shall be integrated within the Systems Engineering processes,

documented in the program’s Systems Engineering Plan (SEP) and Life-Cycle Sustainment Plan (LCSP), and

assessed during technical reviews, test and evaluation (T&E), and Program Support Reviews (PSRs).

(5)

Growth of Software Content in DoD Weapons Systems

Source: Scott Lucero, Office of the Deputy Undersecretary of Defense (Acquisition and Technology), “Software Sustainment Challenges in Defense Acquisition”, AIAA Engineering Conference, April 2009, http://www.acq.osd.mil/se/webinars/2009-04-21-SECIE-SW-Sustainment-Lucero-brief.pdf

(6)

Growth of Software Importance in USAF Aircraft

Platform

Year

% Functions

Controlled by

Software

F-4

1960

8%

A-7

1964

10%

F-111

1970

20%

F-15

1975

35%

F-16

1982

45%

B-2

1990

65%

F-22

2000

80%

Source: Scott Lucero, Office of the Deputy Undersecretary of Defense (Acquisition and Technology), “Software Sustainment Challenges in Defense Acquisition”, AIAA Engineering Conference, April 2009, available online at http://www.acq.osd.mil/se/webinars/2009-04-21-SECIE-SW-Sustainment-Lucero-brief.pdf

(7)

Investment in RMA Pays Off

Operations and Support

make up 65-80% of the lifecycle cost

(Predator, Global Hawk, FBCB2) cited as showing Return on Investment of Reliability Improvement of 5:1 to 128:1

DLA Study of 10-year returns of 15.5:1

Source: Ernest Seglie, (Science Advisor, DOT&E), “Investing in Reliability, Availability, and Maintainability and the Effect on Logistics, Operational Support, and Lifecycle Cost”,

(8)

8

Reliability Best Practices Have Been Established

Program plan Reliability Modeling, Allocation, Assessment FRACAS Methodology Data Contents Database Description Calculation of rates Relationship with FMEA/CA Metholodogies

Relationship with FRACAS, FMEA/CA Failure Data FMEA/CA Predicted Failure Modes Actual Failure Modes

Reliability Test Plan

Predicted Failure Modes Priorities Procedures, Methods Test Data Failure Data Methodologies Test Strategies Interrelationship with other tasks

Unanticipated Failure Modes discovered during test

Security Systems Engineering Security Systems Engineering Security System Safety Security System Safety Reliability Test Report

Source: Myron Hecht, Karen Owens, and Joanne Tagami, “Reliability-Related Requirements in Software-Intensive Systems”, Proc. 2007 Reliability and Maintainability Symposium

(9)

9

Methods for Calculating Operation and

Sustainment Costs Have Been Created

Expected number of failed items

                                          

i c i c i i c i o μ λ c i)! i!(n n! i c! μ λ i)! i!(n n! i) (c p L 1 1 1 0 1 0

 Time waiting to start repair

) ( q q L n L W                        

  i c i o q μ λ )! i!(n n! i) (c p c L L 1 0 1 q unsched q W MTTR MTBO W MTTR U      Unavailability

Source: Myron Hecht, “A Model for Determining the Cost of National Airspace System Maintenance Service Levels”, 2005 AIAA Aviation Technology, Integration, and Operations Conference

L= number of items waiting to be fixed

Wq = waiting time for items to be fixed

po = probability of an empty queue

U = unavailability

MTTR = mean time to repair

MTBO = mean time between outages

(10)

Tools for Reliability Analysis, Evaluation, and Tracking Exist

• General Math / Statistics • Maintainability

Design

• Human Factors – Maintenance Concept

Not Otherwise Categorized

Other Prediction • Detection • MTTR • Not Otherwise Categorized • Other – Reliability Centered Maintenance • Condition Monitoring • Other – Safety Testing • BIT • Other • Testability • Reliability – Availability Components • Applications – Mechanical • Data – Discrete Microcircuit Other Passive Design / Analysis • De-rating • Failure Analysis • Finite Element Analysis • FME(C)A – Design FME(C)A Process FME(C)A • FRACAS • FTA • Markov • Mechanical • Modeling / Simulation – RBD • Other • Parts Control 10 • Quality – ISO-9000 Manufacturing Not Otherwise Categorized Other Statistical Process Control TQM • Risk • Supportability

Life Cycle Cost

Logistics • Data • Maintenance Concept • Other • Spares – Referral Warranty • Thermal Analysis • Weibull – EOS / ESD • Susceptibility – Non-operating Prediction • Bayesian • CNET • Mechanical • MIL-HDBK-217 • Non-operating • Other • 217 • Telcordia (Bellcore) – Screening • Burn-in • ESS • Other – Software Reliability System • Data – Testing • Accelerated Life –

Source: The Reliability Information Analysis Center, Software Tools Page

(11)

…and advanced tool sets are being developed

Model Driven Design and Analysis Tool Set

(Aerospace Corp)

Architecture and Error Models

Stochastic Network Model

(12)

…and advanced tool sets are being developed

ASSERT* (European Space Agency)

• Source – European Space Agency and Collaborating Tool Developers • ESA Autonomous Transfer Vehicle Control System Development Example – Specify arbitration logic

– Define finite state

machine (using SDL)

– Verify (check for

exceptions, deadlocks, stop conditions) – Autogenerate code – Deploy on runtime environment

System Design Language

Finite State

Machine

E. Conquet, M. Perrotin, C. Colombo, M.-A.Esteve, “Results of ASSERT and follow-up studies: technical status and exploitation activities”, AADL User Day, 27 May 2009

http://www.assert-project.net 12

* Automated proof-based System and Software Engineering for Real-Time applications

(13)

So What’s the Problem?

Programs not managing to requirements

Lack of incentives

– RMA Efforts have costs

Impact on design

Impact on cost and schedule

Addition of uncertainty – particularly if RMA group is less competent than Government oversight

– No discernible benefit

Insufficient EVM and cost accounting techniques

Programmatic Attention Elsewhere

Poor Systems Engineering

*Source: Ernest Seglie, (Science Advisor, DOT&E), “Investing in Reliability, Availability, and Maintainability and the Effect on Logistics, Operational Support, and Lifecycle Cost”, http://www.gardenstatesole.org/events/symposium08/seglie.pdf

(14)

What to do

Accurate Valuation Of RMA-related Benefits As Well As Costs In DoD Affordability Initiatives

Effective Earned Value Tracking Of RMA Engineering Tasks

Early and Frequent Feedback From RMA Into The Design And Implementation Activities

(15)

Valuation Of RMA-related Benefits As Well As Costs In

DoD Affordability Initiatives

Cost:

– NRE in engineering and analysis

Benefits

– Greater operational or mission effectiveness

– Fewer units required (fleet availability is higher when unit availability is

higher)

– Smaller sustainment organization

(16)

Accurate Earned Value Tracking Of RMA Engineering Tasks

Task Program Milestone Review Proportion of Earned Value*

Reliability Program Plan System Requirements

Review (SRR) 5%

Reliability/Availability Allocations System Design Review

(SDR) 5%

Preliminary Reliability Predictions and Failure Modes and Effects Analysis (FMEA)

Preliminary Design Review

(PDR) 5%

Final Reliability Models Predictions and FMEA

Critical Design Review

(CDR) 20%

Failure Reporting and Corrective Action System (FRACAS)

Test Readiness Review

(TRR) 10%

Life, Stability and Failure Recovery testing(may be part of Day in the Life Testing) and Maintainability Testing Complete

Functional Configuration Audit (FCA)

30%

Requirements Verification Complete Physical Configuration Audit

(PCA) 25%

16

(17)

Recommended RMA Activities in a Software Intensive

Development Program

17 Definition of Degraded Modes, Acceptable Outage Durations, Maintenance Concepts

R&M Testing (incl. SW)

Redundancy, Failure Containment, Degraded Modes, FMEA* (initial) R&M Requirements Allocation

(incl. SW)

Failure/Recovery Testing

FD&R** Design, FMEA, predictions, FMEA (detailed)

FD&R Implementation, Static Analysis (coding standard conformance), FMEA revisions

DR Data tracking, ID of unanticipated failure modes,

FD&R verification * Failure Modes and Effects Analysis

(18)

18

Next Steps

Valuation techniques to link between RMA engineering to affordability

– Methods of measurement and accounting of operational suitability,

effectiveness, and sustainment engineering for acquisition managers

– Is the link really as strong as congress, the GAO, and RMA advocates

believe?

Methods to assess the RMA-related Key Performance Parameters (KPPs) and Key System Attributes (KSAs) during development

– Metrics – Models

Methods to identify operations and sustainment costs

– Access and analysis of operations and maintenance databases – Methods to improve data recording and collection

Policies and metrics linked to award fees for incentivizing contractors to invest in RMA engineering

– Linking operational and sustainment costs to award fees and past

References

Related documents

This white paper describes an example of the configuration of a client, a proxy service, and a business service communicating over firewalls and a DMZ, using inbound and

From the results of the study, it can be concluded that the organizational culture has a positive and statistically significant effect on the performance of employees of

Abstract In this paper we report results of a series of simulation experiments aimed at comparing the behavior of different similarity indexes proposed in the literature for

I Amandeep Kaur (Research Scholar, English) had taken the interviews of the author Dalip Kaur Tiwana on 4 th. Feb 2015 and 17 th Feb 2019 respectively at her

Asylum law in the EU after the Amsterdam Trea- Asylum law in the EU after the Amsterdam Trea- ty includes the following legal norms: Council Reg- ulation 343/2003 establishing

By arranging opportunities for collaboration, TERENA offers a forum in which people from all organisations can gather together, share knowledge, and develop new services,

DQ issues for Data Warehouses (5) Role of Semantics and Ontologies for DQ (6) Data Quality Assessment and Error Detection (7) DQ Rules and Constraints (8) Record Linkage,

– a quasi-equity injection in the form of a EUR 300m Perpetual Subordinated PIK Convertible Bond fully subscribed by Etihad Airways to be funded in 3 tranches of EUR 100m in Q2, Q3