• No results found

Unsupervised Anomaly Detection in High-Dimensional Flight Data Using Convolutional Variational Auto-Encoder

N/A
N/A
Protected

Academic year: 2021

Share "Unsupervised Anomaly Detection in High-Dimensional Flight Data Using Convolutional Variational Auto-Encoder"

Copied!
18
0
0

Loading.... (view fulltext now)

Full text

(1)

Unsupervised anomaly detection in high-dimensional

flight data using Convolutional V

ariational Auto-Encoder

NASA Jet Propulsion Laboratory, March 25th, 2020

Enc ode r D ec ode r Nominal

Low reconstruction error

Anomalous

High reconstruction error

(2)

2

Motivation for unsupervised anomaly detection

Source: National Transportation Safety Board

(3)

Motivation for unsupervised anomaly detection

Accident rate of commercial flights has been cut in half since 1999.

Passenger load factor has increased to 82.5% in 2017.

ss en ge r e np la ne m en ts [1 0 M ] De pa rtur es [1 00 K ]

(4)

4

Motivation for unsupervised anomaly detection

Accident rate of commercial flights has been cut in half since 1999.

Passenger load factor has increased to 82.5% in 2017.

Creating labels for data in the aviation domain:

o requires huge effort from subject-matter experts.

o is largely expensive and impractical.

(5)

Aviation anomaly detection literature

Exceedance detection:

Comparing against the pre-defined thresholds, which are fixed by subject-matter experts.

Cons:

o complete reliance on

domain knowledge.

o can only identify

known anomalies.

Drop in airspeed > threshold

(6)

6

Aviation anomaly detection literature

Distance-based anomaly detection:

Anomalies are defined as a point in the feature space whose nearest neighbors are far

from it.

Cons:

o poor performance when dealing

with high-dimensional data

o subject to error depending on

percentage of anomalous patterns existing in the training data

(7)

Aviation anomaly detection literature

Kernel-based anomaly detection:

One-Class SVM: Finds a maximal gap hyperplane that separates data from the origin (as the only data point of the none-existent class).

Cons:

o computational complexity of the

kernel building step.

o poor performance in dealing with

high-dimensional data.

gap

(8)

8

Building upon recent advancements in machine learning

Variational Auto-encoder:

Finding a trade-off between reconstruction quality and disentanglement of the latent

space (i.e., balancing the bias and variance of the model):

𝐿 𝜃, 𝜙; 𝑥 = 𝔼)* 𝑧 𝑥 log 𝑝0 𝑥 𝑧 − 𝛽 KL 𝑞6 𝑧 𝑥 ‖𝑝(𝑧) Enc ode r Decode r Re co ns tr uc te d da ta Latent Space ! "#(% ∣ ') ) *+(' ∣ %) !, Ori gi na lda ta

Higgins et al. (2017), ICLR.

represents the

(9)

The visual illustration of the method

Enc ode r D ec ode r Nominal

Low reconstruction error

Anomalous

(10)

10

A convolutional network for anomaly detection

Convolutional Variational Auto-encoder:

Using convolutional layers (instead of recurrent) to

speed up the training process.

Using multiple filter sizes to capture local and global

temporal dependence in the time series.

Memarzadeh et al. (2020), under review.

Original time series window

Concatenate Series of Conv1D & MaxPool filter = 3 Series of Conv1D & MaxPool filter = 5 Series of Conv1D & MaxPool filter = 7 Latent Space Series of Conv1DT & UpSamp filter = 3 Series of Conv1DT & UpSamp filter = 5 Series of Conv1DT & UpSamp filter = 7 Concatenate Conv2D, filter = 1

(11)

Validation on Yahoo!’s benchmark dataset

Data set consists of real (A1) and synthetic (A2)

univariate time series as well as synthetic

multivariate time series without (A3) and with (A4)

change points. A1

(12)

12

Validation on Yahoo!’s benchmark dataset

Data set consists of real (A1) and synthetic (A2) univariate time series as well as synthetic

multivariate time series without (A3) and with (A4) change points.

We compare performance to:

o ADOPT: which is a supervised learning

approach using recurrent neural network

o Kmeans ++

o One-Class SVM (OC-SVM)

On average:

o CVAE outperforms Kmeans++ and OC-SVM

with 62% higher precision and 30% higher recall.

o CVAE has 10% lower precision and 17%

lower recall compared to ADOPT.

(13)

Flight operational quality assurance (FOQA) data

Drop in airspeed case study:

o Our goal is to identify anomalies in the first 60 seconds of commercial flight’s take-off.

o Data consists of 30,000 nominal take-offs and 1000 anomalous ones.

o Each time series consists of 17 variables measuring the roll attitude, altitude

(14)

15

Flight operational quality assurance (FOQA) data

Drop in airspeed case study:

o Our goal is to identify anomalies in the first 60 seconds of commercial flight’s take-off.

o Data consists of 30,000 nominal take-offs and 1000 anomalous ones.

o Surrogate labels for anomalies by subject matter experts: if drop in air speed is more

(15)

Anomaly detection performance on the FOQA data

CVAE outperforms Kmeans++ and OC-SVM with 24% higher precision and 26% higher

recall.

CVAE-Semi improves the performance of CVAE by 25%:

o is the semi-supervised

approach of CVAE, where

the training data is only comprised of nominal data.

(16)

17

Interpretability and explain-ability of the findings

Effect of hyper-parameter 𝛽:

Higher values of 𝛽 result in a model

that is more robust to existing of

anomalies in the training data, but

decreases the interpretability of the latent space. (A) (B) (C) (D) ! = 0.001 ! = 0.5

(17)

Interpretability and explain-ability of the findings

Effect of hyper-parameter 𝛽:

Higher values of 𝛽 result in a model

that is more robust to existing of

anomalies in the training data, but

decreases the interpretability of the latent space.

Higher values of 𝛽 also result in higher

reconstruction error.

This means that a model with very low

𝛽 can over-fit to reconstruct the anomalous patterns.

(18)

19

Concluding remarks and future work

We developed an approach to detect anomalies in an unsupervised fashion for

high-dimensional heterogenous time-series data.

Our approach significantly outperforms clustering/distance/kernel-based methods that

are common in the domain. Future works:

Develop an architecture to accommodate the binary channels of FOQA data:

o State-based anomaly detection by building the state space based on binary

variables.

Improve the explain-ability of the findings in the latent space.

References

Related documents

A large variety of cell types have been used for regenerative medicine, including adult cells, resident tissue specific stem cells, bone marrow stem cells, embryonic stem cells

[r]

Check Brake Chamber Push Rod Stroke 14 Brake Slack Adjuster Position Method 15 Automatic Slack Adjuster Templates 15 Measure the Slack Adjuster 15 Install a Threaded Clevis

• Step 3: Using a lemma about k -regular subgraph covers of r -colored unbalanced complete bipartite graphs we combine the leftover vertices with some vertices of the

*Winner of the Flight Competition. In addition to the qualification data listed in Sections 2.1.4.1 and 2.1.4.2, the winner of the Flight Competition is expected to support

lems when confronted with observations, namely the prediction of dark matter cusps in all galaxies, and in particular in dwarf irregulars, dom- inated by dark matter, or the low

The coverage of this standard package (basisverzekering) is determined by the government and includes medical care by specialists, GPs and midwives, hospital care,

The main purpose of this work is to evaluate the influen- ce of additives such as deflocculating agents on the flow and setting behavior in a system composed of alumina,