Illumination Normalization - BACKGROUND TECHNIQUES FOR FACE RECOGNITION

CHAPTER 2 BACKGROUND TECHNIQUES FOR FACE RECOGNITION

2.3 Illumination Normalization

Changes in ambient illumination, and the resulting variations to facial appearance, are known to signiﬁcantly deteriorate the performance of FR systems. Accordingly, several techniques have been proposed for illumination invariant FR (Sharma et al., 2014). Zou et al. (Zou et al., 2007b) presented a survey of techniques according to passive and active approaches. Passive

approaches focus on the visible spectrum images where face appearance has been altered by

illumination variations. They include illumination variation modeling, illumination invariant features, photometric normalisation, and 3D morphable model techniques. Active approaches employ active imaging techniques to obtain face images captured under consistent illumination condition, or images of illumination invariant modalities. Additional devices (optical ﬁlters, active illumination sources or speciﬁc sensors) are usually involved to actively obtain different modalities of face images that are insensitive to or independent of illumination change. Those modalities include 3D face information and face images in those spectra other than visible spectra, such as thermal infrared image and near-infrared hyper-spatial image.

There are three main types of techniques to produce illumination invariant facial images under the passive approaches: those applied at the pre-processing, feature extraction and classification levels (Struc and Pavesic, 2011). Pre-processing techniques seek to produce facial images that are free of illumination induced facial variations prior to feature extraction. They can be applied within any FR system, since they make no prior assumption that influences feature extraction or classification procedures. Feature extraction techniques seek to compensate for appearance variations in facial images using descriptors or representations that are stable under different illumination conditions. However, different empirical studies with LBP, Gabor wavelet-based features, and other descriptors have shown (Marcel et al., 2006) that none of

these can ensure illumination invariant FR given severe illumination changes. Classification- level techniques compensate for illumination changes according to the type of face model or classifier employed in the FR system. First, some assumptions regarding the effects of illumination on face models or classification procedure are made, and then based on these assumptions, counter measures are undertaken to obtain illumination invariant face models or illumination insensitive classification procedures. Managing the effects of illumination at the feature extraction level is debatable. Classification level techniques may impose difficult requirements on the design data. They may provide the most efficient approach to illumination invariant FR. However, a large training set must usually be acquired under a number of lighting conditions and is computationally expensive. In this section a brief overview on some of the passive illumination normalization techniques is explained.

Most of the existing passive illumination normalization techniques are related to the retinex theory (Land and McCann, 1971). The latter is based in comprehending the main process of the image formation and perception. Mathematically speaking, an image I(x,y) can be presented as a product of the reﬂectance R(x,y) and the luminance L(x,y)

I(x,y) = R(x,y).L(x,y) (2.19)

R(x,y) consists in the characteristic of the object in the image and it is based on the reﬂectivity.

Meanwhile, L(x,y) is based on the amount of illumination in the image. By this, R(x,y) is the representation of the original image that is not dependent to the illumination (i.e invariant). The luminance is considered to vary slowly with the spatial position and can therefore be estimated as a smoother version of the image I(x,y).

R(x,y) = I(x,y)

L(x,y) (2.21)

For example the Single Scale Retinex (SSR) algorithm calculates the luminance factor L(x,y) by performing smoothing with a single Gaussian ﬁlter into the image I(x,y). The reﬂectance

R(x,y) is calculated with the expression using algorithm in equation . However this technique

seems to be insufﬁcient. As a fact, images having large illumination discontinuities may have visible halo on the reﬂectance. Then a newer extension is the Adaptive Single Scale Retinex algorithm (ASSR) which is based on adapting the process of smoothing using iterative convo- lution that applies two discontinuity measures: local inhomogeneity and spatial discontinuity (Park et al., 2008). Another extension of this work was the Multi-Scale Retinex (MSR) (Jobson

et al., 1997) where in this case multiple Gaussian ﬁlters are implemented in terms of widths

and the different values of the reﬂectance for each case are then combined providing a global reﬂectance R(x,y).

Self Quotient image (Wang et al., 2004a) was proposed as another technique for combating illumination issues. It combines the image processing technique of edge preservation filtering with the theory of retinex. A newer version of this approach was introduced in the same work called Multi-Scale Image (SQI) where instead of using Gaussian filter the use of anisotropic filter was used for smoothing.

The Non-local mean (NL) consists in smoothing an image by computing at each pixel value a weighted average of surrounding pixels. The weight is a similarity function that calculates the tendency of similarity between the neighbouring pixels and the target one having constant variables. As for the newer version of this approach called Adaptive Non-local means (ANL), the smoothing parameter found in the weight function is dependent on local contrast instead of having preselected values In this case, more smoothing is performed when contrast is low at a region and vice versa. Details can be found in (Gross and Brajovic, 2003).

In the past few years recent normalization techniques have appeared such as the "Tan and Triggs" (TT) technique (Tan and Triggs, 2010). It merges multiple approaches: robust illumination normalization that contains series of stages in order to cope up with variations such as local shadowing and highlights. The stages can be resumed in applying Gamma correction as a start then ﬁltering with Difference of Gaussian (DoG), ﬁnally performing masking in order to mask out irrelevant variables. Afterwards, local textured base feature extraction Local Ternary Pattern (LTP) which is based on the principle of LBP is employed. Finally, a feature fusion technique is applied. The TT technique shows to perform well when applied to FR especially using LBP feature representation.

In document Local quality-based matching of faces for watchlist screening applications (Page 54-57)