• No results found

Pixel classification by morphological granulometric features

N/A
N/A
Protected

Academic year: 2019

Share "Pixel classification by morphological granulometric features"

Copied!
101
0
0

Loading.... (view fulltext now)

Full text

(1)

Rochester Institute of Technology

RIT Scholar Works

Theses

Thesis/Dissertation Collections

5-1-1991

Pixel classification by morphological granulometric

features

John T. Newell

Follow this and additional works at:

http://scholarworks.rit.edu/theses

This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please [email protected].

Recommended Citation

(2)

Pixel Classification by Morphological Granulometric Features

by

John T. Newell, ill

Rochester Institute of Technology

Center for Imaging Science

May 1, 1991

A thesis submitted in partial fulfillment of the requirements for the degree of Master of

Science in the Center for Imaging Science in the College of Graphic Arts and Photography

of the Rochester Institute of Technology

Signature of Author _

Center for Imaging Science

Approved by

Mendi Vaez-Pravani

(3)

THESIS RELEASE PERMISSION FORM

ROCHESTER INSTITUTE OF TECHNOLOGY

COLLEGE OF GRAPHIC ARTS AND PHOTOGRAPHY

CENTER FOR IMAGING SCIENCE

Pixel Classification by Morphological Granulometric Features

I, JohnT. Newell, III, hereby grant permission to the Wallace Memorial Library of R.I.T.

to reproduce my thesis in whole orinpan. Any reproduction will not befor conunercial

use or profit.

(4)

Abstract

Pixelclassificationsystemsrely on a certain set offeaturesthatare sufficientto

classify a givenpixelintoa classdefinedwithinadatabase. Unlike brightnessand spectral signaturefeatures commonlyusedinremotesensingapplications,texture-basedfeatures

cannotbe definedfora single pixel andmustbe derived froman area or window

surroundingthatpixel. Inthis research,allfeaturesarederived frombinarymorphological granulometries. Oncegenerated, thesefeaturescomprise adatabasewhich can beusedto

classify images. A Gaussian Maximum Likelihood Classifier is trainedwiththisdatabase forsubsequentclassification ofboth dependentandindependent data. Severalaspects of

thesetexture-basefeaturesrequireinvestigation inordertodeterminetheirabilityto

distinguish imagetextures. Three importantaspectsareaddressedinthis study; the effects

of maximumnoise,theoptimalsize ofthelocalizedwindow, andtheminimum number of optimalfeaturesrequiredforaccurate classification. Astatistical approachhas beentaken todetermine theclassificationaccuracywithvaryingwindow size,varyingnumberof

features,andvaryingamounts offourtypesof maximumnoise usinggranulometric

features. Analysisoftheseinvestigationsindicate fourmainresults. First,classification

accuracy in theabsence of noiseisextremely high. Second,forthese texturesatthespatial resolutionof75dpi,classificationaccuracy decreasesdramaticallybelowawindowsizeof

11x11 pixels. Third,thenumberoffeaturesneededfor highclassificationaccuracycanbe

reducedtoafairlysmall number ontheorderof6features. Finally,thesefeaturesare

generallyrobustinthepresenceofmaximumnoiseifthetypeand amount of noise canbe

(5)

Acknowledgements

I wouldliketo thankJeff B. Pelz for his helpand computer programswhich were of

tremendoushelpincompletingthis thesiseventhoughtheywere writtenin Pascal.

I wouldalsoliketo thankWendyRosenblumfor heralgorithms,code and afineexample

of a well writtenthesis.

Finally, Iwishto thank Kaleen Moriartyfor her immeasurablefriendship which hasseen

me throughtheheavenandhellofRIT. Thanks forkeepingme smilingthrough thetough

(6)

Dedication

Thisthesisis dedicatedtoJohn T. Newell,Jr.andAnne W. Newell fortheirlove,

confidence andsupportthroughoutmy collegeeducationandforinstillinginmethepride

(7)

Table of Contents

TableofContents vii

ListofFigures ix

ListofTables xi

1.0 Introduction 1

1. 1 MorphologicalGranulometries 1

1.1.1 Opening 1

1.1.2 Granulometries 3

1.1.3 LocalGranulometries 7

1.2 Image Texture 9

1.3 Image SegmentationUsingGranulometric Features 10

1.3.1 Segmentation 10

1.3.2 UseofGranulometricFeature for Segmentation 10

1.3.3 Higher OrderMoment Features 1 1

1.3.4 StructuringElementsandDerivationofOther

Granulometric Features 1 1

1.4 Image ClassificationandDiscriminant Analysis 13

1.4.1 Classification 13

1.4.2 FeatureProbabilityDistributions 1 4

1.4.3 Maximum Likelihood Classification 1 5

1.4.4 Gaussian Maximum LikelihoodClassification 16

1.5 Minimal WindowSize 19

1.6 Optimal Feature Selection 20

1.6.1 FeatureReduction 20

1.6.2 Mahalanobis-LikeDistance Measure 20

1.6.3 Divergence Measure 22

(8)

1.7 Noise 27

1.7.1 MaximumNoise 27

1.7.2 Point Noise 27

1.7.3 OcclusionNoise 28

1.7.4 Scratch Noise 28

1.7.5 Spaghetti Noise 29

2.0 Statement of Work 30

2.1 SelectionofTexture Images 30

2.2 ThresholdingofTexture Images 33

2.3 GenerationofNoise 36

2.4 GenerationandSelectionofLocalGranulometric Features 42 2.5 ClassificationofDependentandIndependent Data 44

3.0 Analysis of Results 46

3.1 Dependent Classification 46

3.2 Independent Classification 47

3.3 Minimal Window Size Determination 50

3.4 Optimal Feature Selection 52

3.5 ClassificationwithMaximumNoise 56

3.5.1 Dependent Classification 56

3.5.2 Independent Classification 57

3.5.3 CombinationsofNoise Models 63

3.6 Noise Estimation 75

4.0 Conclusions 79

4. 1 Suggestions for Future Work 81

5.0 References 83

Appendix A 86

(9)

List of Figures

Figure 1: ImageS andstructuringelementE 2

Figure 2: Open (S,E); Openingofimage S by

structuringelementE 2 Figure 3: Simulatedbinarygranulometryresultantimages 5 Figure 4: ^(k), <P(k)andd<D(k)fromthesimulatedimagegranulometry 6

Figure 5: ExampledOx(k) probability distribution 8

Figure 6: Feature Zvaluedistribution fortwoclasses 14

Figure 7: Maximum likelihooddecisionboundary 16

Figure 8: Feature setsforclass separability 24

Figure9: Textureimages 31

Figure 9: Binarytextureimages 34

Figure 10: Examplesofbinary noiseimages 40

Figure 12: ClassificationAccuracyvs.Window Size 50 Figure 13: ClassificationAccuracyvs. NumberofOptimalFeatures 53

Figure 14: ClassificationAccuracyvs. % Point Noise 58 Figure 15: ClassificationAccuracyvs. % Spaghetti Noise 58

Figure 16: ClassificationAccuracyvs. % OcclusionNoise 59

Figure 17: ClassificationAccuracyvs. % Scratch Noise 59

Figure 18: ClassificationAccuracyinthepresence ofHorizontal,Fixedand

Random Scratch Noise 62

Figure 19: ClassificationAccuracy withcombinations ofnoise models 64

Figure 20: Feature distributions for Circular PSSD 67

(10)

Figure 22: Feature distributions forPositiveDiagonal PSM 68

Figure 23: Probabilitydistributions for Circular PSSD 69

Figure 24: Probabilitydistributions for Negative Diagonal PSSD 69

Figure 25: Probabilitydistributions for Positive Diagonal PSM 70

Figure 26: Optimal Feature Classification in Point Noise 73

Figure27: Optimal Feature Classification in Spaghetti Noise 73

Figure 28: Optimal Feature Classification in Occlusion Noise 74

Figure 29: Optimal Feature Classification in Scratch Noise 74

(11)

List of Tables

Table 1: Classificationofdependentdata 46

Table 2: Classificationofindependentdata 48

Table 3: Classificationofindependentdata usingpooled covariance 49

Table 4: OptimalFeature Sets using Rosenblum Optimization 55

Table 5: Classificationofdependenttexture-plus-noisedata 56

Table 6: Classificationofindependent data in 5%pointnoise 66

Table 7: Classificationofindependent data in 10% pointnoise 66

Table8: Classificationofdatawith 10%point noise aftertrainingwith5%

point noise 76

Table 9: Classificationofdatawith5% point noise aftertrainingwith 10%

(12)

1.0 Introduction

1.1 Morphological Granulometries

Morphologicalgranulometries wereconceivedbyMatheron [1975] asatypeof"sieving"

operation forbinaryimagesinwhichparticlesinthe imagestructurearefiltered according

to theirsize. Quantificationoftherate atwhichanimageisalteredin thesievingprocess

producesa numericalsizedistribution containing image textureinformation. Binary

granulometriesaregeneratedbysuccessively openingabinary imagebyan increasing

sequence of convexbinarystructuringelements. The imageswhich makeupa setthe

structuringelementsequence are of a specific shape (i.e. linecircle, square,etc.)andthe

texturalinformationwhich can begatheredfromagranulometry isspecific to theshape of

thestructuringelement sequence.

1.1.1 Opening

The openingofabinaryimage S byabinary structuringelementE is definedtobethe

unionof all translationsofEwhich are subsets ofS. Rigorously,x e OPEN(S,E)ifand

only ifthereis sometranslate(E+z)ofEsuchthatx e (E+z) c S. Considertheexample

ofabinarydigitalimageSandthe threepixelhorizontal structuringelementErepresented

(13)

Image S StructuringElement E

* \ \ \ * * *

***** i i

11111111 111

* 1 1 * 1 1 *

1111*1*

11*11*1

Figure 1: Image S and structuringelementE.

Theones represent activated pixelsandthestars areconsideredundefined or non-activated

pixels. Allpixels outside animageare also considered non-activated. Toopenimage S by

structuringelementE, theorigin ofE istranslated toeach pixelin S. Wherever E entirely

fitsoveractivated pixelsinS,all pixelsintheresulting image Open(SE)areactivated. See

Figure 2.

Open(S^)

* 1 i |****

*****jjl 11111111

********

********

Figure 2: Open(S,E); OpeningofimageSbystructuringelementE

Since Ewillfitover allpixelsinthe third row, theentire rowisactivatedinOpen(S).

Noticethat thelastpixelinthefirstrow isactivatedin imageS butnotin imageOpen(SE)

(14)

rows. Becauseofthe size andthe shapeofthe structuringelementinthisparticular

example, any horizontalrunlengthof3or more pixelswillbeactivated.

1.1.2 Granulometries

From thedefinition of anopening, itfollowsthatwhen OPEN(F,E)=F,

OPEN(SE) is a

subimageof OPEN(S,E). Asaresult, ifEi, E2, E3,... isan increasingsequenceof

structuringelements suchthatOPEN(Ek+1, E^=

Ek+1 , then thefiltered imagesforma

decreasingsequence

OPEN(S,Ei) z> OPEN(S,E2) 3 ...

Countingthenumber ofpixelsremaining in eachsucceeding openingresultsina

decreasingfunction^(k),such thatforsome K,*F(k)=0 for k>K.

Dependingon the

shapeofthestructuringelements,varioustexturalinformation isrevealedbystudyingthe function^(k). Theimage sequence {OPEN(S,Ek)} iscalled agranulometryandthe

resulting function^(k) iscalledthesizedistribution. Inpractice,Ex consistsof asingle pixel sothat(1)givesthe totalnumberof activatedpixelsin S.

Since4*(k) isdecreasing,thenormalizationof^(k)isaprobability distribution function

givenbyEquation 1.

(15)

Thediscretederivative, dO(k), isadiscreteprobabilitydensityfunction. It has become

populartorefertothisnormalizedgranulometric-sizedistribution

densityasthepattern

spectrumoftheimages. Thisdistributionrevealstheparticle sizedistributionoftheimage fromwhichit iscalculatedand canbedescribedbyitsmoments. Themoments ofthe

patternspectrum canthen beusedtodescribetextureinformation.

Figure 3ashows asimulatedbinary imagemadeupoffoursizedisksofdiameters 4, 7,

15,and31 pixels. Whentheimageisopenedwitha seriesof circularstructuringelements

Ekofdiameter 1 through4, theresultantimage isunchanged. However, whentheimage is

opened with acircularstructuringelement ofdiameter5,thedisksofdiameter4arefiltered

out ofthe image, leavingtheimage shownin Figure 3b. Opening thisimagewith elements

ofdiameters 6and7 produce nofurtherchangeintheoutputimages. Whenthe image is

opened withdiameter8,thedisksofdiameter 7arefilteredout oftheimage resulting in

Figure 3c. Again,there isnochangeintheoutputimageuntilthestructuringelement

diameterreaches 16pixelsandthedisksofdiameter 15arefilteredoutasshownin Figure

3d. Finally, when thestructuringelement sequencereaches32, allthedisks have been

filteredoutresulting inanullimage. (Itshouldbenotedthat theremay be somedigitization

(16)

Figure3: Simulatedbinarygranulometryresultantimages

a) originalandOPEN(S^i) throughOPEN(S34)

b) OPEN(S^5) throughOPEN(S7)

c) OPEN(S,E8) throughOPEN(S,Ei5)

(17)

6000

<D(k)

dtD(k)

(18)

The^(k),O(k)anddO(k)distributionsfromthe simulatedimage

granulometryare shown

in Figure 4. Allthree parametersarefunctionsofthediameter, k,ofthecircularstructuring

elements. It isimportanttonotethat thesedistributionsarebasedon a pixel count ofthe filteredimage,ratherthana particle count.

1.1.3 LocalGranulometries

A local granulometry isan extension ofthisconceptdescribingtheparticlesizedistribution in a given neighborhood or window aboutsomepixel x. ^(k)is thenthepixel count

withina windowcenteredon pixelx, ratherthan thepixelcountovertheentireimage. In

ordertomaintainlarge-scaletexturalinformation, theimageisopenedgloballyandthe pixel countisperformedlocally. Inthe samemannerdescribed fora globalgranulometry, thenormalizedprobability distribution <Dx(k) iscalculatedfromthelocalsizedistribution

<Dx(k) = l-x(k)/x(l) (2)

foreach point x intheimage. Thediscretederivative,dOx(k),definestheprobability

densityaboutthepixel x. d$x(k)isthen thelocalpatternspectrum atx. Theresultofthe

binarylocal granulometrywith a given windowsizeisa one-dimensionalprobability

densityateach(ij)pixellocation intheimage. These probability densitiesserve as

(19)

0.20-1

0.15

-dO

0.10-Figure 5: Exampled<Dx(k)probabiUty distribution

This distributionisarobust,but impractical descriptorofthe localtexture. However,the

moments canbe usedtodescribe thedistributionand canbeused as a much more practical

descriptorofthelocalimagetexture. The localgranulometricmean,standarddeviation,

varianceand skewness canbeusedasvaluable texturedescriptorsforimagesegmentation

and classification[Doughertyetal, 1990]. Since thesemoments arederived fromrandom

(20)

1.2 Image Texture

Image texture andtexturalinformation have been studiedfor manyyears. Lewis [1971]

illustrated howtexturerelatestogeomorphology using K-bandradarimageryofplains,

lowhills,highhills, and mountainsinthe PanamaandColumbiaarea. Haralickand

Anderson[1971] illustratedhow texturerelatestolandusecategories. Suttonand Hall

[1976] usedtexturemeasuresforautomatedclassification ofpulmonary disease.

Rosenblum[1990] demonstratedtheclassificationaccuracy increaseof aerialimageryby

the addition oftextural featurestoa multi-spectral classificationdata base.

Textureisadescriptionofthespatialdistributionand spatialdependence amongthegrey

tones [Rosenblum, 1990]. Itcan be describedbyperceptualdescriptorssuch as "fine",

"smooth", "coarse", "mottled", "lineated"or "irregular". It may also be described interms

of a pattern madeupof repeatedtextureprimitives[Nevatia, 1982]. Atexture image Jcan

therefore be thoughtofas atransformfromonebandof a spectralimage I inwhichJ(i,j) is

afunctionofI(i,j)andneighboringpixels[Haralick, 1979]. Atexturemeasureata point of

animageissome functionoftheobserved values within alocalneighborhood aboutthe

point[Ahuja, 1983]. Granulometriesusea structural approachtoanalyze visual scenesin

termsof organizationandrelationshipsamong its substructures[Haralick, 1986].

Granulometric features describe imagetexturesintermsofthesizedistributionsofthe

(21)

1.3 Image Segmentation Using Granulometric Features

1.3.1 Segmentation

Oneofthereasonsbehindthedevelopmentofimage processing has beentheneedto

identifydifferentobjectsorregionswithina givenimage. Withinthe studyofimage

texturehas beenthedevelopmentofalgorithmsforsegmentationbasedonimagetexture.

The intuitive ideabehindimagesegmentationistodividetheimageintosegmentssuchthat

each segmentis homogeneous insomesense andtwoneighboring segmentsdiffer from

one anotherin thesamesense [Kashyap,1986]. Segmentation isaccomplishedby

separatingtwoor morehomogeneousregions whichhaveasignificantstatistical

difference. Sincethepixel valuesof abinarytextureregions areinherently

non-homogeneous, texture measures needtobeassignedtoeach pixel forsubsequent

segmentation.

1.3.2 UseofGranulometric Feature for Segmentation

DoughertyandPelz [1989]developed bothadeterministic andanondeterministic model of

image segmentationusingtexturemeasuresderived frommorphological granulometries.

Usingthedeterministicmodel, an imagecomprisedoftwodifferentsizediscswas

segmentedbyusingthemeanofthelocalcircular granulometry. Agranulometric-mean

imagewas generatedbyassigning thislocalcircular pattern spectrum mean(PSM) toeach

pointx of animage. Eachpixelintheresulting imagewasthereforea measureofthelocal

(22)

Thisgrey-scaleimageofmean values wasthen

successfullysegmentedbythresholdingthe

image.

1.3.3 Higher Order MomentFeatures

Ifthelocal PSMoftwo textureregionsisnotsufficientlydifferenttoallowsegmentation,

higherorder momentsofthelocalpattern spectrummay beemployed. The localpattern

spectrumstandarddeviation(PSSD), variance(PSV),and skewness(PSS)can beusedas

texture measurestosegmentanimage. DoughertyandPelz[1989] used thepattern

spectrumvariance(PSV)tosegmentanimageinwhichtwo textureregionshadsimilar

PSMs. Byviewingahomogeneoustextureas a population ofpixels, alllocal

granulometric moments can be interpretedasrealizationsof random variables which are

characteristic oftheimage texture. Theserandom variablespossessprobability

distributions indicativeof an imagetexture.

1.3.4 StructuringElementsandDerivationofOtherGranulometric Features

In additiontobeing texturedependent,all moments of a patternspectrum are specificto the

structuringelements usedtogeneratethespectrum. Manydifferenttypesofstructuring

elementshave beenusedtogenerateapattern spectrum ofanimage. Circular,elliptical and

linear structuringelementshave commonly beenemployedtogenerate granulometric

texturemeasures. Linearandnon-linear combinationsofthemomentscanalsobeusedas

localtexture measures. Dougherty, Kraus, andPelz[1989] introducedthreesuch

(23)

four lineargranulometries; horizontal,vertical,positive-diagonal(45)and

negative-diagonal(135),andMaxLinasthemaximumPSMofthe samefour lineargranulometries.

AveLinisan exampleof alinearcombination whereasMaxLinisanexample ofanon

linearcombination. Linearityis ascale-invariantfeature definedasMaxLindividedbythe

PSMofthecirculargranulometry. Thisratio will resultina value of1 for anycircular

imageelement regardless ofdiameter. Elongated imageelements will producehigher

(24)

1.4 Image Classification and Discriminant Analysis

1.4.1 Classification

Imageclassificationistheprocessofassigningapixeltooneof a number ofpossible classes onthebasisofsomeobservationsmade onfeaturesofthatpixeland/orits

surround. It isadecision makingprocess which uses statisticaldecision theorytomake an intelligentestimateoftheclass towhich a pixelbelongs [Schowengerdt,1983]. In

supervisedclassification,asampleof eachclassis takenforeach observationandthe

statisticaldistributionofthe elementsineach class areanalyzed. Fromthatinformation,the

classification algorithm reaches adecision abouthowtoassignpixelsnotinthesampleto

theappropriateclasses. Schowengerdt[1983] recommendsthat 10to 100pixels be includedpertrainingclasswithmorepixelsforthoseclasses with highervariability.

Pixelclassificationhaslongbeenanintegralpart of remote sensingandotherimage

processingapplications. Spectraland radiometricdata fromaerial and satelliteimages have

been usedasfeaturestoclassify specificregionsoftheseimagesaccordingtosome

predescribedcriterion. Linearcombinations ofthisdata, suchastheredtogreenbandratio

inmultispectralaerial images,have alsobeenfoundtoemphasizedifferences inground

cover typesandcharacteristics of particularinterest Manyapproachestoclassification

basedontexturehave been developedovertheyears. Someofthese approachesinclude

the useoffeatures derived from firstorderstatistics,spectral powerdensityfunctions,

autocorrelationfunctions,and grey-tonerun-lengthdistributions. Oneofthemost

(25)

measuretherelativefrequencieswith which twopixelvalues,withacertain separation,

occurinanimage [HaralickandAnderson,1971].

1.4.2 FeatureProbabilityDistributions

Anynumber oftheseor otherfeaturescanbeusedtoclassifyan image intosomepreset

number ofclasses. Eachclasswillhavethe samenumberoffeatures inafeaturevector.

The distributionof valuesfor anyonefeaturefora givenclasshasacertaindistribution

which can beusedtodecidetheclassification of an unknown pixel. Considerthe two

distributionsof somefeature Z in Figure 6. Each distributioncorrespondstoa separate

class.

class 1

P(xI i)

featureZ

Figure6: Feature Zvaluedistributionfortwoclasses

Thearea under eachofthesedistributioncurvesisnormalizedto 1.0andtheyare assumed

toapproximatethefeatureprobabilitydensityfunctionsof each class. Thesefunctionscan

(26)

belongstoclassi. The probabilityoffindingafeaturevalue of xgiventhatweare

sampling fromclassi is givenas p(x Ii). Thediscriminant functionisdefinedasthe

probabilityof a pixel belongingto classigiventhatithasafeaturevalue x or p(ilx).

p(ilx) =

p(xli)p(i)/p(x) (3)

wherep(i) isthea prioriprobabilitythatclassiexistsintheimageandp(x)isthe

probabilityoffindingapixelfrom anyclass.

Assumingthateachclasshasan equalprobabilityofoccurring, p(i)willbeequalforeach

class. Thevalueforp(x) is simplyanormalizing factorandthereforeaconstantforeach

class. TheaUscriminantfunction isthensimply a calculation of p(xIi)foreach class.

1.4.3 Maximum Likelihood Classification

Maximum likelihoodclassification comparesthediscriminantfunctionvalueforeach

featurevalue xcalculatedforeachclass and assignsthepixelto theclasswhichproduces

thehighest probabilityvalue. Forexample,considerapixel with afeatureZvalue ofx, as

shownin Figure6. Sincethecalculated value ofthediscriminantfunction is greaterfor

class 1 thanforclass 2, thepixel wouldbeassignedorclassifiedintoclass 1.

The decision boundaryforclassification,d,liesatthepointatwhichthe twodistributions

(27)

class 1

P(xI i)

feature Z

Figure 7: Maximum likelihood decisionboundary

Anypixel with a feature Zvalueless thandwouldbeassignedtoclass 1, sincethereisa

higher probabilityofthisvaluecoming fromclass 1 thancoming fromclass2. Likewise,

anypixel with afeature Zvalue greaterthandwouldbeassignedtoclass2. Thetotalerror

inthisclassification isrepresentedbytheoverlapofthe twodistributionswhichis shown

astheshaded region. Thiserroris minimizedbyplacingthedecision boundaryat thepoint

at which p(xI 1)isequaltop(xI 2),This decisionboundaryisrepresentedbyd in Figure

7. Notice thatifthedecisionboundary wasmovedineitherdirection,theerror would

increase.

1.4.4 Gaussian Maximum Likelihood Classification

This is thesimplest exampleofclassificationsincethereis onlyonefeature andonly two

classes. However, thesame principles canbeextendedtomore complicated classification

modelsin whichthereareanynumberoffeaturesand classes. Themostcommonlyused

(28)

mean vectors and covariancematricesoftheclasses are requiredtocompute the

class-conditionaldensityfunctions. Thisclassifier requires thedistributionoffeatureswithin

eachclass tobe approximatelymultivariatenormal. However,theclassifieris"relatively

tolerant"

ofdeviations from normality [seeSwain, 1986].

Adiscriminant function is developedusingameasure ofthegeneralizedsquareddistance

fromthemean vector. Theclassificationcriterioncanbe basedon eithertheindividual

within-class covariance matrices or a pooledcovariancematrix. Aswiththesinglevariable

case,each unknownpixelisclassifiedintotheclassfromwhichit hasthesmallest

generalizedsquareddistance.

Thegeneralized squareddistance fromx toclass tis

Dt2(x) =

gi(x,t) + g2(t) (4)

where xis thevectorcontainingthefeaturevalues of an unknown pixel and

tisasubscripttodistinguishtheclasses.

Ifthewithin-class covariance matricesareusedthen

gl(x,t) = (x

-mO' Sf1 (x

-mt) + lnlStl (5)

where mt isthevectorcontaining themeans ofthefeaturesof an unknownpixel

(29)

Stisthe covariance matrixwithin-class t.

Ifthe pooled covariance matrixisusedthen

gl(x,t) = (x

-mO' S-1 (x

-mt) (6)

where S isthepooledcovariancematrix.

Ifthea priori probabilitiesareallequal,g2(t)iszero. However,iftheyare not all equal

g2(t) =

-21n(qt) (7)

(30)

1.5 Minimal Window Size

The sizeofthewindowforthelocalgranulometriescan havea significant effectonthe

distributionsofthe granulometricfeatures. Dougherty,PelzandNewell[ 1990]

demonstratedthat thevarianceofthe granulometricfeature distributions decreaseswith

increasingwindow size. Decreasing thevariancedecreasestheamount ofprobability

overlap betweenclasses soclassificationaccuracycanbeimprovedby increasingthe

windowsize. However, increasingthewindowsizecan also makeit hardertodetermine

theborder between adjacenttextureregions andleadtomisclassificationofpixelswhose

surroundincludes 2ormoretextureclasses. Generallyspeaking,largerwindowsdecrease

variabilityofthegranulometricfeaturesatthecost ofless detailedsegmentation.

Afeaturevaluefromalocal granulometry is onlyan accurate representation of a given

imagetextureiftheentire window usedtogeneratethatfeaturelieswithinthe texture

region. Otherwise,thefeaturevalue canbeaffectedbyotherimagetexturesandtherefore

represent a combination of anumber ofimagetextures. Apixellying neartheedgeoftwo

textureregions shouldthereforebeunclassified. Giventwoadjacenttextureregionsinan

image, and awindowan oddnumber,x,pixelsinlength,the number of unclassified pixels

(31)

1.6 Optimal Feature Selection

1.6.1 Feature Reduction

Thekeystep in anyclassification problemis thechoiceofaset offeatureswhichreduces

thedimensionofdatatoacomputationallytractablelevel whilepreservingmuch ofthe

classifying information presentintheactualdata[Kashyap, 1986]. Thenumber offeatures

used intheclassification shouldstillgive aminimalprobabilityofmis-classification [Fu,

1976]. Featureswhichdonotaddtoclassificationaccuracyrepresent a costsince,witha

maximumlikelihoodclassifier, the timeneededtomake a calculationincreases quadratically

with theaddition offeatures [Richards, 1986].

Inrecent yearstherehasbeen much attention paidtodeterminingan optimal set of m

features outof atotal set ofNfeatureswithoutsignificantlydegradingtheclassification

abilityofthealgorithm. Thesetechniquesfor featureselection attempttomeasurethe

separabilityoftheclassesforeach combination of mfeaturesout ofthetotal set offeatures.

The subset withthemostpotentialforcorrect classificationissubsequentlyselectedforuse

intheclassifier.

1.6.2 Mahalanobis-LikeDistance Measure

Thesimplesttechniquesoffeatureselectionusetheseparationofthefeaturemeansin

multidimensionalspace. However,thisapproachmayresultina set offeatureswhichare

(32)

developedby Schott, Salvaggio,andKraus [1988], usesthe Mahalanobis-Likedistanceas

a measure ofthe separationofclasses. TheMahalanobis-Likedistancemeasureis defined

as

dst =

(ms

-mO' S"1

(ms

-mt) (8)

where msandmtarethemean vectors ofclasses s andtand

S isthepooledcovariancematrix.

Comparedtocalculatingtheexactprobability overlap betweentwo classes, thismethodis

very fastsinceitrequirestheinversionofonlythepooledcovariancematrix. Usingthe

pooledcovariance hastwomajordrawbacks: 1)theassumption of equalcovariance

matrices isnotusuallytrueand2) it doesnot accountforthevariabilityoftheindividual

classes. Onesolutiontothisproblemistousetheindividualcovariance matricesinplace

ofthepooled covariance matrix [Robert, 1989]. TheMahalanobis-Like distance between

classes correctedfortheindividualcovariancematricestakes theformofEquation (9)

[Richards, 1986].

dst = [(ms

-mt)' St"1

(ms

-m,)] + lnlStl (9)

where Stisthecovariance matrixwithin-classt

However, thisintroducesother problemssincetheresultdependsupon whichcovariance

(33)

1.6.3 DivergenceMeasure

Richards [1986] describes away toquantifythe separation betweentwoclassesbythe

degreeofoverlapoftheclassdistributions. Theoptimal setoffeaturescanthen be found

by findingthe setwiththeleastamountofprobabilityoverlap. The divergence between

two classes, dvst,is definedinEquation(10)asaseparability measurewhichtakesinto

account thevariabilityofboth classes.

-Jt

dvst = J[ p(X I s)

-p(X I t) ] In [ p(X I s)/p(X I t) ] dX (10)

X

where dvst isthe divergence betweenclass sandclasst,

p(X I s) isthe probabilityoffindingthefeaturevectorXwhen

sampling fromclass s and

p(X I t) istheprobabilityoffindingthe featurevectorXwhen

sampling fromclasst.

Iftheclassesare assumedtocomefrommultidimensional normaldistributions,the

divergence becomes

dvst=(l/2)Tr[(Ss

(34)

where msandmt are themeanvectors ofclassess andtand

SsandSt arethe covariance matrixwithin-classs andt respectively.

UnliketheMahalanobis-Like distancemeasureinEquation(9),dvstissymmetric (i.e. dvst

=

dvts)because bothclassdistributionsaretakenintoaccount.

Thisdivergencecan thenbesummedover allclass pairstogivea measure oftheoverall

divergence. Thesetoffeatureswhichresultsinthegreatestoveralldivergenceshouldgive

the greatest classificationaccuracywhen aGaussianmaximumlikelihoodclassifierisused.

Mausel,KramberandLee. [1990]transformed thedivergence betweentwoclassesin the formofEquation(12)toemphasize smallchangesinthedivergenceresulting from

significant changesintheclass separability.

tdvst= 2000[l-exp(-dvst/8)] (12)

Thisvaluehasalimitof2000which wasdesignedtolimit extremely highdivergence

values whichdonotnecessarilycorrespondtocompleteclassseparation.

1.6.4 SeparabilityMeasure

Rosenblum[1990]developedasimilar methodtoenhancetheaccuracyofthe

Mahalanobis-Like distanceseparationmeasuregivenin Equation (9). Theoverall separabilitymeasure

(35)

otherclasses. Figure 8shows two feature setsofthe same threeclasses. In feature setA,

classesx andy arepoorly separated and class zis greatlyseparatedfrom these two.In

feature setB, allthreeclasses arefairlywellseparated.

featuresetA

/\

feature setB

Figure 8: Featuresetsforclassseparability

It isobviousthatfeaturesetB doesabetter jobofseparatingthe threeclassesthanfeature

setA. However,the separability measurefor feature setAwillbegreaterthanthe

separabilitymeasure for featuresetB. Since alarge separation ofanyoneclassfroma

groupofotherscaninflatetheoverallseparabilitymeasurefora set offeatures,adistance

thresholdwasdevelopedtonormalize andlimitthevalues. Withoutthatinflation,a set of

featureswhichseparate all oftheclasseswillbechoseninsteadof a setwhichseparates one

classverywell.

Theprobabilityoffindingthemean ofclass tinasamplefromclass sisdefined as

P(mls) =

[

1/ (ISSI1/2 2iW)] e-^^mi

"

"V Ss'1

<mt

'

(36)

where k isthenumberoffeatureswhich aretobeoptimized.

UsingEquation(13) andassumingmultivariate normal class

distributions,itcanbeshown

that

[(mt

-ms)' Ss"1

(mt

-ms)] + lnlSsl = -2ln[P/

2tc^2]. (14)

(See Appendix A). The left handsideofEquation(14)representstheMahalanobis-Like

distance betweentwoclassmeans,dts,when theindividualclass covarianceforclasssis

used. The right hand side oftheequationistheminimumdistance, dthresh,which must

separatethe twoclassmeansfortheprobabilityofmisclassification tobe P.

dthxesh = -2 1n[P/2K^].

(15)

ThevalueofPshouldbesettoasufficientlysmall valuetoassure nearcompleteseparation

oftheclasses.

Thereare now twodistancemeasures which needtobecalculated: theactualdistanceas

calculatedbyequation (9),

dSt= t(ms

-mO' St"1

(ms

-mO] + lnlStl (16)

andthe thresholddistancefora presetprobability Pgivenbyequation(15). After bothdts

anddthreshhave beencalculated, theratiooftheMahalanobis-Likedistancetothe threshold

(37)

dratio = dst/dthresh

(17)

Sincetheseparabilitymeasureisnotsymmetric,theratiomustbecalculated twiceforeach

pair of classes. Asa resultofcalculatingtwoseparabilitymeasuresforeach pair of

classes, a matrix mustbe usedtorepresentallrelativedistancemeasures. Thesum ofthis

matrix can thenbeusedas a measure oftheoverallseparabilityoftheclasses. Before this

summationhowever, any dratiovalues whichexceed 1.0are setto 1.0topreventthe

inflationoftheoverallseparabilitymeasure. Theoverallseparability isthencomputedfor

all permutations offeaturesubsetsfromthewholeset. Thesubset withthehighestoverall

(38)

1.7 Noise

1.7.1 MaximumNoise

Everyaspect ofimage processingandclassificationisaffectedbynoise. Althoughthere

aremanytypesofnoise associatedwithimages,becauseofthenatureofgranulometric

basedfeatures, we willconcern ourselvesonlywith maximum additivenoise. Sincethis

typeofnoise adds tothe activatedpixel countofathresholdedimageneededforbinary

granulometries,it is readily apparentthatadditivemaximumnoisewill skewthe

granulometricdistributions usedtogeneratethesefeatures.

Thereare manytypesof maximum noise which are inherenttodigital images. Ofthese,we

will examinetheeffects offour basiccategories: pointnoise,occlusion noise,scratch noise

and spaghetti noise. Dougherty, PelzandNewell [1990] brieflyexaminedtheeffect of

maximum point noise and spaghetti noise on granulometricbased features. Although it

was concludedthatthefeaturesweregenerallyrobust, furtherexaminationisrequiredfora

deterministic analysis oftheeffect of additional noise onimagetextureclassification.

1.7.2 PointNoise

Point noiseis definedas singlerandom activated pixels. It may becausedbyflaws

inherentto thedetector,bydustanddirton anydigitallyscannedimage,orbyelectronic

noiseatany levelofthesystem Sinceuniform response ofarray detectors is virtually

(39)

"push"

some pixelvaluespast agiventhreshold Thesignalmay alsofluctuatefromother

electronicsinthedetectorsuch asphotomultipliersandamplifiers.

1.7.3 OcclusionNoise

Occlusionnoiseis definedas noisewhichoccludesorcoversan underlyingsignal. Inan

image, thiscanbethoughtof asparticleswhich arelargeenoughtoaltertheapparentshape

orboundaryofimage structures and substructures. Thiscanbecausedbylarger dirtand

dustparticles ondigitallyscannedimagesor anundesiredintersectingobjectwithinthe

originalimage.

1.7.4 Scratch Noise

Scratchor streak noiseischaracterizedbylong,thinstraightlineswhich propagateina

singledirectionina particularimage. Physical scratches on aphotographic negativecan

appear asmaximum scratch noiseonaphotographic print. These scratches arecommonly

causedbyphotographicequipment andprocessing machineryasthe negativeispulled

though. Singleelementflawsina"push

broom"

typedetectorcanhavea similar effecton

digitized images. Astheone-dimensionaldetector arraymoves acrossanimage,one

defectiveelement or an elementimpairedbydirtcancause a streakintheresultantdigital

(40)

1.7.5 SpaghettiNoise

Spaghetti noiseischaracterizedbyathin"curly" or

"windy"

lineof connected pixels.

Dependingontherelativesizeofthedetectorelements andimagesbeingscanned, this type

(41)

2. 0 Statement of Work

2.1 Selection of Texture Images

Ten textureimageswere chosenforthestudy. AlltenweretakenfromBrodatz'

collection

ofphotographictexture images[1966]. Thesephotographswere scannedat75 dpiintoan

8-bitgrey-scaledigitalformat.Thetenimageswereselectedtorepresent alargerange of

textural complexity. Thistexturalcomplexitycanbethoughtofastheamountofstructure

inthe underlyingtextureprimitives andthevariationinthatstructure. Figure 9showsall

tenimages alongwith

Brodatz'

originaldescriptions.

Theoriginaldigitalimageswere512x512pixels. However,the 132x 132pixelimages

in Figure9werecroppedfromtheoriginalimages before processingtolimitthe

computationtime. Acomplete descriptionofthereasonsforthisexact sizeare statedin

section2.4. Eachtextureimagerepresentsa separatetextureclass. Throughoutthis paper,

eachtextureclass willbereferred tobythedescriptionnumber givenbyBrodatz (i.e.

(42)

a) dl02 Cane

llllllllllllllllll

llllllllllllllllll

llllllllllllllllll

llllllllllllllllll

llllllllllllllllll

liiilliiiliiiniiii

liiiillllliiiiiini

limiiiiiiiiniU!

Iniiiiiiiiimiiii

c) d20 Frenchcanvas

b) d!03 Looseburlap

....J.%*-*'^ '

I*

IIIt^i| J|||J

....

v"^..., mfc

d) (152 Oriental Strawcloth

e) d64 Orientalrattan f) d65 Orientalrattan

(43)

g) d67 Plastic pellets

(inverted greyscale)

i) d75 Coffee beans

h) d68 Woodgrain

wm

lAit

jr-v^--" j^rtyr^T **>

*i

j) d84 Raffialooped

toahighpile

(44)

2.2 Thresholding of Texture Images

Before thelocalbinary granulometrieswerecalculated,these 8-bitgrey-scale imageswere

reducedto binaryimages. Theuse of athresholdprovidedthesimplestmethodforthe

gray levelcompression. Thechoice ofthethresholdvaluecould significantlychangethe

results ofthe granulometriesbychangingthe activatedpixelcountintheimage. In many

casesit isprofitable tochooseathresholdwhich resultsin approximately halfthepixels

being activated. However,whendealingwithimagetexture, maintainingtheunderlying

textural structureandsubstructuresis themostimportantaspect. Althoughthe textural

structure of eachimagecouldbe bestmaintainedbychoosinga separatethresholdvaluefor

each image,a singlethresholdvalue was chosen whichmaintainedamajorityofthe

underlyingstructureinall the imagesand moreclosely simulated reallifeconditionsfor

textureclassification. Figure 10showsthe thresholded versionsoftheimages in Figure 9.

Thesebinaryimagescouldbe greatlyaffectedbynonuniformityoftheimages. Intra-image

nonuniformityofthemean gray levelofthe 8-bitimagescould causethesize and shape of

the textureprimitivestovary significantlywithinasupposedly homogeneous image. This

may have inflatedthevariance and couldhaveskewedthe granulometricdistributions.

Skewingthedistributions may havecaused ashiftinthefeaturemeans. Theinflated

variance ofthedistributionsmay haveresultedinadecreaseofclassificationaccuracydue

(45)

J

>- 1 f' kIII II k4 ha.

> - ,. . ..

.,, ... . . .

a a a

a) dl02 Cane

HllllllTlllllUM lltilllllllDlllll i^iiifnirM^Mi tfipliipifiiiumi

^HUilili^ikii

iffijpipiiMliljiiltfl

^iilii|fifirii.i<

ifiMlii^iiin^i

fcplpfcpiiirliipiUpi

c) d20 Frenchcanvas

faifail^

Hft

llfiMJI

tSUiSG

CMrfll

nmrnrnm

fcMCM*

b) dl03 Looseburlap

.t! ""

^'>tw *-*

d) d52 Oriental Strawcloth

e) d64 Orientalrattan f) d65 Orientalrattan

(46)

g) d67 Plasticpellets

(inverted greyscale)

i) d75 Coffeebeans

h) d68 Woodgrain

iilPtl

^T7)~*^~l

iH

&1fl^l

j) d84 Raffialooped

toahighpile

Figure 10(g-j): Binarytextureimages

Inter-image nonuniformityofthemeangray levelcould cause a significantdifference

betweentheapparentimagetextureofthe8 bit imagesand the textureofresultantbinary

images. As aresult, the most significanttexturalinformationinagrayscaletextureimage

maynothave beenrepresentedinthebinarytextureimage. Thegranulometricfeatures

(47)

2.3 Generation of Noise

Aspreviouslymentioned,fourcategoriesofadditivemaximumnoise wereinvestigated.

Thefournoise modelssimulatedwere pointnoise,occlusion noise,scratch noise and

spaghetti noise. Thissimulation wasaccomplishedby directlyoverlayingnoise imageson

thetenbinarytexture images. Noiseimageswere createdbyplacinga number ofparticles

ofnoise, "noiseelements", ontoablank image. Eachnoiseelement was describedbyits

length, width, "straightness",andinitialangleofpropagation.

Eachofthefournoise models musthaveacertainmean andrangeforeach ofthefour

descriptiveparameters mentioned above. Foreachnoise model,appropriate valuesforthe

mean and range ofthe lengthand width weredefined. Beta distributions were usedto

determine thelengthand width of eachindividualnoise element. Sincethebeta distribution

is only defined between 0and 1.0, ascaling factorand/or a shiftfactorwasappliedtoalter

therange ofthedistributionsothat thepredefined mean number of pixels wouldliewithin

thisrange. The beta distributionparameters,r\andy, weresubsequentlysetto the

appropriatevalues todeterminetheshapeofthedistributionandtherebythevariationofthe

length and width of allthenoiseelementina noise image.

The initialangle of propagationcouldeitherbe set orrandomlychosenfroma uniform

distribution foreach noise element. A beta distributionwas usedtodeterminehowstraight

orcurlya noise element wouldbe. Themaximumchangeintheangleofpropagationwas

setbythemean ofascaledbeta distribution. Thevariationofthedirectionof alineof

(48)

wasactivatedintheimage. Anadjacent second pixelwasthenactivatedattheinitial

propagation angle. Achangeinthe angleofpropagationwascalculatedusingthebeta

distributionfortheangle. Thenew angleofpropagation thenbecame theinitialangle plus

the angular change.

Pointnoise was the simplest of all themodels. The lengthandthewidth of each noise

element was aconstantof 1 pixel. Thestraightness and angleof propagation ofthenoise

elementswerethereforeirrelevant. Therangescaling factor forthelengthand width

distributionswas set to2pixels. This forcedthemidpointoftherangeto 1 pixel. Both r\

andyweresetto 10E+15sothat the final lengthand widthdistributionswereeffectively

deltafunctions at 1 pixel.

Theocclusion noise model wasspecified so thatthelengthandwidth of each individual

noiseelement were equal. Again,thestraightness andangleof propagation ofthenoise

elements wereirrelevant. Therangescaling factor forthelengthandwidthdistributions

wassetto 8 pixels sothat themidpoint occurred at4pixels. Botht| andywere setto3.0

sothat thefinal lengthandwidthdistributionsweresymmetric,centered at4pixelswith a

standarddeviationofapproximately2pixels.

Inthescratchnoisemodel,thewidthwas againsetto 1 pixelforall noise elementsby

settingtherange to2pixels andt\andyto 10E+15. Therangeofthelength distribution

was setto40pixelstocenterthedistribution about20pixels. Parametersr\andywere set

to3.0and 1.5,respectively, inordertoskew thedistributiontohighernumbersofpixels

(49)

ordertoproducescratchesinthe samedirection, theinitialangleof propagationfor allthe

noiseelementsin asingle noiseimagewas settoarandom constant. Thiswas accomplishedbysettingtherangeoftheangledistributionto2n

radians, therangeshift constant toa random numberbetween0and2k,andther|and yofthedistributionto

10E+15toensure straight propagation.

The spaghetti noise model alsohadthewidthsetto 1 pixelforall noiseelements. The

range ofthelength distributionwas setto80pixelstocreate noise elements approximately twice thelengthofthescratch noise. Aswith thescratchnoise,r\ andy forthelength distributionwere setto3.0and 1.5,respectively. The initialangleof propagationforeach

noise elementwas settoarandomangle. Theappropriate valuesfor handgofthe angle distributionwerefoundbyvaryingtheseparameters untilthenoise elementshadthe

desiredcurliness.

The initial image position(i,j)of each noise element(i.e. theposition ofthe firstpixel of

theelement) was chosenfromatwo-dimensionaluniformdistributionthesame sizeasthe

originaltextureimages. A 132x 132non-activated pixelimagewas created asatemplate

forthe additionofthenoiseelements. Aftergeneration,each noise elementwas addedto

this image inordertocreatea noiseimage. Thisadditionoperationallowedforoverlapping ofthenoise elements. Athresholdof1 was thenappliedto thisimage inordertocreatea binarynoiseimage.

Sixnoise conditions were createdforeachnoise model. These noise conditions variedby

(50)

chosenforeach ofthefournoise models were0%,5%, 10%, 15%, 20%, and25%

activated noisepixels. Examplesofthenoisemodelsunderdifferentconditions are shown

(51)

a. 5% Point

^*

7<.* ."*;

''. *.:-'

* i

^^>.-J.v.--|i^i^V

b. 15% Point c. 25% Point

d. 5% Spaghetti e. 15% Spaghetti f. 25% Spaghetti

(52)

9

. : <

5

% '

g. 5% Occlusion h. 15% Occlusion i. 25% Occlusion

j. 5% Scratch k. 15% Scratch 1. 25% Scratch

Figure 1 l(g-l): Examplesofbinarynoiseimages

Foreach ofthe tenbinarytextureimages,anindependentrandom noiseimageof each

noise model andconditionwasgenerated. Thenoiseimages werethenaddedtoeach ofthe

binary textureimages. Thetexture-plus-noise imagesweresubsequentlythresholdedata

(53)

2.4 Generation and Selection of Local Granulometric Features

In ordertocreate the granulometricfeaturesneededforclassification, localgranulometries

were run withfivetypesofstructuringelements. Four linearelement sequence

granulometries: horizontal,vertical,positive-diagonal(+45) andnegative-diagonal(-45)

as well as asequenceofcircular elements were run on all240 images. Foreachpixel, the

local PSM, PSSD, andPSS were calculatedforallfivestructuringelement granulometries.

The PSMofthe MaxLinandLinearitymeasures werealso calculatedresulting inatotalof

17granulometric features foreachimage.

There weretwomain concernsaboutthe selection ofthefeature data: 1) theneedforgood

estimates oftheclassdistributionsand2) theneedtolimittheamountofdatatosome

computationally tractable amount Inaccordance withSchowengerdt's[1983]

recommendation, 100pixels fromeach class were usedinthestudy. Sinceeach ofthe

pixelsin aclass wastoberepresentedby 17 features,atotalof1700realdatavalues were

neededforeach ofthe240 binaryimages.

Toensurethesepixels wouldaccuratelyrepresentanentiretextureclasswithorwithout

noise,all pixels wererandomly selectedfrom 100x 100pixel "featureimages". The

feature imagesconsisted of real numbersrepresenting somelocalgranulometric statistic

about each pixelinthebinarytextureimage. Sinceeach binaryimagewas assumedto

representahomogeneous texture,thefeature images resulting fromthe localgranulometries

were assumedtobe wide sense stationary. The 100 datavaluesfromeachfeature image

(54)

Nopixel wasallowedtobechosena secondtimetoensureaccurate estimates ofthemean

and variance ofthedistributions.

A33 x 33pixel windowsizewas usedtogenerateeach ofthefeature images. Edge effects

may becausedwhenthiswindowdoesnotlie entirelywithina givenimage. Sincethe

local granulometric statisticsforareaslyingneartheedgeof animagecansignificantly

differ fromthosefor interior imageareas, a132x 132pixelbinarytextureimagewas

(55)

2.5 Classification of Dependent and Independent Data

The initial step inallclassification algorithmsis

trainingtheclassifier. Supervisedtraining

isusedtoidentifyanarearepresentativeof eachclass. Inmostcases,greatcare mustbe

takento includeonlypixelsordatawhichbelongtoagiven class. However,inthis case,

data fromeachtextureclasswaseasilyseparated sincethe granulometries wererun

separatelyon eachclass. Thissupervisedtrainingisconductedby inputtingthe 17 features

foreach ofthe 100 datapointsofeach class intothe classifier. Themean vectorand

covariance matrixforeachclassisthencalculated,and adiscriminant function is developed

fromthesemeans and covariances.

Dependent data is definedas thesetofdatausedtotrain theclassifier. Classificationofthe

dependent datacanbeused asaninitialmeasure ofthegoodnessoftheclassifier. A low

degreeof classificationaccuracyofthedependent datacan implyaninadequatestatistical

difference amongtheclasses. However,ahigh degreeofaccuracyofdependent data

merely impliesa reasonable statisticaldifference amongtheclassesinthetrainingdata.

Furtherexaminationisneededtodetermine theoverallgoodnessofthefeaturevectorsfor

classification ofdatanotincluded in thetrainingset.

Aftertheclassifierhas beentrained,independentdatacanbeclassifiedusingthemaximum

likelihood discriminant function developed fromthedependent data. This independentdata

typicallycontains some orallofthesameclasses asthedependentdata. Inthiscase,any

(56)

data. Anyotherset offeaturevaluesfrom 1 toall 10textureclassescanthenbeused as the

independent data.

The classificationaccuracywasdeterminedby dividingthenumber ofcorrectlyclassified

pixelsbythe total number ofpixels classified Thiscould thenbeused as a measure ofthe

abilityofthe granulometricfeaturestodiscriminate betweenthe textureclasses. The

minimum windowsize for generatingthegranulometricfeaturescouldbe foundby

determiningthepoint at whichtheclassificationaccuracy becameunacceptable. The

minimum number of optimalfeaturescouldbefound inasimilarmanner. The

classificationaccuracycould alsobeusedas a measure oftherobustness ofthefeatures in

(57)

3.0 Analysis of Results

3.1 Dependent Classification

The initial indicationofthepower ofthegranulometric featureswasfoundbyclassifying

thedependent datausedto train theclassifier. All 17featureswere employedinthefeature

vectorsforeach class. The granulometries were run ontheoriginal 10binarytextureclass

imageswithoutanyadditional noise. The 17 features from 100random pixelsfromeach of

the 10textureclasseswere usedtotrain aGaussianmaximumlikelihoodclassifier. These

same 1000pixels were subsequentlyclassifiedusingthediscriminant function developed.

Theresults ofthisdependentclassificationareintheformoftheconfusion matrixin

Table 1.

Table 1: Classificationofdependentdata

dl02 dl03 d20 d52 d64 d65 d67 d68 d75 d84

dl02 100 0 0 0 0 0 0 0 0 0

dl03 0 100 0 0 0 0 0 0 0 0

d20 0 0 100 0 0 0 0 0 0 0

d52 0 0 0 100 0 0 0 0 0 0

d64 0 0 0 0 100 0 0 0 0 0

d65 0 0 0 0 0 100 0 0 0 0

d67 0 0 0 0 0 0 100 0 0 0

d68 0 0 0 0 0 0 0 100 0 0

d75 0 0 0 0 0 0 0 0 100 0

d84 0 0 0 0 0 0 0 0 0 100

(58)

Thisconfusion matrixshows thatall ofthedependent datawerecorrectlyclassified. The

rowsofthe matrix representtheoriginal classofeach pixel. Thecolumnsrepresentthe

class intowhich each pixel was classified. Since therewere 100

pixelsfromeachclass, the

values inthematrix representboththe number andthepercentage ofpixelsclassifiedinto

theclassdesignatedbythecolumn.

Althoughthedata failedahomogeneitytestforequalcovarianceofthe classes, the

classifierwastrainedasecondtimewiththesamedata usingapooled covariance to test the

statistical separationofthemeans. Inordertoachieveahighclassification accuracy using

thepooledcovariance,thefeaturemeanshadtobesufficiently separatedtominimize the

probability distributionoverlap. Theresultsofclassifyingthe dependent data usinga

pooled covariancewere identicalto theresultsusingwithin-class covariance. This

demonstratesthat themean vectors of alltenclasses were wellseparated andindicatesthat

thegranulometricfeatures sufficientlyrepresentedthebasic texturaldifferences betweenthe

classes.

3.2 Independent Classification

Independentdatawasemployedtodetermine theoverallgoodnessoftheclassifier. After

trainingwithfeaturevaluesfromtheoriginalsetof 1000dependentpixels, a second setof

100pixels wasrandomly selectedusinga uniformdistribution. Again,all 17 featureswere

included in thefeature vectorsforeachclass. Thiswas considered anindependentsetof

datasincetheprobabilityofarepeat pixel wasonly0.01 usingtheuniformdistribution.

(59)

magnificationalrobustnessofthefeaturessincethemost ofthe granulometric features

inherently size anddirection dependent. Theresultsof

the classificationare giveninthe

confusionmatrixinTable2.

were

Table2: Classificationofindependentdata

dlUZ dl03 d20 d52 d64 d65 d67 d68 d75 d84

dl02 100 0 0 0 0 0 0 0 0 0

dl03 0 100 0 0 0 0 0 0 0 0

d20 0 0 100 0 0 0 0 0 0 0

d52 0 1 0 99 0 0 0 0 0 0

d64 0 0 0 0 100 0 0 0 0 0

d65 0 0 0 0 1 99 0 0 0 0

d67 0 0 0 0 0 0 100 0 0 0

d68 0 0 0 0 0 0 0 100 0 0

d75 0 0 0 0 0 0 0 0 100 0

d84 0 0 0 0 0 0 0 0 0 100

Overallclassificationaccuracy=99.8%

Table 2showsthatonlytwooftheindependentpixels weremisclassified. Theoverall

classificationaccuracyof99.8%indicatesthatassumptionof within-class homogeneityof

the 17featureswasjustified. Thisalsoindicatesthat the basictexturaldifferences between

theclasses were wellrepresentedbythesegranulometricfeatures.

Aswiththedependentclassification, theclassifier wastraineda secondtimeusingapooled

covariance matrixtoassurethat thefeaturemeans were well separated. Theresultsofthis

(60)

Table 3: Classificationofindependentdata usingpooled covariance

dl02 dl03 d20 d52 d64 d65 d67 d68 d75 d84

dl02 100 0 0 0 0 0 0 0 0 0

dl03 0 98 0 0 0 0 0 0 0 2

d20 0 0 100 0 0 0 0 0 0 0

d52 0 0 0 100 0 0 0 0 0 0

d64 0 2 0 0 98 0 0 0 0 0

d65 0 0 0 0 0 100 0 0 0 0

d67 0 0 0 0 0 0 100 0 0 0

d68 0 0 0 0 0 0 0 100 0 0

d75 0 0 0 0 0 0 0 0 100 0

d84 0 0 0 0 0 0 0 0 0 100

Overallclassificationaccuracy=99.6%

Notethat theoverall classificationaccuracy decreasedbyonly 0.2% when comparedto

classificationusingwithin-classcovariance. The difference inclassificationaccuracywas

dueto thedifferenceoftheestimatedfeaturedistributions foreachclass. Sincethepooled

covariance matrix was an estimate oftheaveragecovariance ofthe tenclasses,the estimates

ofwide within-class variancestendedtobenarrowerusingpooled covariance. Likewise,

theestimatesof narrow within-class variancestended tobewiderusingpooled covariance.

Ingeneral,thiscaused anincreaseoftheprobabilityoverlapandintroduced some

(61)

3.3 Minimal Window Size Determination

Overallclassificationaccuracywas usedtodeterminetheminimumlocalwindow size

neededforclassification. Six localgranulometrieswere run oneachoftheten texture

imagesusing square windowswith sides oflength7, 11, 15, 19, 25, and33 pixels. Two

sets offeaturedatawere collectedinordertodeterminetheeffect of window sizeonboth

dependentandindependent data.

Figure 12shows resultsof classification with the6 differentsize windows. The side

lengthofthewindow isreferredtoasthe window size. Notethattheclassification

accuracy axison thisgraph rangesonly from80% to 100%.

100-\ u S 95 < s o a u 90-5 85 80 10 1 15 WindowSize T 20 25 dependent independent -J 30 -1 35

(62)

Over 99%classificationaccuracyofthedependentdatawasachievedforall window sizes

greaterthan 1 1 pixels. Althoughtheclassificationaccuracyoftheindependent datawas

lessthan thatofthedependentdata,theclassification wasstill94.6% accurateusing a

window size of 1 1 pixels. It alsoshouldbenotedthat theclassificationaccuracy for both

thedependentandindependent data felldramaticallybelowthewindowsize of11 pixels.

This indicatesthatmostofthe underlyingtextureprimitiveswhichdistinguishtheseimage

textureswere no smallerthan 1 1 pixels. However,itshouldbe kept in mindthat the

(63)

3.4 Optimal Feature Selection

Anumber ofavailable methodsfordetermininganoptimalfeatureset were applied.

Richards'

[1986] methodfordeterminingan optimalfeaturesetbythe degreeofoverlapof

theclassdistributionsisconsideredthemostaccurate sinceitusesthecovariance ofall

classes and requiresonly theassumption ofGaussiannormaldistributedfeatures ineach

class. However,this methodisalsothemostcomputationallyintensive. The totalnumber

ofcalculations neededtodeterminethedivergence is determinedbythenumberof

permutationsofoptimalfeaturestochoose outofthe totalnumber offeatures. For

example,tochoosetheoptimal6 featuresout of atotalof 17 forall 10classes,thenumber

of calculations wouldbe:

[17!/3!

d7-3)l]

[10!/2!

dO-2)!] =30600 (17)

Foreach ofthe 30600divergencemeasures, 2matrixinversesmustbecomputed.

The next viable optionforoptimalfeature selection wastheclass separation method

developedbyRosenblum [1990]. Theresults shouldbesimilartoRichards'method since

thecovariancematrices of allclasses wereincorporatedintotheseparation measure. This

methodhadtheadvantage ofbeingfaster because onlyone matrixinversion isrequiredfor

each separationmeasurebetweentwoclasses.

Inordertodeterminetheoptimal number offeaturesneededforadequateclassification,the

(64)

optimalfeatureswasfound fromthe 17features,all otherfeatures

were removedfromthe

featurevectors. Theoptimalfeaturedata fromthe ten imagetextureswas then

usedtotrain

theclassifier. Bothdependentandindependent dataweresubsequentlyclassified andthe

overallclassificationaccuracy wasdeterminedforeachoptimalfeatureset. Theresults of

classification withtheoptimalfeature sets are giveninFigure 13.

S3

<

100-i

90

-2 80

60

8 10

#ofFeatures

-O

dependent

independent

-1

12

t

14 16

Figure 13: ClassificationAccuracyvs.NumberofOptimal Features

Notethat theclassificationaccuracyaxis onthisgraph rangesfrom60%to 100%. Over

99%classificationaccuracyofboth thedependentandindependent datawas achieved with

6optimalfeatures. Additional featurescontributedverylittletoimprovingthisaccuracy.

Thefirst6optimal featuresetsusedintheseclassifications canbe foundinTable 4. A

(65)

Noticethatthe classificationaccuracyusing 5optimal featureswasslightly less thanthe

classification accuracy

using 4optimalfeatures. Theaddition ofmorefeaturestothe

feature vectorsdoesnotnecessarilycorrespondtohigherclassificationaccuracy. The

classificationaccuracy mayevendecrease iftheprobability overlap betweenthe classesis

increasedbytheadditionof morefeatures. Inthiscase,therewas moreprobability overlap

betweenthe tenclasseswith any set offive featuresthantherewas withtheoptimalsetof4

features.

Although, aspreviously stated,most ofthese granulometricfeatures used weresizeand

directiondependent,it isinterestingtonotethat thecircularPSM,which is rotationally

invariantwasthemost significant of all 17featuresand appearedineach ofthefirst four

optimalfeature sets. LinearityPSM,whichis invarianttoboth directionandscale,also

appearedinthesetof3optimalfeatures. Althoughtheseoptimalfeaturesets aredependent

ontheimagetexture classes,giventhe diverserange ofimagetextureclasses inthis study,

an optimal set of6features for anygiven setoftextureclasses canbeexpectedtogive

(66)

Table 4: Optimal FeatureSets

usingRosenblumOptimization

1 feature: circularPSM

2features: circularPSM

horizontalPSSD

3 features: circularPSM

horizontalPSSD LinearityPSM

4features: circularPSM

horizontalPSM

negative-diagonalPSSD

negative-diagonalPSS

5features: horizontal PSM

negative-diagonalPSM

negative-diagonalPSSD

negative-diagonalPSS

positive-diagonalPSM

6 features: horizontal PSM

negative-diagonalPSM

negative-diagonalPSSD

negative-diagonalPSS

positive-diagonalPSM

verticalPSM

References

Related documents

یتخانش - شناد ،یراتفر زومآ.. ظن دوخ ،لیصحت م ییوج یم ار تاساسحا و تارکفت ناونع هب ناوت تیلاعف و یلیتتصحت فادها هب یبایتتتسد روظنم هب شوجدوخ یاه درک فیرعت ] 3 [ مظندوخ

Our results also revealed a negative association between MALAT1 expression and ATM-CHK2 pathway phosphor- ylation in tumor tissues, suggesting that up-regulation of MALAT1 may

This study attempt to investigate the influence of volatility of foreign exchange rate located on United States of America, England, Netherlands, Japan, China and

As the monitoring location is upstream to the domestic waste water discharge points to Koel River, the water quality at the monitoring location remains at the Class C level..

TCM medical cases are of vast amounts of data and strong relatedness, and a new and improved firefly algorithm based on the guide of normative knowledge has been proposed to

Methods: In this study, we investigated the performance of three state-of-the-art parsers: the Stanford parser, the Bikel parser, and the Charniak parser, using following two

This unique quality of HRV/HRC analyses has led to interest in its application in pre-hospital triage and emergency clinical decision making; interest which has been bol- stered by

LRP: Laparoscopic radical prostatectomy; LERP: Laparoscopic extraperitoneal radical prostatectomy; RRP: Radical retropubic prostatectomy; RALP: Robot-assisted