Rochester Institute of Technology
RIT Scholar Works
Theses
Thesis/Dissertation Collections
5-1-1991
Pixel classification by morphological granulometric
features
John T. Newell
Follow this and additional works at:
http://scholarworks.rit.edu/theses
This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please [email protected].
Recommended Citation
Pixel Classification by Morphological Granulometric Features
by
John T. Newell, ill
Rochester Institute of Technology
Center for Imaging Science
May 1, 1991
A thesis submitted in partial fulfillment of the requirements for the degree of Master of
Science in the Center for Imaging Science in the College of Graphic Arts and Photography
of the Rochester Institute of Technology
Signature of Author _
Center for Imaging Science
Approved by
Mendi Vaez-Pravani
THESIS RELEASE PERMISSION FORM
ROCHESTER INSTITUTE OF TECHNOLOGY
COLLEGE OF GRAPHIC ARTS AND PHOTOGRAPHY
CENTER FOR IMAGING SCIENCE
Pixel Classification by Morphological Granulometric Features
I, JohnT. Newell, III, hereby grant permission to the Wallace Memorial Library of R.I.T.
to reproduce my thesis in whole orinpan. Any reproduction will not befor conunercial
use or profit.
Abstract
Pixelclassificationsystemsrely on a certain set offeaturesthatare sufficientto
classify a givenpixelintoa classdefinedwithinadatabase. Unlike brightnessand spectral signaturefeatures commonlyusedinremotesensingapplications,texture-basedfeatures
cannotbe definedfora single pixel andmustbe derived froman area or window
surroundingthatpixel. Inthis research,allfeaturesarederived frombinarymorphological granulometries. Oncegenerated, thesefeaturescomprise adatabasewhich can beusedto
classify images. A Gaussian Maximum Likelihood Classifier is trainedwiththisdatabase forsubsequentclassification ofboth dependentandindependent data. Severalaspects of
thesetexture-basefeaturesrequireinvestigation inordertodeterminetheirabilityto
distinguish imagetextures. Three importantaspectsareaddressedinthis study; the effects
of maximumnoise,theoptimalsize ofthelocalizedwindow, andtheminimum number of optimalfeaturesrequiredforaccurate classification. Astatistical approachhas beentaken todetermine theclassificationaccuracywithvaryingwindow size,varyingnumberof
features,andvaryingamounts offourtypesof maximumnoise usinggranulometric
features. Analysisoftheseinvestigationsindicate fourmainresults. First,classification
accuracy in theabsence of noiseisextremely high. Second,forthese texturesatthespatial resolutionof75dpi,classificationaccuracy decreasesdramaticallybelowawindowsizeof
11x11 pixels. Third,thenumberoffeaturesneededfor highclassificationaccuracycanbe
reducedtoafairlysmall number ontheorderof6features. Finally,thesefeaturesare
generallyrobustinthepresenceofmaximumnoiseifthetypeand amount of noise canbe
Acknowledgements
I wouldliketo thankJeff B. Pelz for his helpand computer programswhich were of
tremendoushelpincompletingthis thesiseventhoughtheywere writtenin Pascal.
I wouldalsoliketo thankWendyRosenblumfor heralgorithms,code and afineexample
of a well writtenthesis.
Finally, Iwishto thank Kaleen Moriartyfor her immeasurablefriendship which hasseen
me throughtheheavenandhellofRIT. Thanks forkeepingme smilingthrough thetough
Dedication
Thisthesisis dedicatedtoJohn T. Newell,Jr.andAnne W. Newell fortheirlove,
confidence andsupportthroughoutmy collegeeducationandforinstillinginmethepride
Table of Contents
TableofContents vii
ListofFigures ix
ListofTables xi
1.0 Introduction 1
1. 1 MorphologicalGranulometries 1
1.1.1 Opening 1
1.1.2 Granulometries 3
1.1.3 LocalGranulometries 7
1.2 Image Texture 9
1.3 Image SegmentationUsingGranulometric Features 10
1.3.1 Segmentation 10
1.3.2 UseofGranulometricFeature for Segmentation 10
1.3.3 Higher OrderMoment Features 1 1
1.3.4 StructuringElementsandDerivationofOther
Granulometric Features 1 1
1.4 Image ClassificationandDiscriminant Analysis 13
1.4.1 Classification 13
1.4.2 FeatureProbabilityDistributions 1 4
1.4.3 Maximum Likelihood Classification 1 5
1.4.4 Gaussian Maximum LikelihoodClassification 16
1.5 Minimal WindowSize 19
1.6 Optimal Feature Selection 20
1.6.1 FeatureReduction 20
1.6.2 Mahalanobis-LikeDistance Measure 20
1.6.3 Divergence Measure 22
1.7 Noise 27
1.7.1 MaximumNoise 27
1.7.2 Point Noise 27
1.7.3 OcclusionNoise 28
1.7.4 Scratch Noise 28
1.7.5 Spaghetti Noise 29
2.0 Statement of Work 30
2.1 SelectionofTexture Images 30
2.2 ThresholdingofTexture Images 33
2.3 GenerationofNoise 36
2.4 GenerationandSelectionofLocalGranulometric Features 42 2.5 ClassificationofDependentandIndependent Data 44
3.0 Analysis of Results 46
3.1 Dependent Classification 46
3.2 Independent Classification 47
3.3 Minimal Window Size Determination 50
3.4 Optimal Feature Selection 52
3.5 ClassificationwithMaximumNoise 56
3.5.1 Dependent Classification 56
3.5.2 Independent Classification 57
3.5.3 CombinationsofNoise Models 63
3.6 Noise Estimation 75
4.0 Conclusions 79
4. 1 Suggestions for Future Work 81
5.0 References 83
Appendix A 86
List of Figures
Figure 1: ImageS andstructuringelementE 2
Figure 2: Open (S,E); Openingofimage S by
structuringelementE 2 Figure 3: Simulatedbinarygranulometryresultantimages 5 Figure 4: ^(k), <P(k)andd<D(k)fromthesimulatedimagegranulometry 6
Figure 5: ExampledOx(k) probability distribution 8
Figure 6: Feature Zvaluedistribution fortwoclasses 14
Figure 7: Maximum likelihooddecisionboundary 16
Figure 8: Feature setsforclass separability 24
Figure9: Textureimages 31
Figure 9: Binarytextureimages 34
Figure 10: Examplesofbinary noiseimages 40
Figure 12: ClassificationAccuracyvs.Window Size 50 Figure 13: ClassificationAccuracyvs. NumberofOptimalFeatures 53
Figure 14: ClassificationAccuracyvs. % Point Noise 58 Figure 15: ClassificationAccuracyvs. % Spaghetti Noise 58
Figure 16: ClassificationAccuracyvs. % OcclusionNoise 59
Figure 17: ClassificationAccuracyvs. % Scratch Noise 59
Figure 18: ClassificationAccuracyinthepresence ofHorizontal,Fixedand
Random Scratch Noise 62
Figure 19: ClassificationAccuracy withcombinations ofnoise models 64
Figure 20: Feature distributions for Circular PSSD 67
Figure 22: Feature distributions forPositiveDiagonal PSM 68
Figure 23: Probabilitydistributions for Circular PSSD 69
Figure 24: Probabilitydistributions for Negative Diagonal PSSD 69
Figure 25: Probabilitydistributions for Positive Diagonal PSM 70
Figure 26: Optimal Feature Classification in Point Noise 73
Figure27: Optimal Feature Classification in Spaghetti Noise 73
Figure 28: Optimal Feature Classification in Occlusion Noise 74
Figure 29: Optimal Feature Classification in Scratch Noise 74
List of Tables
Table 1: Classificationofdependentdata 46
Table 2: Classificationofindependentdata 48
Table 3: Classificationofindependentdata usingpooled covariance 49
Table 4: OptimalFeature Sets using Rosenblum Optimization 55
Table 5: Classificationofdependenttexture-plus-noisedata 56
Table 6: Classificationofindependent data in 5%pointnoise 66
Table 7: Classificationofindependent data in 10% pointnoise 66
Table8: Classificationofdatawith 10%point noise aftertrainingwith5%
point noise 76
Table 9: Classificationofdatawith5% point noise aftertrainingwith 10%
1.0 Introduction
1.1 Morphological Granulometries
Morphologicalgranulometries wereconceivedbyMatheron [1975] asatypeof"sieving"
operation forbinaryimagesinwhichparticlesinthe imagestructurearefiltered according
to theirsize. Quantificationoftherate atwhichanimageisalteredin thesievingprocess
producesa numericalsizedistribution containing image textureinformation. Binary
granulometriesaregeneratedbysuccessively openingabinary imagebyan increasing
sequence of convexbinarystructuringelements. The imageswhich makeupa setthe
structuringelementsequence are of a specific shape (i.e. linecircle, square,etc.)andthe
texturalinformationwhich can begatheredfromagranulometry isspecific to theshape of
thestructuringelement sequence.
1.1.1 Opening
The openingofabinaryimage S byabinary structuringelementE is definedtobethe
unionof all translationsofEwhich are subsets ofS. Rigorously,x e OPEN(S,E)ifand
only ifthereis sometranslate(E+z)ofEsuchthatx e (E+z) c S. Considertheexample
ofabinarydigitalimageSandthe threepixelhorizontal structuringelementErepresented
Image S StructuringElement E
* \ \ \ * * *
***** i i
11111111 111
* 1 1 * 1 1 *
1111*1*
11*11*1
Figure 1: Image S and structuringelementE.
Theones represent activated pixelsandthestars areconsideredundefined or non-activated
pixels. Allpixels outside animageare also considered non-activated. Toopenimage S by
structuringelementE, theorigin ofE istranslated toeach pixelin S. Wherever E entirely
fitsoveractivated pixelsinS,all pixelsintheresulting image Open(SE)areactivated. See
Figure 2.
Open(S^)
* 1 i |****
*****jjl 11111111
********
********
Figure 2: Open(S,E); OpeningofimageSbystructuringelementE
Since Ewillfitover allpixelsinthe third row, theentire rowisactivatedinOpen(S).
Noticethat thelastpixelinthefirstrow isactivatedin imageS butnotin imageOpen(SE)
rows. Becauseofthe size andthe shapeofthe structuringelementinthisparticular
example, any horizontalrunlengthof3or more pixelswillbeactivated.
1.1.2 Granulometries
From thedefinition of anopening, itfollowsthatwhen OPEN(F,E)=F,
OPEN(SE) is a
subimageof OPEN(S,E). Asaresult, ifEi, E2, E3,... isan increasingsequenceof
structuringelements suchthatOPEN(Ek+1, E^=
Ek+1 , then thefiltered imagesforma
decreasingsequence
OPEN(S,Ei) z> OPEN(S,E2) 3 ...
Countingthenumber ofpixelsremaining in eachsucceeding openingresultsina
decreasingfunction^(k),such thatforsome K,*F(k)=0 for k>K.
Dependingon the
shapeofthestructuringelements,varioustexturalinformation isrevealedbystudyingthe function^(k). Theimage sequence {OPEN(S,Ek)} iscalled agranulometryandthe
resulting function^(k) iscalledthesizedistribution. Inpractice,Ex consistsof asingle pixel sothat(1)givesthe totalnumberof activatedpixelsin S.
Since4*(k) isdecreasing,thenormalizationof^(k)isaprobability distribution function
givenbyEquation 1.
Thediscretederivative, dO(k), isadiscreteprobabilitydensityfunction. It has become
populartorefertothisnormalizedgranulometric-sizedistribution
densityasthepattern
spectrumoftheimages. Thisdistributionrevealstheparticle sizedistributionoftheimage fromwhichit iscalculatedand canbedescribedbyitsmoments. Themoments ofthe
patternspectrum canthen beusedtodescribetextureinformation.
Figure 3ashows asimulatedbinary imagemadeupoffoursizedisksofdiameters 4, 7,
15,and31 pixels. Whentheimageisopenedwitha seriesof circularstructuringelements
Ekofdiameter 1 through4, theresultantimage isunchanged. However, whentheimage is
opened with acircularstructuringelement ofdiameter5,thedisksofdiameter4arefiltered
out ofthe image, leavingtheimage shownin Figure 3b. Opening thisimagewith elements
ofdiameters 6and7 produce nofurtherchangeintheoutputimages. Whenthe image is
opened withdiameter8,thedisksofdiameter 7arefilteredout oftheimage resulting in
Figure 3c. Again,there isnochangeintheoutputimageuntilthestructuringelement
diameterreaches 16pixelsandthedisksofdiameter 15arefilteredoutasshownin Figure
3d. Finally, when thestructuringelement sequencereaches32, allthedisks have been
filteredoutresulting inanullimage. (Itshouldbenotedthat theremay be somedigitization
Figure3: Simulatedbinarygranulometryresultantimages
a) originalandOPEN(S^i) throughOPEN(S34)
b) OPEN(S^5) throughOPEN(S7)
c) OPEN(S,E8) throughOPEN(S,Ei5)
6000
<D(k)
dtD(k)
The^(k),O(k)anddO(k)distributionsfromthe simulatedimage
granulometryare shown
in Figure 4. Allthree parametersarefunctionsofthediameter, k,ofthecircularstructuring
elements. It isimportanttonotethat thesedistributionsarebasedon a pixel count ofthe filteredimage,ratherthana particle count.
1.1.3 LocalGranulometries
A local granulometry isan extension ofthisconceptdescribingtheparticlesizedistribution in a given neighborhood or window aboutsomepixel x. ^(k)is thenthepixel count
withina windowcenteredon pixelx, ratherthan thepixelcountovertheentireimage. In
ordertomaintainlarge-scaletexturalinformation, theimageisopenedgloballyandthe pixel countisperformedlocally. Inthe samemannerdescribed fora globalgranulometry, thenormalizedprobability distribution <Dx(k) iscalculatedfromthelocalsizedistribution
<Dx(k) = l-x(k)/x(l) (2)
foreach point x intheimage. Thediscretederivative,dOx(k),definestheprobability
densityaboutthepixel x. d$x(k)isthen thelocalpatternspectrum atx. Theresultofthe
binarylocal granulometrywith a given windowsizeisa one-dimensionalprobability
densityateach(ij)pixellocation intheimage. These probability densitiesserve as
0.20-1
0.15
-dO
0.10-Figure 5: Exampled<Dx(k)probabiUty distribution
This distributionisarobust,but impractical descriptorofthe localtexture. However,the
moments canbe usedtodescribe thedistributionand canbeused as a much more practical
descriptorofthelocalimagetexture. The localgranulometricmean,standarddeviation,
varianceand skewness canbeusedasvaluable texturedescriptorsforimagesegmentation
and classification[Doughertyetal, 1990]. Since thesemoments arederived fromrandom
1.2 Image Texture
Image texture andtexturalinformation have been studiedfor manyyears. Lewis [1971]
illustrated howtexturerelatestogeomorphology using K-bandradarimageryofplains,
lowhills,highhills, and mountainsinthe PanamaandColumbiaarea. Haralickand
Anderson[1971] illustratedhow texturerelatestolandusecategories. Suttonand Hall
[1976] usedtexturemeasuresforautomatedclassification ofpulmonary disease.
Rosenblum[1990] demonstratedtheclassificationaccuracy increaseof aerialimageryby
the addition oftextural featurestoa multi-spectral classificationdata base.
Textureisadescriptionofthespatialdistributionand spatialdependence amongthegrey
tones [Rosenblum, 1990]. Itcan be describedbyperceptualdescriptorssuch as "fine",
"smooth", "coarse", "mottled", "lineated"or "irregular". It may also be described interms
of a pattern madeupof repeatedtextureprimitives[Nevatia, 1982]. Atexture image Jcan
therefore be thoughtofas atransformfromonebandof a spectralimage I inwhichJ(i,j) is
afunctionofI(i,j)andneighboringpixels[Haralick, 1979]. Atexturemeasureata point of
animageissome functionoftheobserved values within alocalneighborhood aboutthe
point[Ahuja, 1983]. Granulometriesusea structural approachtoanalyze visual scenesin
termsof organizationandrelationshipsamong its substructures[Haralick, 1986].
Granulometric features describe imagetexturesintermsofthesizedistributionsofthe
1.3 Image Segmentation Using Granulometric Features
1.3.1 Segmentation
Oneofthereasonsbehindthedevelopmentofimage processing has beentheneedto
identifydifferentobjectsorregionswithina givenimage. Withinthe studyofimage
texturehas beenthedevelopmentofalgorithmsforsegmentationbasedonimagetexture.
The intuitive ideabehindimagesegmentationistodividetheimageintosegmentssuchthat
each segmentis homogeneous insomesense andtwoneighboring segmentsdiffer from
one anotherin thesamesense [Kashyap,1986]. Segmentation isaccomplishedby
separatingtwoor morehomogeneousregions whichhaveasignificantstatistical
difference. Sincethepixel valuesof abinarytextureregions areinherently
non-homogeneous, texture measures needtobeassignedtoeach pixel forsubsequent
segmentation.
1.3.2 UseofGranulometric Feature for Segmentation
DoughertyandPelz [1989]developed bothadeterministic andanondeterministic model of
image segmentationusingtexturemeasuresderived frommorphological granulometries.
Usingthedeterministicmodel, an imagecomprisedoftwodifferentsizediscswas
segmentedbyusingthemeanofthelocalcircular granulometry. Agranulometric-mean
imagewas generatedbyassigning thislocalcircular pattern spectrum mean(PSM) toeach
pointx of animage. Eachpixelintheresulting imagewasthereforea measureofthelocal
Thisgrey-scaleimageofmean values wasthen
successfullysegmentedbythresholdingthe
image.
1.3.3 Higher Order MomentFeatures
Ifthelocal PSMoftwo textureregionsisnotsufficientlydifferenttoallowsegmentation,
higherorder momentsofthelocalpattern spectrummay beemployed. The localpattern
spectrumstandarddeviation(PSSD), variance(PSV),and skewness(PSS)can beusedas
texture measurestosegmentanimage. DoughertyandPelz[1989] used thepattern
spectrumvariance(PSV)tosegmentanimageinwhichtwo textureregionshadsimilar
PSMs. Byviewingahomogeneoustextureas a population ofpixels, alllocal
granulometric moments can be interpretedasrealizationsof random variables which are
characteristic oftheimage texture. Theserandom variablespossessprobability
distributions indicativeof an imagetexture.
1.3.4 StructuringElementsandDerivationofOtherGranulometric Features
In additiontobeing texturedependent,all moments of a patternspectrum are specificto the
structuringelements usedtogeneratethespectrum. Manydifferenttypesofstructuring
elementshave beenusedtogenerateapattern spectrum ofanimage. Circular,elliptical and
linear structuringelementshave commonly beenemployedtogenerate granulometric
texturemeasures. Linearandnon-linear combinationsofthemomentscanalsobeusedas
localtexture measures. Dougherty, Kraus, andPelz[1989] introducedthreesuch
four lineargranulometries; horizontal,vertical,positive-diagonal(45)and
negative-diagonal(135),andMaxLinasthemaximumPSMofthe samefour lineargranulometries.
AveLinisan exampleof alinearcombination whereasMaxLinisanexample ofanon
linearcombination. Linearityis ascale-invariantfeature definedasMaxLindividedbythe
PSMofthecirculargranulometry. Thisratio will resultina value of1 for anycircular
imageelement regardless ofdiameter. Elongated imageelements will producehigher
1.4 Image Classification and Discriminant Analysis
1.4.1 Classification
Imageclassificationistheprocessofassigningapixeltooneof a number ofpossible classes onthebasisofsomeobservationsmade onfeaturesofthatpixeland/orits
surround. It isadecision makingprocess which uses statisticaldecision theorytomake an intelligentestimateoftheclass towhich a pixelbelongs [Schowengerdt,1983]. In
supervisedclassification,asampleof eachclassis takenforeach observationandthe
statisticaldistributionofthe elementsineach class areanalyzed. Fromthatinformation,the
classification algorithm reaches adecision abouthowtoassignpixelsnotinthesampleto
theappropriateclasses. Schowengerdt[1983] recommendsthat 10to 100pixels be includedpertrainingclasswithmorepixelsforthoseclasses with highervariability.
Pixelclassificationhaslongbeenanintegralpart of remote sensingandotherimage
processingapplications. Spectraland radiometricdata fromaerial and satelliteimages have
been usedasfeaturestoclassify specificregionsoftheseimagesaccordingtosome
predescribedcriterion. Linearcombinations ofthisdata, suchastheredtogreenbandratio
inmultispectralaerial images,have alsobeenfoundtoemphasizedifferences inground
cover typesandcharacteristics of particularinterest Manyapproachestoclassification
basedontexturehave been developedovertheyears. Someofthese approachesinclude
the useoffeatures derived from firstorderstatistics,spectral powerdensityfunctions,
autocorrelationfunctions,and grey-tonerun-lengthdistributions. Oneofthemost
measuretherelativefrequencieswith which twopixelvalues,withacertain separation,
occurinanimage [HaralickandAnderson,1971].
1.4.2 FeatureProbabilityDistributions
Anynumber oftheseor otherfeaturescanbeusedtoclassifyan image intosomepreset
number ofclasses. Eachclasswillhavethe samenumberoffeatures inafeaturevector.
The distributionof valuesfor anyonefeaturefora givenclasshasacertaindistribution
which can beusedtodecidetheclassification of an unknown pixel. Considerthe two
distributionsof somefeature Z in Figure 6. Each distributioncorrespondstoa separate
class.
class 1
P(xI i)
featureZ
Figure6: Feature Zvaluedistributionfortwoclasses
Thearea under eachofthesedistributioncurvesisnormalizedto 1.0andtheyare assumed
toapproximatethefeatureprobabilitydensityfunctionsof each class. Thesefunctionscan
belongstoclassi. The probabilityoffindingafeaturevalue of xgiventhatweare
sampling fromclassi is givenas p(x Ii). Thediscriminant functionisdefinedasthe
probabilityof a pixel belongingto classigiventhatithasafeaturevalue x or p(ilx).
p(ilx) =
p(xli)p(i)/p(x) (3)
wherep(i) isthea prioriprobabilitythatclassiexistsintheimageandp(x)isthe
probabilityoffindingapixelfrom anyclass.
Assumingthateachclasshasan equalprobabilityofoccurring, p(i)willbeequalforeach
class. Thevalueforp(x) is simplyanormalizing factorandthereforeaconstantforeach
class. TheaUscriminantfunction isthensimply a calculation of p(xIi)foreach class.
1.4.3 Maximum Likelihood Classification
Maximum likelihoodclassification comparesthediscriminantfunctionvalueforeach
featurevalue xcalculatedforeachclass and assignsthepixelto theclasswhichproduces
thehighest probabilityvalue. Forexample,considerapixel with afeatureZvalue ofx, as
shownin Figure6. Sincethecalculated value ofthediscriminantfunction is greaterfor
class 1 thanforclass 2, thepixel wouldbeassignedorclassifiedintoclass 1.
The decision boundaryforclassification,d,liesatthepointatwhichthe twodistributions
class 1
P(xI i)
feature Z
Figure 7: Maximum likelihood decisionboundary
Anypixel with a feature Zvalueless thandwouldbeassignedtoclass 1, sincethereisa
higher probabilityofthisvaluecoming fromclass 1 thancoming fromclass2. Likewise,
anypixel with afeature Zvalue greaterthandwouldbeassignedtoclass2. Thetotalerror
inthisclassification isrepresentedbytheoverlapofthe twodistributionswhichis shown
astheshaded region. Thiserroris minimizedbyplacingthedecision boundaryat thepoint
at which p(xI 1)isequaltop(xI 2),This decisionboundaryisrepresentedbyd in Figure
7. Notice thatifthedecisionboundary wasmovedineitherdirection,theerror would
increase.
1.4.4 Gaussian Maximum Likelihood Classification
This is thesimplest exampleofclassificationsincethereis onlyonefeature andonly two
classes. However, thesame principles canbeextendedtomore complicated classification
modelsin whichthereareanynumberoffeaturesand classes. Themostcommonlyused
mean vectors and covariancematricesoftheclasses are requiredtocompute the
class-conditionaldensityfunctions. Thisclassifier requires thedistributionoffeatureswithin
eachclass tobe approximatelymultivariatenormal. However,theclassifieris"relatively
tolerant"
ofdeviations from normality [seeSwain, 1986].
Adiscriminant function is developedusingameasure ofthegeneralizedsquareddistance
fromthemean vector. Theclassificationcriterioncanbe basedon eithertheindividual
within-class covariance matrices or a pooledcovariancematrix. Aswiththesinglevariable
case,each unknownpixelisclassifiedintotheclassfromwhichit hasthesmallest
generalizedsquareddistance.
Thegeneralized squareddistance fromx toclass tis
Dt2(x) =
gi(x,t) + g2(t) (4)
where xis thevectorcontainingthefeaturevalues of an unknown pixel and
tisasubscripttodistinguishtheclasses.
Ifthewithin-class covariance matricesareusedthen
gl(x,t) = (x
-mO' Sf1 (x
-mt) + lnlStl (5)
where mt isthevectorcontaining themeans ofthefeaturesof an unknownpixel
Stisthe covariance matrixwithin-class t.
Ifthe pooled covariance matrixisusedthen
gl(x,t) = (x
-mO' S-1 (x
-mt) (6)
where S isthepooledcovariancematrix.
Ifthea priori probabilitiesareallequal,g2(t)iszero. However,iftheyare not all equal
g2(t) =
-21n(qt) (7)
1.5 Minimal Window Size
The sizeofthewindowforthelocalgranulometriescan havea significant effectonthe
distributionsofthe granulometricfeatures. Dougherty,PelzandNewell[ 1990]
demonstratedthat thevarianceofthe granulometricfeature distributions decreaseswith
increasingwindow size. Decreasing thevariancedecreasestheamount ofprobability
overlap betweenclasses soclassificationaccuracycanbeimprovedby increasingthe
windowsize. However, increasingthewindowsizecan also makeit hardertodetermine
theborder between adjacenttextureregions andleadtomisclassificationofpixelswhose
surroundincludes 2ormoretextureclasses. Generallyspeaking,largerwindowsdecrease
variabilityofthegranulometricfeaturesatthecost ofless detailedsegmentation.
Afeaturevaluefromalocal granulometry is onlyan accurate representation of a given
imagetextureiftheentire window usedtogeneratethatfeaturelieswithinthe texture
region. Otherwise,thefeaturevalue canbeaffectedbyotherimagetexturesandtherefore
represent a combination of anumber ofimagetextures. Apixellying neartheedgeoftwo
textureregions shouldthereforebeunclassified. Giventwoadjacenttextureregionsinan
image, and awindowan oddnumber,x,pixelsinlength,the number of unclassified pixels
1.6 Optimal Feature Selection
1.6.1 Feature Reduction
Thekeystep in anyclassification problemis thechoiceofaset offeatureswhichreduces
thedimensionofdatatoacomputationallytractablelevel whilepreservingmuch ofthe
classifying information presentintheactualdata[Kashyap, 1986]. Thenumber offeatures
used intheclassification shouldstillgive aminimalprobabilityofmis-classification [Fu,
1976]. Featureswhichdonotaddtoclassificationaccuracyrepresent a costsince,witha
maximumlikelihoodclassifier, the timeneededtomake a calculationincreases quadratically
with theaddition offeatures [Richards, 1986].
Inrecent yearstherehasbeen much attention paidtodeterminingan optimal set of m
features outof atotal set ofNfeatureswithoutsignificantlydegradingtheclassification
abilityofthealgorithm. Thesetechniquesfor featureselection attempttomeasurethe
separabilityoftheclassesforeach combination of mfeaturesout ofthetotal set offeatures.
The subset withthemostpotentialforcorrect classificationissubsequentlyselectedforuse
intheclassifier.
1.6.2 Mahalanobis-LikeDistance Measure
Thesimplesttechniquesoffeatureselectionusetheseparationofthefeaturemeansin
multidimensionalspace. However,thisapproachmayresultina set offeatureswhichare
developedby Schott, Salvaggio,andKraus [1988], usesthe Mahalanobis-Likedistanceas
a measure ofthe separationofclasses. TheMahalanobis-Likedistancemeasureis defined
as
dst =
(ms
-mO' S"1
(ms
-mt) (8)
where msandmtarethemean vectors ofclasses s andtand
S isthepooledcovariancematrix.
Comparedtocalculatingtheexactprobability overlap betweentwo classes, thismethodis
very fastsinceitrequirestheinversionofonlythepooledcovariancematrix. Usingthe
pooledcovariance hastwomajordrawbacks: 1)theassumption of equalcovariance
matrices isnotusuallytrueand2) it doesnot accountforthevariabilityoftheindividual
classes. Onesolutiontothisproblemistousetheindividualcovariance matricesinplace
ofthepooled covariance matrix [Robert, 1989]. TheMahalanobis-Like distance between
classes correctedfortheindividualcovariancematricestakes theformofEquation (9)
[Richards, 1986].
dst = [(ms
-mt)' St"1
(ms
-m,)] + lnlStl (9)
where Stisthecovariance matrixwithin-classt
However, thisintroducesother problemssincetheresultdependsupon whichcovariance
1.6.3 DivergenceMeasure
Richards [1986] describes away toquantifythe separation betweentwoclassesbythe
degreeofoverlapoftheclassdistributions. Theoptimal setoffeaturescanthen be found
by findingthe setwiththeleastamountofprobabilityoverlap. The divergence between
two classes, dvst,is definedinEquation(10)asaseparability measurewhichtakesinto
account thevariabilityofboth classes.
-Jt
dvst = J[ p(X I s)
-p(X I t) ] In [ p(X I s)/p(X I t) ] dX (10)
X
where dvst isthe divergence betweenclass sandclasst,
p(X I s) isthe probabilityoffindingthefeaturevectorXwhen
sampling fromclass s and
p(X I t) istheprobabilityoffindingthe featurevectorXwhen
sampling fromclasst.
Iftheclassesare assumedtocomefrommultidimensional normaldistributions,the
divergence becomes
dvst=(l/2)Tr[(Ss
where msandmt are themeanvectors ofclassess andtand
SsandSt arethe covariance matrixwithin-classs andt respectively.
UnliketheMahalanobis-Like distancemeasureinEquation(9),dvstissymmetric (i.e. dvst
=
dvts)because bothclassdistributionsaretakenintoaccount.
Thisdivergencecan thenbesummedover allclass pairstogivea measure oftheoverall
divergence. Thesetoffeatureswhichresultsinthegreatestoveralldivergenceshouldgive
the greatest classificationaccuracywhen aGaussianmaximumlikelihoodclassifierisused.
Mausel,KramberandLee. [1990]transformed thedivergence betweentwoclassesin the formofEquation(12)toemphasize smallchangesinthedivergenceresulting from
significant changesintheclass separability.
tdvst= 2000[l-exp(-dvst/8)] (12)
Thisvaluehasalimitof2000which wasdesignedtolimit extremely highdivergence
values whichdonotnecessarilycorrespondtocompleteclassseparation.
1.6.4 SeparabilityMeasure
Rosenblum[1990]developedasimilar methodtoenhancetheaccuracyofthe
Mahalanobis-Like distanceseparationmeasuregivenin Equation (9). Theoverall separabilitymeasure
otherclasses. Figure 8shows two feature setsofthe same threeclasses. In feature setA,
classesx andy arepoorly separated and class zis greatlyseparatedfrom these two.In
feature setB, allthreeclasses arefairlywellseparated.
featuresetA
/\
feature setB
Figure 8: Featuresetsforclassseparability
It isobviousthatfeaturesetB doesabetter jobofseparatingthe threeclassesthanfeature
setA. However,the separability measurefor feature setAwillbegreaterthanthe
separabilitymeasure for featuresetB. Since alarge separation ofanyoneclassfroma
groupofotherscaninflatetheoverallseparabilitymeasurefora set offeatures,adistance
thresholdwasdevelopedtonormalize andlimitthevalues. Withoutthatinflation,a set of
featureswhichseparate all oftheclasseswillbechoseninsteadof a setwhichseparates one
classverywell.
Theprobabilityoffindingthemean ofclass tinasamplefromclass sisdefined as
P(mls) =
[
1/ (ISSI1/2 2iW)] e-^^mi"
"V Ss'1
<mt
'where k isthenumberoffeatureswhich aretobeoptimized.
UsingEquation(13) andassumingmultivariate normal class
distributions,itcanbeshown
that
[(mt
-ms)' Ss"1
(mt
-ms)] + lnlSsl = -2ln[P/
2tc^2]. (14)
(See Appendix A). The left handsideofEquation(14)representstheMahalanobis-Like
distance betweentwoclassmeans,dts,when theindividualclass covarianceforclasssis
used. The right hand side oftheequationistheminimumdistance, dthresh,which must
separatethe twoclassmeansfortheprobabilityofmisclassification tobe P.
dthxesh = -2 1n[P/2K^].
(15)
ThevalueofPshouldbesettoasufficientlysmall valuetoassure nearcompleteseparation
oftheclasses.
Thereare now twodistancemeasures which needtobecalculated: theactualdistanceas
calculatedbyequation (9),
dSt= t(ms
-mO' St"1
(ms
-mO] + lnlStl (16)
andthe thresholddistancefora presetprobability Pgivenbyequation(15). After bothdts
anddthreshhave beencalculated, theratiooftheMahalanobis-Likedistancetothe threshold
dratio = dst/dthresh
(17)
Sincetheseparabilitymeasureisnotsymmetric,theratiomustbecalculated twiceforeach
pair of classes. Asa resultofcalculatingtwoseparabilitymeasuresforeach pair of
classes, a matrix mustbe usedtorepresentallrelativedistancemeasures. Thesum ofthis
matrix can thenbeusedas a measure oftheoverallseparabilityoftheclasses. Before this
summationhowever, any dratiovalues whichexceed 1.0are setto 1.0topreventthe
inflationoftheoverallseparabilitymeasure. Theoverallseparability isthencomputedfor
all permutations offeaturesubsetsfromthewholeset. Thesubset withthehighestoverall
1.7 Noise
1.7.1 MaximumNoise
Everyaspect ofimage processingandclassificationisaffectedbynoise. Althoughthere
aremanytypesofnoise associatedwithimages,becauseofthenatureofgranulometric
basedfeatures, we willconcern ourselvesonlywith maximum additivenoise. Sincethis
typeofnoise adds tothe activatedpixel countofathresholdedimageneededforbinary
granulometries,it is readily apparentthatadditivemaximumnoisewill skewthe
granulometricdistributions usedtogeneratethesefeatures.
Thereare manytypesof maximum noise which are inherenttodigital images. Ofthese,we
will examinetheeffects offour basiccategories: pointnoise,occlusion noise,scratch noise
and spaghetti noise. Dougherty, PelzandNewell [1990] brieflyexaminedtheeffect of
maximum point noise and spaghetti noise on granulometricbased features. Although it
was concludedthatthefeaturesweregenerallyrobust, furtherexaminationisrequiredfora
deterministic analysis oftheeffect of additional noise onimagetextureclassification.
1.7.2 PointNoise
Point noiseis definedas singlerandom activated pixels. It may becausedbyflaws
inherentto thedetector,bydustanddirton anydigitallyscannedimage,orbyelectronic
noiseatany levelofthesystem Sinceuniform response ofarray detectors is virtually
"push"
some pixelvaluespast agiventhreshold Thesignalmay alsofluctuatefromother
electronicsinthedetectorsuch asphotomultipliersandamplifiers.
1.7.3 OcclusionNoise
Occlusionnoiseis definedas noisewhichoccludesorcoversan underlyingsignal. Inan
image, thiscanbethoughtof asparticleswhich arelargeenoughtoaltertheapparentshape
orboundaryofimage structures and substructures. Thiscanbecausedbylarger dirtand
dustparticles ondigitallyscannedimagesor anundesiredintersectingobjectwithinthe
originalimage.
1.7.4 Scratch Noise
Scratchor streak noiseischaracterizedbylong,thinstraightlineswhich propagateina
singledirectionina particularimage. Physical scratches on aphotographic negativecan
appear asmaximum scratch noiseonaphotographic print. These scratches arecommonly
causedbyphotographicequipment andprocessing machineryasthe negativeispulled
though. Singleelementflawsina"push
broom"
typedetectorcanhavea similar effecton
digitized images. Astheone-dimensionaldetector arraymoves acrossanimage,one
defectiveelement or an elementimpairedbydirtcancause a streakintheresultantdigital
1.7.5 SpaghettiNoise
Spaghetti noiseischaracterizedbyathin"curly" or
"windy"
lineof connected pixels.
Dependingontherelativesizeofthedetectorelements andimagesbeingscanned, this type
2. 0 Statement of Work
2.1 Selection of Texture Images
Ten textureimageswere chosenforthestudy. AlltenweretakenfromBrodatz'
collection
ofphotographictexture images[1966]. Thesephotographswere scannedat75 dpiintoan
8-bitgrey-scaledigitalformat.Thetenimageswereselectedtorepresent alargerange of
textural complexity. Thistexturalcomplexitycanbethoughtofastheamountofstructure
inthe underlyingtextureprimitives andthevariationinthatstructure. Figure 9showsall
tenimages alongwith
Brodatz'
originaldescriptions.
Theoriginaldigitalimageswere512x512pixels. However,the 132x 132pixelimages
in Figure9werecroppedfromtheoriginalimages before processingtolimitthe
computationtime. Acomplete descriptionofthereasonsforthisexact sizeare statedin
section2.4. Eachtextureimagerepresentsa separatetextureclass. Throughoutthis paper,
eachtextureclass willbereferred tobythedescriptionnumber givenbyBrodatz (i.e.
a) dl02 Cane
llllllllllllllllll
llllllllllllllllll
llllllllllllllllll
llllllllllllllllll
llllllllllllllllll
liiilliiiliiiniiii
liiiillllliiiiiini
limiiiiiiiiniU!
Iniiiiiiiiimiiii
c) d20 Frenchcanvas
b) d!03 Looseburlap
....J.%*-*'^ '
I*
IIIt^i| J|||J
....
v"^..., mfc
d) (152 Oriental Strawcloth
e) d64 Orientalrattan f) d65 Orientalrattan
g) d67 Plastic pellets
(inverted greyscale)
i) d75 Coffee beans
h) d68 Woodgrain
wm
lAit
jr-v^--" j^rtyr^T **>
*i
j) d84 Raffialooped
toahighpile
2.2 Thresholding of Texture Images
Before thelocalbinary granulometrieswerecalculated,these 8-bitgrey-scale imageswere
reducedto binaryimages. Theuse of athresholdprovidedthesimplestmethodforthe
gray levelcompression. Thechoice ofthethresholdvaluecould significantlychangethe
results ofthe granulometriesbychangingthe activatedpixelcountintheimage. In many
casesit isprofitable tochooseathresholdwhich resultsin approximately halfthepixels
being activated. However,whendealingwithimagetexture, maintainingtheunderlying
textural structureandsubstructuresis themostimportantaspect. Althoughthe textural
structure of eachimagecouldbe bestmaintainedbychoosinga separatethresholdvaluefor
each image,a singlethresholdvalue was chosen whichmaintainedamajorityofthe
underlyingstructureinall the imagesand moreclosely simulated reallifeconditionsfor
textureclassification. Figure 10showsthe thresholded versionsoftheimages in Figure 9.
Thesebinaryimagescouldbe greatlyaffectedbynonuniformityoftheimages. Intra-image
nonuniformityofthemean gray levelofthe 8-bitimagescould causethesize and shape of
the textureprimitivestovary significantlywithinasupposedly homogeneous image. This
may have inflatedthevariance and couldhaveskewedthe granulometricdistributions.
Skewingthedistributions may havecaused ashiftinthefeaturemeans. Theinflated
variance ofthedistributionsmay haveresultedinadecreaseofclassificationaccuracydue
J
>- 1 f' kIII II k4 ha.
> - ,. . ..
.,, ... . . .
a a a
a) dl02 Cane
HllllllTlllllUM lltilllllllDlllll i^iiifnirM^Mi tfipliipifiiiumi
^HUilili^ikii
iffijpipiiMliljiiltfl^iilii|fifirii.i<
ifiMlii^iiin^ifcplpfcpiiirliipiUpi
c) d20 Frenchcanvas
faifail^
Hft
llfiMJI
tSUiSG
CMrfll
nmrnrnm
fcMCM*
b) dl03 Looseburlap
.t! ""
^'>tw *-*
d) d52 Oriental Strawcloth
e) d64 Orientalrattan f) d65 Orientalrattan
g) d67 Plasticpellets
(inverted greyscale)
i) d75 Coffeebeans
h) d68 Woodgrain
iilPtl
^T7)~*^~l
iH
&1fl^l
j) d84 Raffialooped
toahighpile
Figure 10(g-j): Binarytextureimages
Inter-image nonuniformityofthemeangray levelcould cause a significantdifference
betweentheapparentimagetextureofthe8 bit imagesand the textureofresultantbinary
images. As aresult, the most significanttexturalinformationinagrayscaletextureimage
maynothave beenrepresentedinthebinarytextureimage. Thegranulometricfeatures
2.3 Generation of Noise
Aspreviouslymentioned,fourcategoriesofadditivemaximumnoise wereinvestigated.
Thefournoise modelssimulatedwere pointnoise,occlusion noise,scratch noise and
spaghetti noise. Thissimulation wasaccomplishedby directlyoverlayingnoise imageson
thetenbinarytexture images. Noiseimageswere createdbyplacinga number ofparticles
ofnoise, "noiseelements", ontoablank image. Eachnoiseelement was describedbyits
length, width, "straightness",andinitialangleofpropagation.
Eachofthefournoise models musthaveacertainmean andrangeforeach ofthefour
descriptiveparameters mentioned above. Foreachnoise model,appropriate valuesforthe
mean and range ofthe lengthand width weredefined. Beta distributions were usedto
determine thelengthand width of eachindividualnoise element. Sincethebeta distribution
is only defined between 0and 1.0, ascaling factorand/or a shiftfactorwasappliedtoalter
therange ofthedistributionsothat thepredefined mean number of pixels wouldliewithin
thisrange. The beta distributionparameters,r\andy, weresubsequentlysetto the
appropriatevalues todeterminetheshapeofthedistributionandtherebythevariationofthe
length and width of allthenoiseelementina noise image.
The initialangle of propagationcouldeitherbe set orrandomlychosenfroma uniform
distribution foreach noise element. A beta distributionwas usedtodeterminehowstraight
orcurlya noise element wouldbe. Themaximumchangeintheangleofpropagationwas
setbythemean ofascaledbeta distribution. Thevariationofthedirectionof alineof
wasactivatedintheimage. Anadjacent second pixelwasthenactivatedattheinitial
propagation angle. Achangeinthe angleofpropagationwascalculatedusingthebeta
distributionfortheangle. Thenew angleofpropagation thenbecame theinitialangle plus
the angular change.
Pointnoise was the simplest of all themodels. The lengthandthewidth of each noise
element was aconstantof 1 pixel. Thestraightness and angleof propagation ofthenoise
elementswerethereforeirrelevant. Therangescaling factor forthelengthand width
distributionswas set to2pixels. This forcedthemidpointoftherangeto 1 pixel. Both r\
andyweresetto 10E+15sothat the final lengthand widthdistributionswereeffectively
deltafunctions at 1 pixel.
Theocclusion noise model wasspecified so thatthelengthandwidth of each individual
noiseelement were equal. Again,thestraightness andangleof propagation ofthenoise
elements wereirrelevant. Therangescaling factor forthelengthandwidthdistributions
wassetto 8 pixels sothat themidpoint occurred at4pixels. Botht| andywere setto3.0
sothat thefinal lengthandwidthdistributionsweresymmetric,centered at4pixelswith a
standarddeviationofapproximately2pixels.
Inthescratchnoisemodel,thewidthwas againsetto 1 pixelforall noise elementsby
settingtherange to2pixels andt\andyto 10E+15. Therangeofthelength distribution
was setto40pixelstocenterthedistribution about20pixels. Parametersr\andywere set
to3.0and 1.5,respectively, inordertoskew thedistributiontohighernumbersofpixels
ordertoproducescratchesinthe samedirection, theinitialangleof propagationfor allthe
noiseelementsin asingle noiseimagewas settoarandom constant. Thiswas accomplishedbysettingtherangeoftheangledistributionto2n
radians, therangeshift constant toa random numberbetween0and2k,andther|and yofthedistributionto
10E+15toensure straight propagation.
The spaghetti noise model alsohadthewidthsetto 1 pixelforall noiseelements. The
range ofthelength distributionwas setto80pixelstocreate noise elements approximately twice thelengthofthescratch noise. Aswith thescratchnoise,r\ andy forthelength distributionwere setto3.0and 1.5,respectively. The initialangleof propagationforeach
noise elementwas settoarandomangle. Theappropriate valuesfor handgofthe angle distributionwerefoundbyvaryingtheseparameters untilthenoise elementshadthe
desiredcurliness.
The initial image position(i,j)of each noise element(i.e. theposition ofthe firstpixel of
theelement) was chosenfromatwo-dimensionaluniformdistributionthesame sizeasthe
originaltextureimages. A 132x 132non-activated pixelimagewas created asatemplate
forthe additionofthenoiseelements. Aftergeneration,each noise elementwas addedto
this image inordertocreatea noiseimage. Thisadditionoperationallowedforoverlapping ofthenoise elements. Athresholdof1 was thenappliedto thisimage inordertocreatea binarynoiseimage.
Sixnoise conditions were createdforeachnoise model. These noise conditions variedby
chosenforeach ofthefournoise models were0%,5%, 10%, 15%, 20%, and25%
activated noisepixels. Examplesofthenoisemodelsunderdifferentconditions are shown
a. 5% Point
^*
7<.* ."*;
''. *.:-'
* i
^^>.-J.v.--|i^i^V
b. 15% Point c. 25% Point
d. 5% Spaghetti e. 15% Spaghetti f. 25% Spaghetti
9
. : <
5
% 'g. 5% Occlusion h. 15% Occlusion i. 25% Occlusion
j. 5% Scratch k. 15% Scratch 1. 25% Scratch
Figure 1 l(g-l): Examplesofbinarynoiseimages
Foreach ofthe tenbinarytextureimages,anindependentrandom noiseimageof each
noise model andconditionwasgenerated. Thenoiseimages werethenaddedtoeach ofthe
binary textureimages. Thetexture-plus-noise imagesweresubsequentlythresholdedata
2.4 Generation and Selection of Local Granulometric Features
In ordertocreate the granulometricfeaturesneededforclassification, localgranulometries
were run withfivetypesofstructuringelements. Four linearelement sequence
granulometries: horizontal,vertical,positive-diagonal(+45) andnegative-diagonal(-45)
as well as asequenceofcircular elements were run on all240 images. Foreachpixel, the
local PSM, PSSD, andPSS were calculatedforallfivestructuringelement granulometries.
The PSMofthe MaxLinandLinearitymeasures werealso calculatedresulting inatotalof
17granulometric features foreachimage.
There weretwomain concernsaboutthe selection ofthefeature data: 1) theneedforgood
estimates oftheclassdistributionsand2) theneedtolimittheamountofdatatosome
computationally tractable amount Inaccordance withSchowengerdt's[1983]
recommendation, 100pixels fromeach class were usedinthestudy. Sinceeach ofthe
pixelsin aclass wastoberepresentedby 17 features,atotalof1700realdatavalues were
neededforeach ofthe240 binaryimages.
Toensurethesepixels wouldaccuratelyrepresentanentiretextureclasswithorwithout
noise,all pixels wererandomly selectedfrom 100x 100pixel "featureimages". The
feature imagesconsisted of real numbersrepresenting somelocalgranulometric statistic
about each pixelinthebinarytextureimage. Sinceeach binaryimagewas assumedto
representahomogeneous texture,thefeature images resulting fromthe localgranulometries
were assumedtobe wide sense stationary. The 100 datavaluesfromeachfeature image
Nopixel wasallowedtobechosena secondtimetoensureaccurate estimates ofthemean
and variance ofthedistributions.
A33 x 33pixel windowsizewas usedtogenerateeach ofthefeature images. Edge effects
may becausedwhenthiswindowdoesnotlie entirelywithina givenimage. Sincethe
local granulometric statisticsforareaslyingneartheedgeof animagecansignificantly
differ fromthosefor interior imageareas, a132x 132pixelbinarytextureimagewas
2.5 Classification of Dependent and Independent Data
The initial step inallclassification algorithmsis
trainingtheclassifier. Supervisedtraining
isusedtoidentifyanarearepresentativeof eachclass. Inmostcases,greatcare mustbe
takento includeonlypixelsordatawhichbelongtoagiven class. However,inthis case,
data fromeachtextureclasswaseasilyseparated sincethe granulometries wererun
separatelyon eachclass. Thissupervisedtrainingisconductedby inputtingthe 17 features
foreach ofthe 100 datapointsofeach class intothe classifier. Themean vectorand
covariance matrixforeachclassisthencalculated,and adiscriminant function is developed
fromthesemeans and covariances.
Dependent data is definedas thesetofdatausedtotrain theclassifier. Classificationofthe
dependent datacanbeused asaninitialmeasure ofthegoodnessoftheclassifier. A low
degreeof classificationaccuracyofthedependent datacan implyaninadequatestatistical
difference amongtheclasses. However,ahigh degreeofaccuracyofdependent data
merely impliesa reasonable statisticaldifference amongtheclassesinthetrainingdata.
Furtherexaminationisneededtodetermine theoverallgoodnessofthefeaturevectorsfor
classification ofdatanotincluded in thetrainingset.
Aftertheclassifierhas beentrained,independentdatacanbeclassifiedusingthemaximum
likelihood discriminant function developed fromthedependent data. This independentdata
typicallycontains some orallofthesameclasses asthedependentdata. Inthiscase,any
data. Anyotherset offeaturevaluesfrom 1 toall 10textureclassescanthenbeused as the
independent data.
The classificationaccuracywasdeterminedby dividingthenumber ofcorrectlyclassified
pixelsbythe total number ofpixels classified Thiscould thenbeused as a measure ofthe
abilityofthe granulometricfeaturestodiscriminate betweenthe textureclasses. The
minimum windowsize for generatingthegranulometricfeaturescouldbe foundby
determiningthepoint at whichtheclassificationaccuracy becameunacceptable. The
minimum number of optimalfeaturescouldbefound inasimilarmanner. The
classificationaccuracycould alsobeusedas a measure oftherobustness ofthefeatures in
3.0 Analysis of Results
3.1 Dependent Classification
The initial indicationofthepower ofthegranulometric featureswasfoundbyclassifying
thedependent datausedto train theclassifier. All 17featureswere employedinthefeature
vectorsforeach class. The granulometries were run ontheoriginal 10binarytextureclass
imageswithoutanyadditional noise. The 17 features from 100random pixelsfromeach of
the 10textureclasseswere usedtotrain aGaussianmaximumlikelihoodclassifier. These
same 1000pixels were subsequentlyclassifiedusingthediscriminant function developed.
Theresults ofthisdependentclassificationareintheformoftheconfusion matrixin
Table 1.
Table 1: Classificationofdependentdata
dl02 dl03 d20 d52 d64 d65 d67 d68 d75 d84
dl02 100 0 0 0 0 0 0 0 0 0
dl03 0 100 0 0 0 0 0 0 0 0
d20 0 0 100 0 0 0 0 0 0 0
d52 0 0 0 100 0 0 0 0 0 0
d64 0 0 0 0 100 0 0 0 0 0
d65 0 0 0 0 0 100 0 0 0 0
d67 0 0 0 0 0 0 100 0 0 0
d68 0 0 0 0 0 0 0 100 0 0
d75 0 0 0 0 0 0 0 0 100 0
d84 0 0 0 0 0 0 0 0 0 100
Thisconfusion matrixshows thatall ofthedependent datawerecorrectlyclassified. The
rowsofthe matrix representtheoriginal classofeach pixel. Thecolumnsrepresentthe
class intowhich each pixel was classified. Since therewere 100
pixelsfromeachclass, the
values inthematrix representboththe number andthepercentage ofpixelsclassifiedinto
theclassdesignatedbythecolumn.
Althoughthedata failedahomogeneitytestforequalcovarianceofthe classes, the
classifierwastrainedasecondtimewiththesamedata usingapooled covariance to test the
statistical separationofthemeans. Inordertoachieveahighclassification accuracy using
thepooledcovariance,thefeaturemeanshadtobesufficiently separatedtominimize the
probability distributionoverlap. Theresultsofclassifyingthe dependent data usinga
pooled covariancewere identicalto theresultsusingwithin-class covariance. This
demonstratesthat themean vectors of alltenclasses were wellseparated andindicatesthat
thegranulometricfeatures sufficientlyrepresentedthebasic texturaldifferences betweenthe
classes.
3.2 Independent Classification
Independentdatawasemployedtodetermine theoverallgoodnessoftheclassifier. After
trainingwithfeaturevaluesfromtheoriginalsetof 1000dependentpixels, a second setof
100pixels wasrandomly selectedusinga uniformdistribution. Again,all 17 featureswere
included in thefeature vectorsforeachclass. Thiswas considered anindependentsetof
datasincetheprobabilityofarepeat pixel wasonly0.01 usingtheuniformdistribution.
magnificationalrobustnessofthefeaturessincethemost ofthe granulometric features
inherently size anddirection dependent. Theresultsof
the classificationare giveninthe
confusionmatrixinTable2.
were
Table2: Classificationofindependentdata
dlUZ dl03 d20 d52 d64 d65 d67 d68 d75 d84
dl02 100 0 0 0 0 0 0 0 0 0
dl03 0 100 0 0 0 0 0 0 0 0
d20 0 0 100 0 0 0 0 0 0 0
d52 0 1 0 99 0 0 0 0 0 0
d64 0 0 0 0 100 0 0 0 0 0
d65 0 0 0 0 1 99 0 0 0 0
d67 0 0 0 0 0 0 100 0 0 0
d68 0 0 0 0 0 0 0 100 0 0
d75 0 0 0 0 0 0 0 0 100 0
d84 0 0 0 0 0 0 0 0 0 100
Overallclassificationaccuracy=99.8%
Table 2showsthatonlytwooftheindependentpixels weremisclassified. Theoverall
classificationaccuracyof99.8%indicatesthatassumptionof within-class homogeneityof
the 17featureswasjustified. Thisalsoindicatesthat the basictexturaldifferences between
theclasses were wellrepresentedbythesegranulometricfeatures.
Aswiththedependentclassification, theclassifier wastraineda secondtimeusingapooled
covariance matrixtoassurethat thefeaturemeans were well separated. Theresultsofthis
Table 3: Classificationofindependentdata usingpooled covariance
dl02 dl03 d20 d52 d64 d65 d67 d68 d75 d84
dl02 100 0 0 0 0 0 0 0 0 0
dl03 0 98 0 0 0 0 0 0 0 2
d20 0 0 100 0 0 0 0 0 0 0
d52 0 0 0 100 0 0 0 0 0 0
d64 0 2 0 0 98 0 0 0 0 0
d65 0 0 0 0 0 100 0 0 0 0
d67 0 0 0 0 0 0 100 0 0 0
d68 0 0 0 0 0 0 0 100 0 0
d75 0 0 0 0 0 0 0 0 100 0
d84 0 0 0 0 0 0 0 0 0 100
Overallclassificationaccuracy=99.6%
Notethat theoverall classificationaccuracy decreasedbyonly 0.2% when comparedto
classificationusingwithin-classcovariance. The difference inclassificationaccuracywas
dueto thedifferenceoftheestimatedfeaturedistributions foreachclass. Sincethepooled
covariance matrix was an estimate oftheaveragecovariance ofthe tenclasses,the estimates
ofwide within-class variancestendedtobenarrowerusingpooled covariance. Likewise,
theestimatesof narrow within-class variancestended tobewiderusingpooled covariance.
Ingeneral,thiscaused anincreaseoftheprobabilityoverlapandintroduced some
3.3 Minimal Window Size Determination
Overallclassificationaccuracywas usedtodeterminetheminimumlocalwindow size
neededforclassification. Six localgranulometrieswere run oneachoftheten texture
imagesusing square windowswith sides oflength7, 11, 15, 19, 25, and33 pixels. Two
sets offeaturedatawere collectedinordertodeterminetheeffect of window sizeonboth
dependentandindependent data.
Figure 12shows resultsof classification with the6 differentsize windows. The side
lengthofthewindow isreferredtoasthe window size. Notethattheclassification
accuracy axison thisgraph rangesonly from80% to 100%.
100-\ u S 95 < s o a u 90-5 85 80 10 1 15 WindowSize T 20 25 dependent independent -J 30 -1 35
Over 99%classificationaccuracyofthedependentdatawasachievedforall window sizes
greaterthan 1 1 pixels. Althoughtheclassificationaccuracyoftheindependent datawas
lessthan thatofthedependentdata,theclassification wasstill94.6% accurateusing a
window size of 1 1 pixels. It alsoshouldbenotedthat theclassificationaccuracy for both
thedependentandindependent data felldramaticallybelowthewindowsize of11 pixels.
This indicatesthatmostofthe underlyingtextureprimitiveswhichdistinguishtheseimage
textureswere no smallerthan 1 1 pixels. However,itshouldbe kept in mindthat the
3.4 Optimal Feature Selection
Anumber ofavailable methodsfordetermininganoptimalfeatureset were applied.
Richards'
[1986] methodfordeterminingan optimalfeaturesetbythe degreeofoverlapof
theclassdistributionsisconsideredthemostaccurate sinceitusesthecovariance ofall
classes and requiresonly theassumption ofGaussiannormaldistributedfeatures ineach
class. However,this methodisalsothemostcomputationallyintensive. The totalnumber
ofcalculations neededtodeterminethedivergence is determinedbythenumberof
permutationsofoptimalfeaturestochoose outofthe totalnumber offeatures. For
example,tochoosetheoptimal6 featuresout of atotalof 17 forall 10classes,thenumber
of calculations wouldbe:
[17!/3!
d7-3)l][10!/2!
dO-2)!] =30600 (17)Foreach ofthe 30600divergencemeasures, 2matrixinversesmustbecomputed.
The next viable optionforoptimalfeature selection wastheclass separation method
developedbyRosenblum [1990]. Theresults shouldbesimilartoRichards'method since
thecovariancematrices of allclasses wereincorporatedintotheseparation measure. This
methodhadtheadvantage ofbeingfaster because onlyone matrixinversion isrequiredfor
each separationmeasurebetweentwoclasses.
Inordertodeterminetheoptimal number offeaturesneededforadequateclassification,the
optimalfeatureswasfound fromthe 17features,all otherfeatures
were removedfromthe
featurevectors. Theoptimalfeaturedata fromthe ten imagetextureswas then
usedtotrain
theclassifier. Bothdependentandindependent dataweresubsequentlyclassified andthe
overallclassificationaccuracy wasdeterminedforeachoptimalfeatureset. Theresults of
classification withtheoptimalfeature sets are giveninFigure 13.
S3
<
100-i
90
-2 80
60
8 10
#ofFeatures
-O
dependent
independent
-1
12
t
14 16
Figure 13: ClassificationAccuracyvs.NumberofOptimal Features
Notethat theclassificationaccuracyaxis onthisgraph rangesfrom60%to 100%. Over
99%classificationaccuracyofboth thedependentandindependent datawas achieved with
6optimalfeatures. Additional featurescontributedverylittletoimprovingthisaccuracy.
Thefirst6optimal featuresetsusedintheseclassifications canbe foundinTable 4. A
Noticethatthe classificationaccuracyusing 5optimal featureswasslightly less thanthe
classification accuracy
using 4optimalfeatures. Theaddition ofmorefeaturestothe
feature vectorsdoesnotnecessarilycorrespondtohigherclassificationaccuracy. The
classificationaccuracy mayevendecrease iftheprobability overlap betweenthe classesis
increasedbytheadditionof morefeatures. Inthiscase,therewas moreprobability overlap
betweenthe tenclasseswith any set offive featuresthantherewas withtheoptimalsetof4
features.
Although, aspreviously stated,most ofthese granulometricfeatures used weresizeand
directiondependent,it isinterestingtonotethat thecircularPSM,which is rotationally
invariantwasthemost significant of all 17featuresand appearedineach ofthefirst four
optimalfeature sets. LinearityPSM,whichis invarianttoboth directionandscale,also
appearedinthesetof3optimalfeatures. Althoughtheseoptimalfeaturesets aredependent
ontheimagetexture classes,giventhe diverserange ofimagetextureclasses inthis study,
an optimal set of6features for anygiven setoftextureclasses canbeexpectedtogive
Table 4: Optimal FeatureSets
usingRosenblumOptimization
1 feature: circularPSM
2features: circularPSM
horizontalPSSD
3 features: circularPSM
horizontalPSSD LinearityPSM
4features: circularPSM
horizontalPSM
negative-diagonalPSSD
negative-diagonalPSS
5features: horizontal PSM
negative-diagonalPSM
negative-diagonalPSSD
negative-diagonalPSS
positive-diagonalPSM
6 features: horizontal PSM
negative-diagonalPSM
negative-diagonalPSSD
negative-diagonalPSS
positive-diagonalPSM
verticalPSM