• No results found

A novel automated spike sorting algorithm with adaptable feature extraction

N/A
N/A
Protected

Academic year: 2021

Share "A novel automated spike sorting algorithm with adaptable feature extraction"

Copied!
11
0
0

Loading.... (view fulltext now)

Full text

(1)

ContentslistsavailableatSciVerseScienceDirect

Journal

of

Neuroscience

Methods

j o u r n al ho m e p age :w w w . e l s e v i e r . c o m / l o c a t e / j n e u m e t h

Computational

Neuroscience

A

novel

automated

spike

sorting

algorithm

with

adaptable

feature

extraction

Robert

Bestel

, Andreas

W.

Daus, Christiane

Thielemann

BioMEMSLab,UniversityofAppliedSciencesAschaffenburg,63743Aschaffenburg,Germany

h

i

g

h

l

i

g

h

t

s

 Newfeatureextractionapproachbasedonreviewofcommonspikesortingmethods.  Variablefeaturederivationbyevaluatingthefeature’sprobabilitydistribution.  Validationofthealgorithmwithneuronalmonolayerand3Dspheroidsignaldata.  Comparisonoftheresultswithcommonfeatureextractiontechniques.

a

r

t

i

c

l

e

i

n

f

o

Articlehistory:

Received8May2012

Receivedinrevisedform13August2012 Accepted15August2012 Keywords: Spikesorting Microelectrodearray Neuron Actionpotential Biosensor

a

b

s

t

r

a

c

t

Tostudytheelectrophysiologicalpropertiesofneuronalnetworks,invitrostudiesbasedon microelec-trodearrayshavebecomeaviabletoolforanalysis.Althoughinconstantprogress,achallengingtask stillremainsinthisarea:thedevelopmentofanefficientspikesortingalgorithmthatallowsanaccurate signalanalysisatthesingle-celllevel.Mostsortingalgorithmscurrentlyavailableonlyextractaspecific featuretype,suchastheprincipalcomponentsorWaveletcoefficientsofthemeasuredspikesignalsin ordertoseparatedifferentspikeshapesgeneratedbydifferentneurons.However,duetothegreatvariety intheobtainedspikeshapes,thederivationofanoptimalfeaturesetisstillaverycomplexissuethat currentalgorithmsstrugglewith.Toaddressthisproblem,weproposeanovelalgorithmthat(i)extracts avarietyofgeometric,Waveletandprincipalcomponent-basedfeaturesand(ii)automaticallyderivesa featuresubset,mostsuitableforsortinganindividualsetofspikesignals.Thus,thereisanewapproach thatevaluatestheprobabilitydistributionoftheobtainedspikefeaturesandconsequentlydetermines thecandidatesmostsuitablefortheactualspikesorting.Thesecandidatescanbeformedintoan individ-uallyadjustedsetofspikefeatures,allowingaseparationofthevariousshapespresentintheobtained neuronalsignalbyasubsequentexpectationmaximisationclusteringalgorithm.Testresultswith sim-ulateddatafilesanddataobtainedfromchickembryonicneuronsculturedonmicroelectrodearrays showedanexcellentclassificationresult,indicatingthesuperiorperformanceofthedescribedalgorithm approach.

© 2012 Elsevier B.V.

1. Introduction

Themostbasicgoalofneuroscienceisundoubtedlyto

under-standtheprocessesofthehumanbrain,notonlytocomprehend

thethoughtprocessitself,butalsotofindmethodsthattreat

neu-rologicaldiseases suchas epilepsy,depression and Alzheimer’s

disease.Onewaytostudythebehaviourofneuronalnetworksis

tomonitorelectricsignalsgeneratedbyvariousneuronsinsuch

anetwork. In vitroexperiments enabletheanalysisof cell–cell

interactionsincontrolledlaboratoryconditionsandfurtheroffer

improvedresearchopportunities becauseofa reduced network

complexity,aswellasbetteraccessibility.Thestandardtechnique

∗ Corresponding authorat: WürzburgerStraße 45, D-63743 Aschaffenburg, Germany.Tel.:+496022813612;fax:+496022813611.

E-mailaddress:[email protected](R.Bestel).

toderiveneuronalsignalsisthepatch-clampmethod(Sakmann

andNeher,2009).Inrecentyears,however,theusageof

micro-electrodearrays(MEAs)hasbecomeincreasinglypopular,asthis

techniqueallowsnon-invasiveandlabel-freerecordingsfrom

mul-tiplepositionsintheneuronalnetwork(Thomasetal.,1972).

However,fundamentalnetworkanalysisstillfacescertain

dif-ficulties,sinceitcannotbeguaranteedthattherecordedelectrode

signaloriginatesonlyfromonesinglecellratherthanfrom

mul-tipleneuronsintheproximityofthatspecificelectrode. Dueto

thisuncertainty,specialmethodsarerequiredtoseparate

differ-entcellsignalswithinthemeasuredelectrodesignal.Onepossible

approachtakesintoaccountthefactthatthederivedshapeofthe

signalchangeswithrespecttothepositionoftheneuronandthe

correspondingelectrode(seeFig.1).

However, none of the proposed spike sorting methods can

bebroadlyestablished,mainlyduetoaccuracyandperformance

issues.Inthefollowingreport,wepresentanoverviewofspike

0165-0270© 2012 Elsevier B.V.

http://dx.doi.org/10.1016/j.jneumeth.2012.08.015

Open access under CC BY-NC-ND license.

(2)

Fig.1.Schematicofthesignalcomponentsthatcanparticipateinthegenerationofameasuredelectrodesignal.Thesignalcancontainspikesgeneratedbydifferentneurons. Theshapesoftheseneuralspikesshouldvaryduetodifferentcouplingqualitiesbetweentheindividualneuronandtheelectrode.Inaddition,noise,originatingfromvarious sources,isusuallypickedupbytherespectiveelectrode.

sortingalgorithmsalreadydevelopedinthisarea.Wealsopresent

anewapproachtothespikesortingproblem,focussingon

possi-blefeaturesthatareusedtodescribethedifferentshapepatterns

ofneuronalspikes.Insteadoflimitingthepoolofpossiblefeatures

toacertaintype,suchasprincipalcomponentsorWavelet-based

features, the proposed algorithm calculates a variety of spike

parameters. Witha newlydeveloped evaluation method,based

onexpectationmaximisation(EM)approximation,thealgorithm

selectsfeaturesthatarethemostsuitableforthespikesortingofa

particulardataset.

2. Stateoftheart

Ananalysisofthecurrentliteratureonneuronalpattern

recog-nition reveals a wide variety of spike sorting approaches (see

Table1).Usually,mostspikesortingalgorithmscanbedividedinto

twomajorsteps:featureextractionfollowedbyclassificationor

clustering.Inaddition,Table1alsoincludesinformationaboutthe

usedspikedetectiontechnique,asthispre-processingstepcanalso

influencetheresultofthespikesortingsignificantly.Many

differ-entapproacheshavealreadybeentestedandintroducedinorder

toaddresstheproblemathand.

Intermsofspikedetection,themostcommonmethodsuse

sim-plethresholdingtechniquestoextracttheactualspikesignalsfrom

thenoisymeasurementdata.Moreelaboratedalgorithmsfurther

addspecificsignaltransformations,e.g.usingthenonlinearor

Tea-gerenergyoperator(NEOorTEO)(KimandKim,2000;Choietal.,

2006)tothethresholdingprocessinordertodecreasethe

influ-enceofnoiseorlowfrequencysignalartefacts.Anotherapproach

presentedbyHulataetal.(2002)usesWavelet-based

decomposi-tiontodiscriminateactualspikeshapesfromnoise.Suchextended

detectionmethodsusuallyyieldamoreprecisedetectionresult,

whichfacilitatestheactualsortingprocess,butattheexpenseofa

highercomputationalcomplexity.

Themost commonmethodof feature extractionand

reduc-tionisstilltheprincipalcomponentanalysis(PCA)(Woodetal.,

2004; Wanget al., 2006;Biffi et al.,2008).However,

Wavelet-orWaveletpacket-basedfeatureshavealsobecomequite

popu-larastheresultingcoefficientsarealsocapableofdescribingthe

differencesofvariousspikesignals(OweissandAnderson,2002;

Hulataetal.,2002;Quirogaetal.,2004).Adrawbackofthis

com-plexmethod is thenecessity ofan additional selection step to

identifythecoefficientsthatmostpreciselydiscriminatebetween

thevariousspike shapes.In somecases, geometricfeaturesare

alsoobtainedfromthemeasuredspikeevents(Feeetal.,1996;

Vogelsteinetal.,2004)asanalternativeor,morecommonly,a

com-plementtostandardPCAfeatures.Thevarietyoftheusedspike

sortingfeaturesindicatesthatnooptimalfeaturesethasyetbeen

established.Thismightbeduetothefactthatitisquestionable

whetherafixedsetoffeaturesexiststhatcangenerallyguarantee

asuitablespikeshaperepresentation.

While ananalysis ofcommon spikesorting features already

showscertaindiversity,thisisevenmoreapparentintheareaof

possibleclassificationalgorithms.Manydifferentpattern

recogni-tionapproaches,rangingfromsimpleclusteringtechniquessuch

ask-means(Hulataetal.,2002;Satoetal.,2007),moreelaborate

expectationmaximisation(KimandKim,2003;Woodetal.,2004)

andsuperparamagneticclusteringmethods(Quirogaetal.,2004),

totrainableclassificationalgorithmssuchasneuralnetworks(Kim

andKim,2000)andsupportvectormachines(SVMs)(Vogelstein

etal.,2004),havealreadybeentestedforneuralspikesorting.

How-ever,noneoftheseapproacheshasbeenabletoprovidegenerally

optimalresultstobewidelyestablished.Whereastrainable

algo-rithmscertainlyprovideagreaterpotential,theirperformancestill

seemstosufferduetoalackoffundamentaltrainingdata.Unlikein

otherareas,forexamplevoicerecognitionorpersonidentification,

inwhichbasictrainingdatabasesalreadyexist,suchdatasetshave

notbeenestablishedforneuralpatterns,therebylimitingactual

algorithmtraining.

In addition, many spike sorting algorithms also include a

final template matching step to further refine the

(3)

Table1

Overviewofthepublishedspikesortingalgorithms.

Author/year Detection Features Classification

Hulataetal.(2002) Threshold;Wavelet reconstruction

Waveletcoefficientsbasedon Shannonentropy

k-meansclustering

Wangetal.(2006) NEO+threshold PCA Subtractive;k-meansclustering;templatematching Vargas-IrwinandDonoghue(2007) Threshold PCA Densitygridcontourclustering;templatematching Feeetal.(1996) Threshold Waveformalignmentbasedon

multi-electrodewaveformpair

Bisectionclustering;iterativefusionofclusterprocesses

Woodetal.(2004) Threshold PCA Spectralclustering;EMwithMoG

Satoetal.(2007) Peak–peakthreshold PCA Iterativek-meansusingDBVI;templatematching KimandKim(2003) – Projectionpursuit/neg.entropy

maximisation

EMwithMoG

KimandKim(2000) NEO+threshold Spikeshape Artificialneuralnetwork(MLPwithRBFN)

Shohametal.(2003) Threshold PCA EMwitht-distributions

Biffietal.(2008) Adaptivenoisefiltering; pos.+neg.thresholddetection

PCA Hierarchicalclustering

Zhangetal.(2004) Threshold PCA Subtractiveclustering;templatematching Hortonetal.(2007) Threshold PCA+spikeshapecurveindices Kohonennetwork

OweissandAnderson(2002) Threshold Waveletpacket decomposition;bestbasis evaluatedbySVD

Quirogaetal.(2004) Threshold Waveletpacketcoefficients (Lillieforstest)

Superparamagneticclustering

Choietal.(2006) MTEO PCA Fuzzyc-meansclustering

Vogelsteinetal.(2004) SVM Spikeshapes SVM

Chenetal.(2011) NEO+threshold Waveletcoefficientsevaluated byPCA

Watershedalgorithm Takegawaetal.(2010) Threshold Waveletcoefficientsevaluated

bydistribution

VariationalBayesclusteringusingt-distributions

2007). However, since the result depends on prior template

extractionmethods,whichusuallyincludethealready-mentioned

clustering techniques, inaccuracies cannot always be

compen-sated.

Inconclusion,therestillseemstobenooptimalmethodthat

isgenerallycapableofsortingneuralspikepatterns.Onereason

forthismightbethelackoffundamental testdatasets

contain-ingaprincipalspectrumoftheoreticallypossiblespikeshapesthat

couldbeusedtoevaluatethedifferentclassificationalgorithms.As

thisproblemalsopreventsathoroughevaluationofthe

suitabil-ityofdifferentspikefeatures,itisgenerallydifficulttoevaluate

thefeasibilityofspecifictypesoffeaturessuchasprincipal

compo-nentsorWaveletcoefficients.Althoughcertainstudieshaveshown

thatWavelet-basedfeatureextractionpotentiallyoutperformsthe

PCA,thisfactcannotbegeneralisedtoallpossibledatasets(Pavlov

etal.,2007).Therefore,anaprioriexclusionofPCAfeaturescan

limittheability todiscriminatebetweencertainspikeforms. In

fact,this isalsotrueforotherfeatures thatmightbeinferiorin

mostcasesbutnotingeneral.Hence,itmightnotbeusefultofocus

onaspecificfeatureextractiontechnique,asotherfeaturesmight

bemoresuitable onspecificoccasions.Therefore,incontrastto

mostmethods,thealgorithmpresentedinthispaperregards

var-iouspossiblefeatures,suchasprincipalcomponentsanddifferent

Waveletpacketcoefficients,aswellasgeometricfeatures.Assuch

anapproachrequiresauniquefeaturereductionstepinorderto

determinethefeaturesmostsuitablefortheparticularspikesof

thesignal,anewlydevelopedfeatureevaluationstepisproposed

inthispaper.Byanalysingtheprobabilitydistributionofthe

dif-ferentfeaturecoefficients,thecandidateswiththemostdistinctive

multimodalcharactercanbefound.Resemblingthedifferentspike

shapecharacteristics,thismethodallowsthederivationofspike

sortingfeaturesetsthatarecustomisedforeachindividualsetof

neuronaldata.

3. Method

Aftertheshortoverviewofspikesortingalgorithmsgiveninthe

previouschapter,thefundamentaltheoryofthemethodpresented

inthispaperisdescribedinthefollowing.

3.1. Featureextraction

Theimplementationofageneralisedfeatureevaluationstepin

thespikesortingprocessdoesnotonlyprovidetheopportunityto

useamixtureofPCAandWaveletfeaturesfortheclassification

step,butalsoallowstheinclusionofotherfeaturesthatare,for

example,basedontheshapeofthemeasuredspikesthemselves.

Certain characteristicsof the spike,e.g. itsnegative or positive

amplitude,aswellasvariousgradientsoritssignalenergycanbe

usedasadditionalcandidates.Ontheonehand,such

geometry-basedfeaturesareusuallyvulnerabletonoisesignalsandvoltage

offsets.Ontheotherhand,theycanbeverypotentinparticular

situations,inwhichsuchafeaturecanspecificallyhighlighta

dif-ferencebetweentwospikeshapes.Furthermore,thesefeaturescan

bequicklyandeasilycalculatedandrepresentnoimminentthreat

tothealgorithm’scomputationalcomplexity(seeTable2).

3.1.1. Geometricfeatures

Thecalculationofthepositiveandnegativeamplitudeis

basi-callyself-explanatory.Theleftandrightspikeanglesrepresentthe

leftandtherightgradientofthecharacteristicspikeminimumto

thepointinthesignal,atwhichitcrossesthemarginof50%of

thatvalue,asshowninFig.2.Thechoiceofthisparticularmargin

canbeexplainedbylowliabilitytothenoisecomponentsofthe

signal.Theresultingvaluecaneasilybetransformedintoan

angu-larmeasure,usingbasictrigonometricfunctions.Inaddition,the

(4)

Table2

Overviewofneuronalsignalfeatures.

Feature Advantages Disadvantages

Negativeamplitude Easytointerpret Vulnerabletosignaloffsets

Positiveamplitude Stressesspecificshapecharacteristics Onlysuitableinparticularcases;vulnerabletosignal offset

Left/rightspikeangle Stressesspecificshapecharacteristics Onlysuitableinparticularcases;vulnerabletonoise distortions

Neg./pos.signalenergy Robustfeaturesintermsofnoisedistortionsorlow samplerates

Naturallycorrelateswithamplitudeandangle features;vulnerabletosignaloffsets

Corespikeduration Stressesspecificshapecharacteristics Onlysuitableinparticularcases;vulnerabletonoise distortions

NEOcoefficient Higherresolutionthannegativeamplitudein particularcases

Naturallycorrelateswithnegativeamplitude Principalcomponents Usuallyaccuratedescriptionofdatasetswithasetof

uncorrelatedparameters

Possibleinaccuraciesifmorethantwospikeshapesare presentinthedataset

Distribution/ShannonWaveletcoefficients PotentWaveletscalefeatureswithnosignificant correlation

Highcomputationalcomplexitycomparedtoother methods;multi-resolutionanalysislimitedbysample rate

intersectionpointsofbothgradientswiththe0Vlevel.The

posi-tiveandnegativesignalenergyofacontinuous-timesignals(t)can

beapproximatedbasedonthefollowingequation(OhmandLüke,

2010):

Es=



−∞

s2(t)dt (1)

Likewise,thesignalenergycanalsobecalculatedfordiscrete-time

signalsbysummingthesquareddiscretesignalvalues.Inthiscase,

thepositiveandnegativeenergycomponentsofthespikeare

calcu-latedinparticulartoseparatemonopolarfrombipolarspikeshapes

moreaccurately.

3.1.2. NEOcoefficients

ThevalueoftheNEOoperator(x[n])iscalculatedforthe

char-acteristicspikeminimumvaluewiththeequationbelow(Choietal.,

2006):

(x[n])=x2[n]−x[n−1]×x[n+1] (2)

Theresultingparameter(x[n])givesanestimateoftheenergy

contentofthesignalatthediscretetimestepnofthesignalx(n).

3.1.3. Principalcomponents

Thefirstfourprincipalcomponentsarederivedbythestandard

principalcomponentanalysis(PCA)algorithmbasedonthe

eigen-vectors and their corresponding eigenvalues of the covariance

(5)

res

ulting

wavelet

coe

fficien

ts

h[n] g[n] h[n] g[n] 2

1. iteration

2. iteration

3. iteration

ti

m

e re

so

lu

ti

o

n

fr

equenc

y r

esol

u

tion

discrete

time sign

al

digital hig

h

pass fil

ter

digital low

pass fil

ter

h[n] g[n]g[n] h[n] g[n]g[n] h[n] g[n] h[n] g[n]g[n] h[n] g[n]g[n] 2 2 2 2 2 2 2 2 2 2 2 2 2

Fig.3.SchematicoftheWPDmulti-resolutionanalysis.

matrix(Joliffe,2002).Thelatteriscomputedwithallthevoltage

valuesrepresentingtheshapeofeachspike,forallspikeevents

foundinthepreviousspikedetectionstep.

3.1.4. Waveletcoefficients

TwodifferenttypesofWaveletcoefficientsarecalculatedusing

theWaveletpacketdecomposition(WPD),whichisanextension

ofthe standard discrete multi-resolutionWavelet analysis.The

Wavelettransformationofacontinuous-timesignalcanbewritten

assuch(Stark,2005): Lf(a,t)= 1



(|a|)



∞ −∞ 



ut a



f(u) du (3)

wherethefunctiona,t(u)denotestheresultingWaveletbasedon

aspecificmotherWaveletandmodifiedbythetwoindependent

variablesaandu.Thelatterisanadditionaltimevariabledescribing

thetemporalshiftoftheWaveletwithrespecttotheactualsignal,

whiletheactualfrequencyorscaledecompositionisvariedbythe

parametera.Inotherwords,differentscalingvaluesofagenerate

differentdaughterWavelets,amodificationofthemotherWavelet,

whichhighlightsspecificfrequencyareasinthedecomposed

sig-nal.Fordiscrete-timesignals,aniterativeWaveletdecomposition

process,alsocalledmulti-resolutionanalysis,isusuallydescribed

asafilteringprocessthatbreaksupthesubjacenttimesignalat

alow-passandaband-passscalewitheachiteration.Hence,the

resultingfrequencyscaleresolutionincreaseswitheverystepofthe

process,whereasthetimeresolutiondecreasesrespectivelywith

eachsplitting.Astheratiobetweenfrequencyandtimeresolution

ispredefinedbytheuncertaintyprinciple,thestandard discrete

Wavelettransform(DWT)multi-resolutionanalysisonlyfocuses

onthelow-passcomponentinsubsequentiterations,sincean

accu-ratefrequencydeterminationisespeciallycrucialinthisparticular

range.Incontrast,theWPDcontinuestodecomposeboththe

high-andlow-passcomponentwitheachstepoftheanalysis(seeFig.3).

Thisallowsamoreaccuratesignalanalysisattheexpenseof

additionalcomputationalcomplexity.Theresultisusuallyreferred

toastheWaveletpackettree(seeFig.3),withadifferentsetof

Waveletcoefficients,callednodes,foreachWaveletfilter.Eachof

thesecoefficientsdescribesthemagnitudeofacertainfrequency

rangeinaspecifictimeinterval,whichresultsinaconsiderable

amount of redundant information with every additional

itera-tion. As a consequence, the most crucial step with respect to

thespikesortingtaskis probablythederivationoftheWavelet

scalecoefficientsthatdescribethedifferencesofthespikeshapes

detectedintheactualtimesignal.

TheShannoninformationcriteriacanbeusedtodeterminethe

amountof information explained byeach node ofthe Wavelet

packettree(Hulataetal.,2002).Basedontheresultingvalues

cal-culatedforeverydecomposedspike,itispossibletoevaluatethe

divergenceofexplainedinformationforallnodesthroughoutall

spikeevents.Themostsuitablenodecanbeanalysedbasedonthe

distributionofthevaluesoftheShannoninformationcriteriaacross

everyspike.Undertheassumptionthattwodifferentspikeshapes

produceamultimodaldistributionatcertainnodes,a

determina-tionofthesecanprovideusefulfeaturetypeforthespikesorting

process.

Toobtainasecondtype ofWavelet-basedfeatures,thesame

process can be implemented for the resulting Wavelet packet

coefficientsthemselves.Again,thegoalistofindacoefficientthat

showsamultimodaldistributionconsideringitsvalueacrossall

detectedspikes.Therefore,thefundamentalextractionalgorithm

isidenticalandonlytheinputvariablesareactuallychanged.

Intheproposedalgorithm,threeWaveletcoefficientsforboth

typeswiththemostpromisingprobabilitydistributionwere

cho-senasspikesortingfeatures.Inotherwords,thebestthreeWavelet

coefficientscalculated withtheShannoninformation criteriaas

wellasthebestthreeWaveletcoefficientswithrespecttotheir

naturaldistributionwereused.Asonlytheresultingdistribution

and not the calculated values themselves wereof interest, the

(6)

Fig.4. Exampleofdifferentfeaturedistributionsextractedfromaneuronaldataset.The“negativeamplitude”andthe“firstprinciplecomponent”inAandB,respectively, seemtobesuitable,whereasthesecondnativeWaveletcoefficientinConlyoffersalimitedperspective.ThefirstShannonWaveletcoefficientinDmayalsobequitesuitable, butitsdistributionislessdistinctivecomparedtothatinB.

themulti-resolutionanalysis,sinceitprovidesthefastest

comput-ingtimewithoutsignificantlychangingtheoutcomecomparedto

othermotherWavelets.Toavoidahighlyredundantfeatureset,the

correlationbetweenthecandidateswasalsotakenintoaccount.

EvenapromisingWaveletcoefficientwasdiscardedifits

correla-tionvaluewithrespecttothesuperiorcoefficientsalreadychosen

crossedacertainthresholdvalue.

Inconclusion,thedistributionapproachanalysesthe

decom-position result in further detail; however, it also requires a

considerableamountof additional computation time compared

to the Shannon entropy-based method. Nevertheless, as each

approachcanidentifydistinctspikecharacteristics,itisbeneficial

toextractbothtypesofWaveletfeaturesforfurtherevaluation.

3.2. Featureevaluationandreduction

Witheveryspikesortingfeaturecalculated,thekeyistofind

thesetthat is most suitablefor the taskathand. Therefore, it

isimportant tokeep evaluatingthefeature’s distributions with

respecttotheirseparability.Inthefollowingsection,thecrucial

partofthepresentedalgorithmisdescribed,whichisusedto

eval-uateandcomparetheextractedfeatures.Forthesakeofclarity,this

isthesamealgorithmthatwasappliedtoidentifythemostsuitable

WaveletcoefficientsforbothWaveletfeaturetypes.

Tofurtherspecifytheactualgoalofthealgorithm,itishelpful

tolookatcertaindistributionfunctionsthatcanbefoundinthis

context(seeFig.4).

LookingatFig.4,itisquiteobviousthat thetwo featuresin

Fig.4AandBseemtobepromisingexampleswithaclearly

identifi-ablemultimodalcharacter,possiblyallowingtheseparationoftwo

spikeshapes.Thus,thedesiredfeaturedistributionshouldnotonly

showatleasttwodistinctmaxima,butalsooughttobeclearly

sep-aratedfromeachother.Therefore,asbothcriteriashouldbeideally

describedinasinglevalue,thisinevitablyleadstosomeblurriness

whenevaluatingthevariousfeaturecandidates.Asaconsequence,

itiscrucialthatbothcriteriaareratedasaccuratelyaspossiblein

ordertominimisetheresultinguncertainty.Hence,the

expecta-tionmaximisation(EM)methodwaschosenforthispurpose,asit

offerssuitableparametersforassessment.AnEMalgorithmusing

Gaussianbasisfunctionswasselected,assumingthatthesumof

thenoisecomponents,usuallypresentineverymeasurementdata,

leadstoaGaussiandistortionofgenerallystablespikeshapes.The

EMalgorithmisusedtoapproximatethecalculateddistributionof

eachfeaturewithaso-calledmixtureofGaussians(MoGs)thatcan

bemathematicallydescribedwiththefollowingequation(Polanski

andKimmel,2007): fmix(x,˛1,...,˛k,p1,...,pk)= K



k=1 ˛kfk(x,pk) (4)

where˛kspecifiestheweightofeachfunctionwithinthemixture

modelandpkdenotesitsactualparameters.Inturn,everyGaussian

componentisspecifiedwithitscharacteristicformula:

fk(x,pk)=fk(x,k,k)= 1 



(2n) exp



(x−)2 22 k



(5)

Therefore,theEMalgorithmneedstoscalethevaluesk,kand

˛kforeachmixturecomponent.Tofindasuitableapproximation,

ititeratesbetweentwodistinctivesteps,theexpectation(E-step)

andthemaximisation(M-step).Atthestartoftheprocess,either

arandomoranapriorivalueispresumedforeachparameter.On

thisbasis,thedifferenceQ(p,pold)betweentheactualdistribution

andthecurrentapproximationiscalculatedintheE-step,usingthe

log-likelihoodfunction(PolanskiandKimmel,2007):

Q (p,pold)= N



n=1 K



k=1 p(k|xn,pold)ln˛k + N



n=1 K



k=1 p(k|xn,pold)lnfk(xn,pk) (6)

TheparameterNequalsthenumberofdatapointsofthefeature

distribution,whileKstilldenotesthenumberofGaussiansinthe

mixture.InthesubsequentM-step,anewsetofparametersis

deter-minedtoimprovetheparitybetweenapproximationandfeature

distribution.Inthiscase,theparameterspk,newk ,knewand˛newk

canbecalculatedwiththefollowingsetofequations(Polanskiand

Kimmel,2007): new k =

N n=1xn×p(k|xn,pold)

N n=1p(k|xn,pold) (7) (new k ) 2 =

N n=1(xn−newk ) 2 ×p(k|xn,pold)

N n=1p(k|xn,pold) (8)

(7)

˛new k =

N n=1(k|xn,p old) N (9)

Thesenewparameterscanbeevaluatedandrefinedwithanother

iterationoftheE-stepandM-step,respectively,eitheruntilQ(p,

pold)cannotbeimprovedanyfurtheroruntilacertainthreshold

criterionismet.

Tofacilitatetheapproximationprocess,eachnativefeature

dis-tribution,derivedwithitshistogram,issmoothedinordertolimit

theeffectsofoutlyingdatapoints.Furthermore,thenumberand

correspondingpositions of existing maximaare determined by

simplecurvesketchingandserveasinitialvaluesfortheEM

pro-cess.

Oncethefinal approximationresultisacquired,theobtained

parametersprovideasuitablebasisfor evaluation.Whereasthe

distance between each Gaussian component in the mixture is

expressedbynew

k ,˛ new

k modelsthedistinctionoftheindividual

functions,whilenew

k canbeusedtodeterminetheoverlapping

oftheidentifiedGaussians.Asitishardtodetermineany

hierar-chyamongtheseparameters,theirvalueshavetobetransformed

intoanoverallratingfortheexaminedfeatures.Forthispurpose,

thefollowingequationhasbeenspecificallydesignedtoprovidea

solutiontotheresultingproblem:

v

(k)=

K

k/=1Co

v

(i,k)

((max)−(k))ײ(k) (10)

Thevalue

v

(k) ratesthe distinctivenessof an individual

Gauss-ianfunction,k,presentinthemixturecomparedtotheGaussian

surroundingitsglobalmaximum,denoted by(max).Onlythe

covariancemeasureputseverymixturecomponentintoaccount,

toaccuratelyexpresstheactualoverlap.AsdescribedinSection

3.3,thepresentedspikesortingalgorithmisspecificallydesigned

toseparatetwospecificclusters.Therefore,thefeaturesshowinga

bimodalcharacteristicareofparticularinterest.Asaconsequence,

inthefirststep

v

(k)iscalculatedindividuallyforeveryGaussian

componentofeachmixture.Afterwards,theminimumvaluefor

v

(k)overallcomponentsofthemixtureisdeterminedandserves

inasanoverallratingforthecorrespondingfeature.Inaddition,

featuresthatappeartohaveonlyaunimodaldistributionarealso

evaluatedbythevariancetheyexplain,butaretreatedassecondary

candidates.

It is this rating that allows thederivation of a unique

fea-turesetforeachindividualdataset.Theparticularcandidatesthat

minimise

v

(k)areselected,althoughagainwiththecorrelationas

anadditionalcriterion.Inthisalgorithm,themaximumsubsetsize

wasadjustedtosix,simplytolimitthecomputationalcomplexity

ofthesubsequentclusteringprocess.However,itispossiblethat

thisnumberisnotreached,asthecorrelationcriteriacanfurther

limitthesizeofthedeterminedsubset.

3.3. Clustering

Asthefinalstepinthespikesortingprocess,theresulting

fea-turesethastobeclassifiedintodistinctclustersthatcontainthe

particularspikes ofan individualneuron. For this purpose, the

derived subsetis firstused tospan a multidimensionalfeature

space.Inordertoidentifytheclustersinvolved,theEMalgorithm

isusedonceagain.Ontheonehand,thisclusteringmethodhasthe

advantageofprovidingasoftclassificationresult,whichallowsthe

exclusionofoutliers,unlike,forexample,thek-meansalgorithm

(Clarkeetal.,2009).Ontheotherhand,asaplainclustering

algo-rithm,itdoesnotrequireanytrainingprocessinadvance,whichis

especiallyadvantageousinthiscase.

Similarlytothefeatureevaluationstepdescribedinthe

previ-ouschapter,theEMalgorithmapproximatesthedatadistribution

ofagivenspace,usingGaussianbasisfunctions.Consequently,each

datapoint, resemblinga specificspike event,is sortedintothe

resultingmixturecomponentsbasedontheprobabilityassociated

witheachGaussian.Asmentionedbefore,thealgorithmisdesigned

tospecificallyseparatethedataintotwoGaussianclusters.Firstof

all,thiseliminatestheproblematictaskofdeterminingthenumber

ofclusters,i.e.thenumberofspikingcells.Secondly,thisgivesthe

opportunitytoderivefeaturesetsthatcanresemblethedifference

inshapeoftheremainingspikesignalsmoreaccurately,thereby

enhancingtheseparability.Thedescribedalgorithmisgenerally

abletoclusterthedataintomorethanjusttwoclasses.

Neverthe-less,intheproposedalgorithm,aniterativeapproachisfavoured,

asitgivestheopportunitytoseparateonespecificclassfromthe

restofthesignalineachiteration.Thus,thederivationofsuitable

featuresbecomessignificantly easier,improvingthesubsequent

clusteringresult.Asafinal consequencethisapproach basically

minimisesthechanceofoversorting,whichmeanstheseparation

ofthespikesof oneindividual cellintomultipleclusters, since

thefeatureevaluationprimarilycreditsthedistinctivenessofthe

Gaussiansinthemixtureofafeatureratherthanthenumberof

Gaussiansinit.ThoughMoG’swithmorethantwoGaussian

com-ponentsarenotexcludedingeneral,thecovariancecriteriaofEq.

(10)ensuresthatsuchfeaturesarepicked,onlyifthesecomponents

canbeclearlyseparated.

4. Resultsanddiscussion

Thealgorithm described inthepreviouschapterwas

imple-mented in a self-made Matlab©-based software. To test the

feasibilityof thedeveloped algorithm,both simulated dataand

recordeddataderived from3Dneuronal cellcultures, so-called

spheroids(LayerandWillbold,1993;Layeretal.,2002),wereused.

4.1. Resultswithsimulateddata

Simulateddatasetswereprimarilyappliedtovalidatetheresults

ofthedescribedspikesortingalgorithm.Suchsimulateddata

sam-pleshavetheadvantageofbeinglabelledandbettersuitedforthe

testingofageneralalgorithm.Hence,simulateddatapublishedfor

suchpurposesbytheUniversityofLeicesterwasused(cf.Quiroga

etal.,2004).Itofferedvariousexamplesofneuronaldata,

simu-latedatasamplerateof24kHz.Thedifferentsetsalsovariedin

theirsignal-to-noiseratio(SNR),allowinganevaluationofthe

sor-tingresultsasafunctionofSNR.Asanexemplarydataset,thefile

cdifficult2noise01withaSNRof10wasapplied(seeFig.5).

Thedatasetcontainedthreebasicspikeshapesthatwere

ran-domlydistortedbyGaussiannoisecomponents.Asapreliminary

step,thespikeeventsweredetectedatasimplevoltagethreshold

thatwascalculatedbasedonthestandarddeviationofthenoise

containedinthesignal(Chiappaloneetal.,2005).Toexcludeany

unwantednoiseeventsthatweredetected,afirstsortingiteration

wasperformed.

Next,theactualspikesortingtaskwasmetwithasecond

iter-ationofthespikesortingalgorithm.Variousspikefeatureswere

calculatedforeachspikeevent,basedona1mswindow(24data

points)aroundthedetectedtime-stampatitsvoltageminimum.

Inthefirstattempt,thedatawereclusteredviatheEMalgorithm,

applyingWavelet-basedfeaturesexclusivelyandinanother

sor-tingattempt,principalcomponentsonly(seeFig.6).Inorderto

improveclarity,theclusteringresultwasprojected onlyonthe

firsttwo dimensions of themultidimensionalfeature space.To

excludeoutlyingdatapointsusuallyoriginatingfromhighly

dis-tortedoroverlappingspikes,aconfidenceprobabilityintervalof

5%wasused.Inotherwords,datapointsbelongingtoacertain

clusterwithaprobabilityoflessthan5%wereseparatedintoa

(8)

Fig.5. 150mssectionofthesimulateddatasampleofdatafromtheUniversityofLeicester.Thesignal,sampledatarateof24kHz,containsthreebasicspikeshapeswith anSNRvalueof10.Inordertoprocessthedatawiththeanalysisframework,thenormalisedsamplewasmultipliedwithanoffsetvalueof70␮V.Furthermore,thesignal wasinverted,asthespikedetectionalgorithmofourMatlab©interfacegenerallyexpectsneuronalspikeswithnegativeamplitudes.Thedifferentspikeshapesaremarked withtheirrespectiveclusternumber.

Fig.6. Clusteringresultofthedatasetcdifficult2noise01usingtheEMalgorithmonlywithWavelet-basedfeatures,indetailthethreeWaveletpacketcoefficientswiththe mostsuitableprobabilitydistribution(left)andonlywiththefirstthreeprincipalcomponents,describing90%ofthedata’svariance(right).Forimprovedclarity,theplots onlyshowaprojectionofthemultidimensionalfeaturespaceontothefirsttwofeatures.Neithercombinationcorrectlyseparatedallthreespikeshapes,asonlytwodistinct clusterscouldbedetermined.Outlyingdatapointswereseparatedintoaparticularclusterusingaconfidenceprobabilityintervalof5%.

algorithms,couldseparateallthreespikeshapesintheparticular

featurespacecorrectly.

Hence,thesubsequentspikesortingresultwasnotsatisfying

either.Inthenextstep,thenovelautomatedfeatureevaluation

algorithm was used to find a suitable mixture of spike shape

features.Thedeterminedsetoffeatureswasthenfedintothe

sub-sequentclusteringstep.Fig.7showstheresultoftheEMalgorithm

Fig.7. Clusteringresultofthedatasetcdifficult2noise01usingthe(1)Wavelet coef-ficient(distribution)andthe(1)principalcomponent.Thesortingalgorithmcould separatethethreedistinctclusters,whileoutlyingdatapointswereseparatedinto aparticularclusterusingaconfidenceprobabilityintervalof5%.

inthetwo-dimensionalfeaturespacespannedbythetwobest

fea-tures,namelythefirstprincipalcomponentandthefirstWavelet

coefficientevaluatedfromitsdistribution.

InthefeaturespaceseeninFig.7,threeclustersinsteadoftwo

wereidentified.Hence,thefeaturecombinationfoundbythe

algo-rithmclearlydescribedtheparticulardatasetmoreaccuratelythan

thesetsofthetwopreviousapproaches.

Therefore,themixtureofdifferenttypesoffeaturesprovidesa

betterclusteringresultcomparedtothecommonattemptswith

oneparticulartypeonlyusedinmostofthealgorithmsalready

established.To lookat the resultsin furtherdetail, thesorting

resultwasevaluatedforeachspikebycomparingthecalculated

classificationresultandtheoriginallabel(seeTable3).

Iftheeventsofoverlappingspikesareexcluded,thespike

sor-tingalgorithmalmostyieldsaperfectresult,withonlyninefalsely

clusteredevents.Thisnumber,however,issignificantlydecreased,

onceoverlappingeventsaretakenintoaccount.Inorderto

exam-inethealgorithm’sresultsinmoredetail,theerrorsmadebetween

missing and incorrectlyassigned spikes ina certain clusterare

summarisedinTable4.

Mostspikesclassifiedinoneofthethreeclustersweresorted

correctly,even when consideringoverlapping events. This

phe-nomenoncanbeexplainedbytwoseparateerrors.First,spikesthat

followtoocloselyafteroneanotherweredetectednotastwo

dif-ferentbutonesingleevent.Hence,oneofthetwocontainedspikes

wasbasicallyomittedandtherefore,lostbeforethesortingstep.

Second,ifbothspikesweredetectedseparately,butdistortedeach

(9)

Table3

Comparisonoforiginalandcalculatedspikesortingresults,usingthedatasetcdifficult2noise01withaSNRof10.

Classes No.oforiginalspikes(with overlappingspikes)

No.ofcorrectlysortedspikes (withoverlappingspikes)

%ofcorrectlysortedspikes (withoverlappingspikes)

Overall 2742(3462) 2733(3351) 99.67(96.79)

Cluster1 957(1187) 953(1146) 99.58(96.55)

Cluster2 898(1136) 898(1113) 100(97.98)

Cluster3 887(1139) 882(1092) 99.44(95.87)

Table4

Classificationoftheresultingclusteringerrorsofthealgorithm,usingthedatasetc difficult2noise01withaSNRof10. Classes No.ofcorrectlysortedspikes

(withoverlappingspikes)

No.ofmissedspikes(with overlappingspikes)

No.offalsepositives(with overlappingspikes)

Overall 2733(3351) 9(111) 9(14)

Cluster1 953(1146) 4(41) 5(5)

Cluster2 897(1112) 0(23) 1(1)

Cluster3 883(1093) 5(47) 3(8)

specialremnantcluster(seeredclusterinFig.7).Inotherwords,

onlyoverlappingspikesthatstillshowedacertainamountofits

distinctiveshapecouldactuallybeprocessedcorrectly.

Onewayof facingthechallengeof overlapping spikesis by

implementinganadditionaltemplatematchingstep,whichrefines

thediscardedspikeevents.Byfindingacombinationofthe

discov-eredspikeshapesinthespikesortingstepthatmatchestheshape

ofaneventinsidetheremnantcluster,thespikesinvolvedcanbe

identified(cf.Zhangetal.,2004).

Inordertoevaluatetheinfluenceofdifferentnoiselevels,the

algorithmwasalsotestedwithanotherdatasetwithaSNRof6.67.

Theused setcdifficult2noise015 basicallyconsists ofthe same

threespikeshapesasthedataabove,yetthedistributionofthe

differentspikesis changed slightly. Due totheincreased noise

distortionitismoredifficulttofindsuitablespikefeatures that

candiscriminateall threeshapes in just onefeatureextraction

step.Hence,usingthefirsttwoWaveletcoefficients(distribution)

andthefirsttwoprincipalcomponents,thealgorithmcouldonly

separateoneofthecellsignalsfromtheother,butnotallsignal

shapesintheinitialprocess.Inthiscasetheadvantageofthe

iter-ativestructureoftheproposedalgorithmcomesintoeffect.Asit

ismucheasiertofindfeaturesthatspecificallydescribethe

dif-ferencesofthetworemainingspikeshapes,thesortingresultis

greatlyimprovedonce thefirstcellsignalisexcludedfromthe

dataset.UsingtheadjustedfirstandsecondWaveletcoefficients

(distribution)aswellastheadaptedfirstprincipalcomponentthe

tworemainingcellsignalscanbeseparatedfromeachother.Ina

finaliterationtheremainingspikeshapecanbeexcludedfromthe

obligatoryremnantcluster.Theresultofthisprocessissummarised

inTables5and6,whicharestructuredsimilartoTables3and4of

thefirstdataset.

Asexpectedtheaccuracyofthespikesortingprocess

deterio-rateswithincreasingsignal-to-noiseratios.Inthiscaseboththe

resultdiscardingoverlappingspikesaswellasresultthatincludes

theseeventsareeffectedbyaboutthesamemargin.Eventhough

theincreasedsignal-to-noiseratioposesanadditionalchallenge,

themajorityofthespikescontainedinthedataweresuccessfully

sortedinthecorrectclusters.Thusthenovelfeatureextractionand

evaluationstepsofthedescribedalgorithmshowsignificant

advan-tagescomparedtothemethodscommoninmostestablishedspike

sortingalgorithms,especiallywhenusedinaniterativeprocess.

4.2. Resultswithspheroiddata

Tofurtherassessthepresentedspikesortingalgorithm,data

obtainedfromchickembryonicneuronscoupledtomicroelectrode

arrayswereused.

Forthispurposechickneuroepithelialtissue(stages28and29,

HamburgerandHamilton(1951))waschoppedandenzymatically

dissociatedwithTrypsin/EDTA(all chemicals: CCPro,Oberdorla,

Germany, unless stated otherwise), similarly as described for

cardiactissue inEgertand Meyer(2005).Approximately2

mil-lioncellsweretransferredto35mmPetridishescontaining2ml

of cell culture medium (DMEM with 10% FCS, 2% CS (Sigma,

Munich, Germany), 0.5mM glutamine, 50U/ml penicillin, and

50␮g/mlstreptomycin).Thecellswerere-aggregatedinto

three-dimensionalsphericalinvitronetworks,socalledspheroids,by

Table5

Comparisonoforiginalandcalculatedspikesortingresults,usingthedatasetcdifficult2noise015withanSNRof6.67.

Classes No.oforiginalspikes(with overlappedspikes)

No.ofcorrectlysortedspikes (withoverlappedspikes)

%ofcorrectlysortedspikes (withoverlappedspikes)

Overall 2631(3440) 2577(3243) 97.95(94.27)

Cluster1 858(1142) 839(1065) 97.78(93.28)

Cluster2 851(1113) 824(1040) 96.83(93.44)

Cluster3 922(1185) 914(1138) 99.13(96.03)

Table6

Classificationoftheresultingclusteringerrorsofthealgorithm,usingthedatasetcdifficult2noise015withanSNRof6.67. Classes No.ofcorrectlysortedspikes

(withoverlappedspikes)

No.ofmissedspikes(with overlappedspikes)

No.offalsepositives(with overlappedspikes)

Overall 2577(3243) 54(197) 53(92)

Cluster1 858(1065) 19(77) 34(49)

Cluster2 851(1040) 27(73) 7(26)

(10)

Fig.8.100mssectionoftherecordedspheroiddatasample,recordedwith50kHz.Thespikeswerelabelledaccordingtotheclusteringresultofthesubsequentclassification (cf.Fig.9).

placingthePetridishesonagyratoryshaker(72rpm)withinan

incubator(37◦C,5%CO2)(Dausetal.,2011).

The resulting spheroids were then plated on

microelec-trodearrays(MEA;MultichannelSystems,Reutlingen,Germany)

for extracellular recording. The MEA chips consisted of 60

substrate-embeddedtitaniumnitrideelectrodes(30␮min

diam-eter) arranged in an 8×8 matrix. Extracellular field potentials,

so-called spikes, wererecorded (25–50kHz sampling rate) and

saved for offline analysis. The timestamps of the spikes were

determinedbyathresholdseparatingneuronalsignalsfromnoise

(Chiappaloneetal.,2005).Thesedataposeanadditionaldifficulty,

asmultipleneuronscontributetotheelectrodesignal.However,

mostoftheseneuronshadaninsufficientcouplingwiththe

elec-trode,which significantly hamperedthe ability toseparatethe

particularspikeshapesoftheemittingneurons(seeFig.8).

Thespikedetectionprocessforthistypeofdatawasasdescribed

above.However,duetothesignal’scomplexity,asupplementary

sortingstepwasnecessarytoexcludeanyshapesoriginatingfrom

distantneuronsthatcouldnotbesatisfyinglyprocessedandonly

actedasadditionalnoisyartefacts.

Similartotheevaluationprocessusingthesimulateddata,the

spikesofthespheroid signalweresortedusingeither

Wavelet-based features, principal components or the feature mixture

derived byournovelapproach.Allmethods couldidentifytwo

differentspikeshapesinthespheroidsignal,whichmeansthat

therecordedspheroidsignalcontainedatleastthesignalsoftwo

neurons,withatightcouplingtotheelectrode(cf.Fig.9).Yetthe

differentfeatureextractionapproachesshowavaryingabilityto

discriminatebetweenthetwoshapes.Theresultingclustersofthe

Wavelet-basedapproachshowedasignificantoverlap,unlikethe

othertwomethods.Theresultusingprincipalcomponents

how-everwasquitesimilartotheclusteringofourproposedmixture

approach,onlywithslightdifferencesattheborderbetweenthe

twoidentifiedclusters.Eventhoughtheresultusingthefeature

mixtureoftheproposedalgorithmmightbeconsideredmarginally

moreaccurate,suchimprovementwouldbequitedifficulttoassess,

duetothelackoflabellingfortherecordedspheroiddata.

In general,eventhough both Waveletand principal

compo-nentbasedfeatureextractiontechniquesarecapableofproducing

acceptablespike sorting results, theyusually fail toachieve an

Fig.9.Scatterplotofthespheroiddatainthefeaturespacespannedbythenegative amplitudeandthe(2)Waveletcoefficient(distribution).Againthescatterplotshows aprojectionoftheclusteringresultsontothetwodimensionalfeaturespaceoffirst andsecondfeature.Thedatacanbesortedintotwospikeshapeclustersusingthe (2)principalcomponentasadditionalfeature.Thedatapointsinredagainhighlight theremnantclustercontainingoutliersinthespheroiddata.(Forinterpretationof thereferencestocolorinthisfigurelegend,thereaderisreferredtothewebversion ofthearticle.)

optimaldiscriminationbetweendifferentcellsignals,simplydue

totheirlack ofvariability. Whereasouralgorithm includes the

strengthofprincipalcomponentswithoutneglectingthepotency

ofotherfeatures,themethodsusingonlyasinglefeaturetype

suf-ferfromtheirrespectiveweaknesses.Hence,itismostfeasibleto

includebothfeaturetypesinordertopreservetheirstrengthsand

tocompensatetheirweaknessesrespectively.

5. Conclusionandoutlook

Insummary,thedescribedalgorithmyieldsaccurateresultsfor

simulatedand recordedneuronaldatasets.Theconcept of

eval-uatingabroadsetofextractedfeaturestofindasuitablesubset

thatdescribesaparticulardatasetmostaccuratelyseemstohave

(11)

featuretype, thedifferentcapabilitiesofvariousfeature

extrac-tiontechniquesbecomeavailableforthesubsequentsortingstages

andtherefore,significantlyfacilitatetheclusteringtask.Therefore,

thisenablesevenacomparablysimpleEMclassificationmethodto

produceapreciseclusteringresult.

Basedonthisfeatureevaluationtechnique,itisalsopossible

toidentifythefeaturesthatexplicitlydiscriminatebetweentwo

differentspikeshapes,whichisadvantageousinneuralrecordings

thatincludemorethanjusttwosignalshapes.Thisabilityisused

bythepresentedalgorithm,giventhataniterativesubtraction

pro-cessofparticularspikeshapesfromtherestofthesignalcanbe

performedifnecessary.Thisallowstheidentificationofanew

fea-turesubsetmostsuitablefortheremainingspikesinthedataset

afterexcludingacertainshape.

Theonly issuethatcouldbeidentifiedisthelackof an

ade-quatehandlingofoverlappingspikeevents.Nevertheless,thiscan

befixedquiteeasilywithanadditionaltemplate matchingstep

basedonthealgorithm’sclusteringresult.

Theremightstillbesomeroomforusefuladaptations.One

pos-sibleapproach,forexample,istheuseoft-distributionsinsteadof

Gaussiansasabasisfunction(Shohametal.,2003).Withan

addi-tionaladjustmentparameter,thedegreesoffreedom(DOF),these

functionsmightbemorerobusttoanynon-Gaussiannoise

distort-ions.However,asadrawback,thisadditionalparametercouldalso

resultinasignificantlyhighercomputationalcomplexity.

Onthe otherhand,the EMalgorithm itselfcouldbe

substi-tutedwiththeVariationalBayes(VB)algorithm(Takegawaetal.,

2010).BeingamoregeneralapproachthanEM,itoffersthesame

advantages,butalsodisadvantages,astheswitchfromGaussianto

t-distributions.

Anotherissuenecessarytoaddressisthecomputational

com-plexityofthealgorithmitself.Thetimerequiredtoapproximatethe

distributionofacertainfeaturesignificantlydependedonthe

par-ticulardistributioncharacteristics.Therefore,itishardtogenerate

ageneralisedstatementregardingtheprocessingtimeneededby

thealgorithm.However,thecurrentversionofthealgorithmhas

tobemodifiedinordertofunctioninanonlinesignalanalysis

pro-cess.Forthispurpose,itispossibletocomputeexplicitoperations

inparallel,usingmultipleCPUorGPUcores(cf.Chenetal.,2011).

Thus,theapproximationofdifferentfeaturesviatheEMalgorithm

canbecalculatedsimultaneously,effectivelyspeedingupthespike

sortingalgorithmaltogether.

Althoughtheremightbesomepossibleadaptationsofthe

algo-rithm, itsgeneral feasibility and advantages compared tomost

common spike sorting algorithms are quite obvious. The

rea-son for this is the improved feature extraction approach that

enablesanadaptivedeterminationofsuitable spikesorting

fea-tures.Thisallowstheinclusionofavarietyofdifferentfeaturetypes,

whichincreasesthechanceofidentifyingparticularkeydifferences

betweenasetofuniquespikeshapes.

In conclusion, the disadvantage of only beingable to use a

limitedsubsetoffeaturetypesthatmostestablishedspikesorting

methodsfacecan beovercomewiththenovelapproaches

pre-sentedinthispaper.Hence,itisalsopossibletoenhancealready

existingalgorithmswiththeseideas,improvingtheirclusteringor

classificationsignificantly.

Acknowledgement

ThisworkwassupportedbytheBMBFAN2208PT-AIF.

References

BiffiE,GhezziD,PedrocchiA,FerrignoG.Spikedetectionalgorithmimprovement, spikewaveformsprojectionswithPCAandhierarchicalclassification.IETConf Pub2008(CP540):122–6.

ChenY,TsaiY,ChenL.Algorithmandimplementationofmulti-channelspikesorting usingGPUinahome-caresurveillancesystem.ICMEIEEE2011(CFP11):1–6. ChiappaloneM,NovellinoA,VajdaI,VatoA,MartinoiaS,vanPeltJ.Burstdetection

algorithmsfortheanalysisofspatiotemporalpatternsincorticalnetworksof neurons.Neurocomputing2005;65:653–62.

ChoiJH,JungHK,KimT.AnewactionpotentialdetectorusingtheMTEOandits effectsonspikesortingsystemsatlowsignal-to-noiseratios.IEEETransBiomed Eng2006;53:738–46.

ClarkeB,FokouéE,ZhangHH.Principlesandtheoryfordataminingandmachine learning.NewYork:Springer;2009.p.408–11.

DausAW, GoldhammerM,LayerPG,ThielemannC. Electromagneticexposure ofscaffold-freethree-dimensionalcellculturesystems.Bioelectromagnetics 2011;32:351–9.

EgertU,MeyerT.Heartonchip—extracellularmultielectroderecordingsfrom car-diacmyocytesinvitro.In:DheinS,editor.Practicalmethodsincardiovascular research.Springer:Berlin;2005.p.432–53.

FeeMS,MitraPP,KleinfeldD.Automaticsortingofmultipleunitneuronalsignals inthepresenceofanisotropicandnon-Gaussianvariability.JNeurosciMethods 1996;69:175–88.

HamburgerV,HamiltonHL.Aseriesofnormalstagesinthedevelopmentofthe chickembryo.JMorphol1951;88:49–92.

HortonPM,NicolAU,KendrickKM,FengJF.Spikesortingbaseduponmachine learningalgorithms(SOMA).JNeurosciMethods2007;60:52–68.

HulataE,SegevR,Ben-JacobE.Amethodforspikesortinganddetectionbased onwaveletpacketsandShannon’smutualinformation.JNeurosciMethods 2002;117:1–12.

JoliffeIT.Principalcomponentanalysis.2nded.NewYork:Springer;2002.p.1–6. KimKH,KimSJ.Neuralspikesortingundernearly0-dBsignal-to-noiseratiousing

nonlinearenergyoperatorandartificialneural-networkclassifier.IEEETrans BiomedEng2000;47:1406–11.

Kim KH, Kim SJ. Method for unsupervised classification of multiunit neural signal recording under low signal-to-noise ratio. IEEE Trans Biomed Eng 2003;50:421–31.

LayerPG,WillboldE.Histogenesisoftheavianretinainreaggregationculture:from dissociatedcellstolaminarneuronalnetworks.IntRevCytol1993;146:1–47. LayerPG,RobitzkiA,RothermelA,WillboldE.Oflayersandspheres:thereaggregate

approachintissueengineering.TrendsNeurosci2002;25(3):131–4. OhmJR,LükeHD.Signalübertragung.11thed.Berlin:Springer;2010.p.203. OweissKG,AndersonDJ.Spikesorting:anovelshiftandamplitudeinvariant

tech-nique.Neurocomputing2002;44:1133–9.

PavlovA,MakarovVA,MakarovaI,PanetsosF.Sortingofneuralspikes:when waveletbasedmethodsoutperformprincipalcomponentanalysis.Natural Com-put2007;6(3):269–81.

PolanskiA,KimmelM.Bioinformatics.Berlin:Springer;2007.p.37–45.

QuirogaRQ,NadasdyZ,Ben-ShaulY.Unsupervisedspikedetectionandsorting withwaveletsandsuperparamagneticclustering.NeuralComput 2004;16: 1661–87.

SakmannB,NeherE.Singlechannelrecording.2nded.NewYork:Springer;2009. p.4.

SatoT,SuzukiT,MabuchiK.Fastautomatictemplatematchingforspike sor-tingbasedonDavies–Bouldinvalidationindices.ConfProcIEEEEngMedSoc 2007:3200–3.

ShohamS,FellowsMR,NormannRA.Robust,automaticspikesortingusingmixtures ofmultivariatet-distributions.JNeurosciMethods2003;127:111–22. StarkHG.Waveletsandsignalprocessing:anapplication-basedintroduction.Berlin:

Springer;2005.p.21–28.

TakegawaT,IsomuraY,FukaiT.Accuratespikesortingformulti-unitrecordings. EurJNeurosci2010;31:263–72.

ThomasCA,SpringerPA,LoebGE,Berwald-NetrY,OkunLM.Aminiature micro-electrodearraytomonitorthebioelectricactivityofculturedcells.ExpCellRes 1972;74:61–6.

Vargas-IrwinC,DonoghueJP.Automatedspikesortingusingdensitygrid con-tourclusteringandsubtractivewaveformdecomposition.JNeurosciMethods 2007:164.

VogelsteinRJ,MurariK,ThakuPH,DiehlC,ChakrabarttyS,CauwenberghsG.Spike sortingwithsupportvectormachines.In:IEEEEMBS;2004.p.4009–12. WangG,ZhouY,ChenA,ZhangP,LiangPA.Arobustmethodforspikesortingwith

automaticoverlapdecomposition.IEEETransBiomedEng2006;53:1195–8. WoodF,FellowsM,DonoghueJP,BlackMJ.Automaticspikesortingforneural

decoding.IEEETransBiomedEng2004;51:912–8.

ZhangP,WuJ,ZhouY,LiangP,YuanJ.Spikesortingbasedonautomatictemplate reconstructionwithapartialsolutiontotheoverlappingproblem.JNeurosci Methods2004;135:55–65.

References

Related documents

In the present analysis, it is seen that high rate of heat transfer takes place between the gas flow and solid element under high values of solid to gas conduction ratio, such

From the case study and data collected, we observe the following distinctions of ERP maintenance: (1) the ERP-using organization, in addition to addressing internally

In comparison with conventional methods, the proposed method shows lower PSNR than BM3D when pseudo white Gaussian noise is added, but higher PSNR than the spatial nonlocal

Digital forensics [1] is a branch of forensic science encompassing the recovery and investigation of material found in digital devices, often in relation to computer

It promotes the development, integration, and periodic evalua- tion of services and facilities in the community, and is designed- (1) to find individuals who have rheumatic fever

They have reported that the energy payback time (EPBT) of a PV/T air collector is reduced by taking into account the growth in annual energy availability of the thermal energy

Certainly having written rules is good practice but should an award of unfair dismissal be increased by 25% because, for example, an employer did not have a written rule that

Using a measure of the innovation in the average relative level of peers’ stock prices that we construct to limit the overlap with the information contained in the firms’ own