Supplemental Information. Discrete Stepping and Nonlinear Ramping Dynamics. Underlie Spiking Responses of LIP Neurons. during Decision-Making

(1)

Neuron, Volume

102

Supplemental Information

Discrete Stepping and Nonlinear Ramping Dynamics

Underlie Spiking Responses of LIP Neurons

during Decision-Making

David M. Zoltowski, Kenneth W. Latimer, Jacob L. Yates, Alexander C. Huk, and Jonathan

W. Pillow

(2)

+high +low zero

-low

-high

200 700

population

time (ms)

step +history linear +history linear +b, +history sqrt +b, +history

200 700 200 700200 700 200 700

200 700

time (ms) firing rate (sp/s) 5

30

200 700

5 30

200 700200 700 200 700

B

A

-.010 0 >.02 5

10

-.010 0 >.02 5

10

+low - -low

in-RF out-RF

# of cells

in-RF out-RF

<-.02 00 >.04 5

10 15

<-.020 0 .01 5

10

-high - -low

+high - +low

variance

5 30

C

fraction hit upper boundary

# of cells

in-RF in-RF, +high

0 0.5 1

0 10 20

Figure S1. Choice-conditioned analyses, related to Figure2. A.Left: the PSTH and spike count variance conditioned on in-RF (solid lines) and out-RF (dashed line) choices, averaged across all cells. Right: simu-lated PSTHs and spike count variances conditioned on choice from the models using the inferred parameters for each cell. For the stepping model, we simulated choices based on the simulated step direction. The probability of an in-RF choice was 0.99, 0.01, and 0.5 given a step to the up state↵2, down state↵1, no step during the trial, respecitvely. In the ramping models, the probability of an in-RF choice was computed using a sigmoid function asp(in-RF choice) = _1+exp( _k1₍_x_T _x0₎₎, wherexT was the value of the latent trajectory at the end of the trial,x0was the inferred mean diffusion start location parameter in the ramping model, and

k = 5. We thresholdedx0to be in the range [0.1,0.9]. B.We computed the posterior mean of the latent

ramping trajectoriesx1:T for each trial to test whether the motion coherence had an effect on the firing rate for trials with the same choice. We computed the average latent ramping slope during the first200ms of each trial for +low and -low coherence trials, separated by choice. The +low coherence trials had more positive slopes than -low coherence trials on both in-RF (p = 0.0059, one-sided paired t-test) and out-RF choice trials (p= 0.0024, one-sided paired t-test). The +high coherence trials had more positive slopes than +low coherence trials on in-RF choice trials (p= 0.0042, one-sided paired t-test), while the -high coherence trials had more negative slopes than -low coherence trials on out-RF choice trials (p= 1.03e 5, one-sided paired t-test). C.The distribution across cells of the fraction of trials for which the median of the posterior over the ramping bound-hit time was before the analysis window ended for in-RF and in-RF, +high trials.

(3)

ΔWAIC (55/60 correct)

ΔDIC (26/60 correct)

0 60

-600 0 600

Δ firing rate

D

E

Cell ΔW

AIC

ramp +history

A

true ramp +history true step +history

step +history

-500 0 100

-100 0 500

ramp +b, +history

true ramp +b, +history true step +history

step +history

B

0 100

<-100 0 >100

Δ firing rate

ΔWAIC (74/80 correct)

ΔDIC (45/80 correct) single coherence simulations without history

Cell ΔW

AIC

ΔWAIC ΔDIC correct

ΔWAIC ΔDIC -500

0 100

-100 0 500

single coherence simulations with history

C

Cell ΔW

AIC

-500 0 100

-1000 500

true sqrt +b, +history true step +history

sqrt +b, +history step +history

Figure S2. The WAIC correctly identifies simulated data from different models, related to Figures4and5.

A.The WAIC reliably distinguished data from the ramping and stepping models with spike-history simulated from the mean parameters of the ramping (left) and stepping (right) models with spike-history for each cell. The number of simulated trials per motion coherence was matched to the true number for each cell. B.

Same asA, except with ramping with non-zero baseline and history model. C.Same asA-B, except with the square root ramping with non-zero baseline and history model. D.-E. Model comparison results from simulated single-coherence ramping data as a function of the change in firing rate from the beginning to the end of the trial using the DIC and WAIC. The WAIC correctly identifies data simulated from single-coherence ramping models (Chandrasekaran et al.,2016).

(4)

A

B

random onset & offset (1)

0 2 4 0

3 6

0 20 40

0 3 6 random onset & offset (2) random offset

random onset (2)

-80 -40 0 40 80 -80 -40 0 40 80

0 3 6

# simulations

firing rate (sp/s)

200 1200

time (ms)

200 1200

time (ms) 0

20 40

firing rate (sp/s)

0 20 40

firing rate (sp/s)

0 20 40

firing rate (sp/s)

0 20 40

firing rate (sp/s)

0 2 4

ΔWAIC ΔWAIC

-80 -40 0 40 80 -80 -40 0 40 80 200 1200

time (ms)

200 1200

time (ms) ΔWAIC ΔWAIC

-80 -40 0 40 80

200 1200

time (ms) ΔWAIC

# simulations

# simulations # simulations

# simulations

random onset (1)

C

D

E

ramp step

Figure S3. Simulated model comparison between the stepping model with history and the linear ramping model with history with random integration onset and/or offset times, related to Figures4and 5. A. Left: Example simulated firing rates from the ramping model with history with a random integration offset time. On each trial, the offset time was set to100ms after a competing accumulator with perfectly anticorrelated diffusion noise hit the upper boundary (Mazurek et al.,2003;Latimer et al.,2017). Right: WAIC differences between the ramping and stepping models with history fit to 16 simulated cells. Positive values favor the step-ping model with history.B.Same asA, except for with a random integration onset time on each trial that was set to200ms plus10ms times a sample from a negative binomial distribution. Here, the parameters of the negative binomial distribution werer= 1.2andp= 0.4, which generated integration onset times with mean 207.9ms and std11.2ms.C.Same asB, except with onset times simulated from a negative binomial distri-bution with parametersr= 1.4andp= 0.8, which generated onset times with mean255.2ms and std52.2 ms.D-E.Same asA, except with both random integration onset and offset times. The integration onset times were generated from the distributions inBandC, respectively. In all but one simulation, the WAIC favored the ramping model with history. For all simulations, we set the following ramping with history model parameters to

(5)

cell

21 28 12

5

linear +history step +history

-150 -100 -50 0 50 100

Cell ΔLOO

cell cell

14 21

19₁₀

13 16

24₁₃

linear +b, +history sqrt +b, +history

-200 -150 -100 -50 0 50 100

A

C

B

linear +history step +history

-0.4 -0.2 0 0.2 0.4 0.6

-high -low zer

o +low +high out-RF in-RF

coherence choice

mean trial ΔLOO

step +history

Cell ΔLOO

-0.2 0 0.2

fit zer o

fit +cohfit in-RF

Figure S4. Model comparison using leave-one-out cross validation, related to Figures 4and5. We used PSIS-LOO to estimate the Bayesian leave-one-out cross validation performance (LOO) on held out trials

(Ve-htari et al.,2017). In Bayesian LOO, predictive performance on a held-out trial is evaluated by integrating over

the posterior distribution of the parameters conditioned on the rest of the trials. PSIS-LOO uses importance sampling to estimate this integral. The proposal distribution is the posterior distribution conditioned on all of the trials (which we sample from in our MCMC procedure) and the importance weights are stabilized through regularization. Model comparison using LOO provided nearly identical results to the model comparison using WAIC.A.Sorted LOO differences between the stepping and linear ramping models with spike-history for all neurons (error bars indicate±1SEM). Color conventions are the same as in Figure4. B. Left: Mean LOO differences across trials of each motion coherence or choice. Right: Mean LOO differences across trials computed for models fit only to data from zero coherence, positive coherence, or in-RF trials (respectively).

C.Sorted LOO differences between the stepping model and the linear (left) and sqrt (right) ramping models with non-zero baseline and spike-history for all cells.

(6)

3 3.5 0

2 4

3 3.2 3.4 0

2 4

3 3.2 3.4 0

2 4

2 2.2 2.4 0

5

1.5 2 2.5 0

2 4

2 2.2 2.4 0

2 4

1.6 1.8 2 0

5

1.6 1.8 2 0

2 4

1.7 1.8 1.9 0

2 4

3.8 4 4.2 0

2 4

3.8 3.9 4 0

2 4

3.8 4 4.2 0

2 4

step +history

linear +history linear +b, +history

frequency

WAIC [x104_]

Cell 1

Cell 2

Cell 3

Cell 4

simulated data

Figure S5. Comparison of WAIC values fit to simulated data and WAIC values for the experimental data, related to Figures4and5. For four different cells, we simulated 16 synthetic datasets each from the inferred parameters of the linear ramping with zero baseline, linear ramping with non-zero baseline, and stepping models (all with history). We then fit the corresponding model to each simulated dataset and computed the WAIC of the simulated data given the true model (light blue histograms). We compared these values with the WAIC values computed on the experimental data (dark blue arrow). For the linear ramping with non-zero baseline and stepping models with history, the WAIC values of the true data were most often within the range of WAIC values computed on simulated data. For this analysis, we selected two of the cells with the strongest choice selectivity (cells 1 and 2) and two random cells (cells 3 and 4).

(7)

0 10 20 30 40 50 mean firing rate (sp/s)

-0.8 -0.6 -0.4 -0.2 0 0.2 0.4

normalized W

AIC dif

fer

ence

zero baseline nonzero baseline

B

A

0 10 20 30 40

in-RF - out-RF (sp/s) -0.6

-0.3 0.3

normalized W

AIC dif

ference

0

Figure S6. Relationship between WAIC differences and firing rates, related to Figure4. A-B.For each cell, the average WAIC difference across trial is plotted against (A) the overall mean firing rate for that cell and (B) the difference in firing rates at the end of in-RF vs. out-RF trials for that cell. The different color dots correspond to comparison of the (red) linear ramping model with zero baseline and history and (blue) linear ramping model with baseline and history against the stepping model with history, respectively. The correlation between the mean firing rate and the normalized WAIC differences is weak (⇢= 0.01and⇢= 0.15for the zero baseline and nonzero baseline conditions, respectively). The data exhibit negative correlations between the modulation in firing rate across the in-RF and out-RF conditions and the WAIC differences (⇢ = 0.35 and⇢= 0.52for the zero baseline and nonzero baseline conditions, respectively).

(8)

0 >1000

Cell ΔW

AIC

B

C

7 8 8

2

3 8 8

6

2 8 8 6

+b +b

+history sqrt quad exp

linear reaction time task

Cell ΔW

AIC

-60 0 80

-80 0 40

cell step +history

linear +history

linear +b, +history

sqrt +b, +history

population PSTH step +history linear +history linear +b, +history sqrt +b, +history

200 600

20 70

firing rate (sp/s)

+high +low zero -low -high

200 600 200 600 200 600 200 600

A

time (ms)

Figure S7. Analysis of LIP responses (N = 16) in a reaction-time (RT) task (Roitman and Shadlen,2002), related to Figure7. A. Population PSTH and simulated PSTHs from the fitted models, aligned to motion onset.B.Model comparison of extended ramping models and linear ramping model. Including spike-history dependence and non-zero baseline firing rates improved ramping model performance but different nonlin-earities performed similarly. C. Comparison of the stepping model and different ramping models, all with spike-history. See FigureS8for a comparable analysis using cross-validation.

(9)

reaction time task discrete-pulse task

A

B

-250 0 >250

Cell ΔLOO

cell

step +history

linear +history linear +b, +history sqrt +b, +history

-30 0 40

-40 0 30

-40 -10 20

7

8 8 2

3

10 6 5

2

8 8

5

Cell ΔLOO

cell

step +history

linear +history linear+b, +history sqrt +b, +history 71

91 24

14

49

75 40

26

55

73 42

28

Figure S8.Model comparison using leave-one-out cross validation (LOO) for the (A) discrete-pulse and (B) reaction time data, related to Figures7andS7. LOO was estimated using PSIS-LOO (Vehtari et al.,2017, see FigureS4). Model comparison using LOO for these datasets provided similar results to model comparison using WAIC.