Several behaviors were typical when rats encountered a delay upon entering a zone. Second
Row. If rats decided to stay, they generally proceeded to the reward site and waited until the tone
counted down and reward was delivered (as indicated by the very low average speed for the
remainder of the time in zone). On these passes VTE was typically quite low. Third Row. If the
delay was above threshold, rats would often skip the zone relatively quickly (decrease in speed at
1 second followed by increasing speed after 2 seconds), spending little time in the current zone.
VTE on these passes was typically low. Fourth Row. If rats encountered a close to threshold
delay and chose to skip the reward, VTE remained high. Rats remained relatively stationary for a
longer period of time (from 1 to 5 seconds) before finally locomoting and leaving the current
zone for the next zone. Fifth Row. On close to threshold delays, rats demonstrated stronger
VTE. If rats chose to sample the reward, they would proceed towards the feeder and wait through
the remainder of the delay (early fluctuation in speed indicates high VTE, followed by decrease,
near 0 cm/s speed indicates the rat has arrived at the feeder location where he remains until
reward is received).
Supplemental Figure S2. Comparison of thresholds within session by rat. Thresholds were
consistent within each session. If we compared the thresholds from the first half to the second
half, no thresholds were significantly different between the first and second half of each session.
Red bars represent the standard error.
Supplemental Figure S3. Overall food handling time. After consuming food, rats typically
took 20-30 seconds before leaving the zone. This did not change as a function of the delay the rat
had waited before receiving the food.
Supplemental Figure S4. Increasing the number of pellets increases the average delay
waited. To determine if the rats took value into account when making decisions to stay or go (a
key tenet in neuroeconomics (Montague and Berns, 2002, Padoa-Schioppa and Assad, 2006,
Kable and Glimcher, 2007, Rangel et al., 2008)), two of the rats (R231 and R234) underwent an
additional variation of the Restaurant Row task following completion on the unmodified version
of the task.
In this modified version, sessions consisted of four 20 minute blocks. During each 20 minute
block, one reward flavor site dispensed three food pellets rather than two pellets (i.e. 3x 45 mg),
while the other sites only dispensed one food pellet (i.e. 1x 45 mg). The four blocks allowed us
to have each site be the more valuable site for one block. The order was randomly determined
each day. Delays were randomly selected, as in the original task. Each 20 minute block was
followed by a one minute rest, during which time the rat was removed to a small flower pot to
the side. Each rat ran one complete session of four blocks per day.
Rats were willing to wait longer for the larger reward (errors bars represent +/ standard error).
This manipulation indicates that increasing the reward size increased the time rats were willing
to wait, which implies that increasing reward size had more value, and that the rats were
behaving economically.
2
Offer at current
Delay > threshold
Delay > threshold
Delay > threshold
Delay < threshold
300
1000
250
200
150
100
30
Zone3
Threshold 26(s)
Delay 12(s)
Stay
200
50
0
R210 Session 10
Lap 5
Zone4
Threshold 14(s)
Delay 1(s)
Stay
VTE
IdPhi
Number of samples
Supplemental Figure S1
20 15 10 5
0
5
10 15 20
+/ Delay time from threshold (threshold at 0 s)
25
20
15
Duration waited 10 5
(s)
0
25
15 20
Delays (s)
10
30
Zone1
Threshold 13(s)
Delay 15(s)
Skip VTE
0.025
0.02
0.015
0.01
0.005
At Feeder
Receives Reward
Threshold 14(s)
Delay 9(s)
0
2
10
0
0
0.025
0.02
0.015
0.01
0.005
0
2
10
9 10
2.5
30
Threshold 14(s)
Delay 29(s)
0
0
0.025
0.02
0.015
0.01
0.5
1.5
30
0.03
0.03
Percent of events at given VTE score
Zone2
Threshold 13(s)
Delay 21(s)
Skip
0.005
0
2
10
Threshold 14(s)
Delay 16(s)
0
0
0.025
0.02
0.015
0.01
30
0.03
At
Feeder
Receives
Reward
0.005
0
2 3 4 5 6 7 8 9 10
Log IdPhi value: higher = more VTE
Threshold 14(s)
Delay 14(s)
0
0
5
10
15
Time from zone entry to zone exit (s)
Supplemental Figure S2
Zone 1
Threshold (s)
30
25
30
R210
Zone 2
30
Zone 3
30
25
25
25
20
20
20
20
15
15
15
15
10
10
10
10
Early
Zone 1
30
25
Late
30
Early
Late
Zone 2
30
Early
Late
Zone 3
5
30
25
25
25
20
20
20
20
15
15
15
15
10
10
10
10
R222
Early
Late
Zone 1
30
5
30
R231
Early
Late
Zone 2
5
30
Early
Late
Zone 3
5
30
25
25
25
20
20
20
20
15
15
15
15
10
10
10
10
25
Early
Late
Zone 1
Early
Late
Early
Late
Zone 3
30
30
25
25
25
20
20
20
15
15
15
15
10
10
10
10
30
25
R234
20
Early
Late
Early
Late
Early
30
Late
Zone 4
Early
Late
Zone 4
Early
Late
Zone 4
Early
Late
Zone 4
Early
Late
Supplemental Figure S3
120
Timespent(s)
100
0.08
80
0.06
60
0.04
40
0.02
20
10
20
Delay(s)
30
Supplemental Figure S4
15
Larger 3 pellets
Smaller 1 pellet
14
13
12
11
10
8
Cherry
Banana
Plain
Flavor
Chocolate
Supplemental Figure S5
Bregma 1.70mm
R234
R231
R222
R210
AcbC
CPu
Bregma 3.70mm
AchCh
aca
LO
VO
AI
Supplemental Figure S6
OFC
15
10
10
30
0
4
0
2
25
30
15
20
10
0
0
2
4
Reward triggered at 0 (s)
15
10
10
5
0
2
0
0
2
4
Reward triggered at 0 (s)
30
25
10
0
4
15
10
2
0
2
5
0
25
6
4
2
0
2
20
15
10
5
0
0
2
4
Reward triggered at 0 (s)
0
4
25
15
10
5
0
0
2
4
Reward triggered at 0 (s)
20
15
10
5
0
0
2
4
Reward triggered at 0 (s)
R22220110731TT12_1
Reward site (banana)
0
2
0
2
10
0
4
0
2
3
2
1
0
2
15
10
5
0
0
2
4
Reward triggered at 0 (s)
25
10
15
0
4
20
15
25
20
20
10
15
25
20
25
5
25
0
4
0
2
20
0
2
10
15
0
2
20
4
0
2
15
15
0
4
25
R22220110805TT12_11
Reward site (banana)
8
20
0
2
Event #
20
10
15
0
2
Event #
0
2
10
30
Event #
15
15
Event #
20
0
0
2
4
Reward triggered at 0 (s)
35
30
25
Event #
R22220110805TT07_3
Reward site (banana)
5
20
20
15
10
5
0
0
4
25
20
15
10
0
2
Event #
20
20
25
10
25
30
Event #
R23420111207TT08_1
Reward site (banana)
30
12
Event #
0
0
2
4
Reward triggered at 0 (s)
0
4
15
25
20
10
0
2
15
10
15
15
10
10
5
0
2
30
0
0
2
4
Reward triggered at 0 (s)
R21020110209TT01_3
Reward site (banana)
25
20
20
15
15
10
10
5
0
2
30
25
20
0
4
0
2
30
20
0
0
2
4
Reward triggered at 0 (s)
25
30
15
20
10
10
5
0
2
50
0
4
25
25
20
20
15
15
15
15
10
10
10
10
0
2
0
0
2
4
Reward triggered at 0 (s)
0
2
0
0
2
4
Reward triggered at 0 (s)
20
25
20
20
15
20
15
15
10
10
5
0
2
30
25
20
15
0
2
20
0
0
2
4
Reward triggered at 0 (s)
R21020110209TT02_3
Reward site (banana)
25
10
10
30
10
10
15
25
15
15
0
2
0
4
Event #
25
d
40
20
30
25
0
4
30
20
15
15
10
10
5
0
2
15
0
0
2
4
Reward triggered at 0 (s)
20
10
15
10
15
0
4
10
30
20
10
0
2
15
25
10
20
10
10
0
0
2
4
Reward triggered at 0 (s)
0
4
20
15
5
0
2
25
15
5
50
40
5
0
2
0
4
Event #
Event #
30
Event #
0
2
10
Event #
20
10
30
10
5
Event #
10
15
15
Event #
R21020110201TT03_2
Reward site (banana)
15
20
Event #
Supplemental Figure S7
vStr
R23420111213TT03_2
10
5
0
2
0
0
2
4
Reward triggered at 0 (s)
Supplemental Figure S8
OFC
0.35
0.4
0.3
0.35
0.25
0.3
Posterior probability
Posterior probability
vStr
0.2
0.15
0.1
0.25
0.2
0.15
0.1
0.05
0.05
Cherry
De
De
Banana
co
de
co
Plain
re
wa
Banana
l
tua
Ac
vStr
shuffle
Posterior probability
Posterior probability
0.5
0.25
0.2
0.15
0.1
0.05
0
ard
ew
al r
u
Act
0.3
rd
a
rew
OFC
0.35
wa
rd
Cherry
0.4
re
Plain
Other
Choc
Choc
rd
de
shuffle
0.4
0.3
0.2
0.1
0
Cherry
Banana
De
co
De
de
Plain
dr
ew
Choc
ar
Choc
d
Plain
Other
Banana
Cherry
de
dr
ew
ar
ard
w
l re
tua
Ac
co
tu
Ac
rd
wa
e
al r
Supplemental Figure S9
OFC
Decoded activity of each zone compared to all other zones
vStr
Decoded activity of each zone compared to all other zones
0.35
0.35
0.3
0.3
Posterior probability
Posterior probability
0.25
0.2
0.15
0.1
0.25
0.2
0.15
0.1
0.05
0.05
Cherry
Banana
De
De
Plain
co
de
Choc
zo
ne
co
Choc
de
dz
Plain
Other
Banana
Cherry
l
tua
Ac
ne
ne
l zo
ua
Act
vStr
0.4
0.5
0.35
shuffle
0.3
shuffle
0.4
Posterior probability
Posterior probability
zo
OFC
on
0.25
0.2
0.15
0.1
0.05
0
0.3
0.2
0.1
0
Cherry
Banana
De
Plain
co
de
dz
Choc
on
Choc
Plain
Other
Banana
Cherry
A
lz
a
ctu
ne
De
co
de
dz
on
on
al z
tu
Ac
OFC
vStr
b
Decoded activity of reward at each zone compared to all other zones
0.25
0.25
0.2
0.2
Posterior probability
Posterior probability
0.15
0.1
0.05
0.15
0.1
0.05
0
0
Cherry
Banana
De
co
Plain
de
dr
ew
ard
Choc
co
de
ew
ar
Banana
Cherry
tua
ne
l zo
a
ctu
n
l zo
Ac
OFC
vStr
0.4
0.5
shuffle
0.35
0.3
shuffle
0.4
Posterior probability
Posterior probability
dr
Plain
Other
De
Choc
0.25
0.2
0.15
0.1
0.05
0
0.3
0.2
0.1
0
Cherry
Banana
De
co
de
Plain
dr
Choc
Choc
ew
ar
Plain
Other
Banana
Cherry
co
de
re
wa
rd
ne
l zo
a
ctu
De
Ac
l
tua
ne
zo
0.45
Current
Previous
Next
Opposite
p(Reward)
p(Reward)
0.45
shuffle
0.25
4
Reward at 0 (s)
0
4
Zone entry at 0 (s)
shuffle
0.45
f
p(Reward)
0.25
shuffle
0.25
0.45
shuffle
0.25
shuffle
0.45
p(Zone)
p(Zone)
0.45
p(Reward)
0.25
0
1
shuffle
0.25
0
1
4
Zone entry at 0 (s)
Current
Previous
Next
Opposite
0.25
p(Reward)
0.25
0.05
0.05
1.5 1
1
2
Time from one entry (s)
1.5 1
1
2
Time from zone entry (s)
0.25
p(Reward)
0.25
0.05
0.05
1.5 1
1
2
Time from zone entry (s)
1.5 1
1
2
Time from zone entry (s)
0.25
p(Reward)
OFC
vStr
0.25
Current
Previous
Next
Opposite
0.15
0.15
shuffle
0.05
1.5 1
shuffle
0.05
1
2
3
Time from zone entry (s)
1.5 1
1
2
3
Time from zone entry (s)
d
0.25
p(Reward)
0.25
0.15
0.15
shuffle
0.05
0
0
1.5 1
0
1
2
Time from zone entry (s)
shuffle
0.05
1.5 1
1
2
Time from zone entry (s)
a
1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
10
15
20
25
Delay (s)
30
35
40
45
Control 2 CDF
Regret-Inducing CDF
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
10
15
20
25
Delay (s)
30
35
40
45
Supplemental Figure 15
0.2
OFC
vStr
Opposite
Next
0.1
p(Reward)
Previous
Opposite
Current
Next
Previous
0
0.2
Opposite
Current
Next
0.1
shuffle
shuffle
Previous
0
1.5
3
4 1.5
0
Time zone entry at 0 (s)
Current
0.5
0.4
c
Posterior probability
p(Zone)
Posterior probability
p(Zone)
Previous
Opposite
Current
Next
0.3
0.2
0.5
Opposite
0.4
0.3
0.2
Previous
0.1
1
0
1
2
Time from zone entry (s)
1
0
1
2
Time from zone entry (s)
Current
Delay < Threshold
Opposite
Posterior probability
p(Zone)
Posterior probability
p(Zone)
0.5
0.5
Next
Next
0.4
0.4
0.3
0.3
0.2
0.2
Previous
Current
Delay > Threshold Delay < Threshold
1
0
1
2
Time from zone entry (s)
1
0
1
2
Time from zone entry (s)
Above threshold delay wait for food then encounter below threshold delay
OFC
0.5
vStr
Posterior probablity
p(Zone)
0.4
0.3
0.2
Previous
Opposite
Current
Next
0.1
1
0.5
shuffle
shuffle
Posterior probability
p(Zone)
0.4
0.3
0.2
0.1
1