I'm trying to do a model of reinforcement learning but I can't get my turtles to hatch correctly. Here's how the program is meant to work.
To start, a state is chosen at random. This is the state-class. Then a turtle with this state-class is chosen at random. Next a turtle with an act-class that is the same breed as the previous turtle is chosen. The value of the act-class is compared to the value of the state-class to determine the payoff. Finally hatch payoff many turtles of the first kind and payoff many turtles of the second kind.
My problem is that an arbitrary number of turtles of random kinds are being hatched. Any idea what's wrong with my code?
to go
ask turtles[let state random 4
let category one-of turtles with [state-class = state]
ifelse [category1?] of category
[let act1 one-of category1 with [act?]
ifelse state = [act-class] of act1
[set payoff 4]
[ifelse (state = 0 or state = 3) and ([act-class] of act1 = 1 or [act-class] of act1 = 2)
[set payoff 3]
[ifelse ([act-class] of act1 = 1 or [act-class] of act1 = 2) and (state = 1 or state = 4)
[set payoff 3]
[set payoff 1]]]
hatch-category1 1 * payoff [ setxy random-xcor random-ycor set state-class state
set state? true set act? false
set color [color] of category]
hatch-category1 1 * payoff [ setxy random-xcor random-ycor
set act-class [act-class] of act1
set state? false set act? true
set color [color] of act1]]
[let act2 one-of category2 with [act?]
ifelse state = [act-class] of act2
[set payoff 4]
[ifelse (state = 0 or state = 3) and ([act-class] of act2 = 1 or [act-class] of act2 = 2)
[set payoff 3]
[ifelse ([act-class] of act2 = 1 or [act-class] of act2 = 2) and (state = 1 or state = 4)
[set payoff 3]
[set payoff 1]]]
hatch-category2 1 * payoff [ setxy random-xcor random-ycor set state-class state
set state? true set act? false
set color [color] of category]
hatch-category1 1 * payoff [ setxy random-xcor random-ycor
set act-class [act-class] of act2
set state? false set act? true
set color [color] of act2]]]
tick
end