marcwhoward,

@NicoleCRust
So for #2 the Bellman equation solves a problem---estimate expected future reward without directly estimating the future---that is not faced by the brain. Insofar as the brain has an explicit temporal memory of the continuous past (and it certainly does), it is straightforward to construct a direct estimate of the distant future via simple Hebbian associations. We didn't know that in the 80's when Sutton and Barto were working on this problem or the mid-90's when the dopamine mapping was made. This paper makes these arguments in much more depth:
https://arxiv.org/abs/2302.10163

(will try to write answer to #3 later, time for dinner)

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • DreamBathrooms
  • mdbf
  • ethstaker
  • magazineikmin
  • cubers
  • rosin
  • thenastyranch
  • Youngstown
  • InstantRegret
  • slotface
  • osvaldo12
  • kavyap
  • khanakhh
  • Durango
  • megavids
  • everett
  • tacticalgear
  • modclub
  • normalnudes
  • ngwrru68w68
  • cisconetworking
  • tester
  • GTA5RPClips
  • Leos
  • anitta
  • provamag3
  • JUstTest
  • lostlight
  • All magazines