#ReinforcementLearning

brohrer, 1 month ago to random

This is an excellent practical introduction to #ReinforcementLearning experimentation and benchmarking from @araffin
https://youtu.be/7-PUg9EAa3Y?si=qBHmyYV2uvgA6s6x

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kcarruthers

chrisoffner3d, 5 months ago to ArtificialIntelligence

This is brought to you by our Reinforcement Learning homework in which we were asked to implement the SAC algorithm. Something tells me this was not the ideal choice for literally the first Reinforcement Learning assignment given to students. 🫠

https://arxiv.org/abs/1801.01290

#RL #ReinforcementLearning #DeepLearning #SAC #SoftActorCritic #ETHZurich #ETHz

video/mp4

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

computingnature, 9 months ago to random

Make your next discovery using #Rastermap, a visualization method for large-scale neural data. Paper now out: https://www.biorxiv.org/content/10.1101/2023.07.25.550571v1 (click on the gif)

gif of neural activity re-sorted by Rastermap algorithm

reply

expand (16)

collapse (16)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ albertcardona, mwfc, philiphubbard, jonny +2 more

computingnature, 9 months ago

#Rastermap sorting of neurons from #ReinforcementLearning agents playing Atari games:

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ elduvelle, futurebird

Jigsaw_You, 11 months ago to machinelearning Dutch

Impressive engineering and useful application of reinforcement learning.

#machinelearning #reinforcementlearning @machinelearning

https://www.nature.com/articles/d41586-023-01883-4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Jigsaw_You

tero, 11 months ago to random

Faster sorting #algorithms discovered using #DeepReinforcementLearning | #Nature

"Here we show how #ArtificialIntelligence can go beyond the current state of the art by discovering hitherto unknown routines. To realize this, we formulated the task of finding a better sorting routine as a single-player game. We then trained a new deep #ReinforcementLearning agent, #AlphaDev, to play this game. AlphaDev discovered small sorting algorithms from scratch that outperformed previously known human benchmarks. These algorithms have been integrated into the #LLVM standard C++ sort library3. This change to this part of the sort library represents the replacement of a component with an algorithm that has been automatically discovered using reinforcement learning."

https://www.nature.com/articles/s41586-023-06004-9

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ dwarmstrong

erinmikail, 11 months ago to random

Today's the Day! 🥂

We're hosting a workshop to get folks started with #RLHF!

Join Jimmy Whitaker, Nikolai Liubimov, and myself for an entry-level workshop on Reinforcement Learning with Human Feedback.

📅 May 30 — 2-3 PM EDT

🔗 https://lu.ma/RLHF

#ReinforcementLearning #Tutorial #MachineLearning #ML

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bwaber, 1 year ago to random

I had a very full day, but I did manage to get out for a bunch of walks (and found witch's butter!) and listen to talks for my #AcademicRunPlaylist! (1/9)

A cherry blossom tree starting to bloom against a clear sky
Dark magenta flowers blooming on trees

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

bwaber, 1 year ago

Next was a great talk by Anne Collins on bridging #cognition, #neuroscience, and computation in #RL at the Learning Salon. After some bombastic claims that "RL is all you need" to explain cognition, Collins and the broader group dissect what's missing from this picture https://www.youtube.com/watch?v=YLbZh-bH8V0 (3/9) #ReinforcementLearning

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

seanpatrickphd, 1 year ago to gaming

#introduction - I'm a #scientist whose research focused on #ReinforcementLearning . I pivoted out of academia and into #DataScience last year and have no regrets.

For fun I write #ScienceFiction and #Fantasy (especially #PostApocalyptic stories) as well as #poetry, and I'd love to meet the Mastodon #WritingCommunity. Big on #gaming, #DnD and #TTRPG.

I identify as #queer #nonbinary #bisexual and #asexual, I use they/them pronouns.

Say hi!

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jcrabapple