brohrer, to random
@brohrer@recsys.social avatar

This is an excellent practical introduction to experimentation and benchmarking from @araffin
https://youtu.be/7-PUg9EAa3Y?si=qBHmyYV2uvgA6s6x

chrisoffner3d, to ArtificialIntelligence

This is brought to you by our Reinforcement Learning homework in which we were asked to implement the SAC algorithm. Something tells me this was not the ideal choice for literally the first Reinforcement Learning assignment given to students. 🫠

https://arxiv.org/abs/1801.01290

video/mp4

computingnature, to random
@computingnature@neuromatch.social avatar

Make your next discovery using #Rastermap, a visualization method for large-scale neural data. Paper now out: https://www.biorxiv.org/content/10.1101/2023.07.25.550571v1 (click on the gif)

gif of neural activity re-sorted by Rastermap algorithm

computingnature,
@computingnature@neuromatch.social avatar

sorting of neurons from agents playing Atari games:

Jigsaw_You, to machinelearning Dutch
@Jigsaw_You@mastodon.nl avatar

Impressive engineering and useful application of reinforcement learning.

@machinelearning

https://www.nature.com/articles/d41586-023-01883-4

tero, to random
@tero@rukii.net avatar

Faster sorting discovered using |

"Here we show how can go beyond the current state of the art by discovering hitherto unknown routines. To realize this, we formulated the task of finding a better sorting routine as a single-player game. We then trained a new deep agent, , to play this game. AlphaDev discovered small sorting algorithms from scratch that outperformed previously known human benchmarks. These algorithms have been integrated into the standard C++ sort library3. This change to this part of the sort library represents the replacement of a component with an algorithm that has been automatically discovered using reinforcement learning."

https://www.nature.com/articles/s41586-023-06004-9

erinmikail, to random
@erinmikail@mastodon.social avatar

Today's the Day! 🥂

We're hosting a workshop to get folks started with !

Join Jimmy Whitaker, Nikolai Liubimov, and myself for an entry-level workshop on Reinforcement Learning with Human Feedback.

📅 May 30 — 2-3 PM EDT

🔗 https://lu.ma/RLHF

bwaber, to random
@bwaber@hci.social avatar

I had a very full day, but I did manage to get out for a bunch of walks (and found witch's butter!) and listen to talks for my ! (1/9)

A cherry blossom tree starting to bloom against a clear sky
Dark magenta flowers blooming on trees

bwaber,
@bwaber@hci.social avatar

Next was a great talk by Anne Collins on bridging , , and computation in at the Learning Salon. After some bombastic claims that "RL is all you need" to explain cognition, Collins and the broader group dissect what's missing from this picture https://www.youtube.com/watch?v=YLbZh-bH8V0 (3/9)

seanpatrickphd, to gaming
@seanpatrickphd@mastodon.social avatar

- I'm a whose research focused on . I pivoted out of academia and into last year and have no regrets.

For fun I write and (especially stories) as well as , and I'd love to meet the Mastodon . Big on , and .

I identify as and , I use they/them pronouns.

Say hi!

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • GTA5RPClips
  • DreamBathrooms
  • everett
  • magazineikmin
  • Durango
  • InstantRegret
  • Youngstown
  • mdbf
  • slotface
  • rosin
  • thenastyranch
  • kavyap
  • ethstaker
  • megavids
  • tacticalgear
  • cubers
  • cisconetworking
  • osvaldo12
  • khanakhh
  • ngwrru68w68
  • modclub
  • tester
  • anitta
  • normalnudes
  • Leos
  • provamag3
  • lostlight
  • All magazines