Hot - Comments - machinelearning

This magazine is from a federated server and may be incomplete. Browse more on the original instance.

incogtino, 2 months ago in Alternative to Generating images: get AI to generate query for real image (Unsplash)

I think I broke it. I told it my company was called Waldo’s Wands and I was in the transport industry - I got a transport intro, a list of magic wand services, and a cannabis picture!

https://lemmy.zip/pictrs/image/6c7766df-d768-464f-9ffb-17da06290281.webp

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

tomjuggler, 2 months ago

I did that on purpose! It’s my shop, gocha

Seriously though, the Unsplash search is pretty rubbish. Also, I’m only returning the first image from search (I only have 50/h) , could probably improve the search on my side by checking metadata or something. At least the plant has the right number of leaves, right?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

tomjuggler, 2 months ago

Actually now that I think of it - I think I made the last image do a search using only the company name (to avoid same images as others above) so you probably got “Waldo’s Weed” there!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Sims, 3 months ago in Who else here loves the end-to-end robotics model that seem to go out on a weekly basis?

Cool. Just adding the pdf for more info: github.com/umi-gripper/…/umi.pdf

I follow AI on YT but focus on cognitive agent architectures and txt models so I don’t have any sources dedicated to robotic AI. Can you recommend a channel that focuses on progressions in that area ?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

keepthepace, 3 months ago

Thanks!

I dont see much info on youtube, but that’s the last use I have of twitter, there are several accounts posting about these here. I mostly follow LLMs and robotics subject. You can take a look at my followers list: twitter.com/ktp_programming/following

Or do as I did, follow someone like @DrJimFan and follow a new person every time he retweets something you find interesting.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

howrar, 3 months ago in Model Design Theory Tips/Tricks/Docs (for a card game agent)

You’ve read a bit on MDPs, which is a good start. You may also want to look into reinforcement learning for how to optimize said MDP.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

taaz, 3 months ago

Thank you very much, I was not sure if it’s the right direction.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

A_A, 7 months ago in The Paradox of AI Consciousness: Navigating the Boundaries of Machine Learning

Consciousness in Artificial Intelligence: Insights from the Science of Consciousness arxiv.org/abs/2308.08708

tools : " …
1- recurrent processing theory,
2- global workspace theory,
3- higher-order theories,
4- predictive processing, and
5- attention schema theory
… "

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

QubaXR, 7 months ago in The Paradox of AI Consciousness: Navigating the Boundaries of Machine Learning

Just a few sentences in, I’m 99% positive this has been written using GPT-4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Ultra_Unlimited, 7 months ago

Interestingly, not accurate. Please try again.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

nirogu, 7 months ago in Training AI to Play Pokemon with Reinforcement Learning

Very well explained! Especially given how difficult RL can be sometimes

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

A_A, 7 months ago (edited 7 months ago) in [R] Unraveling the Mysteries: Why is AdamW Often Superior to Adam+L2 in Practice?

Please explain like I’m a 5 years old.

Maybe I understand the following :
(my apologies if this is grossly simplified and doesn’t help)

1- Better neural network need to contain more (stacked) layers.
2- input layer at one end of the stack is exposed to messy informations from the real world.
3- at the other end the output layer provide results from the network.
4- the first step for making this work is the training of the network during which training, learning is done.
5- instabilities and stagnation in some layers often occur when learning does not occur in an optimal way. This problem increases exponentially with the number of layers.
6- here learning is done all at once to all the layers. Something called rotation which I don’t understand, is used to stabilize and optimize the learning.

I feel this is very different from human learning where it happens in stages : we first learn words, then try to assemble them to form simple sentences, then evolve to make sense of more complex notions and so on. I wish this approach could apply also in artificial intelligence development.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

wagesj45, 7 months ago

The human brain isn't a blank slate when it comes into existence. There are already structures that are designed to do certain things. These structures come "pre trained" and a lot of the learning humans do is more akin to the fine tuning that we do for foundation models.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

A_A, 7 months ago

P.S. : see also :
en.m.wikipedia.org/…/Stochastic_gradient_descent#…

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vluz, 9 months ago (edited 9 months ago) in Recommendations for a context aware text classifier

While designing a similar classifier, I've considered the idea of giving it the whole thread as "context" of sorts.
Not just the parent comment, the whole thread up to original post.

I've abandoned the idea.
A comment must stand on it's own, and it would put limits on results, the way I was planning to do it.
I might be very wrong, your insight into this would be very helpful.

My original idea was to go recursively trough the thread and test each comment individually.
Then I would influence the actual comment results with the combined results of it's parents.
No context during inference, just one comment at a time.

For example consider thread OP->C1->C2->C3.
My current model takes milliseconds per test with little resources used.
It would be ok up to very large threads but would contain a limit to save on answer time.
I want to determine if Comment 3 is toxic in the context of C2, C1, and OP.
Test C3, test C2, test C1, test OP. Save results.
My current model gives answer in several fields ("toxic", "severe toxic", "obscene", "threat", "insult", and "identity hate")
The idea was to then combine the results of each into a final result for C3.

How to combine? Haven't figure it out but it would be results manipulation instead of inference/context, etc.

Edit: Is there any way you can point me at examples difficult to classify? It would be a nice real world test to my stuff.
Current iteration of model is very new and has not been tested in the wild.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Bluetreefrog, 9 months ago

(“toxic”, “severe toxic”, “obscene”, “threat”, “insult”, and “identity hate”)

You aren’t the author of Detoxify are you by any chance? It uses the same classifications. I was originally using it but switched to my own model as I really only needed binary classification and felt a new dataset that better suited Lemmy was needed anyway. I have 2 outputs (toxic and not-toxic).

I’ve been building my own dataset as the existing ones on Huggingface seemed to contain a lot of content you might see on Twitter, and were a poor match for Lemmy. Having said that, I’ve generally avoided putting that sort of content into the dataset as I figured if I can’t easily decide if it’s toxic, then how could a model.

Is there any way you can point me at examples difficult to classify? It would be a nice real world test to my stuff. Current iteration of model is very new and has not been tested in the wild.

Here’s a few where I’ve had to go back to the parent comment or post to try and work out if it was toxic or not:

Do your research on the case and the answers will be very obvious. (What comment prompted this? Is it a trolling comment or a reasonable response to a trolling comment)

Because you’re a fascist. The fact that they disagree with you is secondary (Is the commenter calling another commenter a fascist, or continuing a discussion?)

Me tard, you tard, retard nation! (Is this a quote from a movie or TV show or an insult to another commenter? Not sure.)

Fuck you shoresy! (pretty sure this is a quote from a tv show)

A comment must stand on it’s own, and it would put limits on results, the way I was planning to do it. I might be very wrong, your insight into this would be very helpful.

I originally thought that, and I’m actively tuning my model to try and get the best results on the comment alone, but I don’t think I’ll ever get better than about 80% accuracy. I’ve come to the conclusion that those cases in the grey zone where toxic ~= not-toxic can only be resolved by looking upstream.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vluz, 9 months ago

Oof, pop-culture references are hard and I had not considered that at all.
Thanks for the examples, I'll have a think on how to deal with those.

My only insight is one you already had.
Test at least the comment before, and then use the output to dampen or amplify the final result.
Sorry for being no help at all.

--

My project is very basic but I'll post it here for any insight you might get out of it.
I teach Python in a variety of settings and this is part of a class.

The data used is from Kaggle: https://www.kaggle.com/competitions/jigsaw-toxic-comment-classification-challenge/
The original data came from Wikipedia toxic comments dataset.
There is code too from several users, very helpful for some insight into the problem.

Data is dirty and needs clean up so I've done so and posted result on HF here:
https://huggingface.co/datasets/vluz/Tox

Model is a very basic TensorFlow implementation intended for teaching TF basics.
https://github.com/vluz/ToxTest
Some of the helper scripts are very wonky, need fixing before I present this in class.

Here are my weights after 30 epochs:
https://huggingface.co/vluz/toxmodel30

And here is it running on a HF space:
https://huggingface.co/spaces/vluz/Tox

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Clbustos, 9 months ago in New ChatGPT rival, Claude 2, launches for open beta testing

I used to read some pdf and works very well!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

minticecream, 9 months ago in Looking for resources on music generation

Check out Meta’s AudioCraft AI.

Here’s some samples.

And here’s their GitHub repo.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

f4hy, 9 months ago

Thanks!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Spott, 9 months ago in Discussion of llama source code

Do you have anything specific you want to know.

I’m working on rewriting it with all the multi-processor code removed so you can better understand the algorithm, but I’d love to know what you are trying to better understand.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

astinmiura, 9 months ago in What tools/libraries do you for MLOps?

ray[tune] + mlflow

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kromem, 10 months ago in Generative AI Goes 'MAD' When Trained on AI-Created Data Over Five Times

Now with this effectively replicating the same end result as the Stanford study, I’ll be really interested to see what invariably turns out to fix it in the next 12 months.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

iam, 10 months ago in GPT-4 API general availability and deprecation of older models in the Completions API

Does this mean that I won’t have to pay a dime to use GPT-3.5 Turbo and GPT-4?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

keepthepace, 10 months ago (edited 10 months ago)

I don’t think so. But GPT-4 API was on waitlist (I was stuck on it for a long time) and every paying user now got access to it. It is still billed per token and GPT-4 tokens are more expensive (30x) than GPT-3.5 turbo

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

asterfield, 10 months ago in Great series by Andrej Karpathy on machine learning and training

I’ve been slowly chipping away at this series for months. It’s beyond excellent, but fully digesting everything he’s saying is taking me a while.

I’m also trying to apply each lesson to a side project. I’m failing at most of these projects but it’s helping me cement the lessons and intuitively understand what approaches work, what not, and why.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Federation

Status:

On | Off

Instances:

/m/machinelearning@lemmy.world

Threads (43)

Microblog (0)

All Content

People

Magazines

Collections

Magazine

machinelearning

@machinelearning@lemmy.world

Welcome to Machine Learning – a versatile digital hub where Artificial Intelligence enthusiasts unite. From news flashes and coding tutorials to ML-themed humor, our community covers the gamut of machine learning topics. Regardless of whether you’re an AI expert, a budding programmer, or simply curious about the field, this is your space to share, learn, and connect over all things machine learning. Let’s weave algorithms and spark innovation together.

Created: 11 months ago
Owner: ernest
Subscribers: 35
Online: -

Moderators

ernest

Active people

The Hundred-Page Machine Learning Book https://leanpub.com/theMLbook by Andriy Burkov is the featured book on the Leanpub homepage! https://leanpub.com #DataScience #ComputerScience #MachineLearning #AI #ebooks

11 days ago to datascience

Neural Networks with Python by GitforGits | Asian Publishing House is on sale on Leanpub! Its suggested price is $34.99; get it for $22.39 with this coupon: https://leanpub.com/sh/Q8SJ2kv3 #Ai #MachineLearning #Python

5 days ago to ai

Do LLMs learn foundational concepts required to build world models? (less than expected)...

9 days ago to llm

Hello Mastodon, I know that a lot of you discuss the high environmental cost (such as energy use and water use) of AI and I hope that some of you could reply with authoritative publications/links regarding this problem! I want to try to convince an environmental science colleague #climatechange #AI #chatgpt #energy #technology...

1 day ago to climate

Related threads

A 'black box' AI system has been influencing criminal justice decisions for over two decades—it's time to open it up

10 months ago to tech

How AI is helping airlines mitigate the climate impact of contrails

9 months ago to science

Inside the messy ethics of making war with machines

9 months ago to technology

Interdimensional Machine Room [EXPERIMENTAL VQGAN]

11 months ago to visionsofchaos