Large Language Models

sebsauvage, 5 hours ago French

#IA #LLM
Essayons de résumer où on en est sur ces IA de type LLM (+ une nouvelle faille) : https://sebsauvage.net/links/?0aif1Q

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

tradjincal, 5 hours ago

@sebsauvage j'ai l'impression aussi qu'economiquement parlant, il y a que Nvidia qui fait sont beurre et que les autres boîtes sur les investissements des banques.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

sebsauvage, 5 hours ago

@tradjincal
tout à fait !
J'avais oublié, je l'ajoute.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

drahardja, 6 hours ago

There was a paper shared recently about the exponential amount of training data to get incremental performance gains in #llm #ai, but I seem to have misplaced it. Do you know what I’m referring to? Mind sharing the link if you have it?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ dgoldsmith, NormanDunbar, sysop408

anmey, 1 day ago

I think one of the biggest fears people have about AI is that it isn't perfect as assumed, but that, like us humans, it takes the given information, assumes the most likely outcome, and presents it plausibly.

#ki #LLM #AI

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kellogh, 22 hours ago

@anmey yeah, there’s this paradox — we kinda want computers to think like humans, but when they get plausibly good at it, we complain that they don’t think like computers anymore

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

grumpybozo, 1 day ago

I’d like to trust this story, but it fails to link to its supposed source or provide enough info to find it elsewise. A few clicks around the site makes me think that it may well be nothing but a #LLM-composed content farm. https://cosocial.ca/@kgw/112498693958537559

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

feld, 1 day ago

@grumpybozo the paper they're referencing is here

Sorry clicked wrong PDF first

https://media.bikeshed.party/pleroma/b1be6f7add5dd4b53c31836c752b0da8d4b8b34f167f0283403c2be6528632ee.pdf

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

dvshkn, 2 days ago

I gave the pizza question to Golden Gate Claude. This is truly the people's LLM. Google btfo.

#llm

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

tripplehelix, 2 days ago

@dvshkn What bridge?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

chikim, 3 days ago

Llama.cpp now supports the distributed inference, meaning you can use multiple computers to speed up the response time! Network is the main bottleneck, so all machines need to be hard wired, not connected through wifi. ##LLm #AI #ML https://github.com/ggerganov/llama.cpp/tree/master/examples/rpc

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999

cheukting_ho, 4 days ago

#PyConIT2024 opening keynote by @t_redactyl - #LLM and illusions

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

pauleveritt, 4 days ago

@cheukting_ho @t_redactyl Sunday night: saying bye to Jodie, thinking "whew, glad I’m not getting on a plane tomorrow direct to next conference.”

Today: sad I wasn't there to see Jodie.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

chikim, 5 days ago

Microsoft released Phi3 Small, Medium, and Vision! #LLM #AI #ML https://huggingface.co/collections/microsoft/phi-3-6626e15e9585a200d2d761e3

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ miki

kellogh, 5 days ago

@chikim i love what they’ve been doing with phi!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

hypolite, 6 days ago

How would anyone trust the products these people put worth? They aren’t working on making LLMs more accurate (spoiler alert: they can’t, by design), they’re working to make them more appealing to companies targeting unsuspecting consumers. By any means necessary.

#LLM #AI #StochasticParrot
RE: mastodon.social/users/nixCraft…

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

heluecht, 6 days ago

@hypolite @nixCraft I just watched some demos. They are very irritating. I really don't appreciate this cheeky behaviour of that system.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

troed, 7 days ago

I'm worried for my tech friends.

The vitriol, and - honestly - ignorance around LLM-based "AI" is starting to fill my feeds from normally sane and technologically literate people.

You should be able to see through the hype and misuse. LLMs aren't encyclopedias - they're tools that are able to manipulate data of various sorts in ways that are very similar to how humans do it.

Yes, I compare LLMs to human brains. It's not the same as saying they're conscious (yet) - but the way LLMs work is apparently in many ways similar to how our brains work.

One fascinating insight into that comes from research done on what happens to the ability of LLMs to recall information as they are exposed to large and larger corpuses. Apparently they're better at recalling the early and late information, whilst starting to lose some in the middle.

In human psychology we call that the primacy and recency effect - because our brains do the same.

LLMs are absolutely awesome for a wide variety of tasks (and we have by no means found them all). Every second you spend not understanding this is a second on the way to your own irrelevance (if these tools would aid someone in your chosen area of work) or to become a grumpy old person yelling at clouds.

#LLM #AI

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

angiebaby, 6 days ago

@troed

Snake oil in tech form is still snake oil.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

smurthys, 7 days ago

I just finished a productive Copilot session on a complex programming task. I came up with much of the algorithms, and wrote a lot of the code, and had to guide it a lot throughout, but credit where due, Copilot did make small but meaningful contributions along the way.

Overall, not a pair programmer but someone useful to talk to when WFH alone on complex tasks.

Enough for Copilot to earn a ✋🏽. And I like how it responded to that. It has got that part down. 😉

#Copilot #LLM #programming

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ poppastring

finestructure, 9 days ago

•This• is the compelling #LLM use case for me. If I use a translator to write messages in French I'm not forced to come up with an initial attempt and I lose the learning aspect of that.

If instead I put something into ChatGPT and it not only corrects but explains what my mistakes were that's a huge win in terms of learning from your mistakes.

(I still don't trust the thing 100% but it's also not a high stakes situation – I'm not engaging in a nuclear arms treaty after all 😅)

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

finestructure, 8 days ago

@groue Yeah, my partner pointed that and a few other things out as well and now I’m less convinced this is working as well as I thought it was.

Yet another example where the answer sounds good but only because I don’t have the expertise to verify.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

finestructure, 8 days ago

@rene I’ve actually noticed that about French punctuation, too, at least when it comes to ! and ? 🤷‍♂️

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

m0bi13, 9 days ago Polish

Ciekawa musi być praca nad bezpieczeństwem "sztucznej inteligencji" 🤔

#LLM #AI #SztucznaInteligencja

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ 74

rotnroll666, 10 days ago

Absolutely unbelievable but here we are. #Slack by default using messages, files etc for building and training #LLM models, enabled by default and opting out requires a manual email from the workspace owner.

https://slack.com/intl/en-gb/trust/data-management/privacy-principles

What a time to be alive in IT. 🤦‍♂️

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ onepict, grrrr_shark, GhostOnTheHalfShell, ftranschel +5 more

tanepiper, 9 days ago

@rotnroll666 @schmidt_fu and why I absolutely cannot trust #Slack....

https://tane.dev/2023/09/slack-wants-you-you-know-this-privacy-exploit-is-fine/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

LChoshen, 9 days ago

Do LLMs learn foundational concepts required to build world models? (less than expected)

We address this question with 🌐🐨EWoK (Elements of World Knowledge)🐨🌐

a flexible cognition-inspired framework to test knowledge across physical and social domains

https://ewok-core.github.io

#llm #llms #evaluation #ml #machinelearning

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

luis_in_brief, 9 days ago

@LChoshen I was just talking about this problem with a friend the other day. Really interesting data, thank you for sharing!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

dw_innovation, 9 days ago

As #publishers negotiate #licenses with companies that have built #LLM, they are starting to think about how to assign a dollar #value to their #news.

There are three parts to this problem:

Understanding what can be licensed

Setting a price

Getting the companies to agree to pay

Interesting article by Anya Schiffrin for Poynter: https://www.poynter.org/reporting-editing/2024/google-search-ai-effect-news-publishers-deals/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cs

lpwaterhouse, 10 days ago

To those concerned about #slack now using your chats, including trade secrets, NDA stuff, etc., to train their #llm: #WTF did you expect using a third party with full content access to discuss those things? That they'd be gentlemen and not read your mail? That they somehow wouldn't try to find a way to monetize that juicy data? I am flabbergasted that people working for corporations just as immoral could have been that naive...

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bpavuk, 10 days ago

@lpwaterhouse many startups use discord for the same reason as corporates use slack
tencent has a share in discord inc. and thus an access to all the data
everybody running a startup knows that
nobody cares
idk, maybe that's applicable for startups only

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

lpwaterhouse, 10 days ago

@bpavuk Judging by my timeline a lot of people seem to care now; I am confused as to how they either a) didn't see it coming at all (read: naive in face of overwhelming prior examples) or b) managed to actively "not care" until it was "too late" (Hint: It was "too late" the second you put data in, not when you learned that they are using it). "a)" I can imagine, like a character in a bad novel, though it's a tall order; "b)" on the other hand... My brain does not work like that. At all. Of course there is always c) The people now clamoring are not ones that did, in fact, choose (they might have still been forced to) to use slack in the first place, but are merely expressing their indignation by stylistically putting themselves in that position to increase emotional involvement. Those are... vacuous...

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ianRobinson, 11 days ago

The hype and utility about LLMs are overstated and will cause problems due to leadership teams in organisations (and Governments) buying into the hype.

However, they do have value as personal assistants, research assistants, and sounding boards as long as you treat all LLM output critically, especially on topics where you are not an expert.

I’m using Claude 3 Opus as a research assistant. It’s read more of the world’s info than I ever will. I am also trying out ChatGPT-4o.

#LLM #AI

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

hirad, 11 days ago

@ianRobinson Among the commercial ones that are available online, Claude is definitely the best one.
Though I personally run LLMs on my own PC 90% of the time.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ianRobinson, 11 days ago

@hirad Agreed. I thought about running a local one (or a private Cloud LLM) and loading it with the 1000 or so articles I’ve written over the last decade. I still might.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mofosyne, 11 days ago

https://blog.miguelgrinberg.com/post/how-llms-work-explained-without-math this is an interesting layperson general explanation of LLMs

#llm #laymen #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

peterbutler, 11 days ago

@mofosyne That was a helpful read for me. Thanks

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

smach, 11 days ago

New free course on #LLM agents from DeepLearning AI and crewAI:

“With crewAI, an open source library for building multi-agent systems, you'll get hands-on experience building agent crews for processes like:

💻 Tailoring resumes and interview prep for job applications
💻 Researching, writing, and editing technical articles
💻 Conducting customer outreach campaigns
💻 Financial analysis
💻 Planning events”

Taught by crewAI founder João (Joe) Moura

https://www.deeplearning.ai/short-courses/multi-ai-agent-systems-with-crewai/
#GenAI #AI

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ stevensanderson

cigitalgem, 12 days ago

Giving my #LLM Risks talk (BIML work) today for Google on a global forum. If you work for Google, hop on! #MLsec #ML #AI

Noon Eastern....hosted by Zurich.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 11 days ago

T minus 90 minutes.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ianRobinson, 13 days ago

“Claude.ai is now available to users in the EU”

Via a T&Cs update email. Claude 3 Opus is my favourite LLM. I haven’t had a chance to fully test ChatGPT-4o yet to compare them.

#LLM #Claude3

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

q7AtQ1Pvy3kx, 12 days ago

#Anthropic is killing it with their AI game, especially for a small startup. Their models are way better than #OpenAI's, but they're focusing more on enterprise stuff rather than hyping it up. This might be a risky move since they don't have a cult following like other AI companies. Still, gotta give them props for their impressive tech. It'll be interesting to see how they balance enterprise with getting more attention from the AI community.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vancha, 13 days ago Western Frisian

Looks like today I finally found a good application for #llm 's: Learning languages!
I've been attempting to learn #arabic through duolingo for a while now, without much success. I figured if there's one thing language models should be good at, it's languages. So far the thing has actually been pretty helpful.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kellogh, 12 days ago

@vancha i’ll be interested in how gpt-4o works. the main feature i’m interested in is “please slow down, i’m a n00b”

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

williamtries, 13 days ago

I just posted a short tutorial on setting up a simple #LLM on your #oneplus6 running #postmarketOS . The LLM (7b alpaca in this case) is not terribly accurate but possibly useful in some cases.

Warning! You phone will get hot. I haven't tested it long enough to know if it will harm your device, but do be careful.

Oh! I have a website now! Wow! My history with websites is hit and miss, so enjoy it while it lasts. I have half a dozen posts in the works already.

https://www.williamtries.ovh/llmonpmos/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ postmarketOS, linmob

chikim, 13 days ago

If you missed it, check out the new GPT4O demo. #LLM #ML #AI #OpenAI https://www.youtube.com/watch?v=DQacCB9tDaw

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

chikim, 13 days ago

@bryansmart FAlso funny. Google IO event is tomorrow. OpenAI intentionally revealed their model today. lol

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bryansmart, 13 days ago

@chikim The responsive interruptible conversation interface, plus the visual stuff, really makes me believe we aren't too many years off from glasses, with a voice assistant, that can describe whatever is in front of us, answer questions about it, etc. Amazing stuff!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...