#largelanguagemodels - kbin.social

doctorambient, 14 days ago to LLMs

People: stop asking #LLMs to explain their behavior.

We already know that LLMs don't have the introspection necessary to explain their behavior, and their explanations are often fanciful or "just wrong."

For instance, Gemini claims it reads your emails for training, Google says it doesn't.

(BTW, if it turns out Gemini is right and Google is lying, that might be another example of an LLM convincing me it's actually "intelligent.")

#largelanguagemodels #ai #machinelearning #googlegemini #llm

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ellescommelinguistes, 15 days ago to llm

Sur les limites des #LargeLanguageModels:
Le langage est éminemment incarné ("embodied") alors que les #LLM ne sont que des modèles inférentiels sans notion de vérité, d'émotions, d'engagement envers autrui ou envers l'avenir...

"Machines such as LLMs can generate text strings that signify emotions and moods. But these are statistical constructions. Having no concerns and no bodies, machines have no emotions and no moods, and no means to develop sensibilities for them."

https://cacm.acm.org/opinion/can-machines-be-in-language/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

wagesj45, 21 days ago to ai

:lul:

#steeve #ai #llm #largelanguagemodels #chatbot

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ajsadauskas, 1 month ago to ArtificialIntelligence

Here's an observation that should be bleeding obvious, but often gets overlooked amidst all the AI hype.

Especially in the enterprise IT space, many of the tools and platforms now being hyped up as "AI" were around a decade ago.

Back then, the buzzwords used to sell them were big data, machine learning, and predictive data analytics.

With all the hype around large language models and ChatGPT, the vendors have basically repackaged them as AI.

But essentially, there's a whole bunch of old (or at least not new) tech now being shilled with new buzzwords.

#LargeLanguageModels #ArtificialIntelligence #ChatGPT #enrerpriseIT

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Jakra

btaroli, 1 month ago to ai

As some tout the good tidings and marvels of AI, LLM, and marketing obfuscation ad nauseum, let’s not lose our grasp on how much our own ethics affect that real impact these tools have on all of us. And if we can’t do that, how are we supposed to instill a sense of ethics on these new conscious minds we pride ourselves in creating?

https://patch.com/california/beverlyhills/ai-nude-photo-investigation-finds-16-victims-5-offenders-bhusd

#AI #ArtificialIntelligence #GeneralAI #LLM #NeuralNetworks #LargeLanguageModels #ethics

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Sevoris, 1 month ago to random

Crossing point observation of the day:

On one hand, we have new papers that show how just using the language of a specific human group can trigger implicit, hidden biases in #LargeLanguageModels

on the other hand, we have software developers working to build tools that automatically retrieve information that may be of interest, and that try to reason ahead on your interests. Highest point so far: https://new.computer/

1

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 2 months ago to machinelearning

AI-generated articles prompt Wikipedia to downgrade CNET’s reliability rating - Enlarge (credit: Jaap Arriens/NurPhoto/Getty Images)

Wikipedia... - https://arstechnica.com/?p=2007059 #largelanguagemodels #techpublications #machinelearning #aijournalism #aipublishing #aiarticles #journalism #wikipedia #aiethics #aisafety #chatgpt #chatgtp #biz⁢ #cnet #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 2 months ago to machinelearning

Reddit sells training data to unnamed AI company ahead of IPO - Enlarge (credit: Reddit)

On Friday, Bloomberg reported that Re... - https://arstechnica.com/?p=2004431 #largelanguagemodels #machinelearning #stablediffusion #imagesynthesis #textsynthesis #axelspringer #stevehuffman #bloomberg #chatgpt #chatgtp #biz⁢ #openai #reddit #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ AAKL

ajsadauskas, 2 months ago to ai

In five years time, some CTO will review the mysterious outage or technical debt in their organisation.

They will unearth a mess of poorly written, poorly -documented, barely-functioning code their staff don't understand.

They will conclude that they did not actually save money by replacing human developers with LLMs.

#AI #LLM #LargeLanguageModels #WebDev #Coding #Tech #Technology @technology

reply

expand (51)

collapse (51)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ happyborg, CatherineFlick, kcarruthers

itnewsbot, 2 months ago to machinelearning

Nvidia CEO calls for “Sovereign AI” as his firm overtakes Amazon in market value - Enlarge (credit: Nvidia / Benj Edwards)

On Monday, Nvidia CEO ... - https://arstechnica.com/?p=2002975 #largelanguagemodels #machinelearning #jensenhuang #omaralolama #nvidiaceo #chatgpt #chatgtp #biz⁢ #nvidia #gpus #gpu #uae #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 2 months ago to machinelearning

The Super Bowl’s best and wackiest AI commercials - Enlarge / A still image from BodyArmor's 2024 "Field of Fake" Super Bow... - https://arstechnica.com/?p=2002656 #largelanguagemodels #americanfootball #machinelearning #commercials #commercial #anthropic #microsoft #superbowl #football #biz⁢ #google #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

greg, 2 months ago to llm

Does anyone have a good list of logical questions to judge large language models ability to reason?

Questions like "if it takes 3 hours for 3 towels to dry, how long does it take for 9 towels to dry?"

I'm playing around with Mistrals leaked 70b Miqu LLM and want to test it's reasoning skills for a project I'm working on. I've been really impressed so far. It's slower than Mistral & Mixtral but it's been producing the best reasoned answers I've seen from an LLM. And it's running locally!

#LLM #LLMs #Mistral #Miqu #LargeLanguageModels #GPT #ChatGPT

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 3 months ago to machinelearning

Report: Sam Altman seeking trillions for AI chip fabrication from UAE, others - Enlarge / OpenAI Chief Executive Officer Sam Altman walks on the House ... - https://arstechnica.com/?p=2002331 #semiconductormanufacturing #largelanguagemodels #microsoftcopilot #machinelearning #textsynthesis #googlegemini #microsoft #samaltman #aichips #biz⁢ #openai #uae #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

steve, 3 months ago to cryptocurrency

New rule. If you've worked on #cryptocurrency and/or #blockchain, your email requesting doing a PhD under my supervision goes straight in the trash.

#CareerLimitingChoices

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ oblomov

steve, 2 months ago

Also, no I will not join your research project looking at how #LargeLanguageModels and #GenerativeAI can help solve climate change. There is no possible world in which that makes any sense.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ oblomov

itnewsbot, 3 months ago to machinelearning

OpenAI and Common Sense Media partner to protect teens from AI harms and misuse - Enlarge (credit: Getty Images)

On Monday, OpenAI announced a p... - https://arstechnica.com/?p=1999788 #largelanguagemodels #commonsensemedia #machinelearning #textsynthesis #aireviews #samaltman #aiethics #aisafety #chatgpt #chatgtp #biz⁢ #openai #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kemosite, 3 months ago to ai

ChatGPT: The Future, Like It Or Not

https://kemosite.com/2024/01/21/chatgpt-the-future-like-it-or-not/

#AI #ArtificialIntelligence #ChatGPT #LargeLanguageModels #LLM

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ binaryphile

thoughtworks, 3 months ago to llm

Want to get the desired outputs from your prompts on a #LLM based #GenerativeAI application?

Here's how you can pick up #PromptEngineering to make the most of the #LargeLanguageModels: https://thght.works/3Zo0nrl

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ judeswae

bornach, 3 months ago to llm

[AI Coffee Break with Letitia] explains the #transformer architecture behind #LargeLanguageModels
https://youtu.be/ec9IQMiJBhs
#LLM #NeuralNet #ArtificialIntelligence

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

MisuseCase, 3 months ago to generativeAI

It also depends on what you want the #LargeLanguageModels and/or #generativeAI to do, and if you care to put in the time, effort, and investment to curate the training data or not.

Many of these operations don’t want to do the work in terms of curating their training data (whether that means screening it or asking for permission or whatever) because it’s not cheap or fast!

/1 https://ourislandgeorgia.net/@Wolven/111721354763217828

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 3 months ago to machinelearning

Zuckerberg’s AGI remarks follow trend of downplaying AI dangers - Enlarge / Mark Zuckerberg, chief executive officer of Meta Platforms In... - https://arstechnica.com/?p=1997158 #largelanguagemodels #machinelearning #markzuckerberg #instagramreel #opensourceai #opensource #instagram #samaltman #aiethics #aisafety #facebook #chatgpt #chatgtp #biz⁢ #aihype #openai #meta #agi #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 3 months ago to machinelearning

OpenAI opens the door for military uses but maintains AI weapons ban - Enlarge (credit: OpenAI / Getty Images / Benj Edwards)

On Tues... - https://arstechnica.com/?p=1996787 #largelanguagemodels #usdefensedepartment #suicideprevention #machinelearning #cybersecurity #aiweapons #microsoft #aiethics #aisafety #military #pentagon #chatgpt #chatgtp #biz⁢ #openai #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

RonaldTooTall, 3 months ago to LLMs

Anthropic warns: AI poisoning makes open models vulnerable "sleeper agents" that generate harmful code under specific triggers.

https://arstechnica.com/information-technology/2024/01/ai-poisoning-could-turn-open-models-into-destructive-sleeper-agents-says-anthropic/
#LLMs #AI #ArtificialIntelligence #LargeLanguageModels #CyberSecurity #Security #Technology

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 3 months ago to ChatGPT

AI poisoning could turn open models into destructive “sleeper agents,” says Anthropic - Enlarge (credit: Benj Edwards | Getty Images)

Imagine download... - https://arstechnica.com/?p=1995975 #largelanguagemodels #promptinjections #sleeperagents #llmsecurity #aisecurity #anthropic #chatgpt #chatgtp #claude2 #biz⁢ #claude #llm #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ rticks

itnewsbot, 3 months ago to machinelearning

OpenAI’s GPT Store lets ChatGPT users discover popular user-made chatbot roles - Enlarge (credit: Getty Images / Benj Edwards)

On Wednesday, Op... - https://arstechnica.com/?p=1994230 #largelanguagemodels #chatgptenterprise #ailanguagemodels #machinelearning #textsynthesis #chatgptplus #chatgptteam #aimodels #appstore #gptstore #chatgpt #chatgtp #biz⁢ #openai #gpts #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 4 months ago to machinelearning

OpenAI says it’s “impossible” to create useful AI models without copyrighted material - Enlarge (credit: OpenAI)

ChatGPT developer OpenAI recently ack... - https://arstechnica.com/?p=1994591 #largelanguagemodels #machinelearning #houseoflords #newyorktimes #ailawsuits #microsoft #aiethics #chatgpt #chatgtp #biz⁢ #dall-e #openai #ailaw #ai #uk

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...