ALTAnlp, to Futurology
@ALTAnlp@sigmoid.social avatar

In the lead up to #ALTA2024, we're highlighting #research papers from previous #workshops.

Here, the ChatGPT C-LARA-Instance, Belinda Chiera, Cathy Chua, Chadi Raheb, Manny Rayner, Annika Simonsen, Zhengkang Xiang, and Rina Zviel-Girshin use the #OpenSource #CLARA platform to evaluate #GPT4's ability to perform #linguistics #NLP tasks such as #segmentation, #lemmatization and #glossing.

🔗 C-LARA platform: https://www.c-lara.org/

🔗 Paper: https://aclanthology.org/2023.alta-1.3/

ianRobinson, to llm
@ianRobinson@mastodon.social avatar

I didn’t know that Drafts App had an Action to allow conversations with the OpenAI GPT 4 API. Just installed and tried it. It works a treat.

https://directory.getdrafts.com/a/2RB

I think I'll settle on paying for Anthropic Claude 3 via their web interface (I'll check out the API access at some point too), and use PAYG API credits via Drafts for access to GPT 4. The GPT 4 selector in the API currently redirects to gpt-4-turbo.

brunus, to tech French
@brunus@mamot.fr avatar

Hayé, l'IA est aussi conne que l'humain !
GPT-4 a passé le test de Turing.

DarKou,
@DarKou@mastodon.darkou.fr avatar

@brunus tu veux dire qu'une IA créée par des humains n'est pas plus intelligente que ses créateurs ?

Mais qui aurait pu le prédire ?! 🤷‍♂️

brunus,
@brunus@mamot.fr avatar

@DarKou À la limite, si l'IA est capable de dire qu'on peut sans états d'âmes utiliser les fachos et les capitalistes soit comme composte soit comme population crash-test sur Mars... on pourra éventuellement dire que c'est plus intelligent que certain.e.s humain.e.s

jaybird110127, to random

"Experts?" a father shouted from the crowd. "What experts have experience with school assemblies turning into monsters?"

ErikJonker, to ai
@ErikJonker@mastodon.social avatar

ChatGPT from OpenAI is a service, it's not necessarily the same as the model (GPT-4) that it is using in the background. OpenAI adds some elements like code interpreter which makes it perform (much) better then models without such features. Regardless OpenAI faces some good competition from the Llama3 models, i hope it will stimulate them to quickly release GPT-5.

ErikJonker, to ai
@ErikJonker@mastodon.social avatar

The score of Llama3 70B on the LMSYS leaderboard is impressive. Although it's also clear that the latest GPT-4 is still a lot better. However Llama3 is opensource and freely available and a larger version (400B parameters) is on the way and will be closer to GPT4 with regard to performance on the various benchmarks.
https://chat.lmsys.org/?leaderboard

stefano,
@stefano@bsd.cafe avatar

@ErikJonker That's the point: having control over hosting, even if it means sacrificing some capabilities, can be a game changer for privacy and security reasons.

ErikJonker,
@ErikJonker@mastodon.social avatar

@stefano true, especially for (european) governments the privacy and security of models is very important otherwise their use will probably be illegal

PrivacyDigest, to security
@PrivacyDigest@mas.to avatar

OpenAI's GPT-4 can real by reading advisories

While some other LLMs appear to flat-out suck

https://www.theregister.com/2024/04/17/gpt4_can_exploit_real_vulnerabilities/

br00t4c,
@br00t4c@mastodon.social avatar

@PrivacyDigest Skynet will start building itself any day now 🤣

kubikpixel, to random German
@kubikpixel@chaos.social avatar

Ich hoffe, das Passkeys diesbezüglich nicht betroffen ist so wie Passwort-Manager wie @keepassxc, @bitwarden inklusive 2FA schon einen grösseren Schutz gegenüber der KI ergibt.

»GPT-4 kann eigenständig bekannte Sicherheitslücken ausnutzen:
Forscher haben festgestellt, dass GPT-4 allein anhand der zugehörigen Schwachstellenbeschreibungen 13 von 15 Sicherheitslücken erfolgreich ausnutzen kann.«

🤖 https://www.golem.de/news/mit-cve-beschreibung-gpt-4-kann-eigenstaendig-bekannte-sicherheitsluecken-ausnutzen-2404-184301.html


realbloginista,
@realbloginista@social.cologne avatar

@kubikpixel @keepassxc @bitwarden

Es ist bald soweit. Die Maschinen erheben sich…

😱

HonkHase, to random German
@HonkHase@chaos.social avatar

Mit #CVE-Beschreibung: #GPT4 kann eigenständig bekannte #Sicherheitslücken ausnutzen

"Forscher haben festgestellt, dass GPT-4 allein anhand der zugehörigen #Schwachstellenbeschreibungen 13 von 15 Sicherheitslücken erfolgreich ausnutzen kann."
https://www.golem.de/news/mit-cve-beschreibung-gpt-4-kann-eigenstaendig-bekannte-sicherheitsluecken-ausnutzen-2404-184301.html

larsmb,
@larsmb@mastodon.online avatar

@HonkHase "Gut beschriebene Angriffsmuster können durch Statistik und Substitution halbautomatisch recht erfolgreich auf andere Instanzen übertragen werden" klingt halt etwas weniger spannend und anthromorphisierend.

vampirdaddy,
@vampirdaddy@chaos.social avatar

@HonkHase
Das ist aber weniger ein Zeichen von Intelligenz, sondern vielmehr ein Indikator vom Scraping-Umfang des Trainings (und der Banalität vieler Lücken).

cassidy, to ai
@cassidy@blaede.family avatar

“AI” as currently hyped is giant billion dollar companies blatantly stealing content, disregarding licenses, deceiving about capabilities, and burning the planet in the process.

It is the largest theft of intellectual property in the history of humankind, and these companies are knowingly and willing ignoring the licenses, terms of service, and laws that us lowly individuals are beholden to.

https://www.nytimes.com/2024/04/06/technology/tech-giants-harvest-data-artificial-intelligence.html?unlocked_article_code=1.ik0.Ofja.L21c1wyW-0xj&ugrp=m

cassidy,
@cassidy@blaede.family avatar

It’s a shame that the industry is in the midst of such a circlejerk around the term “AI,” too, because I think a lot of machine learning is genuinely incredible and is the most underappreciated (and often invisible) aspect of a bunch of technology we use: our cameras, keyboards, copy/paste, voice recognition, smart homes, and more are often powered by machine learning models.

But let’s call everything “AI” now because a bunch of billion dollar companies decided it’s a fun space to compete in.

cassidy,
@cassidy@blaede.family avatar

I guess we wait this one out until the “AI” bubble bursts due to the incredible subsidization the entire industry is undergoing. It is not profitable. It is not sustainable.

It will not last—but the damage to our planet and fallout from the immense amount of wasted resources will.

https://arstechnica.com/information-technology/2023/10/so-far-ai-hasnt-been-profitable-for-big-tech/

mattlav1250, to ai
@mattlav1250@journa.host avatar

artificial INTELLIGENCE...

This is from the paid PREMIUM version of GPT4 and DALL-E 3...

·E

image/png
image/png
image/png

gimulnautti,
@gimulnautti@mastodon.green avatar

@mattlav1250 Is it AI hype yet? 🙂

bornach, to llm
@bornach@masto.ai avatar

Asked (formerly ) a familiar riddle but with numbers changed to make it impossible. It generated the same solution but substituting the numbers so that it ends up with the nonsense claim:

10 + 5 = 23

bornach,
@bornach@masto.ai avatar

I tried the prompt:

"I have an empty opaque bag. I put two apples and one banana in the bag. I either remove the banana or I remove one apple. I then remove all remaining fruits from the bag. Is it possible to tell what is in the bag now?"

with which got the right answer, but it confused Perplexity.ai which also cited a website on how to build a disaster survival kit.

glyph,
@glyph@mastodon.social avatar

@bornach I was going to post something like "I guess programmers' jobs are safe" but as I was looking at it I realized that for most companies, 15 is close enough to 23 that they'll just use the AI and call it a win

ErikJonker, to ai
@ErikJonker@mastodon.social avatar

Illustrates my personal experience with LLMs
"The finding underscores the notion that AI will likely be most useful as a tool to augment, not replace, the human reasoning process."
https://www.bidmc.org/about-bidmc/news/2024/04/chatbot-outperformed-physicians-in-clinical-reasoning-in-head-to-head-study

bornach, to ai
@bornach@masto.ai avatar

Stephen Falken: "Except, that I never could get Joshua to learn the most important lesson."

David Lightman: "What's that?"

Stephen Falken: "Futility. That there's a time when you should just give up."

bornach,
@bornach@masto.ai avatar

Meanwhile, Perplexity.AI seems to be picking up bad habits from GPT-3.5

Given a trivially easy jug filling problem it cites the unnecessarily complicated solution by GPT-3.5 that was discussed on an online forum last year.

bornach,
@bornach@masto.ai avatar

Or maybe it was taught by BingChat/Copilot - isn't Microsoft reportedly using GPT-4? Its solution is even more tortuous and later admits to measuring the wrong amount of water. It never realises it could have stopped after step 1.

ErikJonker, to ai
@ErikJonker@mastodon.social avatar

Veel posts over wat GPT4 niet kan verhullen af en toe wel hoe hoe goed het is in kennisvragen over complexe onderwerpen, ook met de betrouwbaarheid en de noodzaak tot controleren in het achterhoofd, heeft het daar veel toegevoegde waarde ten opzichte van Google Search. Met name in pure tekstvragen, uitleg van bepaalde concepten, theorieeen, frameworks etc in elke wetenschap die je kunt bedenken.

ErikJonker,
@ErikJonker@mastodon.social avatar

@Jigsaw_You ...natuurlijk maar ze kunnen zeker helpen bij het begrijpen van bepaalde materie, je wijzen op invalshoeken waar je niet op bent gekomen of bronnen die je niet kent , ik heb genoeg praktijkvoorbeelden varierend van geschiedenis tot quantum/thermodynamica . Niet ter vervanging van originele bronnen, want checken blijft nodig, maar het heeft zeker een nuttige functie, voor uitleg en inspiratie.

ErikJonker,
@ErikJonker@mastodon.social avatar

@Jigsaw_You ...LLMs zijn niet de oplossing voor alles en op zichzelf zelfs wellicht een doodlopende weg, maar als onderdeel van AI systemen onmisbaar, ook in de toekomst.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • megavids
  • thenastyranch
  • magazineikmin
  • cisconetworking
  • cubers
  • Youngstown
  • everett
  • ngwrru68w68
  • slotface
  • ethstaker
  • InstantRegret
  • kavyap
  • DreamBathrooms
  • Durango
  • JUstTest
  • rosin
  • khanakhh
  • modclub
  • osvaldo12
  • mdbf
  • normalnudes
  • Leos
  • GTA5RPClips
  • tacticalgear
  • tester
  • provamag3
  • anitta
  • lostlight
  • All magazines