#GPT4 - kbin.social

ALTAnlp, 2 days ago to Futurology

In the lead up to #ALTA2024, we're highlighting #research papers from previous #workshops.

Here, the ChatGPT C-LARA-Instance, Belinda Chiera, Cathy Chua, Chadi Raheb, Manny Rayner, Annika Simonsen, Zhengkang Xiang, and Rina Zviel-Girshin use the #OpenSource #CLARA platform to evaluate #GPT4's ability to perform #linguistics #NLP tasks such as #segmentation, #lemmatization and #glossing.

🔗 C-LARA platform: https://www.c-lara.org/

🔗 Paper: https://aclanthology.org/2023.alta-1.3/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ KathyReid

Jigsaw_You, 10 days ago to ai

Following much optimism regarding #GPT4's capabilities, recent studies highlight its limited effect on reply time and the potential risks associated with using #AI to draft replies to patient messages.

A study conducted by University of California San Diego School of Medicine showed an increase in response length and reading time and no effect on the reply time using #GenA.

https://healthairegister.com/news/2024/04/29/limited-effect-and-serious-risk-when-using-ai-to-draft-patient-replies/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Jigsaw_You

ianRobinson, 12 days ago to llm

I didn’t know that Drafts App had an Action to allow conversations with the OpenAI GPT 4 API. Just installed and tried it. It works a treat.

https://directory.getdrafts.com/a/2RB

I think I'll settle on paying for Anthropic Claude 3 via their web interface (I'll check out the API access at some point too), and use PAYG API credits via Drafts for access to GPT 4. The GPT 4 selector in the API currently redirects to gpt-4-turbo.

#LLM #Claude #GPT4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

brunus, 17 days ago to tech French

Hayé, l'IA est aussi conne que l'humain !
GPT-4 a passé le test de Turing.

#IA #Turing #GPT4 #tech #science

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

jaybird110127, 17 days ago to random

"Experts?" a father shouted from the crowd. "What experts have experience with school assemblies turning into monsters?"
#NoContext #GPT4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ErikJonker, 20 days ago to ai

ChatGPT from OpenAI is a service, it's not necessarily the same as the model (GPT-4) that it is using in the background. OpenAI adds some elements like code interpreter which makes it perform (much) better then models without such features. Regardless OpenAI faces some good competition from the Llama3 models, i hope it will stimulate them to quickly release GPT-5.
#AI #Llama3 #opensource #GPT4 #GPT5 #ChatGPT

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ErikJonker, 20 days ago to ai

The score of Llama3 70B on the LMSYS leaderboard is impressive. Although it's also clear that the latest GPT-4 is still a lot better. However Llama3 is opensource and freely available and a larger version (400B parameters) is on the way and will be closer to GPT4 with regard to performance on the various benchmarks.
https://chat.lmsys.org/?leaderboard
#AI #GPT4 #LMSYS #Leaderboard #Llama3 #opensource

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

PrivacyDigest, 20 days ago to security

OpenAI's GPT-4 can #exploit real #vulnerabilities by reading #security advisories

While some other LLMs appear to flat-out suck
#llm #gpt4 #openai #ai

https://www.theregister.com/2024/04/17/gpt4_can_exploit_real_vulnerabilities/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ br00t4c

kubikpixel, 22 days ago to random German

Ich hoffe, das Passkeys diesbezüglich nicht betroffen ist so wie Passwort-Manager wie @keepassxc, @bitwarden inklusive 2FA schon einen grösseren Schutz gegenüber der KI ergibt.

»GPT-4 kann eigenständig bekannte Sicherheitslücken ausnutzen:
Forscher haben festgestellt, dass GPT-4 allein anhand der zugehörigen Schwachstellenbeschreibungen 13 von 15 Sicherheitslücken erfolgreich ausnutzen kann.«

🤖 https://www.golem.de/news/mit-cve-beschreibung-gpt-4-kann-eigenstaendig-bekannte-sicherheitsluecken-ausnutzen-2404-184301.html

—
#passkey #passwort #hack #ki #gpt4 #2fa #itsicherheit #sicherheitslucken

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

HonkHase, 22 days ago to random German

Mit #CVE-Beschreibung: #GPT4 kann eigenständig bekannte #Sicherheitslücken ausnutzen

"Forscher haben festgestellt, dass GPT-4 allein anhand der zugehörigen #Schwachstellenbeschreibungen 13 von 15 Sicherheitslücken erfolgreich ausnutzen kann."
https://www.golem.de/news/mit-cve-beschreibung-gpt-4-kann-eigenstaendig-bekannte-sicherheitsluecken-ausnutzen-2404-184301.html

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ bodomenke, MagicLike

cassidy, 1 month ago to ai

“AI” as currently hyped is giant billion dollar companies blatantly stealing content, disregarding licenses, deceiving about capabilities, and burning the planet in the process.

It is the largest theft of intellectual property in the history of humankind, and these companies are knowingly and willing ignoring the licenses, terms of service, and laws that us lowly individuals are beholden to.

https://www.nytimes.com/2024/04/06/technology/tech-giants-harvest-data-artificial-intelligence.html?unlocked_article_code=1.ik0.Ofja.L21c1wyW-0xj&ugrp=m

#AI #GenAI #LLM #LLMs #OpenAI #ChatGPT #GPT #GPT4 #Sora #Gemini

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Taffer, runarcn, Wildebeest, PhieLaidMignon +10 more

mattlav1250, 1 month ago to ai

artificial INTELLIGENCE...

This is from the paid PREMIUM version of GPT4 and DALL-E 3...

#ai #gpt4 #openai #microsoft #DALL·E #dalle3

image/png
image/png
image/png

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 1 month ago to llm

Asked #Copilot (formerly #BingChat) a familiar riddle but with numbers changed to make it impossible. It generated the same solution but substituting the numbers so that it ends up with the nonsense claim:

10 + 5 = 23

#GPT4 #LLM #AI #fail

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ErikJonker, 1 month ago to ai

Illustrates my personal experience with LLMs
"The finding underscores the notion that AI will likely be most useful as a tool to augment, not replace, the human reasoning process."
https://www.bidmc.org/about-bidmc/news/2024/04/chatbot-outperformed-physicians-in-clinical-reasoning-in-head-to-head-study
#AI #GPT4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 1 month ago to ai

Stephen Falken: "Except, that I never could get Joshua to learn the most important lesson."

David Lightman: "What's that?"

Stephen Falken: "Futility. That there's a time when you should just give up."

#Copilot #BingChat #GPT4 #AI #LLM #fail

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ErikJonker, 1 month ago to ai

Veel posts over wat GPT4 niet kan verhullen af en toe wel hoe hoe goed het is in kennisvragen over complexe onderwerpen, ook met de betrouwbaarheid en de noodzaak tot controleren in het achterhoofd, heeft het daar veel toegevoegde waarde ten opzichte van Google Search. Met name in pure tekstvragen, uitleg van bepaalde concepten, theorieeen, frameworks etc in elke wetenschap die je kunt bedenken.
#AI #GPT4

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

schizanon, 1 month ago to programming

I don't know if AI is going to replace programmers or not but there will be a lot of jobs just to delete AI generated code.

#programming #ai #llm #llms #chatgpt #gpt4 #gpt5 #copilot #gemini #claude

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

tero, 1 month ago to LLMs

#LLMs have really created a paradigm shift in machine learning. It used to be so that you would train an #ML model to perform a task by collecting a dataset reflecting the task, with task output labels, and then using supervised learning to learn this task by doing.

Now a new paradigm has emerged: Train by reading about the task. We have such generalist models that we can let them learn about the domain by reading all the books and other content about it, and then utilize that learned knowledge to perform the task. Note that task labels are missing. You might need those to measure the performance but you don't need those for training.

Of course if you have both example performances as task labels and lots of general material about the topic, you can actually use both to get even better performance.

Here is a good example of training the model not by example performances, but by general written knowledge about the topic. #GPT4 surpasses the quality levels of previous state-of-the-art despite not having been trained for this task.

This is the power of generalist models; they unlock new ways to train them, which for example allow us to surpass human-level by side-stepping imitative objectives. This isn't the only way to train skills these models enable, there are countless other ways, but this is an uncharted territory.

The classic triad of supervised learning, unsupervised learning and reinforcement learning are going to have an explosion of new training methodologies to become their peers because of this.

https://www.nature.com/articles/s41592-024-02235-4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh

ErikJonker, 1 month ago to ai

Interesting, some empirical research on how GPT4 /ChatGPT performs in summarizing. Still the low rate of errors can be unacceptable in some contexts. As noted "Life-critical medical decisions should remain based on full, critical, and thoughtful evaluation of the full text of research articles in context with clinical guidelines.".
https://www.annfammed.org/content/22/2/113
#AI #ChatGPT #GPT4 #summary #medical

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ErikJonker, 1 month ago (edited 1 month ago) to ai

Claude 3 is officially on the top of the leaderbord, although it's just one leaderboard/benchmark and added value always depends on use and context, it's still the end of GPT4 total dominance (unil GPT5 arrives probably). Interesting is also the performance of the Claude 3 Haiku model which is relatively small/cheap.
https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard
#leaderboard #Claude3 #GPT4 #AI

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ErikJonker, 1 month ago to security

Autofix on github, logical use of GPT4 in coding.
https://techcrunch.com/2024/03/20/githubs-latest-ai-tool-that-can-automatically-fix-code-vulnerabilities
#security #github #openai #GPT4 #autofix #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ErikJonker, 1 month ago to ai

Jensen Huang: OpenAI's latest model has 1.8 trillion parameters and required 30 billion quadrillion FLOPS to train.
Billion quadrillion somewhat hard to grasp... 😂
#AI #Nvidia #GPT4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ErikJonker, 1 month ago to GraphicsProgramming

(continued from previous post)...blackwell GPU will cost $ 30.000 (minimum), so training a GPT4 model with 2000 GPUs costs approx. $ 60 million ? (in 90 days, at a minimum because there are also other costs)
#training #GPT4 #GPU #Nvidia #Blackwell #AI #LLM

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

Usernamez, 1 month ago to DOOM

paper about DOOM and GPT4.
https://arxiv.org/pdf/2403.05468.pdf

#doom #E1M1 #gpt4 #ai #gamedev

image/jpeg

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ angelus_04

kubikpixel, 2 months ago to ChatGPT German

Schade ich mag ASCII-Art sehr doch wundern mich deswegen nicht. Dies ist eigentlich klar definiert und #Bilde'r in ASCII zu umwandeln ist schon lange keine #Kunst.

«#KI-Sicherheit –Wie Ascii-Art #GPT4 und Gemini austricksen kann:
Verschiedene Sicherheitsmechanismen sollen verhindern, dass euch #ChatGPT die Bauanleitung für eine #Bombe gibt. Jetzt haben #Sicherheit'sforscher:innen herausgefunden, dass sich die umgehen lassen – mit #Ascii-Kunst.»

🧷 https://t3n.de/news/ki-jailbreak-gpt-gemini-claude-1613119/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...