simon

@simon@simonwillison.net

Open source developer building tools to help journalists, archivists, librarians and others analyze, explore and publish their data. https://datasette.io and many other #projects.

This profile is from a federated server and may be incomplete. Browse more on the original instance.

simon, 22 hours ago to random

I wrote about a common misconception I see people have about LLM tools like ChatGPT

Training is not the same as chatting: ChatGPT and other LLMs don’t remember everything you say

https://simonwillison.net/2024/May/29/training-not-chatting/

reply

expand (29)

collapse (29)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Binder, futurebird, juandesant, matt +1 more

simon, 22 hours ago

If you spend a lot of time with LLMs it's easy to fall into the trap of assuming that other people already understand things like this - which can lead to frustrating conversations where people are bringing very different mental models of how these things work

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 20 hours ago

@ericflo yeah I don't think that's documented at all - for different chat bots, how is hitting the context limit in a conversation handled? Some might truncate earlier messages but there are summarization tricks that might end up used as well

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 20 hours ago

This is yet another of those unintuitive things that stem from wrapping a chat interface around an autocompletion language model

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 20 hours ago

@troed I mention that in my article - there are plenty of reasonable reasons that people end up believing this!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 18 hours ago

@mcc that's part of my frustration here: I can't say for sure how this stuff works because OpenAI don't document it!

I frequently use prompt leaking tricks (or chat exports) to confirm that I understand how their low-level prompting for ChatGPT works myself, but that's a pretty unreliable form of documentation

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 18 hours ago

@glyph @mcc headlines are always hard, especially wince thee days it's clear that a lot of people genuinely won't read more than the headline

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 18 hours ago

@mcc @glyph the thing I care about most here is that lots of people really do believe that anything they say to a model is instantly memorized and becomes part of its global "brain" (available to all users) - and that's what the term "training" means to them

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 17 hours ago

@me1000 I made that exact comparison back in December! https://simonwillison.net/2023/Dec/14/ai-trust-crisis/#facebook-dont-spy-microphone

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 day ago to random

Wrote up an alternative way of getting Cloudflare to redirect one domain (in this case a no-www domain) to another using redirect rules https://til.simonwillison.net/cloudflare/redirect-rules

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 day ago

One of those TILs where I figure something out, then go to my TIL website to write it up and discover I already figured out a similar solution just a few months ago! https://til.simonwillison.net/cloudflare/redirect-whole-domain

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ juandesant

simon, 22 hours ago

@alexelcu I have a detailed TIL about how the most recent version of that works here https://til.simonwillison.net/shot-scraper/social-media-cards

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

KevinGimbel, 1 day ago to random

They be putting AI into everything, and they destroy years of trust.

https://kevingimbel.de/blog/2024/05/re-trust/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 19 hours ago

@KevinGimbel was that Golden Gate Bridge suicide one confirmed as real? I got the impression it might have been one of the faked screenshots that were floating around (pretty condemning that it's so easy for us to believe in those fakes if that's true though)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 day ago to random

Weeknotes: mainly PyCon US and updates to both LLM and some Datasette plugins to support GPT-4o and Google's Gemini Flash https://simonwillison.net/2024/May/28/weeknotes/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jonkeegan, 1 day ago to random

NEW POST on my @Beautifulpublicdata newsletter:

New FOIA records from the FAA shed light on the frantic effort in 2015 to rename navigation waypoints related to Donald Trump and reveal the list of naughty waypoint names that were changed over the years.

https://www.beautifulpublicdata.com/trump-naughty-faa-waypoints/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ stefan

simon, 1 day ago

@jonkeegan @Beautifulpublicdata Is it possible to FOIA the rationale for more of those renames? As a big Narwhal fan I'd love to know why NARWL become FOLET for example

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

tannewt, 2 days ago to random

@simon I’m finely understanding the value of GitHub copilot when coding. Thanks for encouraging folks to try llm tools. Any tips for trying the open code completion models when coding? I use sublime text but am curious enough to use another editor. Thanks!

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 1 day ago

@tannewt I haven't spent much time with copilot alternatives yet - not sure what the best tooling is for that right now

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

22, 2 days ago to random

@simon I hope it's ok to ask for support in this manner, if not I apologize!

Does the llm package allow me to download (and sync locally) chats I've had with the web version of ChatGPT? Or does logging only support chats done through the llm package itself?

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 2 days ago

@22 it doesn't support that

It's an interesting idea for a plugin though! Could work by first having you request your JSON export from ChatGPT, then converting that JSON to the LLM SQLite schema

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 4 days ago to random

Just had a delightful voice conversation with ChatGPT (4o) where I asked it how come there was an Elizabeth line train on platform 8 (surface platform) at London Paddington and it explained that there is a set of ramps at Paddington to allow trains to get from the deep below ground lines up to the surface, and then wrote some Python code to render me a diagram https://chatgpt.com/share/1afcc398-b1cf-424a-8835-5b6a2985168b

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 4 days ago

@researchbuzz yeah the ChatGPT iPhone app has a little headphones icon that starts an audio conversation, currently using whisper for speech-to-text and their TTS model for text-to-speech - it's really fun

(At some point they'll be switching that over to their creepy new 4o voice mode but that's not enabled yet https://simonwillison.net/2024/May/15/chatgpt-in-4o-mode/ )

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 3 days ago

@boffbowsh I am 99% confident that no such ramp exists!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 3 days ago

To clarify, I am 99.9% confident that such a ramp does not exist!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 3 days ago

@eliocamp yes, but I found it VERY amusing

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jonty, 4 days ago to random

Well, that was the worst day I have had in months. Please send animal gifs.

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 4 days ago

@jonty here's a really big chicken I met the other day

It's a very big dark brown/black chicken (possibly a rooster?) drinking from a purple bucket

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 5 days ago to random

There is a limited time opportunity right now to try a version of Claude that's completely obsessed with the Golden Gate Bridge, and it is howlingly entertaining https://www.anthropic.com/news/golden-gate-claude

Visit https://claude.ai/ and click the little bridge icon

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ binaryphile, miki

simon, 5 days ago

In tragic news, it looks like Golden Gate Claude is no longer available

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mpesce, 6 days ago to random

Now it can be told:

While doing some AI engineering work for a client, I developed a prompt - completely inadvertently - that reduced every AI chatbot to gibberish (except Anthropic's Claude 3). I then spent a week trying to alert the LLM vendors to this issue - and largely failed. There is no mechanism to report flaws in these models that are already deployed to billions of users. Read the whole story in @theregister

https://www.theregister.com/2024/05/23/ai_untested_unstable/

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ 65dBnoise, j_bertolotti, oblomov, samhainnight +3 more

simon, 4 days ago

@mpesce @researchbuzz @theregister this keeps on happening with prompt injection

Here's an example that was responsibly disclosed in December, nothing happened, the researcher published 4 months later and THEN Google finally mitigated it in response to the public disclosure https://embracethered.com/blog/posts/2024/google-notebook-ml-data-exfiltration/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...