I was following it correctly up until the part where you have to place a child on your laptop. I wish these things would let you know the parts required beforehand.
fwiw I’ve used it pretty extensively on screenshots of text I keep getting sent at work, so far I haven’t noticed any mistakes at all. may just be the type of images though
I’m actually impressed by the reasonably coherent (though nonsense) text. If you think about how generative AI works it’s very surprising it could form words in images.
Microsoft’s image generator has been getting better and better at text. There are still plenty of problems, especially with small text, but someone on another forum was able to get it to output this with a very small prompt:
I worked in a print shop in the 90s (until 95) and we still used xacto knives for our layouts. We had a computer but on now really knew how to use it for graphic design yet.
Yep, sure, it’s a wild world we live in and this topic is changing fast. Missing this memo won’t matter when the next one will be the next generation but generations are only 6 months apart.
I disagree. This is as you say Precisely the type of thing that happens when an image generator is asked to make a chart/diagram, so to me it seems a really wild leap to go from “This looks like exactly what happens when X” to “someone must have designed this to look like what happens when X”.
If it were human designed, I think it would be intentionally funny (which realistically would backfire, but anyway…)
(And besides, paid ChatGPT does indeed connect to DALL-E 3 now)
Tbf I thought DALL-E3 was still just available via bing image creator, missed the memo that ChatGPT was hooked up to it too.
Still, for me though it still looks like it’s human generated to try and be funny (it’s just haha-AI-so-silly isn’t groundbreakingly funny any more). It’s mostly the information continuity throughout the image that I’ve not really seen from an image generating AI before (especially when not even prompted for it), and I’ve had a play around with DALL-E3 so I would expect the ChatGPT version to be equivalent.
Maybe I’m too cynical, but this just reeks of fake to me.
ChatGPT takes the liberty of creating a DALL-E prompt that it doesn’t feel the need to share with the user. You can, however, ask ChatGPT to share the exact prompt and seed with you to reproduce the image. Here is the actual prompt and seed DALL-E ended up working with:
Prompt: “A step-by-step visual guide on using Optical Character Recognition (OCR) in Microsoft Word. The guide includes steps like opening Microsoft Word, inserting an image into a Word document, selecting the image, and using the OCR feature to convert the text in the image into editable text. The layout should be clear and easy to follow, with each step labeled and illustrated in a user-friendly manner, catering to users with basic proficiency in Microsoft Word.”
Seed: 3993182816
To be clear, ChatGPT decided on its own to create and send this prompt to DALL-E in response to my request for tech support.
Add comment