john,
@john@sauropods.win avatar

Can somebody please get an AI image generator to generate a fox with a human face or head for me?

The other way around is really easy and does not count.

kromeboy,
@kromeboy@mastodon.uno avatar

@john Something like this?

Stable Diffusion
Model: ArtUniverse
method: inpainting then image to image

Prompt in the alt text

john,
@john@sauropods.win avatar

@kromeboy I don’t think I painting counts. It’s too much like just compositing in photoshop.

kromeboy,
@kromeboy@mastodon.uno avatar

@john by prompt alone I think that is really hard because the models are trained on man dressed as animals but not animals dressed as man 😄

john,
@john@sauropods.win avatar

@kromeboy Yes, they are conservative. Dall-e doesn't want to do it, but you can make it by giving it a really long flowery prompt.

BlackPhi,

@john Something like this? The trickiest bit was getting it so it doesn't look photoshopped. The inpainting was denoised at 0.9. I don't know if the Americanisations made any difference, I suspect not. There is cleaning up to do around the tail and legs, but that is to be expected with . The initial model was SD's 768-v-ema and the inpainting used the Realistic Vision 5.1 inpainting model, masking the eyes and muzzle.

john,
@john@sauropods.win avatar

@BlackPhi Yeah, infilling is possible, but I guess what I'm interested in is not so much how to get a certain thing done (I can paint/photomanipulate/etc. myself), but what is going on with AI imaging in general .

I'm finding the coaxing aspect interesting. It's just not as simple as the model doesn't have reference images, because DALL-e makes some things hard, but with some cajoling will actually do a pretty good job.

akira28,
@akira28@mastodon.social avatar

@john first try in dalle-3 via chatGPT. Prompt: a fox with a human head and face. A bit creepy for the proportions, but I would say it’s a good job

john,
@john@sauropods.win avatar

@akira28 Yes, if you can load the rest of the thread (unreliable on Mastodon, I know), you'll see that ChatGPT gives a more elaborate prompt to Dall-E, based on your prompt. If you use Bings version you need a fairly elaborate prompt to get it to work.

trachelipus,
@trachelipus@masto.ai avatar

@john I'm insufficiently motivated to spend my own money buying DallE credits in service of this question, but I wonder how it would handle the prompt "A sphinx with the body of a fox"

john,
@john@sauropods.win avatar

@trachelipus You asked for: soft-core furry porn. Vending:

trachelipus,
@trachelipus@masto.ai avatar

@john Furry porn indeed, lol. I was hoping it would know a sphinx has the head of a human and the body of a lion,and thus understand it was supposed to keep the human head while replacing the lion parts with fox. Instead it went full on Egypt while keeping the fox face. A centaur with the body of a fox isn't quite what you want, since I assume you don't want the human torso. Hmm. Interesting challenge.

Twarda,
@Twarda@sauropods.win avatar

@john Out of curiosity. Why do you need human faced foxes xD

john,
@john@sauropods.win avatar

@Twarda Well it would save me a lot of time painting things like this

Twarda,
@Twarda@sauropods.win avatar

@john Lol that's valid

zillophane,

@john midjourney "a person whose body ONLY has been transformed into the body of a fox, but still with the face of a human" partial success

john,
@john@sauropods.win avatar

@zillophane Eh, that's the old fox head on a human body, which Dall-e also really likes to do.

Interesting the aesthetic aspect to this. There's more than your prompt going on.

zillophane,

@john another failure "a photograph of a man caught outside in a realistic fox costume, with his shocked face looking directly at the camera, in the style of nature photography"

john,
@john@sauropods.win avatar

@zillophane Your fox costume idea was good. Didn't work of course, but you never know.

zillophane,

@john this obviously didn't work, but it's still beautiful. Midjourney must have been really selective about their training set

john,
@john@sauropods.win avatar

@zillophane I think they're adding stuff to your prompts to get stylised results.

zillophane,

@john nature beat us to it: the Tibetan fox

john,
@john@sauropods.win avatar

@zillophane And God prompted, “let there be a fox, with the face of a fox, and yet of a man, so that no man knoweth why it unsettles him so" and so it was.

TEG,
@TEG@mastodon.online avatar

@john Enjoy this lovely and not at all horrible creation!

john,
@john@sauropods.win avatar

@TEG What AI is this? It looks Stable Diffusion-ish. What was the prompt?

TEG, (edited )
@TEG@mastodon.online avatar

@john I have very little clue, it's from https://www.craiyon.com/, with the prompt (I think...):

A realistic orange vulpine lammasu, with the body of a fox. The head belongs to Albert Einstein. The fox is orange. The head is smiling. The body is visible. The tail is bushy. The head is a scientist with his tongue out. The face is pink. The head is fully human.

And negative prompt: fox-headed

The vague idea was to get a well-known human head in there to force the AI into using it.

john,
@john@sauropods.win avatar

@TEG Dall-e mini, apparently, which is an attempt to match Dall-e with an open source model. It's seems to pay more attention to the prompt than Stable Diffusion, which gave me this:

pbloem,
@pbloem@sigmoid.social avatar

@john DALL-E 3 seems to manage, and it's suitably majestic.

john,
@john@sauropods.win avatar

@pbloem Damn, you did it! How many times did you try?

pbloem,
@pbloem@sigmoid.social avatar

@john First try, I promise...

john,
@john@sauropods.win avatar

@pbloem I burned all my credits on Bing and it wouldn't work. @AggroBoy tried for ages on DALL-E4 and nope.

I wonder if putting it through the chat interface made it work?

pbloem, (edited )
@pbloem@sigmoid.social avatar

@john @AggroBoy The success rate is 3/5. Not counting this one, which is technically correct, I guess.

pbloem,
@pbloem@sigmoid.social avatar

@john @AggroBoy ChatGPT does do some stuff behind the scenes, including writing its own prompt. You can see part of that in the filename when you download the image. The prompt it wrote for the first image I posted was

"A surreal and imaginative depiction of a fox with a human head, blending the natural orange and white fur of the fox with the distinct features of a h..." (the filename cuts off there).

pbloem,
@pbloem@sigmoid.social avatar

@john @AggroBoy Ah, I can just ask it what prompt it used. Here's the full thing.

john,
@john@sauropods.win avatar

@pbloem @AggroBoy AIs are better prompters than humans. There we go.

BlueTurtleAI,
@BlueTurtleAI@hachyderm.io avatar

@john @pbloem @AggroBoy At least ChatGPT, because it is trained for that purpose. But it can only talk to dall-e, I guess other image generation ais don’t like the generated prompts very much.

john,
@john@sauropods.win avatar

@BlueTurtleAI @pbloem @AggroBoy I thing Midjourney has a prompt-rewriter in there.

hrbrmstr,
@hrbrmstr@mastodon.social avatar

@john one try (via ChatGPT+)

john,
@john@sauropods.win avatar

@hrbrmstr It seems that ChatGPT knows how to get DALL-E to do what it wants, but people less so.

hrbrmstr,
@hrbrmstr@mastodon.social avatar

@john i asked “Please create a realistic image of a fox with a human head/face.” and the resultant prompt it created was this (full text in alt-txt)

john,
@john@sauropods.win avatar

@hrbrmstr Yeah, I pasted the ChatGPT generated prompt upthread into Bing and got this. You need to write a short story to get what you want.

Stable Diffusion still utterly fails though.

sjosjo,
@sjosjo@mas.to avatar

deleted_by_author

  • Loading...
  • hrbrmstr,
    @hrbrmstr@mastodon.social avatar

    @sjosjo @john I try to not get out of the habit of "please" since i know it'll carry over to humans if I do.

    catselbow,
    @catselbow@fosstodon.org avatar

    @john
    By "other way around" do you mean "get a fox with a human head to generate an AI image generator"?

    john,
    @john@sauropods.win avatar

    @catselbow That’s how we got them, right?

    Joking aside, this is serious, be serious and get prompting!

    AggroBoy,
    @AggroBoy@mastodon.social avatar

    @john I spent about half an hour trying last night and just couldn't get it to do it. In a great many attempts, I got one example out of DallE4 (that I didn't save) where it had sortof superimposed a textureless grey human face on top of a fox's head, but it didn't look like it was actually part of the head. The rest were either normal foxes, or humans wirh fox heads. Weirdly, it really liked generating humans with HUGE foxes heads.

    AggroBoy,
    @AggroBoy@mastodon.social avatar

    @john I guess it's just a rare (unique?) enough composition that there are no examples to extrapolate from in the various training sets.

    john,
    @john@sauropods.win avatar

    @AggroBoy Yeah, but I would have thought it could just composite the concepts, like it does with subject and background, for example.

    miekeroth,
    @miekeroth@socialserver.science avatar

    @john seriously..

    john,
    @john@sauropods.win avatar

    @miekeroth It’s trained on way too much furry art!

  • All
  • Subscribed
  • Moderated
  • Favorites
  • ai
  • Durango
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • tacticalgear
  • khanakhh
  • Youngstown
  • mdbf
  • slotface
  • rosin
  • everett
  • ngwrru68w68
  • kavyap
  • InstantRegret
  • JUstTest
  • cubers
  • GTA5RPClips
  • cisconetworking
  • ethstaker
  • osvaldo12
  • modclub
  • normalnudes
  • provamag3
  • tester
  • anitta
  • Leos
  • megavids
  • lostlight
  • All magazines