Visiting The Fae

Visiting The Fae

2024, Mar 29    

These first few posts are going to be about fairies and mushrooms. It’s not my usual thing, really, but I’m doing some clipart type images of mythical creatures in my signature ink & watercolor style. We’ll get into adding my style later. First we have to have a decent image to apply that style to.

I came up with this prompt, just on a whim

a group of fairies dancing in a circle around a mushroom. the mushroom is on fire.

I started with this a while ago, and I believe Midjourney v6 has advanced quite a ways since i started (the worst part about trying to document process on MJ is that it moves so quickly). A lot of people say that this construction works well for MJ 6, and I’d agree. It can take subsequent sentences about the things you’ve already asked for, as long as there’s no ambiguity. “The mushroom is on fire” is better than “it is on fire” because, while it’s probably clear to humans, the model will have trouble figuring out if “it” is the mushroom or the fire or the circle or maybe even the fairies.

After some false starts, I ended up with this prompt:

fantasy-themed clipart of a group of fairies dancing in a circle around a mushroom. the mushroom is on fire. 2d flat vector art on white background –style raw –stylize 800

Even though I’ve been at it for months, I’m still trying to get the hang of style vs stylize. I know academically what they’re supposed to do, but I keep finding the results unpredictable. So I did a quick exercise to isolate some of the differences

I’ll start with a new run of my prompt so i can copy the seed value. We’ll start with no arguments. I run with stylize medium, which sets --s to 100, so we’re still involving the model’s visual sensibilities, but not too much.

fantasy-themed clipart of a group of fairies dancing in a circle around a mushroom. the mushroom is on fire. 2d flat vector art on white background

My original image has seed 623181606 so we’ll specify that from here on out.

Here we get general prompt adherence, and enough of the model’s style to arrange things decently inside the image.

Here’s what I get with --style raw to take some aesthetic decisions away from the model. Taking some of those choices away, you get images closer to the prompt, but maybe at the expense of the images being a touch less coherent, making a little less sense visually.

fantasy-themed clipart of a group of fairies dancing in a circle around a mushroom. the mushroom is on fire. 2d flat vector art on white background –style raw –seed 623181606

Now we’ll play with the interaction between these two values. Here is --style raw with a lower --style 50 argument, equivalent to setting “stylize low”. Weirdly I think this is noticably poorer prompt adherence, with the right elements but just kind of thrown all over the canvas.

fantasy-themed clipart of a group of fairies dancing in a circle around a mushroom. the mushroom is on fire. 2d flat vector art on white background –s 50 –style raw –seed 623181606

if we go all the way to --s 0 we get quite a lot of prompt adherence. This is fine with me, as we can “pretty it up” later.

fantasy-themed clipart of a group of fairies dancing in a circle around a mushroom. the mushroom is on fire. 2d flat vector art on white background –s 0 –style raw –seed 623181606

For illustration, we’ll go --s 250, equivalent to stylize high. None of the shading and depth that MJ wants to give it by default, but otherwise nicely arranged.

fantasy-themed clipart of a group of fairies dancing in a circle around a mushroom. the mushroom is on fire. 2d flat vector art on white background –s 250 –style raw –seed 623181606

Just to prove our point about involving the model, we’ll also look at these without raw mode.

Here’s --s 50 without --style raw. Notice the overall cohesive circular arrangement, along with some odd details like mushrooms growing out of mushrooms or the mushroom cap being upside down. But the image still has some symmetry to it and you get a nicely composed image, even if what’s in the image is odd.

fantasy-themed clipart of a group of fairies dancing in a circle around a mushroom. the mushroom is on fire. 2d flat vector art on white background –s 50 –seed 623181606

And here’s --s 250 without --style raw. We have a lot more detail in the characters, but we still aren’t completely sure it knows what a mushroom looks like.

fantasy-themed clipart of a group of fairies dancing in a circle around a mushroom. the mushroom is on fire. 2d flat vector art on white background –s 250 –seed 623181606

In general these are the use cases I find for these paramters:

Add --style raw along with a low --s value when the model doesn’t seem to be listening to my prompt. It will take less liberty and give me something closer to the prompt text, even though I’ll have to dress up the presentation later. This is fine by me because I tend to add styles last.

High --s values are great for early in your process, when I’m still trying to see what’s possible. The more iterations I do, the lower I set the --s param so that I’m more in charge of what’s changing.