By downloading this model, you agree not to use it for any commercial purpose, including selling images generated using this model. If you are interested in commercial licensing or commissioning a model, please contact me and we can discuss your use case.
If you use my models, please take the time to post some images to the pages so I can see how they are being used and make improvements in the future. Also feel free to reach out if you have ideas for future models!
Overview
This model is my first attempt at a multi-concept amputee model, and it really shows off what Qwen-2.5 VL is capable of as a text encoder. I tried for a long time to accomplish something like this with Flux and was not able to do so, mainly because t5xxl couldn't represent the concepts with enough fidelity.
How to Prompt (Please Read)
Qwen strongly prefers prose based prompts. This applies to negative prompts as well.
This model does not utilize a trigger word, and in general, I would recommend against trigger word approaches with Qwen, as its text encoder is significantly more expressive than what we have seen from older generation models based on CLIP or t5xxl.
Describe limb differences in a semi-clinical fashion. "She has bilateral hip disarticulations" works much better than "She is a dhd amputee" since common abbreviations like "dhd", "dae", etc don't encode to vectors with reasonable relationships during text encoding but the text encoder absolutely understands hips, shoulders, and disarticulation as English words.
Plan to spend a decent amount of words and sentences on the limb schema. Generally each caption contains 1-3 sentences of information, and sometimes more for complex/inconsistent schemas.
Limb Differences
Here are some examples of how to describe some limb differences:
She has bilateral {hip|shoulder} disarticulations
She has a {left|right} {above|below} knee leg amputation
She has a right {hand|foot} amputation
She is a limbless amputee. She has bilateral hip and shoulder disarticulations.
You can try describing the length of residual limbs, or where they end. Generally the model gets it to a point, but the more picky you are the lower the hit rate.
Fixing Extraneous Amputations
The model sometimes removes more limbs than you asked for, especially when you request asymmetric schemas like ooe, since like most models, Qwen Image expects lateral symmetry in human bodies.
If you find this happening, the fix is pretty simple: describe the intact limbs clearly. Where are they? Is the elbow bent? Etc. Add details about them until they are showing up reliably. With a good prompt, I can usually get an 80-90% hit rate on asymmetric schemas.
Lefts and Rights
Unfortunately, Qwen is not good at lefts and rights, and this form of fine tuning is not able to repair that across the board, despite careful captioning and consistent use of stage directions, which is the best practice. This likely won't be fixable with Qwen Image. You can always flip the image horizontally after the fact if you're really picky about it.
Scarring
This model has much more information about scars in its training set than my Flux models. The captions used during training took this into account. Here's some captioning advice:
For disarticulations, refer to the scarred areas as "amputation sites".
For limb or hand amputations, refer to the scarred areas as "residual limbs"
This was done consistently, and helps reinforce the differences between disarticulations and amputations with a residual limb.
Sample phrasing:
She has bilateral shoulder disarticulations with faint scarring at her shoulder amputation sites
She has bilateral hip and shoulder disarticulations with faint scarring at her amputation sites
She has bilateral above-knee amputations with short residual limbs. She has bilateral above-elbow amputations with very short residual limbs. She is supported by distal ends of her residual arms and legs.
Her amputation sites are smooth.
Her residual limbs are smooth.
I recommend avoiding the term "linear scars" as it tends to produce weird behavior. "scar tissue" is OK and will produce slightly different results than "scarring" which you may prefer. If the scars are coming out too gorey or wound-like, mention "wounds" in your negative prompt.
Finally, some images in the training set were described as "Electronic Surgery Images" if they were built in photoshop. I have not experimented much with prompting for this, but it's something you can play with if you have some nostalgia for that vibe.
Photorealism
Qwen's photorealism and lighting choices are not very good without prompting for specifics. You'll want to explicitly caption for photographic qualities that you find desirable:
crisp 35mm photograph of .... Subtle film grain enhances analog texture. The lighting is cool, even, diffused
35mm photograph of ... The gallery's recessed spotlights cast dramatic light across her torso and shoulders, emphasizing ... . Subtle film grain enhances analog texture while...
35mm Photograph of ... The setting features even, diffused daylight highlighting her complexion, with soft shadows emphasizing the contrast of...
Text
Qwen is great at text, and you can go wild with tattoos, captions, overlays, etc in a way that was not possible with Flux. However, be aware that text can leak unintended impacts onto the rest of the image.
If you are playing with text, try the same prompt/seed with and without the text to understand the tradeoffs. If you don't like what happens, you can always use Qwen Edit to add the text after the fact if it's causing you issues.
About the Sample Images
All sample images use the simplest Qwen ComfyUI workflows, with Qwen Image fp8 as the base, and no other LoRAs loaded. 20 Steps, Euler, Guidance of 3.0-4.0, and lora weight=1.0.
Training Details
This model was trained with a new, 500-image dataset that was LLM-captioned using Gemma3 12B, and then all of the concept-specific aspects were captioned by hand, one at a time.
I also used a data set of 10,000 high quality photographs for regularization which helps the model generalize without overfitting while fixing some of Qwen's built-in tropes and defects. The regularization dataset was also captioned using Gemma3 12B.
Generating NSFW content was not a goal during model training, and NSFW images will exhibit typical Qwen Image defects. Feel free to experiment with other LoRA's if that is your goal. In my experience so far, mixing Loras works much better with Qwen Image than it did with Flux.
Commercial Use Policy
By downloading this model, you agree not to use it for any commercial purpose, including selling images generated using this model. If you are interested in commercial licensing or commissioning a model, please contact me and we can discuss your use case.
Description
FAQ
Details
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.