Fyre, a fictional character, redhead with freckles.
Trained on 20 Klein 9b generated images generated from an initial z-image generated image.
Trained with diffusion-pipe, this is a 128-rank lora trained on z-image (non-turbo, aka "base"), wan 2.2 14B low noise, and qwen image 2512, using the ProdigyPlusScheduleFree optimizer (https://github.com/tdrussell/diffusion-pipe/pull/483), and masked training with backgrounds masked to 10%.
The model was evaluated every 200 steps (10 repeats of each training image) using FaceNet512 to evaluate the face similarity, using the same settings, seed, etc for inference, calculating the average distance to a subset of 10 of the training images. For the z-image model, the best face similarity was achieved around epoch 16 and 17 (3,200 steps and 3,400 steps). For the qwen image 2512 model, the best face similarity was achieved at epoch 20 (4,000 steps). For wan 2.2 low noise, the best face similarity was achieved at epoch 10 (2,000) steps.
The attached training data zip includes: the diffusion pipe config files; the training images, prompts, and masks; the validation images (subset of training images); and - for the z-image model - the output of MirrorMetrics evaluation of epoch16 and 17 vs. the validation set.
z-image model evaluation:
For average distances, lower is better. Distance under 0.3 is considered a face match.
Matching percentage is the percent of the 10 validation images that the generated face was under 0.3 distance from.
epoch 16 (3,200 steps)
z-image (non-turbo) inference
Average distance: 0.187
Matching percentage: 100%
z-image-turbo inference
Average distance: 0.262
Matching percentage: 80%
epoch 17 (3,400 steps)
z-image (non-turbo) inference
Average distance: 0.193
Matching percentage: 100%
z-image-turbo inference
Average distance: 0.255
Matching percentage: 80%
Evaluation using MirrorMetrics (html output included in training data attachment)

Training config:
(included in attached training data attachment)
Training command:
PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True NCCL_P2P_DISABLE="1" NCCL_IB_DISABLE="1" deepspeed \
--num_gpus=1 train.py \
--deepspeed \
--regenerate_cache \
--config /home/user/diffusion-pipe-config.tomlDescription
version 1, epoch 16 (3,200 steps)








