CivArchive
    Yet Another Watercolor Style - D4A2 Florence2
    NSFW
    Preview 1Preview 2Preview 3Preview 4Preview 5Preview 6Preview 7Preview 8Preview 9Preview 10

    Yet another watercolor style, trained using synthetic images generated with SDXL and SDXL Niji SE. The goal of the LoRA is flexibility, so that it can be used with other LoRAs without introducing too much of its own aesthetics.

    Known problem with the two earlier versions: the face is always Caucasian. The problem has been fixed by mixing in images of Asians.

    All versions work well (no captioned version seems a bit more flexible though), and can produce nice images at 8 steps with https://civitai.com/models/686704/flux-dev-to-schnell-4-step-lora

    (Seems to work a bit less well with https://civitai.com/models/678829/schnell-lora-for-flux1-d, but the tests are not conclusive)

    The last epochs are posted here. You can find earlier epochs here:

    tensor. art/models/79947554??39540972/YAWC-Watercolor-5CosD4A2R20NoC-2024-11-23-22:59:14-Ep-9

    tensor. art/models/799093298??4978244/YAWC-Watercolor-5CosD6A3R20NoC-2024-11-22-23:12:45-Ep-9

    tensor. art/models/798371619585168991/YAWC-Watercolor-5CosD4A2R20E8-2024-11-21-00:29:10-Ep-8

    tensor. art/models/798765094424184653/YAWC-Watercolor-5CosD4A2R20Cap-2024-11-21-02:52:27-Ep-8

    Training parameters:

    23 to 26 512x512 (downscaled from 1024x1024) images, captioned with Florence2 or by using "no captioning" training with just the trigger "yawc1 watercolor".

    FLUX.1 - dev-fp8

    Trigger: yawc1 watercolor

    Repeat: 20 Epoch: 8 or 9 epochs.

    Unet LR: 0.0005 Scheduler: cosine Optimizer: AdamW/AdamW8bit

    Network Dim: 4 Alpha 2 or Dim 6 Alpha 3

    Description

    Same dataset, but now using Florence 2 captioning.

    Seems a little bit less flexible compared to the "no caption" version (for example, it likes to make people sit on a chair). Also, seed to require more steps when using the Schnell 4-8 Steps LoRAs.

    So my recommendation is to use the "No Caption" version, but feel free to run your own tests and draw your own conclusions.

    Trained with 26 512x512 (downscaled from 1024x1024) images with Florence 2 caption (had to change "walking" to "running" and change "woman" to "Wonder Woman"

    Repeat: 20 Epoch: 8 total of 4160 steps

    Unet: 0.0005 Scheduler: cosine Optimizer: AdamW

    Network Dim: 4 Network Alpha: 2

    Epoch Loss

    1 0.415

    2 0.406

    3 0.403

    4 0.378

    5 0.363

    6 0.354

    7 0.350

    8 0.343