CivArchive
    post_rl_indexed_v2_lora - self-fit-3
    NSFW
    Preview 1

    LoRAs trained using DRaFT-like post-RL using JoyQuality with a new pairwise head

    Don't expect sensible results on any other model than indexed v2

    These are just experiments. I will often not avoid over-training and forgo stronger regularization as I am more so looking for what effects are most prominent for varying data regimes.

    Description

    retraining pairwise head on different data mixture
    different regularization during rl training

    LORA
    Illustrious

    Details

    Downloads
    2
    Platform
    SeaArt
    Platform Status
    Available
    Created
    11/30/2025
    Updated
    12/13/2025
    Deleted
    -

    Files

    Available On (1 platform)

    Same model published on other platforms. May have additional downloads or version variants.