CivArchive
    Preview 6637965Preview 6637974Preview 6637975Preview 6637977Preview 6637976Preview 6637978Preview 6637985Preview 6637989Preview 6637990Preview 6637984Preview 6638424Preview 6639166Preview 8566850

    Version 4.1 was trained with PonyXL v6 as the base, using a dataset of over 24k images, selected to teach concepts that were missing from the model, and 4.0 was based on AnimagineXL. 4.2 was a merge between 4.1 and 4.0, with further training afterwards to stabilize the merge and add styles besides anime. As a result, usage of this model needs to adhere to the licenses of both Pony and Animagine. The main focuses of this model are nsfw, holding weapons, and horror/monster themes, (which is where the "unsafe" name comes from) but it is meant to be a generalist model capable of handling images with or without those concepts. There is also a focus on being able to prompt facial expressions and general art styles.

    Due to using an anime model trained on booru tags as a base, this version fully trained the text encoder, so it should have much better prompt adherence when using booru tags, but will have worse natural language performance. Because it has been trained on a variety of styles, you'll get more consistent results if you specifically prompt for the style you want, such as "photorealistic", "realistic", "3d", "anime", "cel shading", "painterly", "1980s (style)", "oil painting (medium)".

    Description

    This version was trained from version 3.2, which uses Juggernaut v6 as a base, but also had the DPO Lora merged in near the end. The major change this time is training the text encoder on a number of tags, but only one tag at a time to avoid affecting the natural language part more than necessary. The dataset is now 12k images to train towards and 4k images as negative training.