Incomplete anime pony - CivArchive (CivitAI Archive)

Incomplete anime pony - v1.0

NSFW

Version Arknights update:

Arknights 版本使用50K张明日方舟图片训练。

因为发现对画风造成了有趣影响，于是做成了大模型。

基于原版pony训练。

在A100 40G 上训练了 50小时。

The Arknights version was trained using 50,000 images from Arknights.

Due to the interesting effects observed on the art style, it was developed into a large model.

Based on the original Pony model.

Training took 50 hours on an A100 40G.

Prompt

score_9,score_8_up,score_7_up

Negative prompt

logo,score_4,score_5,score_6,lowres,(bad),text,error,fewer,extra,missing,worst quality,jpeg artifacts,low quality,watermark,unfinished,displeasing,oldest,early,chromatic aberration,signature,extra digits,artistic error,username,scan,[abstract],

Version Alpha update:

-Further fine-tuning the furry style in the pony base model to a Japanese anime illustration style.

-The current version of the model can be directly used for generating images.

The base model remains chenkin_20w.safetensors trained using a 200,000 training dataset.

Several manually selected high-quality illustrations are used for targeted training (provided by Euge).

Each illustration is of exceptional quality, labeled as "amazing."

The trained model is merged with chenkin_20w.safetensors with a weight of 0.7.

Therefore, the current quality indicators are:

amazing,best，hight,score_9，

or simply:

amazing,best，hight,

Alpha版本更新：

-pony底模中的furry风格进一步微调为日系动漫插画风格。

-现在版本的模型，可以直接用于出图。

底模依然为使用20w训练集训练的chenkin_20w.safetensors

对若干张人工挑选的超高质量插画进行针对训练。（由尤吉提供）

每一张都是千中挑一的水准，打上了`amazing` 的质量标签。

训练后的模型，以0.7的权重与chenkin_20w.safetensors 合并。

因此，现在的质量提示词是：

amazing,best，hight,score_9，

或者，仅仅使用：

amazing,best，hight,

特别鸣谢：算力赞助 / GPU Sponsor: Neta

特别鸣谢：算力赞助 / GPU Sponsor: nieta.art

This model may not be suitable for generating images and it is not recommended for beginners to download.

Intended as a reference (or lesson) for colleagues who are training on Pony Diffusion V6 XL.

Based on Pony Diffusion V6 XL, trained on 200k anime images, using an A40 48G for over 7 days.

The model was trained to generate images in the style of Japanese anime, but did not achieve the expected results.

1. chenkin_pony.safetensors

Pretrained on 13k random anime images.

2. chenkin_20w.safetensors (Current model)

Trained on top of chenkin_pony.safetensors.

Used 190k selected images from yande with no quality labels (provided by Miss Erity).

Used 10k manually selected high-quality illustrations with the "hight" quality label (provided by Euge).

Using 100 manually selected ultra-high-quality illustrations with the "amazing" quality label (provided by Euge).

amazing,hight,score_9

Trained using the bmaltais/kohya_ss (github.com) project.

This project consumed a significant amount of GPU computing power, thanks to Neta for providing the computing resources.

Special thanks to GPU Sponsor: Neta.

这个模型可能不适合用来生成图片，同时不建议初学者下载。

旨在为更多在Pony Diffusion V6 XL上训练的同仁提供参考（或者说教训）

基于Pony Diffusion V6 XL 训练，使用200k的动漫图片，在 A40 48G 训练了超过7天。

该模型是为了更好生成日本动漫风格的图片而训练，但是没有取得预期。

1.chenkin_pony.safetensors

使用了13k随机动漫图片进行预训练。

2.chenkin_20w.safetensors （当前模型）

在chenkin_pony.safetensors的基础上进行训练

使用了190k张来自yande的精选图片，无质量标。（由二小姐提供）

使用10k张人工挑选的高质量插画，打上了`hight` 的质量标签。（由尤吉提供）

使用100张人工挑选的超高质量插画，打上了`amazing` 的质量标签。（由尤吉提供）

amazing,hight,score_9

使用bmaltais/kohya_ss (github.com ) 项目进行训练。

此项目耗费了大量的GPU的算力，感谢 Neta 愿意为我提供算力。

特别鸣谢：算力赞助 / GPU Sponsor: Neta

训练参数如下（train set）：

[sdxl_arguments]
cache_text_encoder_outputs = false
no_half_vae = false
min_timestep = 0
max_timestep = 1000

[model_arguments]
pretrained_model_name_or_path = "/root/autodl-tmp/stable-diffusion-webui/models/Stable-diffusion/chenkin_pony/chenkin_pony.safetensors"

[dataset_arguments]
shuffle_caption = true
debug_dataset = false
train_data_dir = "/root/autodl-tmp/20w"
dataset_repeats = 1
keep_tokens_separator = "|||"
resolution = "1024, 1024"
caption_dropout_rate = 0
caption_tag_dropout_rate = 0
caption_dropout_every_n_epochs = 0
token_warmup_min = 1
token_warmup_step = 0
enable_bucket = true
min_bucket_reso=640 
max_bucket_reso=2048
bucket_reso_steps=64


[training_arguments]
output_dir = "/root/autodl-tmp/stable-diffusion-webui/models/Stable-diffusion/chenkin_20w"
output_name = "chenkin_20w"
save_precision = "fp16"
train_batch_size=6         
vae_batch_size=4 
max_train_epochs=1 
save_every_n_steps = 2000
max_token_length = 225
mem_eff_attn = false
xformers = true
sdpa = false

max_data_loader_n_workers = 8
persistent_data_loader_workers = true
gradient_checkpointing = true
gradient_accumulation_steps = 1
mixed_precision = "fp16"

[sample_prompt_arguments]
sample_every_n_steps = 200
sample_sampler = "euler_a"
sample_prompts="/root/example.txt"

[saving_arguments]
save_model_as = "safetensors"

[optimizer_arguments]
optimizer_type = "AdaFactor"
learning_rate = 7.5e-7
train_text_encoder = false
learning_rate_te1 = 0
learning_rate_te2 = 0
optimizer_args = [ "scale_parameter=False", "relative_step=False", "warmup_init=False",]
lr_scheduler = "constant_with_warmup"
lr_warmup_steps = 100
max_grad_norm = 0

Description

FAQ

Comments (20)

flyx3Mar 6, 2024· 2 reactions

CivitAI

精神支持~！

说句可能遭喷的话，pony在2D方面的下限比animagine要低，更难适配和微调（上限孰高孰低无法判断），2.5D方面是远好于animagine的（可能和animagine的训练集更集中在近期的acg作品和角色，而pony则比较杂食）。另外就是200K的体量其实对XL模型影响有限，20M应该会有比较大的改善，那就是另外一个算力和时间的故事了~

╮(╯_╰)╭

Chenkin

Author

Mar 6, 2024

很好的建议，其实。pony非常非常难调，我之前1W3数据集炖了好久，然后效果甚微，后面一怒之下，把20W砸进去了（还是效果甚微）。目前下一步的打算是等pony V7 ,或者微调其他模型

Chenkin

Author

Mar 6, 2024

后续如果我还不放弃pony的话，大概会使用lokr的方式微调（更省时间），或者用大量同画风的素材（如nai3）去把原本的furry欧美风格覆盖掉。会考虑使用融合以外的一切手段去微调pony

flyx3Mar 6, 2024

@Chenkin 其实方法都大差不差（数据量足够大情况下），

隔壁有用animagine微调的，22K的体量自然也就是洒洒水拉~“Yesmix XL”，测试了一下，和animagine XL V3 基本是95%相似度（同参数同步数同lora下，出图基本差别不大）。

大模型体量还是核心（NAI3估计在200M级别），没有A100和H100还是不要碰了

Chenkin

Author

Mar 6, 2024

@flyx3 其实我还真能用A100和H800，H800我在某个平台大概有100小时的GPU时间还没用。不过考虑到A3.1已经在训练了，近期应该不会大规模地微调。近期琥珀青叶有另外一种微调大模型的方式，方式比传统ft可能快个2倍速度，我正在了解。

Chenkin

Author

Mar 6, 2024

@flyx3 哈，我近期确实有微调A3XL的打算（100K人工精选图），不过，考虑到22K这个体量对大模型的影响也有限，我大概会谨慎考虑了。

HITTRAKKZMar 10, 2024· 1 reaction

@Chenkin I'm always looking through fan art, i probably have close to that amount of high resolution images from a bunch of different danbooru artists collected to train multiple different style lora's but haven't gotten around to it since I'm still very new to creating my own lora's but all the images i have are 2d i personally don't like cg or 2.5d looking art so i may be able to help you with that

Chenkin

Author

Mar 10, 2024

@HITTRAKKZ

I think this will be useful for me. I have enough computational power to fine-tune the Pony model, but I need high-quality organized data to display — categorized by artist or character.

https://discord.com/invite/DeGPNXSY

HITTRAKKZMar 10, 2024

@Chenkin i joined your discord, this is my username █⁃🅷𝒾𝓉🆉⁃█

Chenkin

Author

Mar 10, 2024

@HITTRAKKZ

It seems that searching is not possible due to unknown characters in the name. Perhaps we can communicate through private messages by adding each other as friends?

HITTRAKKZMar 10, 2024· 1 reaction

@Chenkin if you search hittrakkz you'll find me, i have the same pfp/avatar as i do on here

create023Mar 6, 2024· 1 reaction

CivitAI

試用過後感覺未來可期，表現力強，主要問題是線條不乾淨、肢體不穩定

Chenkin

Author

Mar 7, 2024

是的，毕竟是是半成品，如果可以多训练几个epoch效果会好很多。（但是pony微调成本太大了）

Chenkin

Author

Mar 7, 2024