CivArchive
    Preview 25650259
    Preview 25645281
    Preview 25648791
    Preview 25650276
    Preview 25783955
    Preview 26281196

    The latest version workflow and nodes are here

    https://github.com/peterkickasspeter-civit/ComfyUI-ZImageTurboProgressiveLockedUpscale/tree/experiment

    =====

    Custom node: https://github.com/peterkickasspeter-civit/ComfyUI-ZImageTurboProgressiveLockedUpscale

    Workflow is in the showcase images. More details here - https://www.reddit.com/r/StableDiffusion/comments/1r1qyj2/zimageturboprogressivelockedupscale_works_with_z/

    Distill lora for base - https://huggingface.co/alibaba-pai/Z-Image-Fun-Lora-Distill/tree/main
    *use either Z-Image-Fun-Lora-Distill-8-Steps.safetensors or Z-Image-Fun-Lora-Distill-4-Steps-2602-ComfyUI.safetensors

    Description

    GPT4o Prompt:

    I am planning to train a LoRA for the Stable Diffusion text-to-image model, which uses the T5XXL transformer in its architecture. The prompts should be in natural language and follow a specific format. I will upload images and need you to help me create detailed prompts based on those images. The prompts should start with "Amateur photography of" and end with "on flickr in 2007, 2005 blog, 2007 blog." Always give me the prompt in a single paragraph.
    The format should be:
    Subject Description: Start by describing all the people in the image in detail. It is very important to include their race and ethnicity, physical attributes (such as height, build, skin tone, and hair color), facial features, attire, and any expressions or poses they are making. Be as specific as possible. Make sure to always include the build of the subjects (e.g., plus size, slim, petite) without missing it.
    Scene Description: Accurately convey what exactly the people are doing in the picture. Describe the setting, background elements, any objects they are interacting with, and the overall environment (urban, rural, indoor, outdoor, etc.).
    Image Quality Tags: Include descriptive tags that highlight the quality of the image. Use terms like slight motion blur, cluttered background, warm tones, bright natural light, high contrast, vivid colors, etc. These tags should reflect the mood and feel of the image as well.
    The final output should combine all these elements into a cohesive, detailed prompt that accurately reflects the image.

    FAQ

    Comments (32)

    EKKIVOKAug 22, 2024· 9 reactions
    CivitAI

    the most useful lora on CivitAI. by far !

    BNP222222Aug 23, 2024· 1 reaction
    CivitAI
    Hello, first of all thank you for your great work, it seems that you are using forge, but after I downloaded your image, I can't read the metadate in forge, would you mind uploading the image with the metadate in it so that I can reproduce it, thx!
    peterkickasspeter
    Author
    Aug 23, 2024

    which picture metadata do you want? i dont want to delete and reupload. i will just give them here

    BNP222222Aug 24, 2024

    white bear PLZ :-)

    peterkickasspeter
    Author
    Aug 24, 2024

    @BNP222222 pls click on the image. someone asked it yesterday. i already posted it

    Archi_teknik101Aug 23, 2024· 3 reactions
    CivitAI

    fantastic lora, bringing realism one step closer, great job

    yoloswagg45Aug 23, 2024· 5 reactions
    CivitAI

    I am using Forge, and it seems that when using dev-fp8 which uses around 12GB of VRAM, this Lora maxes out my VRAM at 16GB and basically brings the whole generation to a halt. Why is a 300mb file making VRAM usage so high? Is this a bug on Forge? I haven't tried it on Comfy UI.

    960x1280 using a 4070 ti super (16GB)

    KotoshkoAug 23, 2024

    @yoloswagg45 Most likely it's still a Forge bug, I can't get the lors to work stably, but in Comfy the lors also take longer to load and therefore the image generation takes more time

    rlewisfr346Aug 23, 2024· 3 reactions

    Make sure your Diffusion in Low Bits is set to Automatic fp16 LORA

    yoloswagg45Aug 24, 2024

    @rlewisfr346 Ok, figured it out, let's see if it works. This made a huge difference, thank you. :)

    dal_macAug 24, 2024
    CivitAI

    Captioned with Joy_caption? which LLM? did you purposely omit anything from the captions (style words)?

    peterkickasspeter
    Author
    Aug 24, 2024

    I used gpt4o. No I did not omit anything. See my post in stable diffusion subreddit for the prompt. What is joy caption?

    dal_macAug 24, 2024

    @peterkickasspeter 
    Thank you.
    Joy_caption:
    https://www.reddit.com/r/comfyui/comments/1ez78zl/natural_language_image_captioning_workflow_for/

    Uses Llama 3.1 8B. pretty nice for batch captioning and keeping workflow within comfy

    peterkickasspeter
    Author
    Aug 24, 2024

    @dal_mac I just tired it. I feel like gpt4o gets the tiny details in the image that makes a huge difference when generating with flux. But it's good though

    defnotarobotAug 24, 2024· 4 reactions
    CivitAI

    My article for version 2: https://civitai.com/articles/6897

    peterkickasspeter
    Author
    Aug 24, 2024· 1 reaction

    weight 1 is terrible bro hahahaha. but thanks so much for the tests. your tests are more detailed than mine tbh

    defnotarobotAug 24, 2024

    @peterkickasspeter Thanks for training them.

    RudyBagaAug 25, 2024· 3 reactions
    CivitAI

    This LoRA is terrific! I also want to apologize to the creator. I left a previous comment that was incorrect because of a mistake I made in my generation. The issues I noted indeed seem to have been largely resolved in the new version.

    I appreciate them responding to me and giving me the opportunity to double check myself. Kudos to them for all their hard work!

    peterkickasspeter
    Author
    Aug 25, 2024

    No problem. Glad you liked it

    gx_ground136Aug 26, 2024· 1 reaction
    CivitAI

    great lora,need nsfw vision to increase skin detail

    peterkickasspeter
    Author
    Aug 27, 2024· 1 reaction

    i dont know. i dont want to mix nsfw images in the dataset i am using to train this specific lora. maybe try to use it with other lora's for skin texture and see if it works out for you. see @defnotarobot's article.

    defnotarobotAug 26, 2024· 1 reaction
    CivitAI

    My Article about mixing this LoRA with another: https://civitai.com/articles/6992

    peterkickasspeter
    Author
    Aug 26, 2024· 1 reaction

    I appreciate the testing, thanks so much. Would you be able to test it using https://www.reddit.com/r/StableDiffusion/comments/1eywnv8/comment/ljgjtw2/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button kind of prompts? I feel like some of the images are not utilizing the lora (like they don't have that everyday look to them although they look amateurish because of high lora weights. Perhaps its because none of the trigger words are in the prompt?)

    defnotarobotAug 26, 2024· 1 reaction

    @peterkickasspeter I did notice the vast majority of the prompts looked less-amateur even if they did look photorealistic. I will get back to this soon.

    peterkickasspeter
    Author
    Aug 26, 2024· 1 reaction

    Take a look at this - https://civitai.com/images/26281196
    I just gave your image to gpt4o for the prompt

    defnotarobotAug 26, 2024

    @peterkickasspeter That is wild. Thanks for the tip.

    GodAlMightyAug 29, 2024· 9 reactions
    CivitAI

    God approves this Lora. God was an amateur too once. Especially when he created humanity. Keep up the good work and you'll surely get eternal life.

    ailu91Aug 30, 2024

    Just give me a RTX9090 with 2TB VRAM and let me die in peace

    GodAlMightyAug 30, 2024

    @ailu91 Get a good job and i will provide it for you. Now say thank you God.

    yessy_boemAug 31, 2024· 1 reaction
    CivitAI

    so if i understand correctly V1 is obsolete right? or are both version just different? or should i combine them perhaps on lower strength?

    peterkickasspeter
    Author
    Aug 31, 2024· 1 reaction

    V1 captioning is not so good. V2 and V3 have gpt4o captions. So use one of them.

    yessy_boemSep 2, 2024

    @peterkickasspeter k ^^