Important notice: This model was designed for Text2Image. I do not know how well it works for video generation.
Latest page update: Released model.
My recommended local WAN2.1 Text2Image ComfyUI workflow for maximum quality:
Trigger (must include in prompt): early 2010s snapshot photo captured with a phone and uploaded to facebook, featuring dynamic natural lighting, and a neutral white color balance with washed out colors
Please note that the online generator will NOT give you as good results as running the model locally in ComfyUI, because the online generator does not have the recommended sampler and scheduler and other customization options provided by my recommended local ComfyUI workflow!
Donations: If you want to support me financially, you can donate here: https://ko-fi.com/aicharacters
Latest version notes:
Initial release, no further updates planned for now
LoRa description:
This LoRa model was designed to generate images that look like casual snapshot photos taken with a smartphone.
Disclaimer:
I do not claim to be the current owner, original creator of, person or legal representative of a person depicted in, any of the concepts that my models, including this one, aim to emulate nor the training data used to train this model. I do not claim that any of my models have received an endorsement by those respective individuals. I also do not claim that the emulation attempted by my models represents a 100% accurate depiction of the original concept in question nor do I claim that the quality of my emulation reaches the same quality of the original concept. All credit goes to the respective current owners, original creators, or people and I encourage you to support them in any way you can.
If you are the current owner, original creator of, person or legal representative of a person depicted in, any of the concepts that any of my models aims to emulate and would like that specific model to be removed, then please leave me a private message here or on Reddit with proof of authenticity of your identity and claim and I will remove it.
Additionally, I do not endorse my models being used to violate the law or enact still legal, but immoral acts such as the spread of misinformation using deep fakes.
Description
FAQ
Comments (36)
This is insanely good! Thank you for your commitment. Works like a charm even with Videos. And comparing it to generations without the lora it feels like a game-changer too bad the sample/schedular combination takes away the speed-boost I got used to. Nevermind. The Quality makes up for it.
Interesting, can you show a video example from using this LoRa?
Can you please post some comparisons of with and without using this Lora?
Please upload your others, this is brilliant work.
Half of them are done so far.
Is this Wan2.1Text2Image only a ComfyUi thing? Can I do it on WanGP2? This is my first time learning about this, very interesting.
Its just setting the length to 1 aka generating a single frame. Aka an image.
This is really interesting, training loras for single images on a video model. I've been using Wan a lot, and it's become very clear to me that Wan understands how to make a coherent image better than any image generation model I've ever used.
It has weird transitions in animation sometimes, but it seems like all the training on video has helped it to understand visuals so much more deeply than training on still images. Like, of course it would know how to make a human body better if it also had to know it well enough to change the viewing angle at every single degree of rotation, and the bend of a limb at every single degree of bend, also viewed from every angle.
Wan actually feels like it knows what arms and legs do, and when it makes horrors, it's still using those parts with a higher level of coherence and logic. It seems like when it gets it wrong, it's merely getting confused about what you want, rather than being confused about the object it's showing, unlike models like Flux or SDXL, which you can tell can't really understand anything about the pose it tries to do, or how arms/legs work with that pose.
Anyway, this is an interesting idea, and I might start trying to use it this way.
it's merely getting confused about what you want
That moment when you feel the gguf is alive.
legit, creepy feel good vibes.
Broken workflow. Even with all the pre-requisites installed, the link between image saver and scheduler fails, even though they're connected properly. Even tried changing image saver versions. Tried to fix manually, but ferris wheel image did not match at all with the same settings.
But my linked workflow doesnt even include image saver. Only my samples do, because I need it to provide metadata for CivitAI. But the one I linked for everyone else to use is stripped down. Try that one.
nice work
我想知道这个应该怎么训练,我有大量的数据集,但是我不知道怎么训练,如果你能告诉我怎么训练我将非常感激!!!
This is actually pretty amazing - I had no idea Wan could generate such excellent hi-res stills, but this works really well. Thanks :)
Awesome stuff! Did you share your Lora training config for want2v anywhere yet?
I will at some point, but its 1 to 1 my FLUX config except where I had to change it because Musubi-trainer misses those options.
Very cool, thx for sharing! Could text2image also work with Wan character Loras? Would love to try it with your Aloy Lora
The word "phone" in your Trigger sentence adds a phone to the image. It is more effective to have a meaningless collection of letters as a Trigger word.
I have never had that issue and no one has ever reported that kind of issue before, so I dont see a reason to change the trigger word atm, sry. It is worded in that way for a good reason. If a meaningless combination of tokens were more effective in this case I would have already done so, as I have done with my other LoRas already.
Create a prompt with items on a desk and you'll find a phone among the items.
lykiote Okay but I dont see the issue with that. If you want specific items to appear, then write them out instead of saying generic "items". I cannot change my trigger words for much worse functioning trigger words across the board just because of such a minor issue.
Did you use the instagirl Lora in addition maybe? Bc it also added a smartphone in a girls hand when combining them :D
marqs89 That LoRa is newer than this comment chain.
marqs89 No I did not use that particular lora. I solved it by leaving the trigger word out altogether. It seems that trigger words don't matter that much using WAN; it looks the same with or without the trigger word.
lykiote I find that there is a substantial difference between using one or not. That being said, I did notice the issue you are talking about in WAN2.2 now (but still not WAN2.1), so I am looking to fixing that now.
lykiote Try this new version, that should fix the issue: [WAN2.2] Smartphone Snapshot Photo Reality [STYLE] - v3.0 [WAN2.2]-High-noise | Wan Video 14B t2v LoRA | Civitai
Where do you get the bong_tangent scheduler? And the res_2s sampler? Mine gives an error why I try to use it, and it's not selectable in the list. It's only in your workflow.
Read the notes they carefully attached to the workflow. It tells you to install the Res4Lyf custom node.
@doxelom321899 Ah, there it is. Thank you.
Is this LoRA so unassuming? In the field of image generation, it's even like the quality of the next version of the model, truly stunning
can you do a inpainting at full resolution workflow with this? Pleeeease thank you :D
btw I have one working with flux if you want it as a starting point! when I mask a region it lets me make an inpainting at a fiven resolution on the selected mask area which is amazing to fix faces for example and give it amazing quality.
With Wan 2.1 for t2i I was able to get consistently good complex interactions and motion, and with this Lora I was able to get back natural looking images. Both the people it invents and the camera angles it choses don't feel like they were made with AI. I can't believe both of these problems got resolved at one time.
This is a must have if you want the highest quality image!
Yes it does work for video generation and it's pretty awesome! I used it with Wan 2.2
Details
Files
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
Mirrors
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
wan21-t2v-14b-realism.safetensors
wan21-t2v-14b-realism.safetensors
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
WAN2.1_SmartphoneSnapshotPhotoReality_v1_by-AI_Characters.safetensors
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.








