ByteDance USO makes subject and style fusion simple and powerful.
Who it's for: creators who want this pipeline in ComfyUI without assembling nodes from scratch. Not for: one-click results with zero tuning — you still choose inputs, prompts, and settings.
Open preloaded workflow on RunComfy
Open preloaded workflow on RunComfy (browser)
Why RunComfy first
- Fewer missing-node surprises — run the graph in a managed environment before you mirror it locally.
- Quick GPU tryout — useful if your local VRAM or install time is the bottleneck.
- Matches the published JSON — the zip follows the same runnable workflow you can open on RunComfy.
When downloading for local ComfyUI makes sense — you want full control over models on disk, batch scripting, or offline runs.
How to use (local ComfyUI)
1. Load inputs (images/video/audio) in the marked loader nodes.
2. Set prompts, resolution, and seeds; start with a short test run.
3. Export from the Save / Write nodes shown in the graph.
Expectations — First run may pull large weights; cloud runs may require a free RunComfy account.
Overview
With this workflow, you can achieve seamless creative generation by combining both subject and style control into a unified process. It allows you to place characters into different scenes while preserving their identity with high accuracy. You can also apply diverse artistic styles from reference images to bring unique visual effects to your work. The system offers a combined mode that merges both subject-driven and style-driven guidance into one setup. It is built to deliver sharp, consistent, and flexible results for creators who value output precision. This makes it ideal for designers, digital artists, and content creators seeking efficient control over their visuals. The workflow streamlines generation tasks so you can achieve professional-grade imagery without a complex setup.
Important nodes:
Key nodes in Comfyui ByteDance USO workflow
USOStyleReference (#56)
Applies a style image to the current model stream using the USO projector and CLIP‑Vision features. Use one reference for a strong, coherent look or chain two for nuanced blends; the second reference refines the first. If the style dominates too much, try a single, cleaner reference or simplify its content.
ReferenceLatent (#44)
Injects the encoded subject latent into the conditioning path so ByteDance USO preserves identity. Works best with uncluttered identity photos that clearly show the character’s face or defining features. If identity slips, feed a more complete reference or reduce conflicting style cues.
FluxKontextMultiReferenceLatentMethod (#41)
Combines multiple reference signals within the FLUX context pathway. This is where subject and prompt context are balanced before sampling. If results feel over‑constrained, relax references; if they drift, strengthen subject imagery or simplify the prompt.
FluxGuidance (#35)
Controls the strength of text guidance relative to reference signals. Lower values let subject/style lead; higher values enforce the prompt more strongly. Adjust when you see either prompt underfitting (raise guidance) or style/subject being overridden (lower guidance).
ImageScaleToMaxDimension (#109)
Prepares the identity image for stable feature extraction. Smaller max sizes favor broader composition; larger sizes help when the reference is a tight portrait and you need crisper identity cues. Tune based on whether your subject ref is full‑body or a headshot.
EasyCache (#95)
Speeds up inference by reusing intermediate states when changes are minor. Great for prompt tweaks and rapid iteration, but it can slightly reduce micro‑details. Disable it for final, highest‑quality renders.
KSampler (#31)
Runs the diffusion steps and controls stochasticity via seed and sampler choice. Increase steps for more detail, or lock the seed to reproduce a look while changing references. If textures look noisy, try a different sampler or fewer steps with stronger style guidance.
Notes
ByteDance USO in ComfyUI | Unified Style and Subject Workflow — see RunComfy page for the latest node requirements.
Description
Initial release — ByteDance-USO.