Qwen Image
Qwen Image — 20B parameter text-to-image generation model by Qwen (Alibaba).
mods install qwen-image 61 models for FLUX, Wan 2.2, LTX-Video, Stable Diffusion & more.
Install any model with mods install <name>
Qwen Image — 20B parameter text-to-image generation model by Qwen (Alibaba).
mods install qwen-image FLUX.2 Dev — next-generation image model from Black Forest Labs.
mods install flux2-dev mods install flux2-klein-4b FLUX.2 Klein 9B — mid-size model in the FLUX.2 Klein family.
mods install flux2-klein-9b AI-powered image editing model based on Qwen 2.5 VL architecture.
mods install qwen-image-edit Distillation LoRA for Qwen Image Edit that reduces inference from ~50 steps to 4-8 steps.
mods install qwen-image-edit-lightning Mistral 3 Small text encoder (bf16) for FLUX.2 Dev.
mods install flux2-mistral-text-encoder mods install flux2-qwen3-4b-text-encoder Qwen 3 8B text encoder (fp8 mixed) for FLUX.2 Klein 9B models.
mods install flux2-qwen3-8b-text-encoder mods install flux2-vae Qwen 2.5 Vision-Language 7B text/vision encoder for Qwen Image Edit.
mods install qwen-image-clip mods install qwen-image-vae FLUX.1 Kontext Dev — 12B parameter image editing model by Black Forest Labs.
mods install flux-kontext-dev LTX-2 19B parameter video generation model. Supports text-to-video,
mods install ltx-2-19b Embeddings connector / text encoder for LTX-2 19B video model.
mods install ltx-2-text-encoder mods install ltx-2-vae LTX-Video 13B parameter model (v0.9.8). Higher quality than the 2B version.
mods install ltx-video-13b UMT5-XXL multilingual text encoder for Wan 2.1/2.2 video generation models.
mods install umt5-xxl Wan 2.1 VACE 14B — Video All-in-one Control & Edit model.
mods install wan21-vae Wan 2.2 14B image-to-video model (high-noise expert). Converts static images
mods install wan22-i2v-high-noise-14b Low-noise expert for Wan 2.2 14B image-to-video MoE architecture.
mods install wan22-i2v-low-noise-14b Wan 2.2 14B text-to-video model (high-noise expert). Uses MoE architecture
mods install wan22-t2v-high-noise-14b Low-noise expert for Wan 2.2 14B text-to-video MoE architecture.
mods install wan22-t2v-low-noise-14b Wan 2.2 hybrid text-to-video and image-to-video 5B model. Fits on 8GB VRAM
mods install wan22-vae Efficient 2B parameter video generation model by Lightricks. Fast inference,
mods install ltx-video-2b mods install flux-dev mods install flux-schnell FLUX.1 Canny Dev — canny edge conditioned generation model on FLUX architecture.
mods install flux-canny-dev Depth-conditioned ControlNet for FLUX.1 Dev. Allows generating images
mods install flux-depth-controlnet FLUX.1 Fill Dev — inpainting and outpainting model based on FLUX architecture.
mods install flux-fill-dev FLUX.1 Redux Dev — image variation adapter for FLUX. Takes image input and
mods install flux-redux-dev CLIP-L (Large) text encoder. Used as secondary text encoder by FLUX models
mods install clip-l mods install flux-vae T5-XXL text encoder in fp16 precision. Required by FLUX models
mods install t5-xxl-fp16 T5-XXL text encoder quantized to fp8 precision. Uses half the VRAM
mods install t5-xxl-fp8 mods install ip-adapter-faceid-sdxl IP-Adapter for SDXL using ViT-H image encoder. Enables image-prompted
mods install ip-adapter-sdxl mods install lcm-lora-sd15 mods install lcm-lora-sdxl Stable Diffusion XL base model. High resolution text-to-image generation
mods install sdxl-base-1.0 ControlNet trained on SDXL for Canny edge detection conditioning.
mods install sdxl-controlnet-canny mods install sdxl-refiner-1.0 SDXL Turbo — distilled from SDXL 1.0 using Adversarial Diffusion Distillation.
mods install sdxl-turbo High-quality background removal / image segmentation model.
mods install birefnet-dis mods install 4x-ultrasharp mods install realesrgan-x4plus The original Stable Diffusion 1.5. Lightweight, fast, huge ecosystem
mods install sd-1.5 Stable Diffusion 2.1, fine-tuned from SD 2.0 with improved aesthetics.
mods install sd-2.1 mods install sd-vae-ft-mse ControlNet v1.1 for SD 1.5 — Canny edge detection conditioned generation.
mods install sd15-controlnet-canny ControlNet v1.1 for SD 1.5 — Depth map conditioned generation.
mods install sd15-controlnet-depth ControlNet v1.1 for SD 1.5 — OpenPose body pose conditioned generation.
mods install sd15-controlnet-openpose mods install sdxl-vae-fp16-fix mods install bsrganx2 Qwen 3 4B text encoder used by Z-Image-Turbo.
mods install z-image-text-encoder Distilled 6B parameter text-to-image model from Alibaba Tongyi Lab.
mods install z-image-turbo mods install z-image-turbo-controlnet-union mods install z-image-turbo-distill-lora mods install z-image-vae