stable-diffusion-xl-1.0-inpainting-0.1Model43/100 via “mask-aware latent concatenation for region-preserving inpainting”
text-to-image model by undefined. 2,35,004 downloads.
Unique: Concatenates the original latent directly to UNet input rather than using a separate masking network, reducing model complexity and enabling efficient reuse of the original latent across multiple inpainting runs. Mask blending occurs in latent space at each diffusion step, ensuring smooth transitions without post-processing.
vs others: Direct latent concatenation is simpler and faster than separate masking networks (e.g., used in some proprietary inpainting models), while producing comparable or better boundary quality because the original latent is preserved throughout the entire diffusion process rather than blended only at the end.