diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-06-02 00:01:34 +08:00

Files

MQ 10302496a6 [feat] JoyAI-JoyImage-Edit support (#13444 )

* [feat] JoyAI-JoyImage-Edit support

* [fix] remove rearrange

* [refactor] two pass when do cfg

* [refactor] remove repa, use wantimetextembeding, refactor modulate code

* [refactor] Joyimage Attention refactor

* remove vae tiling and autocast

* [fix] remove einops from setup.py

* [refactor] Refactor JoyImageEditPipeline to use explicit arguments instead of namespace and remove _build_arg

* [fix] remove deprecated method decode_latents

* [refactor] refactor the image pre-processing logic into a separate VaeImageProcessor subclass

* [refactor] add JoyImageAttention to align with Attention + AttnProcessor design and update conversion script for new weight key mapping (e.g. img_attn_qkv -> attn.img_attn_qkv)

* [refactor] simplify bucket logic in JoyImageEditImageProcessor by replacing runtime generation with precomputed lookup tables

* [fix] remove leftover training-only parameters

* [fix] add layerwise casting and fp32 module patterns to JoyImageTransformer3DModel. Reference WanTransformer3DModel to fix layer casting errors during inference.

* [test] add JoyImageEditPipeline fast tests and JoyImageEditTransformer3DModel model tests

* [fix] fix some pipeline args to support batch inference

* [fix] duplicate images to match batch size when fewer images than prompts in JoyImageEditPipeline

* [fix] remove no longer used config parameters

* Apply style fixes

* [fix] remove unused dataclass and rewrite helpers as inline functions

* [fix] make dummy objects for JoyImageEdit

* [fix] allow test_torch_compile_repeated_blocks to pass

* [fix] add examples on JoyImageEditPipeline

* fix code style issues with ruff and black

* Apply style fixes

* [fix] change default num_inference_steps to 40

* [fix] use forward hook to extract pre-norm hidden states for transformers 5.x compatibility

* [fix] change the assert to ValueError in pipeline

* [fix] rename JoyImageTransformer3DModel to JoyImageEditTransformer3DModel, clean up anything about the alias

* [fix] support gradient checkpointing

* [refactor] simplify RoPE utilities, inline helpers, copy WanTimeTextImageEmbedding locally and remove unused parameters

* [fix] remove _get_text_encoder_ckpt and qwen_processor

* [fix] change nn.RMSNorm to FP32LayerNorm

* [fix] small fixes for suggestions given by Claude

* [refactor] build model using from _pretained instead of config

* [refactor] auto-wrap prompt and support text-to-image in JoyImage Edit pipeline

* make style, make quality and make fix-copies

* [test] small fix to use vocab_size=1024

* [refactor] separate encode_prompt_multiple_images from encode_prompt, support prompt_embeds/prompt_embesd_mask/num_images_per_prompt in edit mode

* [test] fix CI: use strict=False for xfail and add @require_torch_accelerator to group offloading test

* [refactor] separate image_latents from latents in prepare_latents to align with flux2

* make style

---------

Co-authored-by: zhangmaoquan.1 <zhangmaoquan.1@jd.com>
Co-authored-by: huangfeice <huangfeice@gmail.com>
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

2026-05-07 10:57:56 -10:00

__init__.py

Fix conversion script

2022-07-15 17:00:41 +00:00

change_naming_configs_and_checkpoints.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

conversion_ldm_uncond.py

[OmegaConf] replace it with yaml (#6488 )

2024-01-15 20:02:10 +05:30

convert_ace_step_to_diffusers.py

Add ACE-Step pipeline for text-to-music generation (#13095 )

2026-04-30 18:30:44 -10:00

convert_amused.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_animatediff_motion_lora_to_diffusers.py

[core] AnimateDiff SparseCtrl (#8897 )

2024-07-26 17:46:05 +05:30

convert_animatediff_motion_module_to_diffusers.py

[Pipeline] AnimateDiff SDXL (#6721 )

2024-05-08 21:27:14 +05:30

convert_animatediff_sparsectrl_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_asymmetric_vqgan_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_aura_flow_to_diffusers.py

[Core] Add AuraFlow (#8796 )

2024-07-11 08:50:19 -10:00

convert_blipdiffusion_to_diffusers.py

Fix style (#10478 )

2025-01-07 11:06:36 +05:30

convert_cogvideox_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_cogview3_to_diffusers.py

[Fix] Syntax error (#10068 )

2024-12-02 11:28:00 +05:30

convert_cogview4_to_diffusers_megatron.py

CogView4 Control Block (#10809 )

2025-03-15 07:15:56 -10:00

convert_cogview4_to_diffusers.py

CogView4 Control Block (#10809 )

2025-03-15 07:15:56 -10:00

convert_consistency_decoder.py

docs: cleanup of runway model (#12503 )

2025-10-17 14:10:50 -07:00

convert_consistency_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_cosmos_to_diffusers.py

Cosmos Transfer2.5 Auto-Regressive Inference Pipeline (#13114 )

2026-02-25 14:42:29 -10:00

convert_dance_diffusion_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_dcae_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_ddpm_original_checkpoint_to_diffusers.py

Ruff: apply same rules as in transformers (#2827 )

2023-03-27 16:18:57 +02:00

convert_diffusers_sdxl_lora_to_webui.py

changed positional parameters to named parameters like in docs (#6905 )

2024-02-08 21:39:03 +05:30

convert_diffusers_to_original_sdxl.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_diffusers_to_original_stable_diffusion.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_dit_to_diffusers.py

Replace flake8 with ruff and update black (#2279 )

2023-02-07 23:46:23 +01:00

convert_flux2_to_diffusers.py

Flux2 klein (#12982 )

2026-01-15 09:10:54 -10:00

convert_flux_to_diffusers.py

Fix typos in docs and comments (#11416 )

2025-04-30 20:30:53 -10:00

convert_flux_xlabs_ipadapter_to_diffusers.py

Support Flux IP Adapter (#10261 )

2024-12-21 17:49:58 +00:00

convert_gligen_to_diffusers.py

Remove torch_dtype in to() to end deprecation (#6886 )

2024-02-08 09:38:57 +05:30

convert_hunyuan_image_to_diffusers.py

HunyuanImage21 (#12333 )

2025-10-23 22:31:12 -10:00

convert_hunyuan_video1_5_to_diffusers.py

[HunyuanVideo1.5] support step-distilled (#12802 )

2025-12-07 21:50:36 -10:00

convert_hunyuan_video_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_hunyuandit_controlnet_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_hunyuandit_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_i2vgen_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_if.py

Update access of configuration attributes (#7343 )

2024-03-18 08:53:29 -10:00

convert_joyimage_edit_to_diffusers.py

[feat] JoyAI-JoyImage-Edit support (#13444 )

2026-05-07 10:57:56 -10:00

convert_k_upscaler_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_kakao_brain_unclip_to_diffusers.py

[Core] move transformer scripts to transformers modules (#6747 )

2024-01-29 22:28:28 +05:30

convert_kandinsky3_unet.py

[@cene555][Kandinsky 3.0] Add Kandinsky 3.0 (#5913 )

2023-11-24 17:46:00 +01:00

convert_kandinsky_to_diffusers.py

[Core] move transformer scripts to transformers modules (#6747 )

2024-01-29 22:28:28 +05:30

convert_ldm_original_checkpoint_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_longcat_audio_dit_to_diffusers.py

[Bugfix] Fix shape mismatch in LongCatAudioDiTTransformer conversion (#13494 )

2026-04-16 16:49:58 -07:00

convert_lora_safetensor_to_diffusers.py

[LoRA test suite] refactor the test suite and cleanse it (#7316 )

2024-03-20 17:13:52 +05:30

convert_ltx2_to_diffusers.py

Add Support for LTX-2.3 Models (#13217 )

2026-03-19 14:58:29 -07:00

convert_ltx_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_lumina_to_diffusers.py

Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline (#10827 )

2025-03-13 09:24:21 -10:00

convert_mochi_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_models_diffuser_to_diffusers.py

Ruff: apply same rules as in transformers (#2827 )

2023-03-27 16:18:57 +02:00

convert_ms_text_to_video_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_music_spectrogram_to_diffusers.py

#7535 Update FloatTensor type hints to Tensor (#7883 )

2024-05-10 09:53:31 -10:00

convert_ncsnpp_original_checkpoint_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_omnigen_to_diffusers.py

Add OmniGen (#10148 )

2025-02-12 02:16:38 +05:30

convert_original_audioldm2_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_original_audioldm_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_original_controlnet_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_original_musicldm_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_original_stable_diffusion_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_original_t2i_adapter.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_ovis_image_to_diffusers.py

Add support for Ovis-Image (#12740 )

2025-12-02 11:48:07 -10:00

convert_pixart_alpha_to_diffusers.py

Fix PixArt 256px inference (#6789 )

2024-03-03 10:31:21 +05:30

convert_pixart_sigma_to_diffusers.py

PixArt-Sigma Implementation (#7654 )

2024-04-23 22:33:08 -10:00

convert_prx_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_rae_to_diffusers.py

feat: implement rae autoencoder. (#13046 )

2026-03-05 20:17:14 +05:30

convert_sana_controlnet_to_diffusers.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_sana_to_diffusers.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_sana_video_to_diffusers.py

add ltx2 vae in sana-video; (#13229 )

2026-03-17 18:09:52 -10:00

convert_sd3_controlnet_to_diffusers.py

Sd35 controlnet (#10020 )

2024-11-27 10:44:48 -10:00

convert_sd3_to_diffusers.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_shap_e_to_diffusers.py

Fix typos in docs and comments (#11416 )

2025-04-30 20:30:53 -10:00

convert_skyreelsv2_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_stable_audio.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_stable_cascade_lite.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_stable_cascade.py

fix: correct import path for load_model_dict_into_meta in conversion scripts (#12616 )

2025-11-10 14:47:18 +05:30

convert_stable_diffusion_checkpoint_to_onnx.py

Update more licenses to 2025 (#11746 )

2025-06-19 07:46:01 +05:30

convert_stable_diffusion_controlnet_to_onnx.py

Convert Stable Diffusion ControlNet to TensorRT (#4465 )

2023-08-11 08:12:26 +05:30

convert_stable_diffusion_controlnet_to_tensorrt.py

Convert Stable Diffusion ControlNet to TensorRT (#4465 )

2023-08-11 08:12:26 +05:30

convert_svd_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_tiny_autoencoder_to_diffusers.py

Remove code snippets containing is_safetensors_available() (#4521 )

2023-08-11 11:05:22 +05:30

convert_unclip_txt2img_to_image_variation.py

Replace flake8 with ruff and update black (#2279 )

2023-02-07 23:46:23 +01:00

convert_unidiffuser_to_diffusers.py

[WIP] Refactor UniDiffuser Pipeline and Tests (#4948 )

2023-10-02 18:24:55 +02:00

convert_vae_diff_to_onnx.py

make style

2023-03-06 10:40:18 +00:00

convert_vae_pt_to_diffusers.py

[BUG] Fix convert_vae_pt_to_diffusers bug (#11078 )

2025-04-10 06:59:45 +01:00

convert_versatile_diffusion_to_diffusers.py

[chore] change licensing to 2025 from 2024. (#10615 )

2025-01-20 16:57:27 -10:00

convert_vq_diffusion_to_diffusers.py

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

convert_wan_to_diffusers.py

Sunset Python 3.8 & get rid of explicit typing exports where possible (#12524 )

2026-02-13 18:16:51 +05:30

convert_wuerstchen.py

Fix typos in docs and comments (#11416 )

2025-04-30 20:30:53 -10:00

convert_zero123_to_diffusers.py

Remove dead code and fix f-string issue (#7720 )

2024-05-08 13:15:28 -10:00

extract_lora_from_model.py

[chore] add a script to extract loras from full fine-tuned models (#10631 )

2025-01-24 11:50:36 +05:30

generate_logits.py

Use model_info.id instead of model_info.modelId (#8912 )

2024-07-20 20:01:21 +05:30