xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
PythonApache-2.0
Pinned issues
Issues
- 0
RoadMap and Looking for Contributions
#213 opened by feifeibear - 0
parallel bug for CogVideoX
#295 opened by oahzxl - 2
- 4
Model flux fail
#282 opened by gty111 - 3
latte模型的pos_embed没有被正确初始化
#255 opened by weyeeji - 1
Add AuraFlow support
#291 opened by isidentical - 0
Parallel VAE fails when the degree of parallelism > 1
#271 opened by gty111 - 1
num_frames is not defined for latte
#283 opened by littletomatodonkey - 0
hunyuanDiT PipeFusion=8 on L40
#284 opened by feifeibear - 2
Flux.1 needs diffusers built from source code
#267 opened by feifeibear - 0
profile support
#272 opened by kuangdao - 0
- 1
- 1
变量is_dp_last_group未定义
#256 opened by weyeeji - 2
FLUX with SP 并行生成图像差异
#262 opened by lixiang007666 - 4
ValueError: Attention Processor class CogVideoXAttnProcessor2_0 is not supported by xFuser
#254 opened by weyeeji - 4
运行CogVideoX-2b模型显示缺少bin文件
#253 opened by weyeeji - 0
Flux sp=2 errors
#252 opened by feifeibear - 1
- 3
Why Latte Model can‘t support PipeFusion
#225 opened by philipwan - 2
TypeError: _flash_attn_forward() missing 1 required positional argument: 'softcap'
#217 opened by Pumbaa-peng - 2
Flux Speed Up with PipeFusion
#230 opened by mali-afridi - 1
- 4
RTX 4090, flux model, out of memory; approach is not compatible with quantization
#218 opened by csdY123 - 0
Running with torch.compile on Flux with ulysses_degree=2 results in incorrect outcomes.
#220 opened by feifeibear - 1
Image Difference for SD3 in the refactored code, when pipefusion_parallel_degree changes
#163 opened by khadijairfan2345 - 2
[feature] Loras support for SD and Flux models
#192 opened by tsubasakong - 0
为什么没有像Megatron一样,使用batch_isend_irecv做p2p通信?
#132 opened by taozhiwei - 3
[SERVER] Can give a http interface example?
#185 opened by senlyu163 - 2
- 1
How to set the offline download model path
#189 opened by lonngxiang - 0
[bug] latte output bug when sp=8
#206 opened by dannyxiaocn - 1
Flux Sequence Parallel Poor Scalability Issue
#204 opened by feifeibear - 1
RTX 4090, flux model, out of memory
#195 opened by csdY123 - 7
Cannot allocate memory
#190 opened by lonngxiang - 3
- 10
Thoughts to implement this on ODD number of GPUs?
#139 opened by alivisionrd - 12
Benchmark running error?
#154 opened by tensorflowt - 0
[feature] runtime check diffusers version
#170 opened by feifeibear - 1
Why this is inference only but not for training?
#164 opened by spacegoing - 0
add pipefusers to pypi
#133 opened by feifeibear - 0
Process stuck after all timesteps are finished
#147 opened by Eigensystem - 0
The best communication pattern for Sequence Parallel(SP)+PipeFusion(PP) hybrid parallelism
#146 opened by feifeibear - 1
- 1
- 3
# of parameters on each device
#135 opened by wonkyoc - 0
HunyuanDiT 2GPUs PipeFusion Bugs
#121 opened by feifeibear - 2
Adding support for pixart-sigma model?
#115 opened by foreverpiano - 0
HunyuanDiT pipefusion use_split_batch True errors
#122 opened by feifeibear - 7