Issues
- 0
Request for external caption files
#52 opened by mmderakhshani - 5
- 10
Training Loss Values.
#50 opened by mmderakhshani - 1
关于Loss的疑问
#51 opened by tulvgengenr - 4
Import BUG
#37 opened by Masaaki-75 - 8
代码中会把t2i、llm、mmu三部分的数据集混合起来训练
#48 opened by sherlockma11 - 2
Small bug in Showo.mmu_generate?
#49 opened by CladernyJorn - 0
代码中会把t2i、llm、mmu三部分的数据集混合起来训练?
#47 opened by sherlockma11 - 1
生成只能用magvit吗
#46 opened by sherlockma11 - 4
- 1
About checkpoints to be used by finetune
#44 opened by trmzpi02 - 2
- 4
About multimodal sequence input
#38 opened by tulvgengenr - 2
FileNotFoundError
#41 opened by Hannieliao - 1
二维码过期啦~!
#40 opened by Strike1999 - 6
- 0
- 2
Generation inference with interleaved input
#35 opened by ys-zong - 1
Evaluation on NLP tasks and training time
#36 opened by KebinWu - 3
Question about SHOW-O's CLIP version
#34 opened by hills-code - 1
Questions about generation quality
#33 opened by xizaoqu - 0
multimodal input -> image output
#28 opened by Redtides0 - 1
Omni-Attention Implementation
#32 opened by ChocoWu - 4
这个diffusion在哪里体现的??
#4 opened by Robootx - 3
- 2
- 6
- 3
Dataset Preparation Script
#29 opened by mmderakhshani - 4
Keyframes generation inference code
#11 opened by qqphung - 1
微信二维码过期了,麻烦更新下,谢谢
#26 opened by tgyy1995 - 1
图像生成推理问题
#25 opened by william-ljz - 1
What are the meaning of special tokens
#24 opened by Doctor-James - 3
ImportError: cannot import name 'SAFE_WEIGHTS_INDEX_NAME' from 'diffusers.utils'
#23 opened by kurolykin - 4
Question about the training of MAGVIT-v2
#22 opened by RobertLuo1 - 1
The reason of continuous feature is better than discrete feature is before the codebook size is small?
#19 opened by dongzhuoyao - 2
No module named 'parquet.parquet_dataset'
#21 opened by mrswang1 - 0
No module named 'parquet.parquet_dataset'
#20 opened by mrswang1 - 4
FlexAttention example (for `mmu_vit` mask)
#8 opened by Chillee - 6
- 1
Comparison with Transfusion
#12 opened by NickGao96 - 1
- 1
- 1
runtime error
#10 opened by junwenxiong - 1
Can Flash Attention be used?
#7 opened by wusize - 2
GPU
#5 opened by yuzhongruicn