YBYBZhang/ControlVideo
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
PythonMIT
Issues
- 1
- 0
Why not use conv3d directly
#35 opened by EveningLin - 1
pretrained file
#34 opened by xua222 - 4
about the evaluate date
#29 opened by FatLong666 - 2
First frame conditioning possible?
#33 opened by ivaniliash - 1
Combination with ip adpater
#32 opened by pikeyang - 0
- 2
Evaluation Question: 125 Video-prompt Pairs
#27 opened by mikolez - 2
Is the same frame noise important?
#30 opened by CHNxindong - 1
Pure text generation
#28 opened by QMME - 2
Typo in NEG_prompt
#25 opened by dhruvhacks - 3
questions about loading ckpt
#24 opened by koalaaaaaaaaa - 6
problem with triton
#15 opened by gonduras - 0
ValueError: Calling CLIPTokenizer.from_pretrained() with the path to a single file or url is not supported for this tokenizer. Use a model identifier or the path to a directory instead.
#23 opened by Rkunique - 2
- 1
ValueError: mutable default <class 'timm.models.maxxvit.MaxxVitConvCfg'> for fie
#20 opened by KaiJia2017 - 3
ERROR: Could not find a version that satisfies the requirement clip==1.0 (from versions: none) ERROR: No matching distribution found for clip==1.0
#19 opened by KaiJia2017 - 1
CUDA out of memory
#18 opened by Daybreak-Zheng - 9
Single Character
#17 opened by p0mad - 3
The idea of full-frames attention shares a great similarity with the spatial-temporal modeling approach used in Vid2Vid-Zeros.
#11 opened by jinxixiang - 2
guide to run on kaggle or google collab
#16 opened by sadath-12 - 5
no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory checkpoints/stable-diffusion-v1-5
#5 opened by didoll-john - 3
RuntimeError: The size of tensor a (24) must match the size of tensor b (12) at non-singleton dimension 2
#14 opened by liuxz-cs - 1
请教大神
#12 opened by xxxiaosong - 1
The size of tensor a (30) must match the size of tensor b (16) at non-singleton dimension 2
#9 opened by Laidawang - 1
The input must have a source video?
#10 opened by fistyee - 1
smoother_step
#8 opened by dlutzzw - 2
- 1
ask for help
#4 opened by zcdliuwei - 1
Excellent work
#2 opened by 1171000410 - 1
Is 11GB GPU the minimum requirements?
#1 opened by Njasa2k - 1
The prompt seems wrong in Readme
#3 opened by thuwzy