qinziqiao's Stars
tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
eriklindernoren/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
facebookresearch/AugLy
A data augmentations library for audio, image, text, and video.
showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, and various other applications.
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
VITA-MLLM/VITA
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
lucidrains/byol-pytorch
Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch
albanie/convnet-burden
Memory consumption and FLOP count estimates for convnets
YBYBZhang/ControlVideo
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
lzx1413/PytorchSSD
pytorch version of SSD and it's enhanced methods such as RFBSSD,FSSD and RefineDet
ahong007007/awesomeCV
记录state-of-art计算机视觉相关论文。
LieluoboAi/radish
C++ model train&inference framework
dvornikita/context_aug
Context-driven data augmentation for Object Detection (ECCV'18)
lucidrains/mirasol-pytorch
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
shenwei356/ClipboardTextJoiner
Monitoring system clipboard change and joining multi-line text. It's very useful when copying multi-line text from PDF files.
sun-xl/ISC2021
sahaRatul/HarmonySearch
MATLAB Implementation of Harmony Search, a nature inspired optimization algorithm
cy5211/MobileNet-Caffe
Caffe Implementation of Google's MobileNets (v1 and v2)