AntonotnaWang

Maybe there was something we could have done.

The University of Hong KongHong Kong

AntonotnaWang's Stars

mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook37.8k 397 674k
chenfei-wu/TaskMatrix
Language:Python34.5k 300 3523.3k
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python30k 217 5452.7k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.3k 257 3042.7k
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Language:Python23.6k 382 1782k
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook18.6k 153 4692.2k
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.4k 103 355849
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Language:Python8.2k 87 217598
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python6.4k 41 296662
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.1k 50 1k616
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python5.9k 64 422406
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python3.6k 31 255336
karpathy/ng-video-lecture
Language:Python3.5k 56 28907
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
3.2k 133 18193
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.5k 32 129197
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.7k 23 106177
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
844 58 1333
jianzhnie/awesome-text-to-video
A Survey on Text-to-Video Generation/Synthesis.
589 15 277
YingqingHe/LVDM
LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation
Language:Python443 28 2316
kohjingyu/gill
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
Language:Jupyter Notebook423 16 4335
rsomani95/shot-type-classifier
Detecting cinema shot types using a ResNet-50
Language:Jupyter Notebook186 16 639
IBM/SALMON
Self-Alignment with Principle-Following Reward Models
Language:Python147 5 313
Ground-A-Video/Ground-A-Video
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)
Language:Python128 8 48
mayuelala/FollowYourHandle
[arXiv 2023] Follow-Your-Handle: This repo is the official implementation of "MagicStick: Controllable Video Editing via Control Handle Transformations"
82 10 12
kyegomez/LUMIERE
Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research
Language:Python51 7 04
bharathprabakaran/FPUS23
Language:Jupyter Notebook23 2 36
DIAL-RPI/Fed-MENU
A python (PyTorch) implementation of federated multi-encoding U-Net (Fed-MENU) method for federated learning-based multi-organ segmentation with inconsistent labels.
Language:Python13 2 11
stanford-rc/slurm-spank-stunnel
Slurm SPANK plugin to ease setup of SSH tunnels and port forwarding
Language:C11 5 22
quantumhpc/slurm-spank-stunnel
Slurm SPANK plugin to ease setup of SSH tunnels and port forwarding
Language:C4 3 02
AntonotnaWang/VL-model-for-ultrasound
A Multi-Task Ultrasound Image Analysis Model by Vision-language Co-training
2 2 0

AntonotnaWang

AntonotnaWang's Stars

mlabonne/llm-course

chenfei-wu/TaskMatrix

lllyasviel/ControlNet

Stability-AI/generative-models

microsoft/JARVIS

tloen/alpaca-lora

guoyww/AnimateDiff

THUDM/CodeGeeX

IDEA-Research/GroundingDINO

bitsandbytes-foundation/bitsandbytes

THUDM/CogVLM

rom1504/img2dataset

karpathy/ng-video-lecture

showlab/Awesome-Video-Diffusion

Doubiiu/DynamiCrafter

Vchitect/Latte

mayuelala/FollowYourClick

jianzhnie/awesome-text-to-video

YingqingHe/LVDM

kohjingyu/gill

rsomani95/shot-type-classifier

IBM/SALMON

Ground-A-Video/Ground-A-Video

mayuelala/FollowYourHandle

kyegomez/LUMIERE

bharathprabakaran/FPUS23

DIAL-RPI/Fed-MENU

stanford-rc/slurm-spank-stunnel

quantumhpc/slurm-spank-stunnel

AntonotnaWang/VL-model-for-ultrasound