AntonotnaWang's Stars
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
chenfei-wu/TaskMatrix
lllyasviel/ControlNet
Let us control diffusion models!
Stability-AI/generative-models
Generative Models by Stability AI
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
guoyww/AnimateDiff
Official implementation of AnimateDiff.
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
karpathy/ng-video-lecture
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
jianzhnie/awesome-text-to-video
A Survey on Text-to-Video Generation/Synthesis.
YingqingHe/LVDM
LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation
kohjingyu/gill
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
rsomani95/shot-type-classifier
Detecting cinema shot types using a ResNet-50
IBM/SALMON
Self-Alignment with Principle-Following Reward Models
Ground-A-Video/Ground-A-Video
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)
mayuelala/FollowYourHandle
[arXiv 2023] Follow-Your-Handle: This repo is the official implementation of "MagicStick: Controllable Video Editing via Control Handle Transformations"
kyegomez/LUMIERE
Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research
bharathprabakaran/FPUS23
DIAL-RPI/Fed-MENU
A python (PyTorch) implementation of federated multi-encoding U-Net (Fed-MENU) method for federated learning-based multi-organ segmentation with inconsistent labels.
stanford-rc/slurm-spank-stunnel
Slurm SPANK plugin to ease setup of SSH tunnels and port forwarding
quantumhpc/slurm-spank-stunnel
Slurm SPANK plugin to ease setup of SSH tunnels and port forwarding
AntonotnaWang/VL-model-for-ultrasound
A Multi-Task Ultrasound Image Analysis Model by Vision-language Co-training