TomatoSlasher's Stars
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
papers-we-love/papers-we-love
Papers from the computer science community to read and discuss.
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
KwaiVGI/LivePortrait
Bring portraits to life!
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
vikhyat/moondream
tiny vision language model
Kwai-Kolors/Kolors
Kolors Team
facebookresearch/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Breakthrough/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
pytorch/torchtitan
A PyTorch native library for large model training
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
microsoft/mup
maximal update parametrization (µP)
google-deepmind/tapnet
Tracking Any Point (TAP)
ali-vilab/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
kakaobrain/coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
character-ai/prompt-poet
Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.
cloneofsimo/minDiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
buoyancy99/diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
cloneofsimo/minRF
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
AssafSinger94/dino-tracker
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)
KellerJordan/cifar10-airbench
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
hako-mikan/sd-webui-cd-tuner
Color/Detail control for Stable Diffusion web-ui
kijai/ComfyUI-ControlNeXt-SVD
RuiningLi/DragAPart
[ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.
moatifbutt/color-peel
we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. By jointly learning on multiple color-shape images, we found that the method can successfully disentangle the color and shape concepts.