SherlockJane's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
openai/openai-cookbook
Examples and guides for using the OpenAI API
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
chenfei-wu/TaskMatrix
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
facebookresearch/AnimatedDrawings
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
openai/consistency_models
Official repo for consistency models.
XuehaiPan/nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
forthespada/CampusShame
互联网仍有记忆!那些曾经在校招过程中毁过口头offer、意向书、三方的公司!纵然人微言轻,也想尽绵薄之力!
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
hysts/pytorch_image_classification
PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet
MichalGeyer/plug-and-play
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
szczyglis-dev/py-gpt
Desktop AI Assistant powered by o1, GPT-4, GPT-4 Vision, Gemini, Claude, Llama 3, Bielik, DALL-E, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, agents, command execution, file upload/download, speech synthesis and recognition, access to Web, memory, presets, assistants, plugins, and more. Linux, Windows, Mac.
Ruu3f/freeGPT
freeGPT provides free access to text and image generation models.
nie-lang/UnsupervisedDeepImageStitching
TIP2021 - Unsupervised deep image stitching network
Vchitect/VideoBooth
[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts
zhangzc21/DynTet
Kobaayyy/Awesome-Low-Level-Vision-Research-Groups
A Collection of Low Level Vision Research Groups
pq-yang/PGDiff
[NeurIPS 2023] PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance
apple/ml-agm
leoranlmia/CAR-DQN
[ICML 2024 Oral] Consistent Adversarial Robust Deep Q Networks (CAR-DQN)