philexohf's Stars
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
facebookresearch/llama
Inference code for LLaMA models
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
lllyasviel/style2paints
sketch + style = paints :art: (TOG2018/SIGGRAPH2018ASIA)
sczhou/CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
mlfoundations/open_clip
An open source implementation of CLIP.
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
bmaltais/kohya_ss
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
alembics/disco-diffusion
timothybrooks/instruct-pix2pix
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
lllyasviel/ControlNet-v1-1-nightly
Nightly release of ControlNet 1.1
ZHKKKe/MODNet
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
crowsonkb/k-diffusion
Karras et al. (2022) diffusion models for PyTorch
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
InternLM/Tutorial
LLM&VLM Tutorial
SHI-Labs/Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
ziqihuangg/Collaborative-Diffusion
[CVPR 2023] Collaborative Diffusion
airockchip/RK3399Pro_npu
youssefHosni/Stable-Diffusion-Crash-Course
waterIKA/JARVIS-A-Smart-To-do-list-assistant
A Smart To do list with LLM