jotoy's Stars
microsoft/Cream
This is a collection of our NAS and Vision Transformer work.
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
abpframework/abp
Open-source web application framework for ASP.NET Core! Offers an opinionated architecture to build enterprise software solutions with best practices on top of the .NET. Provides the fundamental infrastructure, cross-cutting-concern implementations, startup templates, application modules, UI themes, tooling and documentation.
Vision-CAIR/ChatCaptioner
Official Repository of ChatCaptioner
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
tmabraham/ddpo-pytorch
Reproduction of DDPO paper (RLHF for diffusion)
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
deep-floyd/IF
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
openai/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
Nota-NetsPresso/BK-SDM
A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]
segmind/distill-sd
Segmind Distilled diffusion
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
NVlabs/stylegan3
Official PyTorch implementation of StyleGAN3
mli/paper-reading
深度学习经典、新论文逐段精读
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
kjerk/instructblip-pipeline
A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.
BDBC-KG-NLP/QA-Survey-CN
北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TableQA)、基于视觉的问答系统(VisualQA)和机器阅读理解(MRC)等,每类任务分别对学术界和工业界进行了相关总结。
google-research/google-research
Google Research