jotoy

jotoy's Stars

microsoft/Cream
This is a collection of our NAS and Vision Transformer work.
Language:Python1.7k236
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
19.2k1.8k
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python5k489
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.7k432
abpframework/abp
Open-source web application framework for ASP.NET Core! Offers an opinionated architecture to build enterprise software solutions with best practices on top of the .NET. Provides the fundamental infrastructure, cross-cutting-concern implementations, startup templates, application modules, UI themes, tooling and documentation.
Language:C#13.3k3.5k
Vision-CAIR/ChatCaptioner
Official Repository of ChatCaptioner
Language:Jupyter Notebook46330
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.6k2.9k
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
Language:Python68959
kvablack/ddpo-pytorch
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
Language:Python51950
tmabraham/ddpo-pytorch
Reproduction of DDPO paper (RLHF for diffusion)
Language:Jupyter Notebook832
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language:Python1.4k71
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python40.6k5.2k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python59.5k6k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python28.3k5.8k
deep-floyd/IF
Language:Python7.8k511
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python42.7k6.5k
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
Language:Python4.2k406
openai/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
Language:Python3.5k506
Nota-NetsPresso/BK-SDM
A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]
Language:Python28420
segmind/distill-sd
Segmind Distilled diffusion
Language:Python59538
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Language:Python8.2k779
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook12.6k1.6k
NVlabs/stylegan3
Official PyTorch implementation of StyleGAN3
Language:Python6.6k1.2k
mli/paper-reading
深度学习经典、新论文逐段精读
29.6k2.6k
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.6k801
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.9k3.4k
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python150k28k
kjerk/instructblip-pipeline
A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.
Language:Python302
BDBC-KG-NLP/QA-Survey-CN
北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答（KBQA），基于文本的问答系统（TextQA），基于表格的问答系统（TableQA）、基于视觉的问答系统（VisualQA）和机器阅读理解（MRC）等，每类任务分别对学术界和工业界进行了相关总结。
1.8k263
google-research/google-research
Google Research
Language:Jupyter Notebook35.2k8k