CodingMice's Stars
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
lllyasviel/ControlNet
Let us control diffusion models!
danielgatis/rembg
Rembg is a tool to remove images background
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Doubiiu/DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
MasterBin-IIAU/UNINEXT
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
SwinTransformer/Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
xmu-xiaoma666/FightingCV-Paper-Reading
⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀
ShuhongChen/panic3d-anime-reconstruction
CVPR 2023: PAniC-3D Stylized Single-view 3D Reconstruction from Portraits of Anime Characters
ankanbhunia/PIDM
Person Image Synthesis via Denoising Diffusion Model (CVPR 2023)
drboog/ProFusion
Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach
The-FinAI/PIXIU
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
simoninithomas/awesome-ai-tools-for-game-dev
A curated list of awesome AI tools for game developers
aimagelab/multimodal-garment-designer
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
showlab/UniVTG
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
OPPO-Mente-Lab/Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
IDEA-Research/HumanSD
[ICCV 2023] The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"
prasunroy/stefann
:fire: [CVPR 2020] STEFANN: Scene Text Editor using Font Adaptive Neural Network (official code).
salesforce/causalai
Salesforce CausalAI Library: A Fast and Scalable framework for Causal Analysis of Time Series and Tabular Data
ygtxr1997/ReliableSwap
Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'