sjf18's Stars
codecrafters-io/build-your-own-x
Master programming by recreating your favorite technologies from scratch.
jaywcjlove/awesome-mac
Now we have become very big, Different from the original idea. Collect premium software in various categories.
localsend/localsend
An open-source cross-platform alternative to AirDrop
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
linexjlin/GPTs
leaked prompts of GPTs
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
bleedline/aimoneyhunter
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
BartoszJarocki/cv
Print-friendly, minimalist CV page
Acly/krita-ai-diffusion
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
guofei9987/blind_watermark
Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
zengyh1900/Awesome-Image-Inpainting
A curated list of image inpainting and video inpainting papers and resources
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Vincentqyw/image-matching-webui
🤗 image matching toolbox webui
OrionStarAI/Orion
Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型,包括对话模型,长文本模型,量化模型,RAG微调模型,Agent微调模型等。
cvg/glue-factory
Training library for local feature detection and matching
KKGo1999/Stable-diffusion-person
由基于Stable-diffusion的Chilloutmix模型生成高清真实的人像
AlaaLab/InstructCV
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
xuanyuzhang21/EditGuard
[CVPR 2024🔥] EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection