lonngxiang's Stars
kleinlee/DH_live
每个人都能用的数字人
OpenT2S/LlamaVoice
LlamaVoice is a llama-based large voice generation model, providing inference and training ability.
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
black-forest-labs/flux
Official inference repo for FLUX.1 models
onwidget/astrowind
⭕️ AstroWind: A free template using Astro 4.0 and Tailwind CSS. Astro starter theme.
InternLM/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
hustvl/EVF-SAM
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
sickcodes/Docker-OSX
Run macOS VM in a Docker! Run near native OSX-KVM in Docker! X11 Forwarding! CI/CD for OS X Security Research! Docker mac Containers.
IDEA-Research/TAPTR
[ECCV 2024] Official implementation of the paper "TAPTR: Tracking Any Point with Transformers as Detection"
ShihaoZhaoZSH/Uni-ControlNet
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
kohya-ss/sd-scripts
bmaltais/kohya_ss
huggingface/optimum-quanto
A pytorch quantization backend for optimum
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
LykosAI/StabilityMatrix
Multi-Platform Package Manager for Stable Diffusion
Xiaojiu-z/Stable-Hair
Stable-Hair: Real-World Hair Transfer via Diffusion Model
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
DeepGraphLearning/ProtST
[ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts
mu-fazil-vk/FluxTube
A powerful and ad-free YouTube client built using Flutter. Watch YouTube videos without ads, subscribe to channels, retrieve video dislikes, read comments, save videos, and much more.
mpflutter/mpflutter
MPFlutter 是一个跨平台 Flutter 开发框架,可用于微信小程序以及 Web 应用开发。
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
mpetroff/pannellum
Pannellum is a lightweight, free, and open source panorama viewer for the web.
sxzxs/Real-time-translation-typing
实时打字翻译软件、语音实时打字、语音实时翻译、LOL 语音打字
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
npmstudy/indie-dev-with-ai
独立开发者的最佳技术栈
Ikaros-521/AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Zheng-Chong/CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).