redpintings's Stars
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
DigitalPhonetics/IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
ihmily/outfit-anyone
Outfit Anyone(最新修复版): Ultra-high quality virtual try-on for Any Clothing and Any Person
kanadeblisst00/wechat_ocr
使用Python调用微信本地ocr服务
aiola-lab/whisper-medusa
Whisper with Medusa heads
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
AuvaLab/itext2kg
Incremental Knowledge Graphs Constructor Using Large Language Models
unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Kwai-Kolors/Kolors
Kolors Team
cocktailpeanut/fluxgym
Dead simple FLUX LoRA training UI with LOW VRAM support
THUDM/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
thuhcsi/MagicMan
Official repository for paper "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement"
liuzhao1225/YouDub-webui
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
jbilcke-hf/clapper
Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema
rosuH/EasyWatermark
🔒 🖼 Securely, easily add a watermark to your sensitive photos. 安全、简单地为你的敏感照片添加水印,防止被小人泄露、利用
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
chromedp/chromedp
A faster, simpler way to drive browsers supporting the Chrome DevTools Protocol.
submato/xhscrawl
小红书数据采集,小红书逆向,小红书 x-s逆向,小红书爬虫,小红书账号、推广
ManimCommunity/manim
A community-maintained Python framework for creating mathematical animations.
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
BMPixel/moffee
moffee: Make Markdown Ready to Present
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
open-compass/GAOKAO-Eval
antvis/G6
♾ A Graph Visualization Framework in JavaScript.
opendatalab/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
harry0703/AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记