DJXia's Stars
LiGameAcademy/godot_core_system
godot4.4 核心系统功能支持,包括事件总线、分层状态机、资源管理器、序列化和日志系统等
schoolpost/CinePI
OpenSource Cinema Camera using Raspberry Pi
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Tonejs/Tone.js
A Web Audio framework for making interactive music in the browser.
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
jianzongwu/DiffSensei
Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
juce-framework/JUCE
JUCE is an open-source cross-platform C++ application framework for desktop and mobile applications, including VST, VST3, AU, AUv3, LV2 and AAX audio plug-ins.
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
silent-chen/Shap-Editor
[CVPR 2024] SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds
HOIfHLI/Human-Object-Interaction-from-Human-Level-Instructions
zju3dv/EasyVolcap
[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research
Francis-Rings/StableAnimator
[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses.
KRTirtho/spotube
🎧 Open source Spotify client that doesn't require Premium nor uses Electron! Available for both desktop & mobile!
samxuxiang/BrepGen
[SIGGRAPH 2024] Official PyTorch Implementation of "BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry".
hmrishavbandy/FlipSketch
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
modelcontextprotocol/servers
Model Context Protocol Servers
richards199999/Thinking-Claude
Let your Claude able to think
Meshcapade/SMPL_blender_addon
This add-on allows you to edit, reshape and animate SMPL-H, SMPL-X, and SUPR bodies ("SMPL Bodies" for short) to your current Blender scene. Each body consists of a mesh, a shape specific skeleton, and blendshapes (also known as "shape keys") for body shape, facial expressions and pose correctives.
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
sickcodes/Docker-OSX
Run macOS VM in a Docker! Run near native OSX-KVM in Docker! X11 Forwarding! CI/CD for OS X Security Research! Docker mac Containers.
3b1b/manim
Animation engine for explanatory math videos
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
microsoft/SynthMoCap
SynthMoCap Datasets
brightwang/dify-tool-service
为AI带路党Pro视频准备
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
google/flatbuffers
FlatBuffers: Memory Efficient Serialization Library
rhasspy/piper
A fast, local neural text to speech system