DJXia

DJXia's Stars

LiGameAcademy/godot_core_system
godot4.4 核心系统功能支持，包括事件总线、分层状态机、资源管理器、序列化和日志系统等
Language:GDScript9815
schoolpost/CinePI
OpenSource Cinema Camera using Raspberry Pi
1.3k48
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Language:Python6.4k520
Tonejs/Tone.js
A Web Audio framework for making interactive music in the browser.
Language:TypeScript13.8k998
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Language:Python21.1k2.4k
jianzongwu/DiffSensei
Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
Language:Python73060
juce-framework/JUCE
JUCE is an open-source cross-platform C++ application framework for desktop and mobile applications, including VST, VST3, AU, AUv3, LV2 and AAX audio plug-ins.
Language:C++7k1.8k
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Language:Python9.4k785
silent-chen/Shap-Editor
[CVPR 2024] SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds
Language:Python33
HOIfHLI/Human-Object-Interaction-from-Human-Level-Instructions
10
zju3dv/EasyVolcap
[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research
Language:Python1.1k64
Francis-Rings/StableAnimator
[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses.
Language:Python1.2k77
KRTirtho/spotube
🎧 Open source Spotify client that doesn't require Premium nor uses Electron! Available for both desktop & mobile!
Language:Dart38.7k1.6k
samxuxiang/BrepGen
[SIGGRAPH 2024] Official PyTorch Implementation of "BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry".
Language:Python27835
hmrishavbandy/FlipSketch
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
Language:Python32934
modelcontextprotocol/servers
Model Context Protocol Servers
Language:JavaScript23.4k2.4k
richards199999/Thinking-Claude
Let your Claude able to think
Language:TypeScript14.8k1.7k
Meshcapade/SMPL_blender_addon
This add-on allows you to edit, reshape and animate SMPL-H, SMPL-X, and SUPR bodies ("SMPL Bodies" for short) to your current Blender scene. Each body consists of a mesh, a shape specific skeleton, and blendshapes (also known as "shape keys") for body shape, facial expressions and pose correctives.
Language:Python19116
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组
Language:Python12k1.2k
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python105k8.2k
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Language:Python19.4k2.1k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
Language:Python31.5k3.2k
sickcodes/Docker-OSX
Run macOS VM in a Docker! Run near native OSX-KVM in Docker! X11 Forwarding! CI/CD for OS X Security Research! Docker mac Containers.
Language:Shell50k2.8k
3b1b/manim
Animation engine for explanatory math videos
Language:Python76.3k6.6k
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python5.9k513
microsoft/SynthMoCap
SynthMoCap Datasets
Language:Python43522
brightwang/dify-tool-service
为AI带路党Pro视频准备
Language:Python18843
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language:C++65.6k9.8k
google/flatbuffers
FlatBuffers: Memory Efficient Serialization Library
Language:C++24k3.3k
rhasspy/piper
A fast, local neural text to speech system
Language:C++8.3k620