DeepHansda's Stars
Zyphra/Zonos
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers.
Wan-Video/Wan2.1
Wan: Open and Advanced Large-Scale Video Generative Models
ipikuka/next-mdx-remote-client
A wrapper of `@mdx-js/mdx` for `Next.js` applications in order to load MDX content. It is a fork of `next-mdx-remote`.
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
emmabostian/developer-portfolios
A list of developer portfolios for your inspiration
actualize-ae/voice-chat-pdf
Use OpenAI's realtime API for a chatting with your documents
IGL-HKUST/DiffusionAsShader
[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
unclecode/crawl4ai
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.
mycfhs/DreamMix
The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting
yformer/EfficientTAM
Efficient Track Anything
LadybirdBrowser/ladybird
Truly independent web browser
vicsejas/fastapi-with-tailwindcss
How to setup FastAPI with TailwindCSS
sunscrapers/fastapi-htmx-daisyui
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
nerfstudio-project/nerfstudio
A collaboration friendly studio for NeRFs
lizhe00/AnimatableGaussians
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
karanpratapsingh/system-design
Learn how to design systems at scale and prepare for system design interviews
anil-sidhu/dsa-with-js
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
jina-ai/serve
☁️ Build multimodal AI applications with cloud-native stack
zhanymkanov/fastapi-best-practices
FastAPI Best Practices and Conventions we used at our startup
haoheliu/voicefixer
General Speech Restoration
Pandaily591/OnlySpeakTTS
Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.
ChenyangLEI/All-In-One-Deflicker
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas