DeepHansda

Asansol,West Bengal

DeepHansda's Stars

Zyphra/Zonos
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers.
Language:Python6.2k660
Wan-Video/Wan2.1
Wan: Open and Advanced Large-Scale Video Generative Models
Language:Python9.1k978
ipikuka/next-mdx-remote-client
A wrapper of `@mdx-js/mdx` for `Next.js` applications in order to load MDX content. It is a fork of `next-mdx-remote`.
Language:TypeScript774
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
Language:Python31.5k3.2k
emmabostian/developer-portfolios
A list of developer portfolios for your inspiration
Language:Python12.6k2.5k
actualize-ae/voice-chat-pdf
Use OpenAI's realtime API for a chatting with your documents
Language:TypeScript24033
IGL-HKUST/DiffusionAsShader
[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Language:Python54818
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python3.8k475
unclecode/crawl4ai
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Language:Python34.1k3k
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
Language:Jupyter Notebook7.8k503
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.
Language:TypeScript58.2k12.3k
mycfhs/DreamMix
The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting
Language:Python1175
yformer/EfficientTAM
Efficient Track Anything
Language:Python50420
LadybirdBrowser/ladybird
Truly independent web browser
Language:C++36.4k1.5k
vicsejas/fastapi-with-tailwindcss
How to setup FastAPI with TailwindCSS
Language:CSS383
sunscrapers/fastapi-htmx-daisyui
Language:Python11017
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python15k1.3k
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Language:Jupyter Notebook4.6k398
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python4.2k484
nerfstudio-project/nerfstudio
A collaboration friendly studio for NeRFs
Language:Python10k1.4k
lizhe00/AnimatableGaussians
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
Language:Python98164
karanpratapsingh/system-design
Learn how to design systems at scale and prepare for system design interviews
35.1k4k
anil-sidhu/dsa-with-js
Language:HTML7954
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python28.2k5.8k
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.6k928
jina-ai/serve
☁️ Build multimodal AI applications with cloud-native stack
Language:Python21.5k2.2k
zhanymkanov/fastapi-best-practices
FastAPI Best Practices and Conventions we used at our startup
11k813
haoheliu/voicefixer
General Speech Restoration
Language:Python1.1k134
Pandaily591/OnlySpeakTTS
Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes speech generation much faster by default.
Language:Python5211
ChenyangLEI/All-In-One-Deflicker
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
Language:Python72543

DeepHansda

DeepHansda's Stars

Zyphra/Zonos

Wan-Video/Wan2.1

ipikuka/next-mdx-remote-client

myshell-ai/OpenVoice

emmabostian/developer-portfolios

actualize-ae/voice-chat-pdf

IGL-HKUST/DiffusionAsShader

TMElyralab/MuseTalk

unclecode/crawl4ai

NVIDIA/Cosmos

lobehub/lobe-chat

mycfhs/DreamMix

yformer/EfficientTAM

LadybirdBrowser/ladybird

vicsejas/fastapi-with-tailwindcss

sunscrapers/fastapi-htmx-daisyui

SYSTRAN/faster-whisper

sanchit-gandhi/whisper-jax

fudan-generative-vision/champ

nerfstudio-project/nerfstudio

lizhe00/AnimatableGaussians

karanpratapsingh/system-design

anil-sidhu/dsa-with-js

huggingface/diffusers

HumanAIGC/EMO

jina-ai/serve

zhanymkanov/fastapi-best-practices

haoheliu/voicefixer

Pandaily591/OnlySpeakTTS

ChenyangLEI/All-In-One-Deflicker