tripathiarpan20

On a venture to explore the boundaries of human creativity & efficiency reachable by AI

tripathiarpan20's Stars

black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python17.8k 150 01.3k
exo-explore/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Language:Python16.2k 111 305857
fishaudio/fish-speech
Brand new TTS solution
Language:Python14.7k 99 4121.1k
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.4k 49 244434
Picovoice/porcupine
On-device wake word detection powered by deep learning
Language:Python3.8k 65 555503
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python3.5k 43 177299
Dhravya/cloudflare-saas-stack
Quickly make and deploy full-stack apps with database, auth, styling, storage etc. figured out for you. Add all primitives you want.
Language:TypeScript3.1k 17 31234
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Language:Python2.1k 31 101193
bghira/SimpleTuner
A general fine-tuning kit geared toward diffusion models.
Language:Python1.8k 20 387176
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.8k 21 69115
wordware-ai/twitter
AI Agent for Twitter Personality Analysis
Language:TypeScript1.3k 12 9221
ali-vilab/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Language:Python1.1k 14 2482
muzishen/IMAGDressing
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual try-on.
Language:Python1k 14 4386
Zheng-Chong/CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
Language:Python962 12 73114
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python909 10 10460
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Language:Python754 71 14110
gojasper/flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Language:Python476 9 1435
apple/ml-mdm
Train high-quality text-to-image diffusion models in a data & compute efficient manner
Language:Python448 13 2532
donahowe/AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Language:Jupyter Notebook415 10 4731
maxin-cn/Cinemo
Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models
Language:Python237 8 1320
IDEA-Research/TAPTR
[ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3
Language:Python202 4 1112
kijai/ComfyUI-LuminaWrapper
Language:Python185 4 287
czg1225/AsyncDiff
[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
Language:Python167 4 98
Yuanshi9815/Video-Infinity
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
Language:Python164 1 1015
hustvl/GaussianDreamerPro
GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enhanced Quality
163 18 44
hwjiang1510/Real3D
Code for "Real3D: Scaling Up Large Reconstruction Models with Real-World Images"
Language:Python146 8 92
snap-research/weights2weights
Official Implementation of weights2weights
Language:Jupyter Notebook121 11 84
PINTO0309/whisper-onnx-cpu
ONNX implementation of Whisper. PyTorch free.
Language:Python85 4 08
yandex-research/invertible-cd
[NeurIPS'2024] Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps
Language:Python85 5 51
GPT-Talker/GPT-Talker
24MM
Language:HTML3 1 0

tripathiarpan20

tripathiarpan20's Stars

black-forest-labs/flux

exo-explore/exo

fishaudio/fish-speech

snakers4/silero-vad

Picovoice/porcupine

Tencent/HunyuanDiT

Dhravya/cloudflare-saas-stack

KoljaB/RealtimeSTT

bghira/SimpleTuner

cambrian-mllm/cambrian

wordware-ai/twitter

ali-vilab/MimicBrush

muzishen/IMAGDressing

Zheng-Chong/CatVTON

DAMO-NLP-SG/VideoLLaMA2

Text-to-Audio/Make-An-Audio

gojasper/flash-diffusion

apple/ml-mdm

donahowe/AutoStudio

maxin-cn/Cinemo

IDEA-Research/TAPTR

kijai/ComfyUI-LuminaWrapper

czg1225/AsyncDiff

Yuanshi9815/Video-Infinity

hustvl/GaussianDreamerPro

hwjiang1510/Real3D

snap-research/weights2weights

PINTO0309/whisper-onnx-cpu

yandex-research/invertible-cd

GPT-Talker/GPT-Talker