RahulBhalley's Stars
zed-industries/zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
apple/corenet
CoreNet: A library for training deep neural networks
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
XPixelGroup/DiffBIR
Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
StanfordBDHG/HealthGPT
Query your Apple Health data with natural language 💬 🩺
rsxdalv/tts-generation-webui
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
karpathy/nano-llama31
nanoGPT style version of Llama 3.1
firebase/firebase-admin-python
Firebase Admin Python SDK
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
huggingface/swift-transformers
Swift Package to implement a transformers-like API in Swift
steven-tey/extrapolate
Age transformation AI app powered by Next.js, Vercel, Replicate, Upstash, and Cloudflare R2 + Workers.
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
apeatling/ollama-voice-mac
Mac compatible Ollama Voice
EurekaLabsAI/tensor
The Tensor (or Array)
warchimede/RangeSlider
A simple range slider made in Swift
sidharthrajaram/StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
ZHO-ZHO-ZHO/ComfyUI-I2VGenXL
Unofficial implementation of I2VGenXL for ComfyUI
yusufdalva/VecGAN
Implementation for the works "VecGAN: Image-to-Image Translation with Interpretable Latent Directions" (ECCV 2022) and "Face Attribute Editing with Disentangled Latent Vectors"
KumapowerLIU/CLCAE
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint. CVPR 2023
ALucek/llama3-websearch-agent
warisqr007/ppg2ppg
Zero-Shot Foreign Accent Conversion without a Native Reference