Pinned Repositories
AMEERAZAM08
I'm Azam
Django-With-Cleary-Tasks
ML-QA
Pose-aware-masking-Using-Mediapipe
res-adapter
Official implementation of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".
Road-Accident-Prediction-Using-ML
sam-sdxl-inpainting
sdxs
Official repo of paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
StyleLipSync
Official pytorch implementation of "StyleLipSync: Style-based Personalized Lip-sync Video Generation".
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
AMEERAZAM08's Repositories
AMEERAZAM08/Deploy-Lama-On-Baseten
AMEERAZAM08/Flux-Dev-24GB-Quantize-8bit
AMEERAZAM08/GaussianAvatars
Official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
AMEERAZAM08/story-adapter
AMEERAZAM08/BrokenSource
❤️🩹 Broken Source Software's Monorepo. A fragmented world, code shards converging to form a vibrant and dynamic software universe
AMEERAZAM08/Deploy-Model-On-Baseten
AMEERAZAM08/DiffTED
AMEERAZAM08/DiffuseHigh
Official implementation of DiffuseHigh, *Younghyun Kim, *Geunmin Hwang, Junyu Zhang, Eunbyung Park.
AMEERAZAM08/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
AMEERAZAM08/dreamhoi
DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors
AMEERAZAM08/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
AMEERAZAM08/facefusion
Industry leading face manipulation platform
AMEERAZAM08/FluxMusic
Text-to-Music Generation with Rectified Flow Transformer
AMEERAZAM08/GAGAvatar
[NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar
AMEERAZAM08/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
AMEERAZAM08/JoyHallo
JoyHallo: Digital human model for Mandarin
AMEERAZAM08/Lotus
Official Implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
AMEERAZAM08/mediapipe
MediaPipe is the simplest way for researchers and developers to build world-class ML solutions and applications for mobile, edge, cloud and the web.
AMEERAZAM08/MeGA
The official implementation of "MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing".
AMEERAZAM08/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
AMEERAZAM08/MVANet
AMEERAZAM08/RobustSAM
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
AMEERAZAM08/ScanTalk
[ECCV 2024] ScanTalk: 3D Talking Heads from Unregistered Scans
AMEERAZAM08/SEG-SDXL
The implementation of the paper "Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention" (NeurIPS`24)
AMEERAZAM08/sliders
Concept Sliders for Precise Control of Diffusion Models
AMEERAZAM08/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
AMEERAZAM08/TANGO-hf
AMEERAZAM08/transformer_latent_diffusion
Text to Image Latent Diffusion using a Transformer core
AMEERAZAM08/voice-pro
The best gradio web-ui for ai transcription, translation and TTS. Automatic subtitle creation using faster whisper. Easy one click installation. Fully portable.
AMEERAZAM08/whisper.cpp
Port of OpenAI's Whisper model in C/C++