Rizwanali324's Stars
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
kobinabrandon/Hourly-Divvy-Trip-Predictor
An end-to-end batch machine learning system that produces hourly predictions of the number of arrivals and departures that will take place at various stations in Lyft's bike sharing system in Chicago.
PKU-YuanGroup/ConsisID
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
RexanWONG/text-behind-image
https://textbehindimage.rexanwong.xyz - create text behind image designs easily
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
XLabs-AI/x-flux
Hillobar/Rope
GUI-focused roop
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
bachzz/UW-DiffPhys
bmaltais/kohya_ss
Nutlope/llama-ocr
Document to Markdown OCR library with Llama 3.2 vision
khetansarvesh/CV
Implementation of algorithms like CNN, Vision Transformers, VAE, GAN, Diffusion .... for image data
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
warmshao/FasterLivePortrait
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
entbappy/AWS-CICD-Deploymennt
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
roboflow/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
aihacker111/Efficient-Live-Portrait
Fast running Live Portrait with TensorRT and ONNX models
Mrkomiljon/Webcam_Live_Portrait
Bring portraits to life via webcam!
AarohiSingla/Automatic-Number-Plate-Recognition--ANPR-
Automatic Number Plate Recognition (ANPR) Using YOLOv8 and easyOCR
faridrashidi/kaggle-solutions
🏅 Collection of Kaggle Solutions and Ideas 🏅
PromtEngineer/localGPT-Vision
Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs
FACEGOOD/FACEGOOD-Audio2Face
http://www.facegood.cc
media-sec-lab/Audio-Deepfake-Detection
Research progress on speech deepfake detection: Relevant datasets aggregated from the review literature and publicly available codes
evidentlyai/evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
joeyism/linkedin_scraper
A library that scrapes Linkedin for user data
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
bornfree/talking_avatar_backend
Simple ExpressJS backend for talking avatars
RUB-SysSec/WaveFake
mostafa-saad/deep-activity-rec
Paper ibrahim et al, cvpr 2016 - A Hierarchical Deep Temporal Model for Group Activity Recognition -