mrtpk's Stars
soumik-kanad/diff2lip
ajay-sainy/Wav2Lip-GFPGAN
High quality Lip sync
Abhi7410/DASS_Project
This project aims to turn 2D photographs provided by the user into 3D versions that can be lip-synced with an audio file of the user's choice. The final product will be a lifelike video of the subject. The project also considers future additions such as integration with different backgrounds, text inclusion, and social media sharing.
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
raymond-li/tflite_tensor_outputter
Generates intermediate tensor outputs for tflite
roboflow/supervision
We write your reusable computer vision tools. 💜
unslothai/unsloth
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
rmwkwok/transposed_convolution_in_numpy
Numpy implementation of transposed convolution as matrix multiplication
yerfor/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
ggerganov/ggml
Tensor library for machine learning
TannerGilbert/Computer-Vision-Synthetic-Data-Generation
Synthetic data-set generator for Object Detection and Instance Segmentation
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
siliconflow/onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
drainingsun/ybat
Ybat - YOLO BBox Annotation Tool
jsbroks/awesome-dataset-tools
🔧 A curated list of awesome dataset tools
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Taeyoung96/Yolo-to-COCO-format-converter
Yolo to COCO annotation format converter
laclouis5/globox
A package to read and convert object detection datasets (COCO, YOLO, PascalVOC, LabelMe, CVAT, OpenImage, ...) and evaluate them with COCO and PascalVOC metrics.
yerfor/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
abhineet123/Deep-Learning-for-Tracking-and-Detection
Collection of papers, datasets, code and other resources for object tracking and detection using deep learning
adobe-research/MakeItTalk
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
gmongaras/AI_Girlfriend
Creating a waifu
pkhungurn/talking-head-anime-4
Talking Head(?) Anime from a Single Image 4: Improved Model and Its Distillation
harlanhong/awesome-talking-head-generation
flyerhq/flutter_chat_ui
Actively maintained, community-driven chat UI implementation with an optional Firebase BaaS.
PrettyPrinted/youtube_video_code
The code for all the YouTube videos I publish on YouTube.
comfyanonymous/ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
segmind/distill-sd
Segmind Distilled diffusion
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.