DinoMan's Stars
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
karpathy/micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
rasbt/pattern_classification
A collection of tutorials and examples for solving and understanding machine learning and pattern classification tasks
karpathy/ng-video-lecture
dmlc/decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
google/generative-ai-docs
Documentation for Google's Gen AI site - including the Gemini API and Gemma
NVlabs/alias-free-gan
Alias-Free GAN project website and code
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
zhanglonghao1992/One-Shot_Free-View_Neural_Talking_Head_Synthesis
Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"
rosinality/alias-free-gan-pytorch
Unofficial implementation of Alias-Free Generative Adversarial Networks. (https://arxiv.org/abs/2106.12423) in PyTorch
cydonia999/VGGFace2-pytorch
PyTorch Face Recognizer based on 'VGGFace2: A dataset for recognising faces across pose and age'
c0decracker/video-splitter
Simple Python script to split video into equal length chunks or chunks of equal size, duration, etc.
mpc001/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
mpc001/Visual_Speech_Recognition_for_Multiple_Languages
Visual Speech Recognition for Multiple Languages
TaoRuijie/TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
hche11/VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
neeek2303/MegaPortraits
Supplementary materials for paper MegaPortraits [ACMM22]
SamsungLabs/pytorch-ensembles
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning, ICLR 2020
wichtounet/etl
Blazing-fast Expression Templates Library (ETL) with GPU support, in C++
mpc001/end-to-end-lipreading
Pytorch code for End-to-End Audiovisual Speech Recognition
fkodom/yet-another-retnet
A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (https://arxiv.org/pdf/2307.08621.pdf)
ahaliassos/RealForensics
Official code for Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection (CVPR 2022)
ibug-group/fpage
FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in the Wild
neeek2303/Depth-Enhancement-and-Super-Resolution
Towards Unpaired Depth Enhancement and Super-Resolution in the Wild paper code
neeek2303/Leaf-diseases-segmentation
Finale project of Deep Learning course
maxs-kan/InterpretableNeuroDL
yiminglin-ai/imdb-clean
A cleaned version of IMDB-WIKI dataset for facial age estimation.
neeek2303/Lenta-Hackathon
Code and files for skoltech/lenta hackaton sept.2020
ibug-group/ibug_head_pose_estimator