DinoMan

DinoMan's Stars

google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
28.3k 293 442.3k
karpathy/micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Language:Jupyter Notebook11.5k 156 331.7k
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
Language:Python10.4k 112 716903
rasbt/pattern_classification
A collection of tutorials and examples for solving and understanding machine learning and pattern classification tasks
Language:Jupyter Notebook4.2k 386 31.3k
karpathy/ng-video-lecture
Language:Python3.9k 59 301k
dmlc/decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
Language:C++2.1k 29 272172
google/generative-ai-docs
Documentation for Google's Gen AI site - including the Gemini API and Gemma
Language:Jupyter Notebook1.9k 74 124674
NVlabs/alias-free-gan
Alias-Free GAN project website and code
1.3k 345 042
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
Language:Python1.2k 13 25103
zhanglonghao1992/One-Shot_Free-View_Neural_Talking_Head_Synthesis
Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"
Language:Python829 25 79147
rosinality/alias-free-gan-pytorch
Unofficial implementation of Alias-Free Generative Adversarial Networks. (https://arxiv.org/abs/2106.12423) in PyTorch
Language:Python506 26 2743
cydonia999/VGGFace2-pytorch
PyTorch Face Recognizer based on 'VGGFace2: A dataset for recognising faces across pose and age'
Language:Python493 6 1595
c0decracker/video-splitter
Simple Python script to split video into equal length chunks or chunks of equal size, duration, etc.
Language:Python476 24 26156
mpc001/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
Language:Python410 7 65104
mpc001/Visual_Speech_Recognition_for_Multiple_Languages
Visual Speech Recognition for Multiple Languages
Language:Python394 13 3261
TaoRuijie/TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Language:Python357 11 7379
hche11/VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
Language:Python307 6 1833
neeek2303/MegaPortraits
Supplementary materials for paper MegaPortraits [ACMM22]
264 54 618
SamsungLabs/pytorch-ensembles
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning, ICLR 2020
Language:Jupyter Notebook235 15 524
wichtounet/etl
Blazing-fast Expression Templates Library (ETL) with GPU support, in C++
Language:C++222 20 318
mpc001/end-to-end-lipreading
Pytorch code for End-to-End Audiovisual Speech Recognition
Language:Python175 2 3250
fkodom/yet-another-retnet
A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (https://arxiv.org/pdf/2307.08621.pdf)
Language:Python104 2 1417
ahaliassos/RealForensics
Official code for Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection (CVPR 2022)
Language:Python90 2 1410
ibug-group/fpage
FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in the Wild
Language:Python59 3 38
neeek2303/Depth-Enhancement-and-Super-Resolution
Towards Unpaired Depth Enhancement and Super-Resolution in the Wild paper code
Language:Jupyter Notebook55 1 03
neeek2303/Leaf-diseases-segmentation
Finale project of Deep Learning course
Language:Jupyter Notebook53 1 03
maxs-kan/InterpretableNeuroDL
Language:HTML52 3 27
yiminglin-ai/imdb-clean
A cleaned version of IMDB-WIKI dataset for facial age estimation.
Language:Python46 1 04
neeek2303/Lenta-Hackathon
Code and files for skoltech/lenta hackaton sept.2020
Language:Jupyter Notebook37 1 01
ibug-group/ibug_head_pose_estimator
Language:Python2 2 00