labmlai/annotated_deep_learning_paper_implementations
๐งโ๐ซ 60+ Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
PythonMIT
Issues
- 1
mha.py array shapes
#262 opened by erlebach - 2
Unet error in DDPM
#270 opened by TwinkleStarst - 1
How to Contribute to This Repository
#275 opened by terancejiang - 2
- 0
Request for Implementation of Mamba Paper
#261 opened by huyiwen - 2
How to use my own database for training and evaluating Retro for Question-Answering?
#260 opened by Zahin112 - 2
:kr: Korean Translation
#237 opened by movie5 - 1
ppo code running error
#245 opened by cxw-droid - 1
- 3
Question about RoPE code
#253 opened by rangehow - 1
gae formula bug
#255 opened by kangnil - 4
Chinese Translation
#257 opened by pengchzn - 1
Scripts Scripts_ Img2img get is empty
#217 opened by fu-jianhua - 2
Bug in implementation of Rotary Positional Embeddings
#215 opened by Inkorak - 4
question about RoPE code
#227 opened by yukyeongmin - 1
Bug in rotary positional embedding
#244 opened by scv11 - 2
Question about value_pe
#251 opened by Youngea - 2
Mistake in RoPE File
#256 opened by eliplutchok - 1
Website Code
#252 opened by fishbotics - 2
want to use CelebA dataset๏ผbut there is an issue
#190 opened by Z0Victor - 0
connection timed out
#254 opened by cxw-droid - 1
Question about gatv2 code
#228 opened by XiaokangORCA - 3
Training code or references for training the latent diffusion model on a custom dataset
#195 opened by mitchell-cheng - 1
- 0
"pip install labml-nn" generated errors. How to resolve it and complete the installation?
#247 opened by jxwanguab - 1
which one is the implementation of "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis"?
#200 opened by guyuezuntinggithub - 0
Implement new models/architectures:
#197 opened by Adesoji1 - 4
Make my own annotations
#236 opened by sonderlau - 0
Wrong Image Scale in DDPM
#241 opened by ww-rm - 0
DDPM interpolation formula error
#238 opened by jjjuuuun - 0
็ฟป่ฏ็ฟป่ฏ
#225 opened by Vector-Cross - 1
- 3
Network connection issues during training
#194 opened by XYTriste - 0
- 1
Incorrect expression in explanation
#220 opened by adrshsrvstv - 1
Is there some errors in transformers mha.py
#216 opened by lqzzy - 0
- 0
- 0
q,k,v have different shape but torch.stack works?
#202 opened by junsukha - 0
- 1
Bug in Transformer-XL shift method
#185 opened by Bearnardd - 2
can not run ViT(vision transformer) experiment file (failed to connect to https://api.labml.ai/api/vl/track?run%20wuid-87829.c05191leeae2db06088ee9ee4&labml%20version=0.4.162)
#189 opened by HiFei4869 - 0
Como
#188 opened by chinoresioe - 1
do you have code for BERT?
#187 opened by Sandy4321 - 2
MultiHeadAttention parameter setting
#180 opened by LXXiaogege - 0
- 0
A tiny bug in unet.py
#184 opened by lwb2099 - 0
Where can I get the sample
#186 opened by yaoysyao - 0
mah
#179 opened by LXXiaogege - 1
Unable to run In-paint images script
#178 opened by Vikramank