Foo1szz
I am Yang Hanjie,a normal graduate student of Dalian University of Technology.
DLUTDalian, Chinese
Foo1szz's Stars
krahets/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
scutan90/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
ZhongFuCheng3y/austin
消息推送平台🔥 推送下发【邮件】【短信】【微信服务号】【微信小程序】【企业微信】【钉钉】等消息类型。
datawhalechina/daily-interview
Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
lucidrains/byol-pytorch
Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch
lucidrains/FLASH-pytorch
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
nikhilbarhate99/min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
shkrwnd/Deep-Reinforcement-Learning-for-Dynamic-Spectrum-Access
Using multi-agent Deep Q Learning with LSTM cells (DRQN) to train multiple users in cognitive radio to learn to share scarce resource (channels) equally without communication
gxywy/rl-plotter
:sparkles: A plotter for reinforcement learning (RL)
baturaysaglam/RIS-MISO-Deep-Reinforcement-Learning
Joint Transmit Beamforming and Phase Shifts Design with Deep Reinforcement Learning
jerrodparker20/adaptive-transformers-in-rl
Adaptive Attention Span for Reinforcement Learning
shibhansh/loss-of-plasticity
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation
stefanbschneider/mobile-env
An open, minimalist Gymnasium environment for autonomous coordination in wireless mobile networks.
mxu34/prompt-dt
Official code repository for Prompt-DT.
LanqingLi1993/FOCAL-ICLR
Code for FOCAL Paper Published at ICLR 2021
YYCAAA/V-MPO_Lunarlander
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
PKU-RL/CORRO
CORRO code
lasseufpa/ITU-Challenge-ML5G-PHY-RL
Scripts for the "ITU-ML5G-PS-006: ML5G-PHY-Reinforcement learning: scheduling and resource allocation"
acyclics/MPO
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
Arya87/RL_draw_seabron
Use seaborn to draw RL picture
The-AI-Summer/byol-cifar10
implement byol in cifar-10
jsikyoon/V-MPO_torch
V-MPO torch version with DMLab30 and GTrXL
AiltonOliveir/RL-env-for-communications
Reinforcement learning environment for MIMO communications.
kethan-1818/5G-channel-modulation-using-RL
I have developed a custom environment using OpenAI Gym in Python for simulating a 5G wireless communication channel as part of a reinforcement learning project.
cyj407/RL-MPO-DMC
qiuruiyu/rl-plotter
:sparkles: A plotter for reinforcement learning (RL)