reasondk's Stars
karpathy/llm.c
LLM training in simple, raw C/CUDA
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
numba/numba
NumPy aware dynamic Python compiler using LLVM
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
tensorlayer/TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
yandexdataschool/Practical_RL
A course in reinforcement learning in the wild
keras-rl/keras-rl
Deep Reinforcement Learning for Keras.
google-research/timesfm
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
CrazyBoyM/llama3-Chinese-chat
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
wangshusen/DRL
Deep Reinforcement Learning
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
NeuronDance/DeepRL
Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
AI-Hypercomputer/maxtext
A simple, performant and scalable Jax LLM!
coreylynch/async-rl
Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
cbyn/bitpredict
Machine learning for high frequency bitcoin price prediction
wdndev/llama3-from-scratch-zh
从零实现一个 llama3 中文版
marload/DeepRL-TensorFlow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
google-deepmind/dqn_zoo
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.
jaromiru/AI-blog
Accompanying repository for Let's make a DQN / A3C series.
cyoon1729/deep-Q-networks
Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51
THINK989/Real-Time-Stock-Market-Prediction-using-Ensemble-DL-and-Rainbow-DQN
Kaixhin/NoisyNet-A3C
Noisy Networks for Exploration
CrazyBoyM/llama2-Chinese-chat
首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)
LuEE-C/PPO-Keras
My implementation of the Proximal Policy Optisation algorithm using Keras as a backend
cocolico14/N-step-Dueling-DDQN-PER-Pacman
Using N-step dueling DDQN with PER for playing Pacman game
Jannik0/RUG_ReinforcementLearning