helpingstar's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
google/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
moskomule/senet.pytorch
PyTorch implementation of SENet
microsoft/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
pfnet/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
gorisanson/pikachu-volleyball
Pikachu Volleyball implemented into JavaScript by reverse engineering the original game
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
PINTO0309/onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.
RobertTLange/gymnax
RL Environments in JAX 🌍
instadeepai/jumanji
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
google-deepmind/distrax
corl-team/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
CausalInferenceLab/Causal-Inference-with-Python
Causal Inference for The Brave and True 책의 한국어 번역 자료입니다.
elodin-sys/elodin
Physics simulation software for space + aerospace
EdanToledo/Stoix
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
instadeepai/flashbax
⚡ Flashbax: Accelerated Replay Buffers in JAX
microsoft/onnxruntime-web-demo
demos to show the capabilities of ONNX Runtime Web
keraJLi/rejax
vwxyzjn/invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
rust-kr/doc.rust-kr.org
The Rust Programming Language 한국어 번역
tjuHaoXiaotian/pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
microsoft/onnxruntime-nextjs-template
PufferAI/pokegym
Gymnasium environment for Pokemon Red
MyNameIsArko/RL-Flax
Various reinforcement learning algorithms written in Jax + Flax
gebob19/rl_with_jax
clear single-file JAX implementations of common RL algorithms
gebob19/natural-policy-gradient-reinforcement-learning
code for Understanding the Effects of Second-Order Approximations in Natural Policy Gradient Reinforcement Learning
edd26/pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing