helpingstar

helpingstar's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python71k 576 08.4k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.2k 286 422.3k
google/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Language:Jupyter Notebook10.6k 425 1691.4k
moskomule/senet.pytorch
PyTorch implementation of SENet
Language:Python2.3k 16 0441
microsoft/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
Language:C++1.2k 38 162336
pfnet/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
Language:Python1.2k 91 75157
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language:Python1.1k 26 36189
gorisanson/pikachu-volleyball
Pikachu Volleyball implemented into JavaScript by reverse engineering the original game
Language:JavaScript980 10 10115
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python716 11 2260
PINTO0309/onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.
Language:Python700 10 25774
RobertTLange/gymnax
RL Environments in JAX 🌍
Language:Python644 10 5562
instadeepai/jumanji
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
Language:Python618 12 10380
google-deepmind/distrax
Language:Python535 18 4532
corl-team/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python479 3 1020
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
Language:Python433 12 2091
CausalInferenceLab/Causal-Inference-with-Python
Causal Inference for The Brave and True 책의 한국어 번역 자료입니다.
Language:Jupyter Notebook423 9 1078
elodin-sys/elodin
Physics simulation software for space + aerospace
Language:Rust419 6 613
EdanToledo/Stoix
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
Language:Python230 6 3124
instadeepai/flashbax
⚡ Flashbax: Accelerated Replay Buffers in JAX
Language:Python210 13 1111
microsoft/onnxruntime-web-demo
demos to show the capabilities of ONNX Runtime Web
Language:TypeScript173 27 1341
keraJLi/rejax
Language:Python147 4 107
vwxyzjn/invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Language:Python139 2 322
rust-kr/doc.rust-kr.org
The Rust Programming Language 한국어 번역
Language:Rust136 4 545
tjuHaoXiaotian/pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
Language:Python129 3 1112
microsoft/onnxruntime-nextjs-template
Language:TypeScript101 3 723
PufferAI/pokegym
Gymnasium environment for Pokemon Red
Language:Python34 0 013
MyNameIsArko/RL-Flax
Various reinforcement learning algorithms written in Jax + Flax
Language:Python22 4 00
gebob19/rl_with_jax
clear single-file JAX implementations of common RL algorithms
Language:Python14 1 01
gebob19/natural-policy-gradient-reinforcement-learning
code for Understanding the Effects of Second-Order Approximations in Natural Policy Gradient Reinforcement Learning
Language:Python4 1 00
edd26/pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
Language:Python2 0 00