Mamba413's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
coder/code-server
VS Code in the browser
psf/black
The uncompromising Python code formatter
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
soumith/ganhacks
starter from "How to Train a GAN?" at NIPS2016
alshedivat/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
hughjonesd/ggmagnify
Create a magnified inset of part of a ggplot object
yihaosun1124/OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
sgkit-dev/sgkit
Scalable genetics toolkit
sagelywizard/pytorch-mdn
Mixture Density Networks for PyTorch
tomwenseleers/export
R package for streamlined export of graphs and data tables.
microsoft/mimic_sepsis
Sepsis cohort from MIMIC dataset
young-geng/JaxCQL
Conservative Q learning in Jax
ryanxhr/IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
callmespring/MDPOD
Simulation of Ridesharing Market and the MDP Order Dispatch Policy
BioAlgs/GM
Gaussian Mirror R Package
jaydu1/SparsePortfolio
High Dimensional Portfolio Selection with Cardinality Constraints
Mamba413/ROOM
Robust Offline Reinforcement Learning with Heavy-Tailed Rewards
Tan-jianbin/Statistical-Inference-in-General-order-Dynamic-Systems
brtang63/A-Splicing-Algorithm-for-Best-Subset-Selection-in-Sliced-Inverse-Regression
Jianbin-Tan/GFPCA
Mamba413/msi-interpretability
Mamba413/Nonparametric-Statistical-Inference-via-Metric-Distribution-Function-in-Metric-Spaces
Reproducible materials for Nonparametric Statistical Inference via Metric Distribution Function in Metric Spaces (JASA, 2023+)
bbayukari/ScopeCpp
a C++ implement of SCOPE algorithm