Pinned Repositories
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
eval_gen
Evaluation code with models for the paper "On the Quantitative Analysis of Decoder-Based Generative Models"
flax
Flax is a neural network ecosystem for JAX that is designed for flexibility.
gmmn
Generative moment matching networks
jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
LIME
Official code for paper LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning
LiveCodeBench
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
python-ggp
General Game Playing (GGP) framework in Python
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
ray
A system for parallel and distributed Python that unifies the ML ecosystem.
tonywu95's Repositories
tonywu95/eval_gen
Evaluation code with models for the paper "On the Quantitative Analysis of Decoder-Based Generative Models"
tonywu95/LIME
Official code for paper LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning
tonywu95/adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
tonywu95/gmmn
Generative moment matching networks
tonywu95/python-ggp
General Game Playing (GGP) framework in Python
tonywu95/ray
A system for parallel and distributed Python that unifies the ML ecosystem.
tonywu95/flax
Flax is a neural network ecosystem for JAX that is designed for flexibility.
tonywu95/jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
tonywu95/LiveCodeBench
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
tonywu95/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
tonywu95/trax
Trax — Deep Learning with Clear Code and Speed