Pinned Repositories
agent
Interpretability dashboard for reinforcement learners
agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
CAGrad
Official PyTorch Implementation for Conflict-Averse Gradient Descent (CAGrad)
cleverhans
An adversarial example library for constructing attacks, building defenses, and benchmarking both
codellama
Inference code for CodeLlama models
DD2424--KTH-assignment
gans
Generative Adversarial Networks implemented in PyTorch and Tensorflow
gans-awesome-applications
Curated list of awesome GAN applications and demo
hiro_ant_maze
keras-io
Keras documentation, hosted live at keras.io
yunpeng-ma's Repositories
yunpeng-ma/hiro_ant_maze
yunpeng-ma/agent
Interpretability dashboard for reinforcement learners
yunpeng-ma/agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
yunpeng-ma/CAGrad
Official PyTorch Implementation for Conflict-Averse Gradient Descent (CAGrad)
yunpeng-ma/cleverhans
An adversarial example library for constructing attacks, building defenses, and benchmarking both
yunpeng-ma/codellama
Inference code for CodeLlama models
yunpeng-ma/DD2424--KTH-assignment
yunpeng-ma/gans
Generative Adversarial Networks implemented in PyTorch and Tensorflow
yunpeng-ma/gans-awesome-applications
Curated list of awesome GAN applications and demo
yunpeng-ma/keras-io
Keras documentation, hosted live at keras.io
yunpeng-ma/keras-rl
Deep Reinforcement Learning for Keras.
yunpeng-ma/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
yunpeng-ma/kth_id2223
yunpeng-ma/ml-agents
Unity Machine Learning Agents Toolkit
yunpeng-ma/ML_Andrew_Ng
Machine Learning by Andrew Ng, Stanford University
yunpeng-ma/planet
Learning Latent Dynamics for Planning from Pixels
yunpeng-ma/Research
Research repository
yunpeng-ma/roboschool
DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.
yunpeng-ma/sac
Soft Actor-Critic
yunpeng-ma/safe_rl
yunpeng-ma/SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
yunpeng-ma/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
yunpeng-ma/spinningup
An educational resource to help anyone learn deep reinforcement learning.
yunpeng-ma/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
yunpeng-ma/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
yunpeng-ma/TEAC
trust entropy actor critic (ICLR2021 submission)
yunpeng-ma/TensorFlow-Machine-Learning-Cookbook
Code repository for TensorFlow Machine Learning Cookbook by Packt
yunpeng-ma/Troubleshooting_and_debugging_techniques
Coursera course of google for troubleshooting and debugging