yunpeng-ma

Karlstad UniversitySweden

Pinned Repositories

agent
Interpretability dashboard for reinforcement learners
Language:Python0 0 00
agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Language:Python0 0 00
CAGrad
Official PyTorch Implementation for Conflict-Averse Gradient Descent (CAGrad)
Language:Python0 0 00
cleverhans
An adversarial example library for constructing attacks, building defenses, and benchmarking both
Language:Jupyter Notebook0 0 00
codellama
Inference code for CodeLlama models
Language:Python0 0 00
DD2424--KTH-assignment
Language:MATLAB0 2 00
gans
Generative Adversarial Networks implemented in PyTorch and Tensorflow
Language:Jupyter Notebook0 0 00
gans-awesome-applications
Curated list of awesome GAN applications and demo
0 0 00
hiro_ant_maze
Language:Python1 1 00
keras-io
Keras documentation, hosted live at keras.io
Language:Jupyter Notebook0 0 00

yunpeng-ma's Repositories

yunpeng-ma/hiro_ant_maze
Language:Python1 1 00
yunpeng-ma/agent
Interpretability dashboard for reinforcement learners
Language:Python0 0 00
yunpeng-ma/agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Language:Python0 0 00
yunpeng-ma/CAGrad
Official PyTorch Implementation for Conflict-Averse Gradient Descent (CAGrad)
Language:Python0 0 00
yunpeng-ma/cleverhans
An adversarial example library for constructing attacks, building defenses, and benchmarking both
Language:Jupyter Notebook0 0 00
yunpeng-ma/codellama
Inference code for CodeLlama models
Language:Python0 0 00
yunpeng-ma/DD2424--KTH-assignment
Language:MATLAB0 2 00
yunpeng-ma/gans
Generative Adversarial Networks implemented in PyTorch and Tensorflow
Language:Jupyter Notebook0 0 00
yunpeng-ma/gans-awesome-applications
Curated list of awesome GAN applications and demo
0 0 00
yunpeng-ma/keras-io
Keras documentation, hosted live at keras.io
Language:Jupyter Notebook0 0 00
yunpeng-ma/keras-rl
Deep Reinforcement Learning for Keras.
Language:Python0 0 00
yunpeng-ma/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Language:Python0 0 00
yunpeng-ma/kth_id2223
Language:Jupyter Notebook1 0
yunpeng-ma/ml-agents
Unity Machine Learning Agents Toolkit
Language:C#0 0
yunpeng-ma/ML_Andrew_Ng
Machine Learning by Andrew Ng, Stanford University
Language:Jupyter Notebook1 0
yunpeng-ma/planet
Learning Latent Dynamics for Planning from Pixels
Language:Python0 0
yunpeng-ma/Research
Research repository
Language:Python0 0
yunpeng-ma/roboschool
DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.
Language:Python0 0
yunpeng-ma/sac
Soft Actor-Critic
Language:Python0 0
yunpeng-ma/safe_rl
Language:Python0 0
yunpeng-ma/SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
Language:Python0 0
yunpeng-ma/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Language:Python0 0
yunpeng-ma/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Language:Python0 0
yunpeng-ma/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Language:Python0 0
yunpeng-ma/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python0 0
yunpeng-ma/TEAC
trust entropy actor critic (ICLR2021 submission)
Language:Python0 0
yunpeng-ma/TensorFlow-Machine-Learning-Cookbook
Code repository for TensorFlow Machine Learning Cookbook by Packt
Language:Python0 0
yunpeng-ma/Troubleshooting_and_debugging_techniques
Coursera course of google for troubleshooting and debugging
Language:Python1 0