Pinned Repositories
A2BCD
Code for the numerical test in the paper
ARock
Asynchronous parallel coordinate update algorithms (ARock)
AsyncQVI
A light c++11 package for three reinforcement learning algorithms.
bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
ENIAC
code for paper: Provably Correct Optimization and Exploration with Non-linear Policies
IPSVRG
StateDecoding
Reinforcement Learning via Latent State Decoding
AsyncQVI
A light C++11 package for three reinforcement learning algorithms.
IPSVRG
A light MATLAB package for acceleration of SVRG and Katyusha X by inexact preconditioning.
ARock
Experiments code for asynchronous parallel coordinate update algorithms (ARock)
FlorenceFeng's Repositories
FlorenceFeng/StateDecoding
Reinforcement Learning via Latent State Decoding
FlorenceFeng/ENIAC
code for paper: Provably Correct Optimization and Exploration with Non-linear Policies
FlorenceFeng/A2BCD
Code for the numerical test in the paper
FlorenceFeng/ARock
Asynchronous parallel coordinate update algorithms (ARock)
FlorenceFeng/AsyncQVI
A light c++11 package for three reinforcement learning algorithms.
FlorenceFeng/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
FlorenceFeng/IPSVRG
FlorenceFeng/FlorenceFeng.github.io
FlorenceFeng/Grid2Op
Grid2Op a testbed platform to model sequential decision making in power systems.
FlorenceFeng/gym
A toolkit for developing and comparing reinforcement learning algorithms.
FlorenceFeng/L2RPN_NIPS_2020_a_PPO_Solution
This is a solution for L2RPN_NIPS_Competitions_2020, and it wins 2nd prize.
FlorenceFeng/MATLAB
Resources for using C++ with MATLAB
FlorenceFeng/micoso-solver
Educational mixed-integer cone solver
FlorenceFeng/NeurIPS_2020_L2RPN_Comp_An_Approach
The implementation of NeurIPS_2020_L2RPN_Track1(Robustness) and Track2 (Adaptability) Competition
FlorenceFeng/PARL
A high-performance distributed training framework for Reinforcement Learning
FlorenceFeng/ProxSDP.jl
Semidefinite programming optimization solver
FlorenceFeng/TMAC
TMAC: A Toolbox of Modern Async-Parallel, Coordinate, Splitting, and Stochastic Methods