lushleaf's Stars
labuladong/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
IBM/FedMA
Code for Federated Learning with Matched Averaging, ICLR 2020.
a1600012888/YOPO-You-Only-Propagate-Once
Code for our nips19 paper: You Only Propagate Once: Accelerating Adversarial Training Via Maximal Principle
Cranial-XIX/CAGrad
Official PyTorch Implementation for Conflict-Averse Gradient Descent (CAGrad)
dbmptr/EPOSearch
Exact Pareto Optimal solutions for preference based Multi-Objective Optimization
Jason-CKY/lunar_lander_DQN
Pytorch implementation of DQN on openai's lunar lander environment
vsaveris/lunar-lander-DQN
Implementation of a Deep Reinforcement Learning agent (Deep Q-Network) for landing successfully the ‘Lunar Lander’ from the OpenAI Gym.