Pinned Repositories
aes-rl
alfworld-v2
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Brainstorm-Optimization
A BSO algorithm
continual_world_v2
CoTASP
Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting
deep-head-pose-lite
A lite-version hopenet for head pose estimation with PyTorch
EC-CMAES
Demo version Covariance Matrix Adaptation Evolution Strategy (CMAES) is mainly provided for educational purpose: reading, understanding and running basic experiments
Emma-Alfworld
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
P3
Official code for the paper: Pareto Policy Pool for Model-based Offline Reinforcement Learning
songs_album
主要是唱片
stevenyangyj's Repositories
stevenyangyj/deep-head-pose-lite
A lite-version hopenet for head pose estimation with PyTorch
stevenyangyj/Emma-Alfworld
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
stevenyangyj/CoTASP
Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting
stevenyangyj/P3
Official code for the paper: Pareto Policy Pool for Model-based Offline Reinforcement Learning
stevenyangyj/Brainstorm-Optimization
A BSO algorithm
stevenyangyj/continual_world_v2
stevenyangyj/songs_album
主要是唱片
stevenyangyj/aes-rl
stevenyangyj/alfworld-v2
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
stevenyangyj/ARS
An implementation of the Augmented Random Search algorithm
stevenyangyj/CPPN-WGAN
Generative Art Experiments
stevenyangyj/DeepCF
DeepCF: A Unified Framework of Representation Learning and Matching Function Learning in Recommender System
stevenyangyj/EMO-2019
The codes, data and figures for paper submitted to EMO-2019
stevenyangyj/FaceDetection-DSFD
stevenyangyj/imp
infinite mixture prototypes for few-shot learning
stevenyangyj/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
stevenyangyj/MADRL
Repo containing code for multi-agent deep reinforcement learning (MADRL).
stevenyangyj/Maze-Solving-using-A-star-algorithm
In this repository, I have made a maze solving system. The system takes in input of an maze using a camera. This Image is converted into a grid. It lets you find the shortest path between any two points
stevenyangyj/metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
stevenyangyj/pg-is-all-you-need
Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.
stevenyangyj/practicalAI
📚A practical approach to learning and using machine learning.
stevenyangyj/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
stevenyangyj/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
stevenyangyj/RLlearning
The exercising code and some algorithms implementation for the text book "Reinforcement Learning: An Introduction"
stevenyangyj/SMTO
Codes for stochastic multimodal trajectory optimization (SMTO)
stevenyangyj/som
Pytorch implementation of a Self-Organizing Map
stevenyangyj/stevenyangyj.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
stevenyangyj/svo-intersection
Simulation and code for a socially-compliant intersection manager that coordinates human and autonomous vehicles on the road.
stevenyangyj/swarmtools
stevenyangyj/Test-Functions-for-Optimization
A test function set for optimization in Python 3.x