stevenyangyj

stevenyangyj.github.io

University of Technology SydneySydney, Australia; Shenzhen, China

Pinned Repositories

aes-rl
Language:Python0 1 00
alfworld-v2
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Language:Python0 0 00
Brainstorm-Optimization
A BSO algorithm
Language:Python3 1 02
continual_world_v2
Language:Python1 0 00
CoTASP
Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting
Language:Python14 2 11
deep-head-pose-lite
A lite-version hopenet for head pose estimation with PyTorch
Language:Python187 2 1641
EC-CMAES
Demo version Covariance Matrix Adaptation Evolution Strategy (CMAES) is mainly provided for educational purpose: reading, understanding and running basic experiments
Language:Python1 1 00
Emma-Alfworld
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Language:Python51 4 90
P3
Official code for the paper: Pareto Policy Pool for Model-based Offline Reinforcement Learning
Language:Python6 2 00
songs_album
主要是唱片
1 1 01

stevenyangyj's Repositories

stevenyangyj/deep-head-pose-lite
A lite-version hopenet for head pose estimation with PyTorch
Language:Python187 2 1641
stevenyangyj/Emma-Alfworld
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Language:Python51 4 90
stevenyangyj/CoTASP
Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting
Language:Python14 2 11
stevenyangyj/P3
Official code for the paper: Pareto Policy Pool for Model-based Offline Reinforcement Learning
Language:Python6 2 00
stevenyangyj/Brainstorm-Optimization
A BSO algorithm
Language:Python3 1 02
stevenyangyj/continual_world_v2
Language:Python1 0 00
stevenyangyj/songs_album
主要是唱片
1 1 01
stevenyangyj/aes-rl
Language:Python0 1 00
stevenyangyj/alfworld-v2
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Language:Python0 0 00
stevenyangyj/ARS
An implementation of the Augmented Random Search algorithm
Language:Python1 0
stevenyangyj/CPPN-WGAN
Generative Art Experiments
Language:Python1 0
stevenyangyj/DeepCF
DeepCF: A Unified Framework of Representation Learning and Matching Function Learning in Recommender System
Language:Python0 0
stevenyangyj/EMO-2019
The codes, data and figures for paper submitted to EMO-2019
Language:Python1 0
stevenyangyj/FaceDetection-DSFD
Language:Jupyter Notebook1 0
stevenyangyj/imp
infinite mixture prototypes for few-shot learning
Language:Python1 0
stevenyangyj/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook0 0
stevenyangyj/MADRL
Repo containing code for multi-agent deep reinforcement learning (MADRL).
stevenyangyj/Maze-Solving-using-A-star-algorithm
In this repository, I have made a maze solving system. The system takes in input of an maze using a camera. This Image is converted into a grid. It lets you find the shortest path between any two points
Language:Python1 0
stevenyangyj/metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
Language:Python0 0
stevenyangyj/pg-is-all-you-need
Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.
Language:Jupyter Notebook1 0
stevenyangyj/practicalAI
📚A practical approach to learning and using machine learning.
Language:Jupyter Notebook1 0
stevenyangyj/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language:Python1 0
stevenyangyj/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Language:Python1 0
stevenyangyj/RLlearning
The exercising code and some algorithms implementation for the text book "Reinforcement Learning: An Introduction"
Language:Python1 0
stevenyangyj/SMTO
Codes for stochastic multimodal trajectory optimization (SMTO)
Language:Python1 0
stevenyangyj/som
Pytorch implementation of a Self-Organizing Map
Language:Python1 0
stevenyangyj/stevenyangyj.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:HTML
stevenyangyj/svo-intersection
Simulation and code for a socially-compliant intersection manager that coordinates human and autonomous vehicles on the road.
stevenyangyj/swarmtools
Language:Python2 0
stevenyangyj/Test-Functions-for-Optimization
A test function set for optimization in Python 3.x
Language:Python1 01