Pinned Repositories
AAGPT
AAGPT is another experimental open-source application showcasing the capabilities of large language models, such as GPT-3.5 and GPT-4.
distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
div-hindsight
This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" [PRICAI 2021].
google-football-pytorch
It's the pytorch implementation of google research football.
hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
integrated-gradient-pytorch
This is the pytorch implementation of the paper - Axiomatic Attribution for Deep Networks.
metaworld-sac
mosse-object-tracking
This is the implementation of MOSSE tracking algorithm (correlation filter based).
reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
TianhongDai's Repositories
TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
TianhongDai/hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
TianhongDai/integrated-gradient-pytorch
This is the pytorch implementation of the paper - Axiomatic Attribution for Deep Networks.
TianhongDai/mosse-object-tracking
This is the implementation of MOSSE tracking algorithm (correlation filter based).
TianhongDai/google-football-pytorch
It's the pytorch implementation of google research football.
TianhongDai/div-hindsight
This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" [PRICAI 2021].
TianhongDai/metaworld-sac
TianhongDai/esil-hindsight
This is the official code of our paper "Episodic Self-Imitation Learning with Hindsight" [Electronics 2020].
TianhongDai/deep-hdr-baselines
TianhongDai/react2-code
This is the official code of our paper "Machine Learning to Support Visual Auditing of Home-based Lateral Flow Immunoassay Self-Test Results for SARS-CoV-2 Antibodies" [Communications Medicine 2022].
TianhongDai/wavelet-hdr
This is the official code for our paper "Wavelet-Based Network For High Dynamic Range Imaging" [CVIU 2023].
TianhongDai/dockerfiles
It contains the dockerfiles for the purpose of machine learning / deep learning research.
TianhongDai/daim-rl
This is the official code of our paper "Diversity-Augmented Intrinsic Motivation for Deep Reinforcement Learning" [Neurocomputing 2021].
TianhongDai/abdn-hpc
University of Aberdeen HPC Cluster User Guides
TianhongDai/completor.vim
Async completion framework made ease.
TianhongDai/dhSegment-torch
dhSegment on pytorch
TianhongDai/dm_control
The DM Control Suite and Package is a tool for developing and testing reinforcement learning agents for the MuJoCo physics engine.
TianhongDai/domain-rand-interp
This is the official code of our paper "Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation" [Neurocomputing 2022].
TianhongDai/flow
Computational framework for reinforcement learning in traffic control
TianhongDai/garage
A toolkit for reproducible reinforcement learning research.
TianhongDai/JC3001-Tutorial
TianhongDai/LevDoom
Platform for training generalizable deep reinforcement learning agents
TianhongDai/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
TianhongDai/PSAMNet
TianhongDai/PyRep
A toolkit for robot learning research.
TianhongDai/ray
A fast and simple framework for building and running distributed applications.
TianhongDai/RLBench
A large-scale benchmark and learning environment.
TianhongDai/tianhongdai.github.io
Personal Website
TianhongDai/time-series-forecasting
TianhongDai/vim-profile
save my vimrc