TianhongDai

@ShadowFiendTeam United Kingdom

Pinned Repositories

AAGPT
AAGPT is another experimental open-source application showcasing the capabilities of large language models, such as GPT-3.5 and GPT-4.
Language:Python154 18 022
distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
Language:Python59 2 213
div-hindsight
This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" [PRICAI 2021].
Language:Python10 2 12
google-football-pytorch
It's the pytorch implementation of google research football.
Language:Python38 1 011
hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Language:Python379 7 2777
integrated-gradient-pytorch
This is the pytorch implementation of the paper - Axiomatic Attribution for Deep Networks.
Language:Python178 5 326
metaworld-sac
Language:Python10 1 02
mosse-object-tracking
This is the implementation of MOSSE tracking algorithm (correlation filter based).
Language:Python116 3 440
reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Language:Python649 15 10105
self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
Language:Python64 2 013

TianhongDai's Repositories

TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Language:Python649 15 10105
TianhongDai/hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Language:Python379 7 2777
TianhongDai/integrated-gradient-pytorch
This is the pytorch implementation of the paper - Axiomatic Attribution for Deep Networks.
Language:Python178 5 326
TianhongDai/mosse-object-tracking
This is the implementation of MOSSE tracking algorithm (correlation filter based).
Language:Python116 3 440
TianhongDai/google-football-pytorch
It's the pytorch implementation of google research football.
Language:Python38 1 011
TianhongDai/div-hindsight
This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" [PRICAI 2021].
Language:Python10 2 12
TianhongDai/metaworld-sac
Language:Python10 1 02
TianhongDai/esil-hindsight
This is the official code of our paper "Episodic Self-Imitation Learning with Hindsight" [Electronics 2020].
Language:Python7 2 52
TianhongDai/deep-hdr-baselines
Language:Python6 2 01
TianhongDai/react2-code
This is the official code of our paper "Machine Learning to Support Visual Auditing of Home-based Lateral Flow Immunoassay Self-Test Results for SARS-CoV-2 Antibodies" [Communications Medicine 2022].
Language:Python6 2 02
TianhongDai/wavelet-hdr
This is the official code for our paper "Wavelet-Based Network For High Dynamic Range Imaging" [CVIU 2023].
Language:Python6 1 1
TianhongDai/dockerfiles
It contains the dockerfiles for the purpose of machine learning / deep learning research.
Language:Dockerfile3 1 00
TianhongDai/daim-rl
This is the official code of our paper "Diversity-Augmented Intrinsic Motivation for Deep Reinforcement Learning" [Neurocomputing 2021].
Language:Python1 2 01
TianhongDai/abdn-hpc
University of Aberdeen HPC Cluster User Guides
TianhongDai/completor.vim
Async completion framework made ease.
Language:Python1 0
TianhongDai/dhSegment-torch
dhSegment on pytorch
Language:Python1 0
TianhongDai/dm_control
The DM Control Suite and Package is a tool for developing and testing reinforcement learning agents for the MuJoCo physics engine.
Language:Python2 0
TianhongDai/domain-rand-interp
This is the official code of our paper "Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation" [Neurocomputing 2022].
3 0
TianhongDai/flow
Computational framework for reinforcement learning in traffic control
Language:Python2 0
TianhongDai/garage
A toolkit for reproducible reinforcement learning research.
Language:Python1 0
TianhongDai/JC3001-Tutorial
1 0
TianhongDai/LevDoom
Platform for training generalizable deep reinforcement learning agents
Language:Python1 0
TianhongDai/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
Language:Cython1 0
TianhongDai/PSAMNet
Language:Python1 0
TianhongDai/PyRep
A toolkit for robot learning research.
Language:Python2 0
TianhongDai/ray
A fast and simple framework for building and running distributed applications.
Language:Python2 0
TianhongDai/RLBench
A large-scale benchmark and learning environment.
Language:Python2 0
TianhongDai/tianhongdai.github.io
Personal Website
Language:HTML2 0
TianhongDai/time-series-forecasting
Language:Jupyter Notebook
TianhongDai/vim-profile
save my vimrc
1 0