idthanm

I am currently a Ph.D. candidate at Tsinghua University, Beijing, China. I am working on advanced technologies in autonomous driving and reinforcement learning.

Tsinghua Univ.Beijing, China

Pinned Repositories

admm_adp
Language:Python1 1 00
baselines_sil
Language:Python2 1 00
baselines_toyota_2018
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python2 0 00
env_build
The repo develops a general and extensible RL environment for large-scale autonomous driving tasks.
Language:Python47 1 421
exp4dirl
Language:Python2 1 00
h-DDPG
Language:Python2 1 00
idthanm.github.io
Language:HTML2 1 00
mpg
MPG is originated from the paper "Mixed policy gradient", which also contains a cluster of high-quality implementations of deep reinforcement learning algorithms.
Language:Python22 1 06
ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python39.1k 487 21.2k6.8k
experiment_driving
Real Vehicle Experiment for Integrated decision and control framwork
Language:Python6 1 02

idthanm's Repositories

idthanm/env_build
The repo develops a general and extensible RL environment for large-scale autonomous driving tasks.
Language:Python47 1 421
idthanm/mpg
MPG is originated from the paper "Mixed policy gradient", which also contains a cluster of high-quality implementations of deep reinforcement learning algorithms.
Language:Python22 1 06
idthanm/baselines_sil
Language:Python2 1 00
idthanm/baselines_toyota_2018
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python2 0 00
idthanm/exp4dirl
Language:Python2 1 00
idthanm/h-DDPG
Language:Python2 1 00
idthanm/idthanm.github.io
Language:HTML2 1 00
idthanm/admm_adp
Language:Python1 1 00
idthanm/apex_sac
Language:Python1 1 0
idthanm/baselines4dsac
Language:Python1 1 0
idthanm/dm_control
The DeepMind Control Suite and Package
Language:Python1 0 0
idthanm/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Language:Jupyter Notebook1 0 0
idthanm/dreamer
Dream to Control: Learning Behaviors by Latent Imagination
Language:Python1 0 0
idthanm/dreamerv2
Mastering Atari with Discrete World Models
Language:Python1 0 0
idthanm/GamestonkTerminal
The next best thing after Bloomberg Terminal
Language:Python1 0 0
idthanm/handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Language:Python1 0 0
idthanm/Intersections
For toyota project
Language:Python1 0 0
idthanm/mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
Language:Python1 0 0
idthanm/mbrl-lib
Library for Model Based RL
Language:Python1 0 0
idthanm/obstacle-tower-challenge
Starter Kit for the Unity Obstacle Tower challenge
Language:Python1 0 0
idthanm/obstacle-tower-env
Obstacle Tower Environment
Language:Python1 0 0
idthanm/otc
used for the competition
Language:Jupyter Notebook1 1 0
idthanm/ray
A fast and simple framework for building and running distributed applications.
Language:Python1 0 0
idthanm/TRARS
Language:Python1 1 0

idthanm

Pinned Repositories

admm_adp

baselines_sil

baselines_toyota_2018

env_build

exp4dirl

h-DDPG

idthanm.github.io

mpg

ray

experiment_driving

idthanm's Repositories

idthanm/env_build

idthanm/mpg

idthanm/baselines_sil

idthanm/baselines_toyota_2018

idthanm/exp4dirl

idthanm/h-DDPG

idthanm/idthanm.github.io

idthanm/admm_adp

idthanm/apex_sac

idthanm/baselines4dsac

idthanm/dm_control

idthanm/dopamine

idthanm/dreamer

idthanm/dreamerv2

idthanm/GamestonkTerminal

idthanm/handful-of-trials

idthanm/Intersections

idthanm/mbpo

idthanm/mbrl-lib

idthanm/obstacle-tower-challenge

idthanm/obstacle-tower-env

idthanm/otc

idthanm/ray

idthanm/TRARS