Pinned Repositories
A-Guide-to-DeepMinds-StarCraft-AI-Environment
This is the code for "A Guide to DeepMind's StarCraft AI Environment" by Siraj Raval on Youtube
AirSim
Open source simulator based on Unreal Engine for autonomous vehicles from Microsoft AI & Research
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
apollo
An open autonomous driving platform
Apollo-11
Original Apollo 11 Guidance Computer (AGC) source code for the command and lunar modules.
Arnold
Arnold - DOOM Agent
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
baselines-results
DRL_P3K
gluon-tutorials-zh
通过MXNet/Gluon来动手学习深度学习
HongdaZhang's Repositories
HongdaZhang/datasets
A collection of datasets of ML problem solving
HongdaZhang/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
HongdaZhang/Diffusion_RL
This repo has the code and suplementary materials of our 2024 RAL submission.
HongdaZhang/eat_tensorflow2_in_30_days
Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋
HongdaZhang/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
HongdaZhang/formation
ROS package for formation and rendezvous of multi-drone (T-Cyber 2020)
HongdaZhang/homework
Assignments for CS294-112.
HongdaZhang/KaTeX
Fast math typesetting for the web.
HongdaZhang/MaCA
HongdaZhang/MADDPG_torch
The code for maddpg using pytorch
HongdaZhang/MAgent
A Platform for Many-agent Reinforcement Learning
HongdaZhang/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
HongdaZhang/Multi-Agent-Deep-Deterministic-Policy-Gradients
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
HongdaZhang/Multi-Agent-Reinforcement-Learning
PyTorch implementations of MADDPG, MAPPO (coming)
HongdaZhang/nmea_navsat_driver
ROS package containing drivers for NMEA devices that can output satellite navigation data (e.g. GPS or GLONASS).
HongdaZhang/off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
HongdaZhang/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
HongdaZhang/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
HongdaZhang/orbbec_competition
第四届3DV创新应用竞赛
HongdaZhang/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
HongdaZhang/Python
All Algorithms implemented in Python
HongdaZhang/ray
A fast and simple framework for building and running distributed applications.
HongdaZhang/rl-book
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
HongdaZhang/ROS-ENKI_robot_simulation
A framework for the development of new closed-loop AI algorithms
HongdaZhang/smac
SMAC: The StarCraft Multi-Agent Challenge
HongdaZhang/spinningup
An educational resource to help anyone learn deep reinforcement learning.
HongdaZhang/StarCraft
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
HongdaZhang/tensorflow_study
tensorflow学习代码
HongdaZhang/tvt
HongdaZhang/WorldModels
An implementation of the ideas from this paper https://arxiv.org/pdf/1803.10122.pdf