Pinned Repositories
cdrl
Collaborative Deep Reinforcement Learning
rpg
Ranking Policy Gradient
Simulator
Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning
async-rl
Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"
async_deep_reinforce
Asynchronous Methods for Deep Reinforcement Learning
FISTA_exercise
iMTL
interactive Multi-Task Learning
MTIL-example
Multi-Task Feature Interaction Learning -- code example.
MTLayerNeuralNet-python
KaixiangLin's Repositories
KaixiangLin/federated-learning
KaixiangLin/baselines-results
KaixiangLin/cdrl
Collaborative Deep Reinforcement Learning
KaixiangLin/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
KaixiangLin/cpo
Constrained Policy Optimization
KaixiangLin/cs229t
Statistical Learning Theory (CS229T) Lecture Notes
KaixiangLin/DeepRLHacks
Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)
KaixiangLin/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
KaixiangLin/EvalAI-Starters
How to create a challenge on EvalAI?
KaixiangLin/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
KaixiangLin/gym-maze
A customizable gym environment for maze/gridworld
KaixiangLin/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
KaixiangLin/KaixiangLin
Config files for my GitHub profile.
KaixiangLin/kaixianglin.github.io
Personal website.
KaixiangLin/lantaoyu.github.io
Github Pages template for academic personal websites
KaixiangLin/luminous
KaixiangLin/magnet
MAGNet: Multi-agents control using Graph Neural Networks
KaixiangLin/mend
MEND: Fast Model Editing at Scale
KaixiangLin/ml-agents
Unity Machine Learning Agents Toolkit
KaixiangLin/moca
MOCA (Modular Object-Centric Approach) addresses the task of long horizon instruction following with a modular architecture that decouples a task into visual perception and action policy prediction.
KaixiangLin/NeuralDialog-ZSDG
PyTorch codebase for zero-shot dialog generation, It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
KaixiangLin/paper-notes
Random notes on papers, likely a short-term repo.
KaixiangLin/playground
PlayGround: AI Research into Multi-Agent Learning.
KaixiangLin/project-DRL16
Course project, deep reinforcement learning, open AI gym
KaixiangLin/TC-Bot
User Simulation for Task-Completion Dialogues
KaixiangLin/trl
Train transformer language models with reinforcement learning.
KaixiangLin/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
KaixiangLin/universe-starter-agent
A starter agent that can solve a number of universe environments.
KaixiangLin/USTC-Course
:heart:**科学技术大学课程资源
KaixiangLin/virtualhome_unity
Source Code for VirtualHome environment