Pinned Repositories
a3c
APL0
apv
ARS
An implementation of the Augmented Random Search algorithm
ATAC
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal.
Awesome-Multi-Modal-Reinforcement-Learning
A curated list of Multi-Modal Reinforcement Learning resources (continually updated)
bcline
blog
My blog, written with Tornado + AppEngine
blog-1
A simple Google App Engine blog
pso-ga-cnn
zhan0903's Repositories
zhan0903/APL0
zhan0903/Awesome-Multi-Modal-Reinforcement-Learning
A curated list of Multi-Modal Reinforcement Learning resources (continually updated)
zhan0903/apv
zhan0903/ATAC
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal.
zhan0903/CLIP
Contrastive Language-Image Pretraining
zhan0903/CSPS
zhan0903/d4pg-pytorch
PyTorch implementation of Distributed Distributional Deterministic Policy Gradients (https://arxiv.org/abs/1804.08617)
zhan0903/d4rl
A benchmark for offline reinforcement learning.
zhan0903/dify
Dify is an open-source LLM app development platform. It has the core tech required to build AI-native apps, including RAG, agent capabilities, model management, observability and more, packaged into one intuitive interface.
zhan0903/ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
zhan0903/GAC
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
zhan0903/habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
zhan0903/ICQ
Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight https://arxiv.org/abs/2106.03400)
zhan0903/lifelong_rl
Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Reset-Free Lifelong Learning with Skill-Space Planning.
zhan0903/memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
zhan0903/MineCLIP
Foundation Model for MineDojo
zhan0903/MineDojo
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
zhan0903/mjrl
Reinforcement learning algorithms for MuJoCo tasks
zhan0903/mopo
Code for MOPO: Model-based Offline Policy Optimization
zhan0903/mvp
Masked Visual Pre-training for Motor Control
zhan0903/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
zhan0903/Off2OnRL
zhan0903/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
zhan0903/reasoning-teacher
Official code for "Large Language Models Are Reasoning Teachers", ACL 2023
zhan0903/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
zhan0903/rlkit
Collection of reinforcement learning algorithms
zhan0903/smac
SMAC: The StarCraft Multi-Agent Challenge
zhan0903/SwinBERT
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
zhan0903/train-CLIP
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
zhan0903/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.