zhan0903

Pinned Repositories

a3c
Language:Python0 1 00
APL0
Language:Python2 1 00
apv
Language:Python0 0 00
ARS
An implementation of the Augmented Random Search algorithm
Language:Python0 1 00
ATAC
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal.
Language:Python0 0 00
Awesome-Multi-Modal-Reinforcement-Learning
A curated list of Multi-Modal Reinforcement Learning resources (continually updated)
1 1 00
bcline
Language:C0 1 00
blog
My blog, written with Tornado + AppEngine
Language:Python00
blog-1
A simple Google App Engine blog
Language:Python0 2 00
pso-ga-cnn
Language:Python50

zhan0903's Repositories

zhan0903/APL0
Language:Python2 1 00
zhan0903/Awesome-Multi-Modal-Reinforcement-Learning
A curated list of Multi-Modal Reinforcement Learning resources (continually updated)
1 1 00
zhan0903/apv
Language:Python0 0 00
zhan0903/ATAC
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal.
Language:Python0 0 00
zhan0903/CLIP
Contrastive Language-Image Pretraining
Language:Jupyter Notebook
zhan0903/CSPS
Language:Python1 0
zhan0903/d4pg-pytorch
PyTorch implementation of Distributed Distributional Deterministic Policy Gradients (https://arxiv.org/abs/1804.08617)
Language:Python0 0
zhan0903/d4rl
A benchmark for offline reinforcement learning.
Language:Python0 0
zhan0903/dify
Dify is an open-source LLM app development platform. It has the core tech required to build AI-native apps, including RAG, agent capabilities, model management, observability and more, packaged into one intuitive interface.
Language:Python0 0
zhan0903/ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
zhan0903/GAC
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
Language:Python0 0
zhan0903/habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
zhan0903/ICQ
Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight https://arxiv.org/abs/2106.03400)
Language:Python0 0
zhan0903/lifelong_rl
Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Reset-Free Lifelong Learning with Skill-Space Planning.
Language:Python0 0
zhan0903/memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
zhan0903/MineCLIP
Foundation Model for MineDojo
Language:Python0 0
zhan0903/MineDojo
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Language:Java
zhan0903/mjrl
Reinforcement learning algorithms for MuJoCo tasks
Language:Python0 0
zhan0903/mopo
Code for MOPO: Model-based Offline Policy Optimization
zhan0903/mvp
Masked Visual Pre-training for Motor Control
Language:Python0 0
zhan0903/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python0 0
zhan0903/Off2OnRL
Language:Python0 0
zhan0903/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python0 0
zhan0903/reasoning-teacher
Official code for "Large Language Models Are Reasoning Teachers", ACL 2023
zhan0903/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Language:Python0 0
zhan0903/rlkit
Collection of reinforcement learning algorithms
Language:Python0 0
zhan0903/smac
SMAC: The StarCraft Multi-Agent Challenge
zhan0903/SwinBERT
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
Language:Python0 0
zhan0903/train-CLIP
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
Language:Python0 0
zhan0903/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0