tianjunz

Pinned Repositories

agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Language:Python0 1 00
awesome-deep-rl
For deep RL and the future of AI.
0 1 00
azure-cli-cheatsheet
Azure CLI Cheatsheet
0 0 00
BERT-pytorch
Google AI 2018 BERT pytorch implementation
Language:Python0 1 00
c-planning
0 2 00
cs162-group
Language:C0 1 00
HIR
Language:Python159 5 212
MADE
Language:Python18 4 05
NovelD
Language:Python38 2 36
TEMPERA
Language:Python42 3 48

tianjunz's Repositories

tianjunz/HIR
Language:Python159 5 212
tianjunz/TEMPERA
Language:Python42 3 48
tianjunz/NovelD
Language:Python38 2 36
tianjunz/MADE
Language:Python18 4 05
tianjunz/agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Language:Python0 1 00
tianjunz/awesome-deep-rl
For deep RL and the future of AI.
0 1 00
tianjunz/azure-cli-cheatsheet
Azure CLI Cheatsheet
0 0 00
tianjunz/c-planning
0 2 00
tianjunz/DeepSpeedExamples
Example models using DeepSpeed
Language:Python0 0 01
tianjunz/dreamerv2
Mastering Atari with Discrete World Models
Language:Python1 0
tianjunz/guidance
A guidance language for controlling large language models.
Language:Jupyter Notebook0 01
tianjunz/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python1 0
tianjunz/Learn_Prompting
Language:TeX0 0
tianjunz/marLo
Multi Agent Reinforcement Learning using MalmÖ
Language:Python1 0
tianjunz/MemGPT
Create LLM agents with long-term memory and custom tools 📚🦙
Language:Python0 0
tianjunz/metaseq
Repo for external large-scale work
Language:Python1 0
tianjunz/ml-agents
Unity Machine Learning Agents Toolkit
Language:C#1 0
tianjunz/my-offlinerl
Language:Python1 0
tianjunz/ort
Accelerate PyTorch models with ONNX Runtime
Language:Python0 0
tianjunz/overcooked_ai
A benchmark environment for fully cooperative multi-agent performance.
Language:JavaScript1 0
tianjunz/poet
ML model training for edge devices
Language:Python0 0
tianjunz/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python1 0
tianjunz/python
Official Python client library for kubernetes
Language:Python0 0
tianjunz/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python1 0
tianjunz/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
Language:Python1 01
tianjunz/raft
1 0
tianjunz/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Language:Python1 0
tianjunz/tianjunz.github.io
Language:JavaScript2 0
tianjunz/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python0 0
tianjunz/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0