Pinned Repositories
AdaptationAgnosticMetaLearning
source code to ICML 2021 AutoML Workshop, 'Adaptation-agnostic Meta-training'
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC)
F2M
jiaxinchen666.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Kaggle_Lux_AI_2021
lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
Lux-Design-2021
Home to the design and engine of the @Lux-AI-Challenge Season 1, hosted on @kaggle
meta-theory
torchbeast-nmmo
A PyTorch Platform for Distributed RL
variational-scaling
A pytorch implementation of 'Variational Metric Scaling for Metric-based Meta-learning'
jiaxinchen666's Repositories
jiaxinchen666/meta-theory
jiaxinchen666/variational-scaling
A pytorch implementation of 'Variational Metric Scaling for Metric-based Meta-learning'
jiaxinchen666/torchbeast-nmmo
A PyTorch Platform for Distributed RL
jiaxinchen666/AdaptationAgnosticMetaLearning
source code to ICML 2021 AutoML Workshop, 'Adaptation-agnostic Meta-training'
jiaxinchen666/Kaggle_Lux_AI_2021
jiaxinchen666/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC)
jiaxinchen666/F2M
jiaxinchen666/jiaxinchen666.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
jiaxinchen666/lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
jiaxinchen666/Lux-Design-2021
Home to the design and engine of the @Lux-AI-Challenge Season 1, hosted on @kaggle
jiaxinchen666/lux-open
jiaxinchen666/med-vqa
Medical Visual Question Answering via Conditional Reasoning [ACM MM 2020]
jiaxinchen666/Meta2Learning
jiaxinchen666/ml-agents
Unity Machine Learning Agents Toolkit
jiaxinchen666/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
jiaxinchen666/sample-factory
High throughput asynchronous reinforcement learning
jiaxinchen666/SDC-IL
Semantic Drift Compensation for Class-Incremental Learning (CVPR2020)
jiaxinchen666/starter-academic
jiaxinchen666/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs