alexlioralexli
PhD student in machine learning at Carnegie Mellon University. Prev: undergrad at UC Berkeley.
Pittsburgh, PA
Pinned Repositories
10-716-project
cmu-vision.github.io
compression
Data compression in TensorFlow
diagnosing_qlearning
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
flow-demos
generalized-hindsight
learned-fourier-features
Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"
noncontrastive-ssl
Analyzing partial dimensional collapse in non-contrastive self-supervised learning. "Understanding Collapse in Non-Contrastive Siamese Representation Learning." In ECCV, 2022.
rllab-finetuning
diffusion-classifier
Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training
alexlioralexli's Repositories
alexlioralexli/rllab-finetuning
alexlioralexli/learned-fourier-features
Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"
alexlioralexli/noncontrastive-ssl
Analyzing partial dimensional collapse in non-contrastive self-supervised learning. "Understanding Collapse in Non-Contrastive Siamese Representation Learning." In ECCV, 2022.
alexlioralexli/generalized-hindsight
alexlioralexli/flow-demos
alexlioralexli/10-716-project
alexlioralexli/cmu-vision.github.io
alexlioralexli/compression
Data compression in TensorFlow
alexlioralexli/diagnosing_qlearning
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
alexlioralexli/dm_control
The DM Control Suite and Package is a tool for developing and testing reinforcement learning agents for the MuJoCo physics engine.
alexlioralexli/doodad
alexlioralexli/hidden-networks
alexlioralexli/hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
alexlioralexli/homework
Assignments for CS294-112.
alexlioralexli/models
Models and examples built with TensorFlow
alexlioralexli/mujoco-py-v0.5.7
alexlioralexli/oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
alexlioralexli/papers
alexlioralexli/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
alexlioralexli/researcher
A jekyll based resume template
alexlioralexli/rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
alexlioralexli/sac
Soft Actor-Critic
alexlioralexli/simsiam
PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566
alexlioralexli/tape
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.
alexlioralexli/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
alexlioralexli/website