alexlioralexli

PhD student in machine learning at Carnegie Mellon University. Prev: undergrad at UC Berkeley.

Pittsburgh, PA

Pinned Repositories

10-716-project
Language:Jupyter Notebook00
cmu-vision.github.io
Language:HTML00
compression
Data compression in TensorFlow
Language:Python00
diagnosing_qlearning
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
Language:Python00
flow-demos
Language:Python10
generalized-hindsight
Language:Python8 1 03
learned-fourier-features
Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"
Language:Python16 1 03
noncontrastive-ssl
Analyzing partial dimensional collapse in non-contrastive self-supervised learning. "Understanding Collapse in Non-Contrastive Siamese Representation Learning." In ECCV, 2022.
Language:Jupyter Notebook11 2 21
rllab-finetuning
Language:Python27 2 19
diffusion-classifier
Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training
Language:Python386 16 3027

alexlioralexli's Repositories

alexlioralexli/rllab-finetuning
Language:Python27 2 19
alexlioralexli/learned-fourier-features
Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"
Language:Python16 1 03
alexlioralexli/noncontrastive-ssl
Analyzing partial dimensional collapse in non-contrastive self-supervised learning. "Understanding Collapse in Non-Contrastive Siamese Representation Learning." In ECCV, 2022.
Language:Jupyter Notebook11 2 21
alexlioralexli/generalized-hindsight
Language:Python8 1 03
alexlioralexli/flow-demos
Language:Python10
alexlioralexli/10-716-project
Language:Jupyter Notebook00
alexlioralexli/cmu-vision.github.io
Language:HTML00
alexlioralexli/compression
Data compression in TensorFlow
Language:Python00
alexlioralexli/diagnosing_qlearning
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
Language:Python00
alexlioralexli/dm_control
The DM Control Suite and Package is a tool for developing and testing reinforcement learning agents for the MuJoCo physics engine.
Language:Python00
alexlioralexli/doodad
Language:Python
alexlioralexli/hidden-networks
alexlioralexli/hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Language:Python
alexlioralexli/homework
Assignments for CS294-112.
Language:Python
alexlioralexli/models
Models and examples built with TensorFlow
Language:Python
alexlioralexli/mujoco-py-v0.5.7
Language:Python1 0
alexlioralexli/oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
Language:Python
alexlioralexli/papers
Language:Python
alexlioralexli/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python
alexlioralexli/researcher
A jekyll based resume template
Language:HTML
alexlioralexli/rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
Language:Python1 0
alexlioralexli/sac
Soft Actor-Critic
Language:Python
alexlioralexli/simsiam
PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566
Language:Python
alexlioralexli/tape
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.
alexlioralexli/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Language:Shell
alexlioralexli/website
Language:HTML

alexlioralexli

Pinned Repositories

10-716-project

cmu-vision.github.io

compression

diagnosing_qlearning

flow-demos

generalized-hindsight

learned-fourier-features

noncontrastive-ssl

rllab-finetuning

diffusion-classifier

alexlioralexli's Repositories

alexlioralexli/rllab-finetuning

alexlioralexli/learned-fourier-features

alexlioralexli/noncontrastive-ssl

alexlioralexli/generalized-hindsight

alexlioralexli/flow-demos

alexlioralexli/10-716-project

alexlioralexli/cmu-vision.github.io

alexlioralexli/compression

alexlioralexli/diagnosing_qlearning

alexlioralexli/dm_control

alexlioralexli/doodad

alexlioralexli/hidden-networks

alexlioralexli/hindsight-experience-replay

alexlioralexli/homework

alexlioralexli/models

alexlioralexli/mujoco-py-v0.5.7

alexlioralexli/oyster

alexlioralexli/papers

alexlioralexli/pytorch-a2c-ppo-acktr-gail

alexlioralexli/researcher

alexlioralexli/rllab

alexlioralexli/sac

alexlioralexli/simsiam

alexlioralexli/tape

alexlioralexli/TD3

alexlioralexli/website