Asap7772
I am a PhD student in Computer Science at Stanford University. My research interests are in scaling up decision-making methods such as reinforcement learning.
Stanford UniversityCalifornia
Pinned Repositories
antmaze_gen
byol_rl
Cal-QL
coq_softwarefoundations
Work on Software Foundations Course
DeepCriminalize
Project that uses GAN's to develop a sketch artist like representation of a criminal. Winners of the Cal Hack Fellowship 2019
OfflineRlWorkflow
This repository accompanies the following paper: A Workflow for Offline Model-Free Robotic RL
PTR
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.
Release
Release of Supervision Search. Recently published findings in the IEEE Journal of Translational Engineering in Health and Medicine: A mobile application for keyword search in real-world scenes.
understanding-rlhf
widowx_control
Asap7772's Repositories
Asap7772/PTR
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.
Asap7772/understanding-rlhf
Asap7772/DeepCriminalize
Project that uses GAN's to develop a sketch artist like representation of a criminal. Winners of the Cal Hack Fellowship 2019
Asap7772/widowx_control
Asap7772/antmaze_gen
Asap7772/byol_rl
Asap7772/D4RL
A collection of reference environments for offline reinforcement learning
Asap7772/epickitchensproc
Asap7772/gridworld_notebook
Asap7772/Cal-QL
Asap7772/PeerWalk
PeerWalk is a React-Native app to allow verified college students to schedule walks with peers to safely walk around campus. Created at the Facebook San Francisco Hackathon (FBSF).
Asap7772/Asap7772.github.io
Resume Website
Asap7772/asr_lab
EE 225D ASR Lab
Asap7772/bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
Asap7772/dpo
Asap7772/genpaths
Asap7772/inac_baseline
Asap7772/JaxCQL
Conservative Q learning in Jax
Asap7772/jaxrl2_finetuning_benchmark
Asap7772/kitchen_eval
Asap7772/pythia
Asap7772/ReDS
Asap7772/rt1_eval
Asap7772/scripts_tpus
Asap7772/spring2024-assignment1-basics
Asap7772/TrajWeightingBaseline
Asap7772/transformers-latent-dpo-cleaned
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Asap7772/trl
Train transformer language models with reinforcement learning.
Asap7772/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Asap7772/website