Asap7772

I am a PhD student in Computer Science at Stanford University. My research interests are in scaling up decision-making methods such as reinforcement learning.

Stanford UniversityCalifornia

Pinned Repositories

antmaze_gen
Language:Python1 1 00
byol_rl
Language:Python1 1 00
Cal-QL
Language:Python0 1 00
coq_softwarefoundations
Work on Software Foundations Course
Language:Coq1 2 00
DeepCriminalize
Project that uses GAN's to develop a sketch artist like representation of a criminal. Winners of the Cal Hack Fellowship 2019
Language:Python2 3 01
OfflineRlWorkflow
This repository accompanies the following paper: A Workflow for Offline Model-Free Robotic RL
Language:Python11 3 02
PTR
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.
Language:Python29 3 13
Release
Release of Supervision Search. Recently published findings in the IEEE Journal of Translational Engineering in Health and Medicine: A mobile application for keyword search in real-world scenes.
Language:Java10
understanding-rlhf
Language:Python23 1 13
widowx_control
Language:Python2 1 01

Asap7772's Repositories

Asap7772/PTR
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.
Language:Python29 3 13
Asap7772/understanding-rlhf
Language:Python23 1 13
Asap7772/DeepCriminalize
Project that uses GAN's to develop a sketch artist like representation of a criminal. Winners of the Cal Hack Fellowship 2019
Language:Python2 3 01
Asap7772/widowx_control
Language:Python2 1 01
Asap7772/antmaze_gen
Language:Python1 1 00
Asap7772/byol_rl
Language:Python1 1 00
Asap7772/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python1 0 01
Asap7772/epickitchensproc
Language:Jupyter Notebook1 1 0
Asap7772/gridworld_notebook
Language:Jupyter Notebook1 2 00
Asap7772/Cal-QL
Language:Python0 1 00
Asap7772/PeerWalk
PeerWalk is a React-Native app to allow verified college students to schedule walks with peers to safely walk around campus. Created at the Facebook San Francisco Hackathon (FBSF).
Language:JavaScript00
Asap7772/Asap7772.github.io
Resume Website
Language:JavaScript1
Asap7772/asr_lab
EE 225D ASR Lab
Language:Jupyter Notebook1 0
Asap7772/bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
Language:Python0 0
Asap7772/dpo
Language:Python1 0
Asap7772/genpaths
Language:Python1 01
Asap7772/inac_baseline
Language:Python1 0
Asap7772/JaxCQL
Conservative Q learning in Jax
Language:Python0 0
Asap7772/jaxrl2_finetuning_benchmark
Language:Jupyter Notebook0 0
Asap7772/kitchen_eval
Language:Python1 0
Asap7772/pythia
Language:Jupyter Notebook0 0
Asap7772/ReDS
Language:Python1 0
Asap7772/rt1_eval
Language:Jupyter Notebook1 01
Asap7772/scripts_tpus
Language:Shell1 0
Asap7772/spring2024-assignment1-basics
Language:Python0 0
Asap7772/TrajWeightingBaseline
Language:Python1 0
Asap7772/transformers-latent-dpo-cleaned
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python
Asap7772/trl
Train transformer language models with reinforcement learning.
Language:Python0 0
Asap7772/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python0 0
Asap7772/website
Language:HTML0 0