Pinned Repositories
tensorzero
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
backwards_dict
bats
best-action trajectory stitching
blackjack
This is my blackjack learning program. It is bullheaded.
brain-tokyo-workshop
🧠🗼
catalyst
Reproducible and fast DL & RL.
differentiable_grasp_quality
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
hasItDropped
A quick script that is gonna try and text me when Frank Ocean's album drops by trawling reddit
procedural_objects
A library for procedurally generating objects.
virajmehta's Repositories
virajmehta/differentiable_grasp_quality
virajmehta/bats
best-action trajectory stitching
virajmehta/procedural_objects
A library for procedurally generating objects.
virajmehta/backwards_dict
virajmehta/brain-tokyo-workshop
🧠🗼
virajmehta/catalyst
Reproducible and fast DL & RL.
virajmehta/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
virajmehta/gym
A toolkit for developing and comparing reinforcement learning algorithms.
virajmehta/gym-anm
Design Reinforcement Learning environments that model Active Network Management (ANM) tasks in electricity distribution networks.
virajmehta/hiphoptimes
This will trawl reddit every 10min for a while and store the number of current visitors at reddit.com/r/hiphopheads
virajmehta/hucrl
virajmehta/jeopardy-data
virajmehta/jeopardy-parser
Extracts clues from the J! Archive website.
virajmehta/MAX
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
virajmehta/meshpy
A 3-D triangular mesh package for Python.
virajmehta/MetaCURE-Public
virajmehta/neatplot
Plotting utilities for Python
virajmehta/nvim
neovim config with autocomplete and copilot and stuff
virajmehta/PILCO
Bayesian Reinforcement Learning in Tensorflow
virajmehta/PointSetGeneration
Code for ``A Point Set Generation Network for 3D Object Reconstruction from a Single Image''
virajmehta/real-nvp
Implementation of Real NVP in PyTorch
virajmehta/rl-blackjack
virajmehta/rl-inference
Reinforcement Learning through Active Inference
virajmehta/sac
Soft Actor-Critic
virajmehta/statsmodels
Statsmodels: statistical modeling and econometrics in Python
virajmehta/subwaysign
Tell me when the train is coming
virajmehta/trl
Train transformer language models with reinforcement learning.
virajmehta/vae-training
Supporting code to the paper: Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
virajmehta/website-build
virajmehta/website-source
A beautiful, simple, clean, and responsive Jekyll theme for academics