Pinned Repositories
task-distillation
Code for Domain Adaptation Through Task Distillation (ECCV 20)
j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
verdict
Inference-time scaling for LLMs-as-a-judge.
gg
[deprecated] git workflow shortcuts
git-fire
:fire: Save Your Code in an Emergency
jbin
:package: Java Binary Executables
m385c
Graduate Measure Theory
p
:snake: Python Version Management Made Simple
ralph
:mouse: UNIX Aliases with Superpowers: Parameters, Sudo-Able, and More
robomaster-driver
DJI RoboMaster S1 Driver for Evaluating Real-World Deep Visuomotor Policies
qw3rtman's Repositories
qw3rtman/git-fire
:fire: Save Your Code in an Emergency
qw3rtman/gg
[deprecated] git workflow shortcuts
qw3rtman/p
:snake: Python Version Management Made Simple
qw3rtman/ralph
:mouse: UNIX Aliases with Superpowers: Parameters, Sudo-Able, and More
qw3rtman/jbin
:package: Java Binary Executables
qw3rtman/random-feature-maps
Fast Random Kernelized Features: Support Vector Machine Classification for High-Dimensional IDC Dataset
qw3rtman/gymrat-inflation
what does inflation look like for a gymrat?
qw3rtman/cyclegan-wandb
wandb hook cyclegan
qw3rtman/LearningByCheating
Driving in CARLA using waypoint prediction and two-stage imitation learning
qw3rtman/m385c
Graduate Measure Theory
qw3rtman/robomaster-driver
DJI RoboMaster S1 Driver for Evaluating Real-World Deep Visuomotor Policies
qw3rtman/coral
🌊 A Real Shell Package Manager
qw3rtman/cycada_release
Code to accompany ICML 2018 paper
qw3rtman/DirectFuturePrediction
Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017
qw3rtman/dockless-scooter-traffic
modeling dockless scooter rides
qw3rtman/habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
qw3rtman/habitat2robomaster
can we do it
qw3rtman/llama.cpp
LLM inference in C/C++
qw3rtman/mono-vo
An OpenCV based implementation of Monocular Visual Odometry
qw3rtman/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
qw3rtman/point-architecture
experiments that never worked
qw3rtman/Quirk
A drag-and-drop quantum circuit simulator that runs in your browser. A toy for exploring and understanding small quantum circuits.
qw3rtman/qw3rtman
qw3rtman/RB_GEN
RB_GEN is a simple package for generating random binning features for solving large-scale kernel classification, regression, and clustering.
qw3rtman/retype
Retype is an ✨ ultra-high-performance✨ static site generator that builds a website based on simple text files.
qw3rtman/slow-loop
A simple loop manager for robotic control pipelines with slow perception/planning modules
qw3rtman/ViZDoom
Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. :godmode:
qw3rtman/weave
Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.