wassname
If you can't possibly remember what something is called, what it is, what it does, or why it does it, it instantly becomes a 'wassname'
I'm just a guy who likes to machine learnPerth, Australia
Pinned Repositories
makehuman-js
A library to build 3D human characters in the browser
attentive-neural-processes
implementing "recurrent attentive neural processes" to forecast power usage (w. LSTM baseline, MCDropout)
keywordshitter2
A website to find long-tail keywords using search suggestions
open_pref_eval
Hackable, simple, llm evals on preference datasets
repr-preference-optimization
align inner states not actions for better generalization? [wip]
rl-portfolio-management
Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)
viz_torch_optim
Videos of deep learning optimizers moving on 3D problem-landscapes
world-models-sonic-pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more work needed
wassname's Repositories
wassname/viz_torch_optim
Videos of deep learning optimizers moving on 3D problem-landscapes
wassname/prob_jsonformer
Generate Structured JSON with probs from Language Models
wassname/quiet-star
investigate Quiet-STaR paper, and it's thought scratchpad
wassname/awesome-interpretability
Awesome tools for interpreting, manipulating the internals of of deep neural networks.
wassname/adapters_can_monitor_lies
inspired by circuit breakers paper. honesty>harmless
wassname/cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
wassname/detect_bs_text
Can we measure how good a text is by how much an LLM learns from it?
wassname/LoRA_are_lie_detectors
Experiment to see if low rank adapters can work as interventions for lie detection on LLM's
wassname/open_pref_eval
Hackable, simple, llm evals on preference datasets
wassname/iris_bigvae
experiment: IRIS but with pretrained LLM
wassname/lie_elicitation_prompts
Research dataset. We use prompts to get LLM's to lie. Using sys prompts and multi shot examples
wassname/repr-preference-optimization
align inner states not actions for better generalization? [wip]
wassname/scrape_r_rational
scraping book reccomendations from reddit r rational
wassname/baukit
wassname/chatGPTBox
Integrating ChatGPT into your browser deeply, everything you need is here
wassname/circuit-breakers
Improving Alignment and Robustness with Circuit Breakers
wassname/Craftax
(Crafter + NetHack) in JAX
wassname/dreamerv3-torch
Implementation of Dreamer v3 in pytorch.
wassname/GENIES
Generalization Analogies: A Testbed for Generalizing AI Oversight to Hard-To-Measure Domains
wassname/jaxtyping
Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/
wassname/leaked-system-prompts
Collection of leaked system prompts
wassname/optuna-dashboard
Real-time Web Dashboard for Optuna.
wassname/rag_search_cite
Hackable frontend for LLM assisted searching with citations
wassname/REM
Implementation of our paper "Improving Token-Based World Models with Parallel Observation Prediction"
wassname/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
wassname/stampy_nb
wassname/twm
Transformer-based World Models
wassname/wassname
wassname/word_level_diff_writing_assistant
Spell check with an llm and quickly verify it with a word level diff
wassname/xbsjsonedit
A basic editor for xBrowserSync json backup files