Pinned Repositories
sn-react
React boilerplate for ServiceNow applications
deepspeed_llama
Finetuning LLaMA with DeepSpeed
gpt-honest-articulation
Exploring GPT-3 ability to articulate its knowledge
node-rsa-nopadding
A boilerplate for `node-rsa` package encryption/decryption without padding
ois-class-watcher
openpilot-pipeline
Training pipeline for end-to-end self-driving with Comma AI's Openpilot. WIP
react-election-registration
A simple election check-in app for use by students in university elections.
self-attention-rl
Re-implementation of an RL + Transformer paper: https://arxiv.org/abs/1907.08027
tgnews
My submission to Telegram Data Clustering contest (ranked 5th/122, team of 2)
wams
An API for making Multi-Surface Applications
mbalesni's Repositories
mbalesni/openpilot-pipeline
Training pipeline for end-to-end self-driving with Comma AI's Openpilot. WIP
mbalesni/deepspeed_llama
Finetuning LLaMA with DeepSpeed
mbalesni/self-attention-rl
Re-implementation of an RL + Transformer paper: https://arxiv.org/abs/1907.08027
mbalesni/gpt-honest-articulation
Exploring GPT-3 ability to articulate its knowledge
mbalesni/react-election-registration
A simple election check-in app for use by students in university elections.
mbalesni/mbalesni.github.io
mbalesni/tgnews
My submission to Telegram Data Clustering contest (ranked 5th/122, team of 2)
mbalesni/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
mbalesni/ai-safety-paper-notes
Summaries, notes and questions on AI safety research papers.
mbalesni/anthropic-hack-23
mbalesni/ARENA_2.0
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
mbalesni/BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
mbalesni/censored-cognition
mbalesni/DeepTraffic
Deep Learning models for network traffic classification
mbalesni/diia-integration
mbalesni/ebm-driving
mbalesni/g-in-llms
mbalesni/grok
mbalesni/ibc
(Fork of) Official implementation of Implicit Behavioral Cloning, as described in our CoRL 2021 paper, see more at https://implicitbc.github.io/
mbalesni/iphone-checker
mbalesni/llm-security-challenge
Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments
mbalesni/mats-3-aligning-lms
A common repo of the MATS 3.0 stream on Aligning Language Models
mbalesni/onnx2pytorch
Transform ONNX model to PyTorch representation
mbalesni/posters
mbalesni/presentations
mbalesni/setup-python
Set up your GitHub Actions workflow with a specific version of Python [ALWAYS CACHE]
mbalesni/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
mbalesni/TCP
[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
mbalesni/vote-verification
Web app with a custom anonymous & secure voting verification protocol.
mbalesni/whisper
Robust Speech Recognition via Large-Scale Weak Supervision