mbalesni

London, UK

Pinned Repositories

sn-react
React boilerplate for ServiceNow applications
Language:JavaScript33 6 141
deepspeed_llama
Finetuning LLaMA with DeepSpeed
Language:Python94
gpt-honest-articulation
Exploring GPT-3 ability to articulate its knowledge
Language:Jupyter Notebook1 2 00
node-rsa-nopadding
A boilerplate for `node-rsa` package encryption/decryption without padding
Language:JavaScript2 1 00
ois-class-watcher
Language:JavaScript1 1 00
openpilot-pipeline
Training pipeline for end-to-end self-driving with Comma AI's Openpilot. WIP
Language:Jupyter Notebook106 9 1738
react-election-registration
A simple election check-in app for use by students in university elections.
Language:JavaScript1 1 00
self-attention-rl
Re-implementation of an RL + Transformer paper: https://arxiv.org/abs/1907.08027
Language:Jupyter Notebook3 2 01
tgnews
My submission to Telegram Data Clustering contest (ranked 5th/122, team of 2)
Language:Python0 1 00
wams
An API for making Multi-Surface Applications
Language:JavaScript1 0 182

mbalesni's Repositories

mbalesni/openpilot-pipeline
Training pipeline for end-to-end self-driving with Comma AI's Openpilot. WIP
Language:Jupyter Notebook106 9 1738
mbalesni/deepspeed_llama
Finetuning LLaMA with DeepSpeed
Language:Python94
mbalesni/self-attention-rl
Re-implementation of an RL + Transformer paper: https://arxiv.org/abs/1907.08027
Language:Jupyter Notebook3 2 01
mbalesni/gpt-honest-articulation
Exploring GPT-3 ability to articulate its knowledge
Language:Jupyter Notebook1 2 00
mbalesni/react-election-registration
A simple election check-in app for use by students in university elections.
Language:JavaScript1 1 00
mbalesni/mbalesni.github.io
Language:HTML00
mbalesni/tgnews
My submission to Telegram Data Clustering contest (ranked 5th/122, team of 2)
Language:Python0 1 00
mbalesni/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python0 0
mbalesni/ai-safety-paper-notes
Summaries, notes and questions on AI safety research papers.
mbalesni/anthropic-hack-23
Language:Python
mbalesni/ARENA_2.0
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
Language:HTML0 0
mbalesni/BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
mbalesni/censored-cognition
Language:Python
mbalesni/DeepTraffic
Deep Learning models for network traffic classification
mbalesni/diia-integration
Language:Python0 0
mbalesni/ebm-driving
Language:Jupyter Notebook0 0
mbalesni/g-in-llms
Language:Python0 0
mbalesni/grok
Language:Python
mbalesni/ibc
(Fork of) Official implementation of Implicit Behavioral Cloning, as described in our CoRL 2021 paper, see more at https://implicitbc.github.io/
Language:Python0 0
mbalesni/iphone-checker
Language:Python1 0
mbalesni/llm-security-challenge
Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments
mbalesni/mats-3-aligning-lms
A common repo of the MATS 3.0 stream on Aligning Language Models
Language:Jupyter Notebook1 0
mbalesni/onnx2pytorch
Transform ONNX model to PyTorch representation
Language:Python0 01
mbalesni/posters
1 0
mbalesni/presentations
mbalesni/setup-python
Set up your GitHub Actions workflow with a specific version of Python [ALWAYS CACHE]
Language:TypeScript
mbalesni/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
mbalesni/TCP
[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
Language:Python0 0
mbalesni/vote-verification
Web app with a custom anonymous & secure voting verification protocol.
Language:JavaScript0 0
mbalesni/whisper
Robust Speech Recognition via Large-Scale Weak Supervision