janEbert

Good luck!

Pinned Repositories

hearts-gym
Multi-agent Hearts card game environment
Language:Python9 6 29
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python0 0 00
DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Language:Python3 1 00
dotfiles
Optimized workflow
Language:Emacs Lisp1 2 00
immo-search
Make real estate information more easily available
Language:JavaScript1 2 00
PyTorch-VeLO
VeLO optimizer in PyTorch
Language:Python18 3 32
rldurak
Bachelor's thesis on reinforcement learning for the card game Durak
Language:Python5 2 11
sb3-ppg
Phasic policy gradient algorithm for stable-baselines3
Language:Python8 2 31
SMWLevelGenerator
Master's thesis on generating Super Mario World levels using deep neural networks
Language:Julia7 2 00

janEbert's Repositories

janEbert/PyTorch-VeLO
VeLO optimizer in PyTorch
Language:Python18 3 32
janEbert/dotfiles
Optimized workflow
Language:Emacs Lisp1 2 00
janEbert/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Language:Python0 0 00
janEbert/Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Language:Python0 0 00
janEbert/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python0 0
janEbert/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Language:Python0 0
janEbert/BitMat
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
Language:Python0 0
janEbert/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
janEbert/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python0 0
janEbert/flash-attention
Fast and memory-efficient exact attention
janEbert/fly
Language:Python0 0
janEbert/gptel
A no-frills ChatGPT client for Emacs
Language:Emacs Lisp0 0
janEbert/janEbert.github.io
Personal website
Language:CSS2 0
janEbert/lightning
Deep learning framework to train, deploy, and ship AI products Lightning fast.
Language:Python0 0
janEbert/litgpt
Load, pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
janEbert/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python0 0
janEbert/localpilot
Language:Python0 0
janEbert/megablocks
janEbert/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1 0
janEbert/Megatron-LM
Ongoing research training transformer models at scale
Language:Python0 01
janEbert/mne-python
MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python
Language:Python0 0
janEbert/mup
maximal update parametrization (µP)
Language:Jupyter Notebook0 0
janEbert/NeMo
NeMo: a toolkit for conversational AI
Language:Python0 0
janEbert/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:C++1 0
janEbert/Quicksetup-ai
A flexible template, as a quick setup for deep learning projects in pytorch-lightning
Language:Python0 0
janEbert/safari
Convolutions for Sequence Modeling
Language:Assembly0 0
janEbert/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python0 0
janEbert/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
janEbert/triton
Development repository for the Triton language and compiler
Language:C++0 0
janEbert/utilities
Common Python utilities and GitHub Actions in Lightning Ecosystem
Language:Python0 0