b-albar

AI Engineer

CATIEFrance

Pinned Repositories

attention-surgery
Attention surgery for LLMs
Language:Python00
chessformer
Chessformer
Language:Rust0 0 00
flash-attention
Fast and memory-efficient exact attention
Language:Python0 0 00
flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
Language:Python0 0 00
portfolio
Personal Portfolio in Machine Learning
Language:HTML0 1 00
simple-decoder
A simple yet optimized decoder-only architecture
00
flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
Language:Python74 9 89
rustlm
RustLM : An efficient Rust CTC Decoder supporting external language models
Language:Rust0 1 00
triton-rust
An api for interfacing Nvidia Trition Inference Server with Rust
Language:Rust7 2 00
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python85.8k 1.8k 48k23.1k

b-albar's Repositories

b-albar/attention-surgery
Attention surgery for LLMs
Language:Python00
b-albar/chessformer
Chessformer
Language:Rust0 0 00
b-albar/flash-attention
Fast and memory-efficient exact attention
Language:Python0 0 00
b-albar/flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
Language:Python0 0 00
b-albar/portfolio
Personal Portfolio in Machine Learning
Language:HTML0 1 00
b-albar/simple-decoder
A simple yet optimized decoder-only architecture
00