kabicm

A PhD Student in CS at ETH Zürich. Previously, a software engineer at Swiss National Supercomputing Centre. Enthusiastic about Databases, Cloud-Computing, HPC.

ETH ZurichETH Zürich

Pinned Repositories

arbor
The Arbor multi-compartment neural network simulation library.
Language:C++113 22 93561
COSMA
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Language:C++201 21 6928
COSTA
Distributed Communication-Optimal Shuffle and Transpose Algorithm
Language:C++11 9 94
Tiled-MM
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
Language:C++30 4 59
cp2k
Quantum chemistry and solid state physics software package
Language:Fortran0 1 00
grid2grid
A library transforming between two arbitrary grid-like matrix data layouts over MPI ranks.
Language:C++2 3 42
lu
LU-factorization with Scalapack
Language:C++1 3 01

kabicm's Repositories

kabicm/alpa
Auto parallelization for large-scale neural networks
Language:Python0 0
kabicm/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python0 0
kabicm/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Language:Python0 0
kabicm/ColossalAI
Colossal-AI: A Unified Deep Learning System for Big Model Era
Language:Python0 0
kabicm/conflux
Distributed Communication-Optimal LU-factorization Algorithm
Language:C++0 0
kabicm/COSTA
Distributed Communication-Optimal Shuffle and Transpose Algorithm
Language:C++0 0
kabicm/cuCollections
Language:C++0 0
kabicm/cudf
cuDF - GPU DataFrame Library
Language:C++0 0
kabicm/DFI-public
Language:C++0 0
kabicm/DT-FM
Language:Python0 0
kabicm/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++0 0
kabicm/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Language:C++0 0
kabicm/flash-attention
Fast and memory-efficient exact attention
Language:C++0 0
kabicm/flax
Flax is a neural network library for JAX that is designed for flexibility.
Language:Python0 0
kabicm/gavel
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
Language:Jupyter Notebook0 0
kabicm/google-research
Google Research
Language:Jupyter Notebook0 0
kabicm/marius
Large scale embeddings on a single machine.
Language:C++0 0
kabicm/mesh
Mesh TensorFlow: Model Parallelism Made Easier
Language:Python0 0
kabicm/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
Language:Python0 0
kabicm/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python0 0
kabicm/parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
Language:Python0 0
kabicm/pytorch3d
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
Language:Python0 0
kabicm/query-engine
LingoDB: A new analytical database system that blurs the lines between databases and compilers.
kabicm/semiprof
Simple thread safe annotation based C++ profiler.
Language:C++1 0
kabicm/snn_toolbox
Toolbox for converting analog to spiking neural networks (ANN to SNN), and running them in a spiking neuron simulator.
Language:Python0 0
kabicm/spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
Language:Python0 0
kabicm/sql-parser
SQL Parser for C++. Building C++ object structure from SQL statements.
Language:C++0 0
kabicm/transformer-from-scratch
Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
Language:Python0 0
kabicm/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
kabicm/trax
Trax — Deep Learning with Clear Code and Speed
Language:Python0 0

kabicm

Pinned Repositories

arbor

COSMA

COSTA

Tiled-MM

cp2k

grid2grid

lu

kabicm's Repositories

kabicm/alpa

kabicm/apex

kabicm/attention-is-all-you-need-pytorch

kabicm/ColossalAI

kabicm/conflux

kabicm/COSTA

kabicm/cuCollections

kabicm/cudf

kabicm/DFI-public

kabicm/DT-FM

kabicm/FasterTransformer

kabicm/FBGEMM

kabicm/flash-attention

kabicm/flax

kabicm/gavel

kabicm/google-research

kabicm/marius

kabicm/mesh

kabicm/mesh-transformer-jax

kabicm/minGPT

kabicm/parallelformers

kabicm/pytorch3d

kabicm/query-engine

kabicm/semiprof

kabicm/snn_toolbox

kabicm/spack

kabicm/sql-parser

kabicm/transformer-from-scratch

kabicm/transformers

kabicm/trax