drunkcoding

4th Year PhD student, The University of Edinburgh.

Pinned Repositories

Abstract-Algebra
learning math
Language:TeX0 2 00
alpa
Training and serving large-scale neural networks with auto parallelization.
Language:Python0 0 00
awesome-database-learning
A list of learning materials to understand databases internals
0 1 00
Awesome-Places-for-Food-Drinks
0 1 00
cheetah-fastclick
FastClick with the Cheetah elements
Language:C++0 1 00
huggingface-utils
Language:Python1 2 00
MCU-project
VE373 final project on microprocessor based system
Language:C1 3 10
model-inference
utilities and tests for model inference
Language:Python1 2 01
Multi-thread_DB
Experiment on multi-thread by implementing a database
Language:C++2 2 00
MoE-Infinity
PyTorch library for cost-effective, fast and easy serving of MoE models.
Language:Python103 7 148

drunkcoding's Repositories

drunkcoding/huggingface-utils
Language:Python1 2 00
drunkcoding/model-inference
utilities and tests for model inference
Language:Python1 2 01
drunkcoding/alpa
Training and serving large-scale neural networks with auto parallelization.
Language:Python0 0 00
drunkcoding/Awesome-Places-for-Food-Drinks
0 1 00
drunkcoding/cheetah-fastclick
FastClick with the Cheetah elements
Language:C++0 1 00
drunkcoding/core
The core library and APIs implementing the Triton Inference Server.
Language:C++0 1 00
drunkcoding/CS411-Database-System
Project for database system -- an interactive website
Language:Python3 0
drunkcoding/DeeperSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Language:Python1 0
drunkcoding/DLRM
2 0
drunkcoding/efficient-nlp
Language:Python2 0
drunkcoding/eudyptula
linux kernel challenge
Language:C2 0
drunkcoding/falcon
FALCON - Fast Analysis of LTE Control channels
Language:C++1 0
drunkcoding/flaxformer
Language:Python1 0
drunkcoding/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python1 0
drunkcoding/jiant
jiant is an nlp toolkit
Language:Python1 0
drunkcoding/MIT-6.824-Distributed-System
Spring 2020
Language:Go2 01
drunkcoding/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
Language:Python0 0
drunkcoding/model-finetune
finetune pre-trained models
Language:Python2 0
drunkcoding/onnxruntime_backend
The Triton backend for the ONNX Runtime.
Language:C++1 0
drunkcoding/open-moe-llm-leaderboard
Language:Python0 0
drunkcoding/power-meter
A software power measurement tool for both CPU and GPU using vendor provided API
Language:C++2 0
drunkcoding/pytorch_backend
The Triton backend for the PyTorch TorchScript models.
Language:C++1 01
drunkcoding/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python1 0
drunkcoding/ServerlessLLM
Fast, easy and cost-efficient multi-LLM serving.
Language:Python
drunkcoding/simple-shell
Simple functioning shell implemented in C
Language:C2 0
drunkcoding/swap-engine
Language:Python2 02
drunkcoding/time-series-forecast
Language:Python2 0
drunkcoding/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
drunkcoding/transformers-utils
transformers utilization made easy
drunkcoding/wasmint
Library for interpreting / debugging wasm code
Language:C++1 0