WeihanLikk's Stars
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
triton-lang/triton
Development repository for the Triton language and compiler
state-spaces/mamba
Mamba SSM architecture
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
ShiArthur03/ShiArthur03
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
gpu-mode/lectures
Material for gpu-mode lectures
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
josephg/noisejs
Javascript 2D Perlin & Simplex noise functions
fla-org/flash-linear-attention
🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
yangkky/Machine-learning-for-proteins
Listing of papers about machine learning for proteins.
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
radarFudan/Awesome-state-space-models
Collection of papers on state-space models
proger/accelerated-scan
Accelerated First Order Parallel Associative Scan
erdogant/clustimage
clustimage is a python package for unsupervised clustering of images.
ruke1ire/RTF
A State-Space Model with Rational Transfer Function Representation.
zju-bmi-lab/Fast-SNN
NicolasZucchet/minimal-LRU
Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)
joao-semedo/communication-subspace
Code for "Cortical areas interact through a communication subspace", Semedo et al. (Neuron, 2019)
johnryan465/pscan
EEA-sensors/sequential-parallelization-examples
This is a collection of code samples aimed at illustrating temporal parallelization methods for sequential data.
bradhilton/o1-chain-of-thought
o1 Chain of Thought Examples
raphaelreme/torch-kf
Fast implementation of Kalman filtering with PyTorch
nerdslab/bams
PyTorch implementation of BAMS (https://multiscale-behavior.github.io/)
krey/rrpy
Reduced rank regression in Python
WeihanLikk/MRM-GP
The code for paper: Multi-Region Markovian Gaussian Process: An Efficient Method to Discover Directional Communications Across Multiple Brain Regions [ICML 2024]