Raphael-Hao's Stars
xai-org/grok-1
Grok open release
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
huggingface/candle
Minimalist ML framework for Rust
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
paramiko/paramiko
The leading native Python SSHv2 protocol library.
Netflix/metaflow
Open Source AI/ML Platform
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Farama-Foundation/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
apple/corenet
CoreNet: A library for training deep neural networks
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
pyinvoke/invoke
Pythonic task management & command execution.
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
beartype/beartype
Unbearably fast near-real-time hybrid runtime-static type-checking in pure Python.
buresdv/Cork
A fast GUI for Homebrew written in SwiftUI
openppl-public/ppl.nn
A primitive library for neural network
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
myscale/MyScaleDB
A @ClickHouse fork that supports high-performance vector search and full-text search.
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
zhuzilin/ring-flash-attention
Ring attention implementation with flash attention
lhao499/ringattention
Transformers with Arbitrarily Large Context
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
lucidrains/ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
ByungKwanLee/MoAI
[ECCV 2024] Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks.
pytorch-labs/float8_experimental
This repository contains the experimental PyTorch native float8 training UX
zorazrw/awesome-tool-llm
InternLM/AcmeTrace
microsoft/ParrotServe
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
microsoft/ConvStencil
uchuhimo/amanda