Frozenmad's Stars
huangb23/VTimeLLM
[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".
pipipi-pikachu/PPTist
PowerPoint-ist(/'pauəpɔintist/), An online presentation application that replicates most of the commonly used features of MS PowerPoint, allowing for the editing and presentation of PPT online.
SqueezeBits/QUICK
QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference
kyegomez/BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
TUDB-Labs/mLoRA
An Efficient "Factory" to Build Multiple LoRA Adapters
rayleizhu/vllm-ra
[ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts
thunlp/Ouroboros
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)
efeslab/Atom
[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
replicate/cog
Containers for machine learning
SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
delicious-tasty/Paella
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Vahe1994/SpQR
v2fly/fhs-install-v2ray
Bash script for installing V2Ray in operating systems such as Debian / CentOS / Fedora / openSUSE that support systemd
flexflow/flexflow-train
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
megvii-research/Sparsebit
A model compression and acceleration toolbox based on pytorch.
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
modularml/mojo
The Mojo Programming Language
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
fpgaminer/GPTQ-triton
GPTQ inference Triton kernel
Stability-AI/StableLM
StableLM: Stability AI Language Models
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
toeverything/AFFiNE
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use.
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
ROCm/HIP
HIP: C++ Heterogeneous-Compute Interface for Portability