accupham

Pinned Repositories

awesome-reasoning
a curated list of data for reasoning ai
0 0 00
FourBi
Binarizing Documents by Leveraging both Space and Frequency. (ICDAR 2024)
Language:Python0 0 00
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0 00
lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
Language:Python1.4k 14 11065
LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Language:Python8.2k 72 407822
sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python5.4k 55 540393
QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
Language:Python259 11 4020
sdk-python
Temporal Python SDK
Language:Python454 24 36967
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python27.8k 228 4.7k4.1k

accupham's Repositories

accupham/awesome-reasoning
a curated list of data for reasoning ai
0 0 00
accupham/FourBi
Binarizing Documents by Leveraging both Space and Frequency. (ICDAR 2024)
Language:Python0 0 00
accupham/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0 00
accupham/sdk-python
Temporal Python SDK
Language:Python