Pinned Repositories
awesome-reasoning
a curated list of data for reasoning ai
FourBi
Binarizing Documents by Leveraging both Space and Frequency. (ICDAR 2024)
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
sglang
SGLang is a fast serving framework for large language models and vision language models.
QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
sdk-python
Temporal Python SDK
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
accupham's Repositories
accupham/awesome-reasoning
a curated list of data for reasoning ai
accupham/FourBi
Binarizing Documents by Leveraging both Space and Frequency. (ICDAR 2024)
accupham/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
accupham/sdk-python
Temporal Python SDK