/vector-llm-compressor

Pipeline for quantizing/compressing LLM's in order to optimize them for deployment.

Primary LanguagePythonApache License 2.0Apache-2.0

vector-llm-compressor

Pipeline for quantizing/compressing LLM's in order to optimize them for deployment.