Convert & quantize HuggingFace models using llama.cpp on premises
Primary LanguageJupyter NotebookMIT LicenseMIT