LLM bootstrap loader for local CPU/GPU inference with fully customizable chat.
Primary LanguagePython