Pinned issues
Issues
- 0
401 error on llama2 model while access granted
#289 opened by tomtomtomtom44 - 1
Runpod Serverless
#288 opened by stonejohnson - 0
Loading basaran on multiple gpus leads to error
#280 opened by tanmaylaud - 0
- 0
- 1
Add support for chat completion API
#140 opened by peakji - 0
Tried multiple different models but get "The model weights are not tied..." error every time..
#266 opened by jontstaz - 3
How to send Audio Inputs to the Basaran
#234 opened by Tushar-ml - 3
- 1
Llama 2 models not working - how to pass auth token?
#232 opened by arsaboo - 1
TypeError: issubclass() arg 1 must be a class
#253 opened by gsuuon - 2
:latest version tag
#138 opened by mariushosting - 0
Use basaran API as Langchain LLM
#256 opened by brightebyte - 0
- 1
- 1
Error when Running Vicuna's FastChat Model without GPU
#223 opened by davyeu - 0
FR support for using fine tuned models that use Peft
#221 opened by samos123 - 5
Langchain Prompt Format
#198 opened by 0xDigest - 1
I want use the function prefix_allowed_tokens_fn, where of basaran's source code shall I modify?
#220 opened by zoubaihan - 7
Falcon 40B : too slow and random answers
#204 opened by ArnaudHureaux - 3
GPTQ & 4bit
#180 opened by olihough86 - 2
- 4
QLoRa support
#202 opened by bitnom - 1
concurrent request supported?
#205 opened by hudengjunai - 12
Support for `v1/embeddings` endpoint
#179 opened by josephrocca - 14
in stream mode, the English word has no space after detokenizer and Chinese were messed up
#197 opened by lucasjinreal - 13
Vicuna problem
#160 opened by zhound420 - 2
- 2
- 2
- 2
Define chat history format using jinja template
#141 opened by peakji - 1
Do you have Discord community?
#185 opened by karfly - 5
- 2
- 1
ValueError: Tokenizer class LLaMATokenizer does not exist or is not currently imported.
#139 opened by josephrocca - 1
- 1
Support ARM Docker images
#110 opened by WillBeebe - 4
how to run model in total offline?
#109 opened by gitknu - 2
Support for llama.cpp/ggml models
#107 opened by codito - 1
RuntimeError: mat1 and mat2 shapes cannot be multiplied
#181 opened by lcw99 - 5
Getting error for model when using vicuna model
#152 opened by djaffer - 3
Possible to run on M-series chips/MPS?
#173 opened by fakerybakery - 5
Question about COMPLETION_MAX_PROMPT
#158 opened by nicpopovic - 1
- 2
The requested URL was not found on the server
#151 opened by artivis - 1
Instructions unclear
#137 opened by Anonym0us33 - 7
CORS headers
#143 opened by josephrocca - 0
Replace EventSource with POST requests in playground
#145 opened by fardeon - 0
Add a chat interface to playground
#142 opened by peakji - 2
Slow Streaming
#99 opened by manojpreveen