PotatoSpudowski/fastLLaMa

fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backend.

CMIT

Issues

GGUF and/or LLama-3 support?
#86 opened 5 months ago by BHX2
0
"No module named 'fastllama.api' " after pip installation
#78 opened a year ago by cosimoiaia
10
how to load model in webui ?
#82 opened a year ago by valdesguefa
3
Webui UX issue on mobile
#84 opened a year ago by stduhpf
0
Port llama.cpp openCL support to fastllama?
#83 opened a year ago by stduhpf
0
Make running fastLLaMa on windows simple!
#43 opened a year ago by PotatoSpudowski
1
README.md is outdated in sections #running-llama and #running-alpaca-lora
#81 opened a year ago by stduhpf
1
Deciding the Schema for the protocol between webUI and webSocket Server
#77 opened a year ago by PotatoSpudowski
2
Integrating + Testing webUI and WebSocket Server
#80 opened a year ago by PotatoSpudowski
0
Implement the WebSocket Server
#79 opened a year ago by PotatoSpudowski
0
Pip support testing
#55 opened a year ago by PotatoSpudowski
21
Designing the UI
#75 opened a year ago by PotatoSpudowski
1
Pip uninstall not removing the package
#74 opened a year ago by stduhpf
2
How install on Windows?
#27 opened a year ago by soleimanian
4
TypeError: Model.generate() got an unexpected keyword argument 'stop_word'
#73 opened a year ago by vootshiclone
2
Should fastLLaMa support more than just Python?
#13 opened a year ago by PotatoSpudowski
3
Feature suggestions!
#38 opened a year ago by PotatoSpudowski
7
Enabling custom logger makes it crash at ingestion.
#65 opened a year ago by stduhpf
1
from build.fastllama import Model, ModelKind ModuleNotFoundError: No module named 'build.fastllama'
#56 opened a year ago by lucasjinreal
8
n_ctx argument is ignored
#29 opened 2 years ago by stduhpf
4
When stop words are reached, they get ingested, but are not forwarded to streaming_fn.
#62 opened a year ago by stduhpf
4
convert-pth-to-ggml.py expects 2 parts for ALPACA-LORA-13B, but it has only one
#57 opened a year ago by stduhpf
5
Cannot build this
#51 opened a year ago by bratislav
5
Passing arguments such as -ins
#41 opened a year ago by regstuff
2
RuntimeError: Unable to load model because of bad magic
#17 opened a year ago by yevhen-kalyna
12
AVX2 performance issue
#21 opened a year ago by Showdown76py
17
Bad Magic error
#58 opened a year ago by Th3F0x-code
6
Lora adaptor support
#49 opened a year ago by PotatoSpudowski
1
Cmake Error
#50 opened a year ago by notmehul
1
Is Alpaca 13B and 30B tested?
#12 opened 2 years ago by imranraad07
5
Unicode characters break tokenizer
#24 opened a year ago by stduhpf
18
ModuleNotFoundError: No module named 'fastLlama' after setup.py update
#32 opened a year ago by yevhen-kalyna
14
function wrap for getting the embedding
#36 opened a year ago by complexly
3
Add posibility to choose python version for module or make it independent from version
#16 opened a year ago by yevhen-kalyna
11
Fix multiple relative pointer transform
#15 opened a year ago by amitsingh19975
1
Return Log Probs in Output
#6 opened a year ago by PotatoSpudowski
2
Problems while Trying to Run code programatically
#40 opened a year ago by robin-coac
9
Error when using setup.py
#42 opened a year ago by SumYin
5
Make prompt ingestion faster!
#2 opened a year ago by PotatoSpudowski
2
Still slow on AVX2 CPUs
#37 opened 2 years ago by Showdown76py
1
Does not support Python 3.11
#25 opened 2 years ago by jhud
1
Error at ./build.sh
#18 opened 2 years ago by Niellai
8
Error using build.sh
#19 opened 2 years ago by poohzaza166
1
Getting Error with make command
#8 opened 2 years ago by imranraad07
22
Example doc has incorrect repo
#7 opened 2 years ago by raldebsi
1
Stop words is buggy!
#1 opened 2 years ago by PotatoSpudowski
1
quantize.py is not build. quantize binary is.
#4 opened 2 years ago by cosimoiaia
2
Unable to build bridge.cpp and link the 'libllama'
#3 opened 2 years ago by rohitgr7
2