bitsandbytes-foundation/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

PythonMIT

Issues

too large numeric difference with pytorch inference
#1396 opened a day ago by weixsong
0
CUDA Setup failed despite GPU being available
#1394 opened 2 days ago by carlxc911
1
I ran a NF4 72B model in 2xA6000 using llamafactory
#1392 opened 3 days ago by charleswg
0
CUDA Architecture 80+ Causing Incorrect Model Behavior with BitsAndBytes Quantization
#1391 opened 3 days ago by gunjunlee
0
ARM Runners December 2024
#1390 opened 4 days ago by johnnynunez
0
Release v44 not available for Mac
#1378 opened 19 days ago by ACMCMC
3
where are the outliers stored in LLM.int8 quantization for inference suing transformers library on AMD GPU?
#1320 opened 2 months ago by vbayanag
2
Available version for 4-bit quantization supporting CUDA 11.1?
#1388 opened 8 days ago by justin4ai
0
Failed to import transformers.integrations.bitsandbytes because of the following error (look up to see its traceback): CUDA Setup failed despite GPU being available. Please run the following command to get more information: python -m bitsandbytes Inspect the output of the command and see if you can locate CUDA libraries. You might need to add them to your LD_LIBRARY_PATH. If you suspect a bug, please take the information from python -m bitsandbytes and open an issue at: https://github.com/TimDettmers/bitsandbytes/issues
#1387 opened 8 days ago by smillpine
0
An error occurred: CUDA is required but not available for bitsandbytes.
#1384 opened 9 days ago by GaoDalie
0
AdEMA NaN when loading from state_dict
#1382 opened 16 days ago by darius-lam
1
FLUTE Integration for Fast Inference
#1293 opened 3 months ago by HanGuo97
12
Paged optimizer resuming from checkpoint - attributeError: 'int' object has no attribute 'cpu'
#1381 opened 18 days ago by shivam15s
1
Python 3.9 support broken in 0.44.0
#1376 opened 19 days ago by Benzhaomin
0
Model architecture is modified when I use BitsAndBytesConfig with default params
#1371 opened 24 days ago by yunhao-tech
0
Release 0.44.0 does not belong to the current repository
#1370 opened 25 days ago by ccoulombe
2
dequantize_4bit() gives wrong output when working in cuda graph mode
#1308 opened 2 months ago by chenqianfzh
4
cuda is available but import bnb error
#1355 opened a month ago by ZeroneBo
2
Merge LoRA into 405B
#1359 opened a month ago by junzhang-zj
4
RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasGemmEx
#1363 opened a month ago by LukeLIN-web
1
Bug when using optimizer LAMB 32bits
#1350 opened a month ago by FrsECM
0
Lion Optimizer With Triton Kernel
#1356 opened a month ago by lapp0
1
Model not able to quantize
#1354 opened a month ago by alielfilali01
0
libcudart.so Not Found
#1313 opened a month ago by arunsandy1309
2
Torch autograd support for dequantize methods
#1347 opened 2 months ago by yaldashbz
0
Cannot load decoder.lm_head.weight when loading 4 bit quantized model using VisionEncoderDecoder.from_pretrained
#1343 opened 2 months ago by AditiJain14
1
quantize_4bit/dequantize_4bit gives wrong output on in-contiguous tensor
#1342 opened 2 months ago by chenqianfzh
0
Pretrained Causal LM cannot be loaded in 4bit/8bit
#1331 opened 2 months ago by adrienchaton
6
Any plan to support block size 32?
#1329 opened 2 months ago by lllyasviel
4
Linear8bitLt can not be moved back to cpu
#1332 opened 2 months ago by Nerogar
0
Error occurred when executing DownloadAndLoadFlorence2Model:
#1324 opened 2 months ago by JunpeakChen
1
'nf4' compute datatype?
#1321 opened 2 months ago by dorsa-zeinali
1
RuntimeError: Failed to import transformers.integrations.bitsandbytes because of the following error (look up to see its traceback):
#1327 opened 2 months ago by pradeep10kumar
1
Error while trying to install the multi-backend-refactor branch for rocm in WSL2
#1323 opened 2 months ago by Kademo15
1
RuntimeError: CUDA Setup failed despite GPU being available. Please run the following command to get more information:
#1322 opened 2 months ago by pradeep10kumar
1
About fusion of **kdequantize kernel** and **simple bf16/fp16 matmul**
#1319 opened 2 months ago by Ther-nullptr
1
NameError: name 'str2optimizer32bit' is not defined
#1281 opened 3 months ago by qingqinggu
3
Communicate blocksize constraints to kernels that take blocksize as a runtime argument
#1317 opened 2 months ago by mm04926412
6
Runtime Error, cannot import name 'get_keys_to_not_convert' from 'transformers.integrations'
#1309 opened 2 months ago by zeruiz99
1
libbitsandbytes_cpu.so，libbitsandbytes_cuda124_nocublaslt124.so
#1312 opened 2 months ago by magicwang1111
4
Crash running FSDP on BF16-prequantized models
#1310 opened 2 months ago by dmitrii-palisaderesearch
4
Unable to override PyTorch CUDA Version
#1315 opened 2 months ago by tinglvv
4
Regarding bnb import error
#1306 opened 2 months ago by Mubashirshariq
2
4bit quantized model.dequantize() fails on CPU
#1311 opened 2 months ago by npbool
0
RuntimeError: CUDA Setup failed despite GPU being available. Please run the following command to get more information: python -m bitsandbytes Inspect the output of the command and see if you can locate CUDA libraries. You might need to add them to your LD_LIBRARY_PATH. If you suspect a bug, please take the information from python -m bitsandbytes and open an issue at: https://github.com/TimDettmers/bitsandbytes/issues
#1307 opened 2 months ago by senzawapoi
0
bitsbytes 8bit quantized LLama 3.1 gets stuck sometimes when producing output
#1304 opened 2 months ago by Techbhatia
0
Clarifying the quantization algorithm
#1283 opened 3 months ago by chrisjmccormick
1
> I encountered the same issue on CUDA 11.6 and fixed it by building bitsandbytes from source. Below is my bash script for reference:
#1297 opened 3 months ago by insafim
0
CUDA Setup failed despite GPU being available
#1289 opened 3 months ago by Keertiraj
1
Who owns bitsandbytes?
#1288 opened 3 months ago by garrettbyrd
0