huggingface/diffusion-models-class

Hackathon - unable to get bitsandbytes to detect CUDA

StatsGary opened this issue · 1 comments

I have tried the notebook on Googe Cloud and have run into exactly the same thing as when I do the install locally. I am running my instance on a Tesla T4 GPU in a Google Cloud VM and have 30GB RAM and 1 GPU. I have scaled the RAM up and down to try and make this run, but to no avail. The error message that I am seeing is:

I am still getting this issue:
CUDA Setup failed despite GPU being available. Inspect the CUDA SETUP outputs aboveto fix your environment! If you cannot find any issues and suspect a bug, please open an issue with detals about your environment: https://github.com/TimDettmers/bitsandbytes/issues

I tried the steps to resolve this, but it still seems an issue:

  1. git clone git@github.com:TimDettmers/bitsandbytes.git
  2. cd bitsandbytes
  3. CUDA_VERSION=116
  4. python setup.py install

Anyone else with the same issue? When I try nvidia-smi I know my GPU is being detected and works well with PyTorch.

I had conflicting versions of CUDA on my machine, this was due to installing CUDA toolkit on my Ubuntu 20.04. I removed CUDA toolkit from the environment, as I realised that this is bundled with PyTorch. For anyone else with the same problem, make sure you only have one install and rerun the tool.