AIWintermuteAI/aXeleRate

Training is not using GPU

Closed this issue · 5 comments

It seems CUDA_VISIBLE_DEVICES env variable has no effect on this training speed.

Can you provide more details? Are you running aXeleRate in Colab or local computer?
If local computer: native installation, virtual env or conda?

I am running on local computer with conda. I have another conda environment with tf2.2. There GPU works perfectly fine. aXcelerate seems to be doing something, I used tensorflow_gpu 1.15 but still no luck. Next I will see how much effort is there to switch to tf 2.2 while still using aXcelerate. Will try to run it on colab as well. I generally prefer local, but whatever gets the job done is good by me.

If you do port it to tf 2.2, make a PR :) not complicated, but that is quite a lot of work though.
Try making clean environment, then install aXeleRate with pip there and after that do conda install tensorflow-gpu==1.15 - that will make sure you have CUDA installed in conda environment you've activated. aXeleRate by itself doesn't influence GPU/CUDA usage. But from what I understand, tensorflow installed in conda environment needs CUDA installed with conda install command and will not use system-wide CUDA.

Try this
git clone https://github.com/AIWintermuteAI/aXeleRate.git
cd aXeleRate
python setup.py install
conda install tensorflow-gpu==1.15 keras

I just did this with new clean conda environment - after python setup.py install I wasn't able to use GPU, but when I did conda install tensorflow-gpu==1.15 keras it automatically installed necessary CUDA/CUDNN packages in the environment and GPU acceleration was back.

If this solution works for you, please do give feedback :)

That was strange. I had installed the same version using pip. But an install with conda of same package solved the probelm.
Thanks.