[PyTorch] torch.cuda.is_bf16_supported() is missing

Question

[PyTorch] torch.cuda.is_bf16_supported() is missing

haifengl opened this issue 7 months ago · 8 comments

I cannot find it anywhere.

Thanks!

Answer 1 · 2024-06-01T06:52:38.000Z

That is a Python-only function.
From the presets, you can check for the device compute capability, or just try to create a small BF16 tensor.

Answer 2 · 2024-06-01T14:37:26.000Z

Thanks! How to check device compute capability? torch_cuda.getDeviceProperties() returns a plain Pointer.

Answer 3 · 2024-06-01T15:11:15.000Z

Also how to get CUDA runtime version such as cudaRuntimeGetVersion()? torch.C10_CUDA_VERSION_MAJOR seems compile time version.

Answer 4 · 2024-06-01T16:41:27.000Z

Thanks! How to check device compute capability? torch_cuda.getDeviceProperties() returns a plain Pointer.

Right. That's something I'm currently working on. Next version of Pytorch presets will depend on CUDA presets and this kind of function will return the proper type.
In the meantime, you could directly use the CUDA presets.

Also how to get CUDA runtime version such as cudaRuntimeGetVersion()? torch.C10_CUDA_VERSION_MAJOR seems compile time version.

I'm not sure. Maybe there is a way using the CUDA presets.

I guess you'd better try to create a BF16 gpu tensor and catch the exception if the final objective is the one of your top post.

Answer 5 · 2024-06-01T16:54:31.000Z

Thanks. BTW, torch.C10_CUDA_VERSION_MAJOR and torch.C10_CUDA_VERSION are always 0, which are not correct.

It is not right just to create a BF16 tensor. On pre-ampere hardware bf16 works, but doesn't provide speed-ups compared to fp32 matmul operations, and some matmul operations are failing outright. So I would like to check cuda version and device compute capability.

Answer 6 · 2024-06-02T00:17:03.000Z

I think those are just wrappers for CUDA functions anyway, so I'd try to just use these directly:

Answer 7 · 2024-06-02T22:27:32.000Z

Although these methods work fine on a single GPU box, they hang on a multi-GPU box.