Error calculating cat gradient on GPU

Question

Error calculating cat gradient on GPU

fabricerosay opened this issue 4 years ago · 4 comments

let

t(x)=sum(cat(x,x,dims=3))

Then julia grad(t)(rand(6,6,1,1)) works.
But julia grad(t)(KnetArray(rand(6,6,1,1))) returns the following errors CuArray has no field ptr.

Following the error message leads to this funtion in karray.jl

function KnetArray(x::CuArray{T,N}) where {T,N}
    p = Base.bitcast(Cptr, x.ptr)
    k = KnetPtr(p, sizeof(x), Int(CUDA.device().handle), x)
    KnetArray{T,N}(k, size(x))
end

If I change it with:

function KnetArray(x::CuArray{T,N}) where {T,N}
    p = Base.bitcast(Cptr, x.baseptr)
    k = KnetPtr(p, sizeof(x), Int(CUDA.device().handle), x)
    KnetArray{T,N}(k, size(x))
end

Then the calculation works, the field ptr in Cuarrays has changed to baseptr with CUDA.jl
Would this work as intended ?

Answer 1 · 2020-11-13T18:05:54.000Z

Fixed indeed the CUDNN errors in LSTM (as long as using a newer version of CUDA). Thanks!!!

Answer 2 · 2020-11-15T14:12:54.000Z

Similar issue when upgrading to CUDA 2.2.1 -- not just LSTM, but also in my 3D DenseNet.

Answer 3 · 2020-11-27T12:25:39.000Z

It seems like CuArray field name changed from ptr to baseptr in CUDA@2.1.0.
Using pointer(a) seems to work in both cases, I will fix this and check for other cases of a.ptr in the code.

Answer 4 · 2020-11-27T12:46:12.000Z

#636 fixes this.