POSTECH-CVLab/point-transformer

Trouble executing S3DIS : No module named 'pointops_cuda'

QuanticDisaster opened this issue · 3 comments

Hello,

I am having an error when executing the semantic segmentation on S3DIS with the command given in the readme :

Totally 204 samples in train set.
Totally 68 samples in val set.
Traceback (most recent call last):
  File "/home/user/Documents/git_zip/point-transformer/exp/s3dis/pointtransformer_repro/train.py", line 371, in <module>
    main()
  File "/home/user/Documents/git_zip/point-transformer/exp/s3dis/pointtransformer_repro/train.py", line 88, in main
    mp.spawn(main_worker, nprocs=args.ngpus_per_node, args=(args.ngpus_per_node, args))
  File "/home/user/anaconda3/lib/python3.9/site-packages/torch/multiprocessing/spawn.py", line 230, in spawn
    return start_processes(fn, args, nprocs, join, daemon, start_method='spawn')
  File "/home/user/anaconda3/lib/python3.9/site-packages/torch/multiprocessing/spawn.py", line 188, in start_processes
    while not context.join():
  File "/home/user/anaconda3/lib/python3.9/site-packages/torch/multiprocessing/spawn.py", line 150, in join
    raise ProcessRaisedException(msg, error_index, failed_process.pid)
torch.multiprocessing.spawn.ProcessRaisedException: 

-- Process 0 terminated with the following error:
Traceback (most recent call last):
  File "/home/user/anaconda3/lib/python3.9/site-packages/torch/multiprocessing/spawn.py", line 59, in _wrap
    fn(i, *args)
  File "/home/user/Documents/git_zip/point-transformer/exp/s3dis/pointtransformer_repro/train.py", line 104, in main_worker
    from model.pointtransformer.pointtransformer_seg import pointtransformer_seg_repro as Model
  File "/home/user/Documents/git_zip/point-transformer/model/pointtransformer/pointtransformer_seg.py", line 4, in <module>
    from lib.pointops.functions import pointops
  File "/home/user/Documents/git_zip/point-transformer/lib/pointops/functions/pointops.py", line 7, in <module>
    import pointops_cuda
ModuleNotFoundError: No module named 'pointops_cuda'

Normally, there was no error during compilation of point ops :

running install
running bdist_egg
running egg_info
writing pointops.egg-info/PKG-INFO
writing dependency_links to pointops.egg-info/dependency_links.txt
writing top-level names to pointops.egg-info/top_level.txt
reading manifest file 'pointops.egg-info/SOURCES.txt'
writing manifest file 'pointops.egg-info/SOURCES.txt'
installing library code to build/bdist.linux-x86_64/egg
running install_lib
running build_ext
building 'pointops_cuda' extension
Emitting ninja build file /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/build.ninja...
Compiling objects...
Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N)
[1/13] c++ -MMD -MF /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/aggregation/aggregation_cuda.o.d -pthread -B /home/user/anaconda3/envs/pt/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/aggregation/aggregation_cuda.cpp -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/aggregation/aggregation_cuda.o -g -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++14
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/Parallel.h:140,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensorApply.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensor.h:5,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THCTensor.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THC.h:12,
                 from /home/user/Documents/git_zip/point-transformer/lib/pointops/src/aggregation/aggregation_cuda.cpp:2:
/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/ParallelOpenMP.h:87: warning: ignoring #pragma omp parallel [-Wunknown-pragmas]
   87 | #pragma omp parallel for if ((end - begin) >= grain_size)
      | 
[2/13] c++ -MMD -MF /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/sampling/sampling_cuda.o.d -pthread -B /home/user/anaconda3/envs/pt/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/sampling/sampling_cuda.cpp -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/sampling/sampling_cuda.o -g -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++14
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/Parallel.h:140,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensorApply.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensor.h:5,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THCTensor.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THC.h:12,
                 from /home/user/Documents/git_zip/point-transformer/lib/pointops/src/sampling/sampling_cuda.cpp:2:
/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/ParallelOpenMP.h:87: warning: ignoring #pragma omp parallel [-Wunknown-pragmas]
   87 | #pragma omp parallel for if ((end - begin) >= grain_size)
      | 
[3/13] c++ -MMD -MF /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/interpolation/interpolation_cuda.o.d -pthread -B /home/user/anaconda3/envs/pt/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/interpolation/interpolation_cuda.cpp -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/interpolation/interpolation_cuda.o -g -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++14
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/Parallel.h:140,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensorApply.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensor.h:5,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THCTensor.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THC.h:12,
                 from /home/user/Documents/git_zip/point-transformer/lib/pointops/src/interpolation/interpolation_cuda.cpp:2:
/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/ParallelOpenMP.h:87: warning: ignoring #pragma omp parallel [-Wunknown-pragmas]
   87 | #pragma omp parallel for if ((end - begin) >= grain_size)
      | 
[4/13] c++ -MMD -MF /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/grouping/grouping_cuda.o.d -pthread -B /home/user/anaconda3/envs/pt/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/grouping/grouping_cuda.cpp -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/grouping/grouping_cuda.o -g -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++14
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/Parallel.h:140,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensorApply.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensor.h:5,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THCTensor.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THC.h:12,
                 from /home/user/Documents/git_zip/point-transformer/lib/pointops/src/grouping/grouping_cuda.cpp:2:
/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/ParallelOpenMP.h:87: warning: ignoring #pragma omp parallel [-Wunknown-pragmas]
   87 | #pragma omp parallel for if ((end - begin) >= grain_size)
      | 
[5/13] c++ -MMD -MF /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/knnquery/knnquery_cuda.o.d -pthread -B /home/user/anaconda3/envs/pt/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/knnquery/knnquery_cuda.cpp -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/knnquery/knnquery_cuda.o -g -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++14
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/Parallel.h:140,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensorApply.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensor.h:5,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THCTensor.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THC.h:12,
                 from /home/user/Documents/git_zip/point-transformer/lib/pointops/src/knnquery/knnquery_cuda.cpp:2:
/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/ParallelOpenMP.h:87: warning: ignoring #pragma omp parallel [-Wunknown-pragmas]
   87 | #pragma omp parallel for if ((end - begin) >= grain_size)
      | 
[6/13] c++ -MMD -MF /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/subtraction/subtraction_cuda.o.d -pthread -B /home/user/anaconda3/envs/pt/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/subtraction/subtraction_cuda.cpp -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/subtraction/subtraction_cuda.o -g -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++14
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/Parallel.h:140,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensorApply.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH/THTensor.h:5,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THCTensor.h:4,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC/THC.h:12,
                 from /home/user/Documents/git_zip/point-transformer/lib/pointops/src/subtraction/subtraction_cuda.cpp:2:
/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/ParallelOpenMP.h:87: warning: ignoring #pragma omp parallel [-Wunknown-pragmas]
   87 | #pragma omp parallel for if ((end - begin) >= grain_size)
      | 
[7/13] /usr/local/cuda-11.1/bin/nvcc  -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/subtraction/subtraction_cuda_kernel.cu -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/subtraction/subtraction_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -gencode=arch=compute_86,code=compute_86 -gencode=arch=compute_86,code=sm_86 -std=c++14
[8/13] /usr/local/cuda-11.1/bin/nvcc  -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/sampling/sampling_cuda_kernel.cu -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/sampling/sampling_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -gencode=arch=compute_86,code=compute_86 -gencode=arch=compute_86,code=sm_86 -std=c++14
[9/13] /usr/local/cuda-11.1/bin/nvcc  -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/aggregation/aggregation_cuda_kernel.cu -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/aggregation/aggregation_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -gencode=arch=compute_86,code=compute_86 -gencode=arch=compute_86,code=sm_86 -std=c++14
[10/13] /usr/local/cuda-11.1/bin/nvcc  -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/grouping/grouping_cuda_kernel.cu -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/grouping/grouping_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -gencode=arch=compute_86,code=compute_86 -gencode=arch=compute_86,code=sm_86 -std=c++14
[11/13] /usr/local/cuda-11.1/bin/nvcc  -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/knnquery/knnquery_cuda_kernel.cu -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/knnquery/knnquery_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -gencode=arch=compute_86,code=compute_86 -gencode=arch=compute_86,code=sm_86 -std=c++14
[12/13] /usr/local/cuda-11.1/bin/nvcc  -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/interpolation/interpolation_cuda_kernel.cu -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/interpolation/interpolation_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -gencode=arch=compute_86,code=compute_86 -gencode=arch=compute_86,code=sm_86 -std=c++14
[13/13] c++ -MMD -MF /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/pointops_api.o.d -pthread -B /home/user/anaconda3/envs/pt/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/TH -I/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/THC -I/usr/local/cuda-11.1/include -I/home/user/anaconda3/envs/pt/include/python3.7m -c -c /home/user/Documents/git_zip/point-transformer/lib/pointops/src/pointops_api.cpp -o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/pointops_api.o -g -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=pointops_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++14
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/Parallel.h:140,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include/torch/utils.h:3,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include/torch/nn/cloneable.h:5,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include/torch/nn.h:3,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/csrc/api/include/torch/all.h:13,
                 from /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/torch/extension.h:4,
                 from /home/user/Documents/git_zip/point-transformer/lib/pointops/src/pointops_api.cpp:2:
/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/include/ATen/ParallelOpenMP.h:87: warning: ignoring #pragma omp parallel [-Wunknown-pragmas]
   87 | #pragma omp parallel for if ((end - begin) >= grain_size)
      | 
creating build/lib.linux-x86_64-3.7
g++ -pthread -shared -B /home/user/anaconda3/envs/pt/compiler_compat -L/home/user/anaconda3/envs/pt/lib -Wl,-rpath=/home/user/anaconda3/envs/pt/lib -Wl,--no-as-needed -Wl,--sysroot=/ /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/pointops_api.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/knnquery/knnquery_cuda.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/knnquery/knnquery_cuda_kernel.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/sampling/sampling_cuda.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/sampling/sampling_cuda_kernel.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/grouping/grouping_cuda.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/grouping/grouping_cuda_kernel.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/interpolation/interpolation_cuda.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/interpolation/interpolation_cuda_kernel.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/subtraction/subtraction_cuda.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/subtraction/subtraction_cuda_kernel.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/aggregation/aggregation_cuda.o /home/user/Documents/git_zip/point-transformer/lib/pointops/build/temp.linux-x86_64-3.7/src/aggregation/aggregation_cuda_kernel.o -L/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/torch/lib -L/usr/local/cuda-11.1/lib64 -lc10 -ltorch -ltorch_cpu -ltorch_python -lcudart -lc10_cuda -ltorch_cuda_cu -ltorch_cuda_cpp -o build/lib.linux-x86_64-3.7/pointops_cuda.cpython-37m-x86_64-linux-gnu.so
creating build/bdist.linux-x86_64
creating build/bdist.linux-x86_64/egg
copying build/lib.linux-x86_64-3.7/pointops_cuda.cpython-37m-x86_64-linux-gnu.so -> build/bdist.linux-x86_64/egg
creating stub loader for pointops_cuda.cpython-37m-x86_64-linux-gnu.so
byte-compiling build/bdist.linux-x86_64/egg/pointops_cuda.py to pointops_cuda.cpython-37.pyc
creating build/bdist.linux-x86_64/egg/EGG-INFO
copying pointops.egg-info/PKG-INFO -> build/bdist.linux-x86_64/egg/EGG-INFO
copying pointops.egg-info/SOURCES.txt -> build/bdist.linux-x86_64/egg/EGG-INFO
copying pointops.egg-info/dependency_links.txt -> build/bdist.linux-x86_64/egg/EGG-INFO
copying pointops.egg-info/top_level.txt -> build/bdist.linux-x86_64/egg/EGG-INFO
writing build/bdist.linux-x86_64/egg/EGG-INFO/native_libs.txt
zip_safe flag not set; analyzing archive contents...
__pycache__.pointops_cuda.cpython-37: module references __file__
creating dist
creating 'dist/pointops-0.0.0-py3.7-linux-x86_64.egg' and adding 'build/bdist.linux-x86_64/egg' to it
removing 'build/bdist.linux-x86_64/egg' (and everything under it)
Processing pointops-0.0.0-py3.7-linux-x86_64.egg
creating /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/pointops-0.0.0-py3.7-linux-x86_64.egg
Extracting pointops-0.0.0-py3.7-linux-x86_64.egg to /home/user/anaconda3/envs/pt/lib/python3.7/site-packages
Adding pointops 0.0.0 to easy-install.pth file

Installed /home/user/anaconda3/envs/pt/lib/python3.7/site-packages/pointops-0.0.0-py3.7-linux-x86_64.egg
Processing dependencies for pointops==0.0.0
Finished processing dependencies for pointops==0.0.0

packages versions :

# Name                    Version                   Build  Channel
_libgcc_mutex             0.1                        main  
_openmp_mutex             4.5                       1_gnu  
blas                      1.0                         mkl  
ca-certificates           2021.10.8            ha878542_0    conda-forge
certifi                   2021.10.8        py37h89c1867_1    conda-forge
cudatoolkit               11.1.74              h6bb024c_0    nvidia
freetype                  2.11.0               h70c0345_0  
giflib                    5.2.1                h7b6447c_0  
h5py                      2.10.0           py37hd6299e0_1    anaconda
hdf5                      1.10.6               hb1b8bf9_0    anaconda
intel-openmp              2021.4.0          h06a4308_3561  
jpeg                      9d                   h7f8727e_0  
lcms2                     2.12                 h3be6417_0  
ld_impl_linux-64          2.35.1               h7274673_9  
libffi                    3.3                  he6710b0_2  
libgcc-ng                 9.3.0               h5101ec6_17  
libgfortran-ng            7.3.0                hdf63c60_0    anaconda
libgomp                   9.3.0               h5101ec6_17  
libpng                    1.6.37               hbc83047_0  
libprotobuf               3.15.8               h780b84a_0    conda-forge
libstdcxx-ng              9.3.0               hd4cf53a_17  
libtiff                   4.2.0                h85742a9_0  
libuv                     1.40.0               h7b6447c_0  
libwebp                   1.2.0                h89dd481_0  
libwebp-base              1.2.0                h27cfd23_0  
lz4-c                     1.9.3                h295c915_1  
mkl                       2021.4.0           h06a4308_640  
mkl-service               2.4.0            py37h7f8727e_0  
mkl_fft                   1.3.1            py37hd3c417c_0  
mkl_random                1.2.2            py37h51133e4_0  
ncurses                   6.3                  h7f8727e_2  
ninja                     1.10.2           py37hd09550d_3  
numpy                     1.21.2           py37h20f2e39_0  
numpy-base                1.21.2           py37h79a1101_0  
olefile                   0.46                     py37_0  
openssl                   1.1.1l               h7f8727e_0  
pillow                    8.4.0            py37h5aabda8_0  
pip                       21.2.2           py37h06a4308_0  
pointops                  0.0.0                    pypi_0    pypi
protobuf                  3.15.8           py37hcd2ae1e_0    conda-forge
python                    3.7.11               h12debd9_0  
python_abi                3.7                     2_cp37m    conda-forge
pytorch                   1.9.0           py3.7_cuda11.1_cudnn8.0.5_0    pytorch
pyyaml                    5.3.1            py37h7b6447c_1    anaconda
readline                  8.1                  h27cfd23_0  
setuptools                58.0.4           py37h06a4308_0  
sharedarray               3.2.1            py37h6f94858_2    conda-forge
six                       1.16.0             pyhd3eb1b0_0  
sqlite                    3.36.0               hc218d9a_0  
tensorboardx              2.4                pyhd8ed1ab_0    conda-forge
tk                        8.6.11               h1ccaba5_0  
torchvision               0.2.2                      py_3    pytorch
typing_extensions         3.10.0.2           pyh06a4308_0  
wheel                     0.37.0             pyhd3eb1b0_1  
xz                        5.2.5                h7b6447c_0  
yaml                      0.2.5                h7b6447c_0    anaconda
zlib                      1.2.11               h7b6447c_3  
zstd                      1.4.9                haebb681_0  

nvcc - V gives

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2020 NVIDIA Corporation
Built on Mon_Oct_12_20:09:46_PDT_2020
Cuda compilation tools, release 11.1, V11.1.105
Build cuda_11.1.TC455_06.29190527_0

Thx !

For those encountering the same problem, I seem to have found a way to circumvent it. The pointops_cuda.py file is situated in your anaconda environment here which was here for me for instance : home/user/anaconda3/envs/pt/lib/python3.7/site-packages/pointops-0.0.0-py3.7-linux-x86_64.egg

Add this path to python by adding the following two lines after the imports in the file "train.py" :

import sys
sys.path.append('/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/pointops-0.0.0-py3.7-linux-x86_64.egg')

However, a second problem occured in my case, saying that the python intepreter version used (3.9) was different than the one with which the library was compiled (3.7), so I had to add eval "conda activate pt" after the first eval in the train.sh file

The problem seems now resolved with this

For those encountering the same problem, I seem to have found a way to circumvent it. The pointops_cuda.py file is situated in your anaconda environment here which was here for me for instance : home/user/anaconda3/envs/pt/lib/python3.7/site-packages/pointops-0.0.0-py3.7-linux-x86_64.egg

Add this path to python by adding the following two lines after the imports in the file "train.py" :

import sys
sys.path.append('/home/user/anaconda3/envs/pt/lib/python3.7/site-packages/pointops-0.0.0-py3.7-linux-x86_64.egg')

However, a second problem occured in my case, saying that the python intepreter version used (3.9) was different than the one with which the library was compiled (3.7), so I had to add eval "conda activate pt" after the first eval in the train.sh file

The problem seems now resolved with this

After this, I come across this problem

Traceback (most recent call last):
File "exp/s3dis/pointtransformer_repro/train.py", line 374, in
main()
File "exp/s3dis/pointtransformer_repro/train.py", line 91, in main
mp.spawn(main_worker, nprocs=args.ngpus_per_node, args=(args.ngpus_per_node, args))
File "/home/liy0r/anaconda3/lib/python3.8/site-packages/torch/multiprocessing/spawn.py", line 171, in spawn
while not spawn_context.join():
File "/home/liy0r/anaconda3/lib/python3.8/site-packages/torch/multiprocessing/spawn.py", line 118, in join
raise Exception(msg)
Exception:

-- Process 1 terminated with the following error:
Traceback (most recent call last):
File "/home/liy0r/anaconda3/lib/python3.8/site-packages/torch/multiprocessing/spawn.py", line 19, in _wrap
fn(i, *args)
File "/ibex/scratch/liy0r/cvpr/point-transformer/exp/s3dis/pointtransformer_repro/train.py", line 107, in main_worker
from model.pointtransformer.pointtransformer_seg import pointtransformer_seg_repro as Model
File "/ibex/scratch/liy0r/cvpr/point-transformer/model/pointtransformer/pointtransformer_seg.py", line 4, in
from lib.pointops.functions import pointops
File "/ibex/scratch/liy0r/cvpr/point-transformer/lib/pointops/functions/pointops.py", line 7, in
import pointops_cuda
File "/home/liy0r/anaconda3/envs/pt/lib/python3.7/site-packages/pointops-0.0.0-py3.7-linux-x86_64.egg/pointops_cuda.py", line 9, in
bootstrap()
File "/home/liy0r/anaconda3/envs/pt/lib/python3.7/site-packages/pointops-0.0.0-py3.7-linux-x86_64.egg/pointops_cuda.py", line 7, in bootstrap
mod = importlib.util.module_from_spec(spec)
ImportError: libtorch_cpu.so: cannot open shared object file: No such file or directory

ImportError: libtorch_cpu.so: cannot open shared object file: No such file or directory

Okay I fixed it by adding conda activate pt into train.sh~ hahahahh interesting