Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fails to build against CUDA 10 #834

Open
eigengrau opened this issue Oct 1, 2018 · 50 comments
Open

Fails to build against CUDA 10 #834

eigengrau opened this issue Oct 1, 2018 · 50 comments

Comments

@eigengrau
Copy link

eigengrau commented Oct 1, 2018

When building HEAD against CUDA 10.0.130, the build fails with:

/home/seb/tmp/build/torch7-cutorch-git/src/torch7-cutorch-git/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/run/user/1000/tmpxft_00007438_00000000-4_THCTensorIndex.cpp4.ii".
CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:279 (message):
  Error generating file
  /home/seb/tmp/build/torch7-cutorch-git/src/torch7-cutorch-git/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o
@granzuglia
Copy link

I have exactly the same issue.

@nagadomi
Copy link

nagadomi commented Oct 10, 2018

EDIT: I have merged these changes to my distro repo. https://github.com/nagadomi/distro

Building Torch with CUDA10 has two problems.

  1. cmake/3.6/Modules/FindCUDA.cmake is outdated.
  2. atomicAdd(__half *address, __half val) is defined in /usr/local/cuda/include/cuda_fp16.h (duplicated).

How to build Torch with CUDA10

  1. Install the latest CMake from github repo (the latest FindCUDA.cmake will be installed)
$ sudo apt-get purge cmake
$ git clone https://github.com/Kitware/CMake.git
$ cd CMake
$ ./bootstrap; make; sudo make install
  1. Remove FindCUDA.cmake.
$ cd ~/torch
$ rm -fr cmake/3.6/Modules/FindCUDA*
  1. Apply the following patch to cutorch
diff --git a/lib/THC/THCAtomics.cuh b/lib/THC/THCAtomics.cuh
index 400875c..ccb7a1c 100644
--- a/lib/THC/THCAtomics.cuh
+++ b/lib/THC/THCAtomics.cuh
@@ -94,6 +94,7 @@ static inline __device__ void atomicAdd(long *address, long val) {
 }
 
 #ifdef CUDA_HALF_TENSOR
+#if !(__CUDA_ARCH__ >= 700 || !defined(__CUDA_ARCH__) )
 static inline  __device__ void atomicAdd(half *address, half val) {
   unsigned int * address_as_ui =
       (unsigned int *) ((char *)address - ((size_t)address & 2));
@@ -117,6 +118,7 @@ static inline  __device__ void atomicAdd(half *address, half val) {
    } while (assumed != old);
 }
 #endif
+#endif
$ cd extra/cutorch
$ cat > atomic.patch
<copy and paste the patch>
$ patch -p1 < atomic.patch
  1. Build
$ ./clean.sh
$ export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__"
$ ./install.sh

I was successfully build Torch with CUDA10 with the above steps.

@WhiskyChoy
Copy link

WhiskyChoy commented Oct 12, 2018

@nagadomi Yeah, it really works! Thanks a lot. I learnt elsewhere that a newer version of cmake is needed so I installed one. However I didn't remove the FindCUDA.cmake the torch package contained. Now it works well, after I finished the steps you recommended.

@Archit01
Copy link

Thanks

@HZRun
Copy link

HZRun commented Oct 19, 2018

@nagadomi It works following your steps. Thank you

@PhilipDro
Copy link

@nagadomi Thanks a lot that helped with my issue. It seems though, that removing FindCUDA will cause problems when trying to install cuDDN.

@nagadomi
Copy link

@PhilipDro
The master branch of cuDNN.torch does not support cuDNN v7. Installing from R7 branch probably works fine.

$ git clone https://github.com/soumith/cudnn.torch.git -b R7
$ cd cudnn.torch
$ luarocks make cudnn-scm-1.rockspec

@PhilipDro
Copy link

@nagadomi Thank you for your response.
Your suggested workaround does not work unfortunately.
I get following error:

`CMake Error at /home/philip/torch/install/share/cmake/torch/FindCUDA.cmake:617 (message):
Specify CUDA_TOOLKIT_ROOT_DIR
Call Stack (most recent call first):
CMakeLists.txt:14 (FIND_PACKAGE)

-- Configuring incomplete, errors occurred!
See also "/home/philip/cudnn.torch/build/CMakeFiles/CMakeOutput.log".

Error: Build error: Failed building.
`

@Anupama94
Copy link

Try changing the gcc version to gcc-6. Worked for me!

@Cdfghglz
Copy link

I also could not make it work until I did not kick out these lines:

diff --git a/install.sh b/install.sh
index ad65434..65a0163 100755
--- a/install.sh
+++ b/install.sh
@@ -99,9 +99,6 @@ fi
 if [ -x "$path_to_nvcc" ] || [ -x "$path_to_nvidiasmi" ]
 then
     echo "Found CUDA on your machine. Installing CMake 3.6 modules to get up-to-date FindCUDA"
-    cd ${THIS_DIR}/cmake/3.6 && \
-(cmake -E make_directory build && cd build && cmake .. -DCMAKE_INSTALL_PREFIX="${PREFIX}" \
-        && make install) && echo "FindCuda bits of CMake 3.6 installed" || exit 1
 fi

After this it just used the newest cmake (3.13 for me) and did not complain any more.

@JohnHammell
Copy link

Thank you @nagadomi ! Your instructions above were very helpful in getting Torch7 installed and running with CUDA 10 & cuDNN 7.3 on Ubuntu 18.04 LTS

@rahilwazir
Copy link

rahilwazir commented Nov 12, 2018

@nagadomi I'm trying to install it via luarocks install cutorch but your workaround didn't work for me. Please help me with this

...
...
[ 16%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathScan.cu.o
[ 17%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o
/tmp/luarocks_cutorch-scm-1-6307/cutorch/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/jet/tmp/tmpxft_000050de_00000000-12_THCTensorIndex.compute_70.cpp1.ii".
CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:279 (message):
  Error generating file
  /tmp/luarocks_cutorch-scm-1-9488/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o


lib/THC/CMakeFiles/THC.dir/build.make:161: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o' failed
make[2]: *** [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....

Using the newer CMake with the applied patch

cmake --version
cmake version 3.13.20181111-g206ce3

CMake suite maintained and supported by Kitware (kitware.com/cmake).

@kyang-06
Copy link

@nagadomi I'm trying to install it via luarocks install cutorch but your workaround didn't work for me. Please help me with this

...
...
[ 16%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathScan.cu.o
[ 17%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o
/tmp/luarocks_cutorch-scm-1-6307/cutorch/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/jet/tmp/tmpxft_000050de_00000000-12_THCTensorIndex.compute_70.cpp1.ii".
CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:279 (message):
  Error generating file
  /tmp/luarocks_cutorch-scm-1-9488/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o


lib/THC/CMakeFiles/THC.dir/build.make:161: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o' failed
make[2]: *** [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....

Using the newer CMake with the applied patch

cmake --version
cmake version 3.13.20181111-g206ce3

CMake suite maintained and supported by Kitware (kitware.com/cmake).

have the same issue with you

@nagadomi
Copy link

I'm trying to install it via luarocks install cutorch

luarocks install cutorch command downloads cutorch from git repo and installs it, so the patch will not be applied. To install modified cutorch from local disk, use the following command in cutorch dir.

$ luarocks make rocks/cutorch-scm-1.rockspec

@kyang-06
Copy link

I'm trying to install it via luarocks install cutorch

luarocks install cutorch command downloads cutorch from git repo and installs it, so the patch will not be applied. To install modified cutorch from local disk, use the following command in cutorch dir.

$ luarocks make rocks/cutorch-scm-1.rockspec

Great! It works. Thank you so much

@doneforaiur
Copy link

I'm trying to install it via luarocks install cutorch

luarocks install cutorch command downloads cutorch from git repo and installs it, so the patch will not be applied. To install modified cutorch from local disk, use the following command in cutorch dir.

$ luarocks make rocks/cutorch-scm-1.rockspec
/home/lelu/torch/extra/cutorch/lib/THC/generic/THCTensorMath.cu(393): error: more than one operator "==" matches these operands:
            function "operator==(const __half &, const __half &)"
            function "operator==(half, half)"
            operand types are: half == half

/home/lelu/torch/extra/cutorch/lib/THC/generic/THCTensorMath.cu(414): error: more than one operator "==" matches these operands:
            function "operator==(const __half &, const __half &)"
            function "operator==(half, half)"
            operand types are: half == half

2 errors detected in the compilation of "/tmp/tmpxft_000021ca_00000000-4_THCTensorMath.cpp4.ii".
CMake Error at THC_generated_THCTensorMath.cu.o.Release.cmake:279 (message):
  Error generating file
  /home/lelu/torch/extra/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorMath.cu.o


lib/THC/CMakeFiles/THC.dir/build.make:3113: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o' failed
make[2]: *** [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o] Error 1
make[2]: *** Bitmemiş işler için bekliyor....

lib/THC/CMakeFiles/THC.dir/build.make:4792: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathPairwise.cu.o' failed
make[2]: *** [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathPairwise.cu.o] Error 1
CMakeFiles/Makefile2:177: recipe for target 'lib/THC/CMakeFiles/THC.dir/all' failed
make[1]: *** [lib/THC/CMakeFiles/THC.dir/all] Error 2
Makefile:129: recipe for target 'all' failed
make: *** [all] Error 2

Error: Build error: Failed building.

@nagadomi I'm still having this issue. ;-;

@nagadomi
Copy link

@doneforaiur
Try the following command before luarocks make..

$ export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__"

@lh14576
Copy link

lh14576 commented Dec 1, 2018

@nagadomi Thank you! This just saved my day! Was just about to return my RTX2070

@fiandro
Copy link

fiandro commented Dec 8, 2018

Thanks to @nagadomi for the help, I could successfully build luatorch against cuda 10 and cudnn7 on Ubuntu 16.04, RTX2080. I canonly add that it is not necessary to unistall the system-wide cmake, it is sufficient to build it from the sources on some user directory such as /home/user/myrepos/and then do
export PATH="/home/user/myrepos/CMake/bin:${PATH}"
before building luatorch.

@nagadomi
Copy link

nagadomi commented Dec 8, 2018

@fiandro
I merged changes for CUDA10 into a personal distro repo yesterday.
It was not really necessary to install newest CMake, but a few lines of patches for cmake/3.6/Modules/FindCUDA.cmake.
diff: nagadomi/distro@946affd#diff-9ea7fc52fa0c8de646449b3b8cda3ac9

Related to nagadomi/waifu2x#253 (comment)

@AlekzNet
Copy link

AlekzNet commented Dec 8, 2018

Does it work with gcc-8?

@AlekzNet
Copy link

AlekzNet commented Dec 8, 2018

Compilation fails with the following errors:

luarocks make rocks/cutorch-scm-1.rockspec                                                                                                     Warning: unmatched variable LUALIB  

jopts=$(getconf _NPROCESSORS_CONF)

echo "Building on $jopts cores"
cmake -E make_directory build && cd build && cmake .. -DLUALIB= -DLUA_INCDIR=/home/alekz/torch/install/include -DCMAKE_CXX_FLAGS=${CMAKE_CXX_FLAGS} -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH="/home/alekz/torch/install/bin/.." -DCMAKE_INSTALL_PREFIX="/home/alekz/torch/install/lib/luarocks/rocks/cutorch/scm-1" && make -j$jopts install

Building on 8 cores
-- The C compiler identification is GNU 8.2.0
-- The CXX compiler identification is GNU 8.2.0
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Found Torch7 in /home/alekz/torch/install
-- Looking for pthread.h
-- Looking for pthread.h - found
-- Looking for pthread_kill
-- Looking for pthread_kill - not found
-- Looking for pthread_create in pthreads
-- Looking for pthread_create in pthreads - not found
-- Looking for pthread_create in pthread
-- Looking for pthread_create in pthread - found
-- Found Threads: TRUE  
-- Found CUDA: /usr/local/cuda (found suitable version "10.0", minimum required is "6.5") 
-- Removing -DNDEBUG from compile flags
-- TH_LIBRARIES: TH
-- MAGMA not found. Compiling without MAGMA support
-- Autodetected CUDA architecture(s):  5.2
-- got cuda version 10.0
-- Found CUDA with FP16 support, compiling with torch.CudaHalfTensor
-- CUDA_NVCC_FLAGS: -D__CUDA_NO_HALF_OPERATORS__;-gencode;arch=compute_52,code=sm_52;-DCUDA_HAS_FP16=1
-- THC_SO_VERSION: 0
-- Performing Test HAS_LUAL_SETFUNCS
-- Performing Test HAS_LUAL_SETFUNCS - Failed
-- Configuring done
-- Generating done
-- Build files have been written to: /var/tmp/cutorch/build
[  2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o
[  2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCReduceApplyUtils.cu.o
[  3%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCBlas.cu.o
[  4%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCHalf.cu.o
[  5%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o
[  6%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensor.cu.o
[  8%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCSleep.cu.o
[  8%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorageCopy.cu.o
/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

6 errors detected in the compilation of "/tmp/tmpxft_00006cb1_00000000-6_THCSleep.cpp1.ii".
6 errors detected in the compilation of "/tmp/tmpxft_00006cb9_00000000-6_THCBlas.cpp1.ii".
CMake Error at THC_generated_THCSleep.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCSleep.cu.o


/usr/include/c++/8/type_traits(1049): error: type name is not allowed

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:79: lib/THC/CMakeFiles/THC.dir/THC_generated_THCSleep.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....
/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

CMake Error at THC_generated_THCBlas.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCBlas.cu.o


make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:72: lib/THC/CMakeFiles/THC.dir/THC_generated_THCBlas.cu.o] Error 1
/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

6 errors detected in the compilation of "/tmp/tmpxft_00006cca_00000000-6_THCReduceApplyUtils.cpp1.ii".
/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

CMake Error at THC_generated_THCReduceApplyUtils.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCReduceApplyUtils.cu.o


make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:65: lib/THC/CMakeFiles/THC.dir/THC_generated_THCReduceApplyUtils.cu.o] Error 1
6 errors detected in the compilation of "/tmp/tmpxft_00006cd7_00000000-6_THCTensor.cpp1.ii".
CMake Error at THC_generated_THCTensor.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensor.cu.o


make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:100: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensor.cu.o] Error 1
6 errors detected in the compilation of "/tmp/tmpxft_00006ce2_00000000-6_THCStorageCopy.cpp1.ii".
CMake Error at THC_generated_THCStorageCopy.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCStorageCopy.cu.o


make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:93: lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorageCopy.cu.o] Error 1
/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

6 errors detected in the compilation of "/tmp/tmpxft_00006ce1_00000000-6_THCTensorCopy.cpp1.ii".
/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

CMake Error at THC_generated_THCTensorCopy.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorCopy.cu.o


make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:107: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o] Error 1
/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

6 errors detected in the compilation of "/tmp/tmpxft_00006d00_00000000-6_THCHalf.cpp1.ii".
CMake Error at THC_generated_THCHalf.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCHalf.cu.o


make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:562: lib/THC/CMakeFiles/THC.dir/THC_generated_THCHalf.cu.o] Error 1
6 errors detected in the compilation of "/tmp/tmpxft_00006d08_00000000-6_THCStorage.cpp1.ii".
CMake Error at THC_generated_THCStorage.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCStorage.cu.o


make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:86: lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o] Error 1
make[1]: *** [CMakeFiles/Makefile2:177: lib/THC/CMakeFiles/THC.dir/all] Error 2
make: *** [Makefile:130: all] Error 2

Error: Build error: Failed building.

Any ideas?

@nagadomi
Copy link

nagadomi commented Dec 8, 2018

@AlekzNet
gcc versions later than 7 are not supported on CUDA10.
ax3l/CUDA_Compilers.md

@AlekzNet
Copy link

AlekzNet commented Dec 8, 2018

Mmm... I'm getting the same error with gcc-7.

@nagadomi
Copy link

nagadomi commented Dec 8, 2018

Perhaps you need to delete CMakeCache.txt.

$ rm build/CMakeCache.txt                                      
$ CC=gcc-7 CXX=g++-7 luarocks make rocks/cutorch-scm-1.rockspec

@AlekzNet
Copy link

AlekzNet commented Dec 9, 2018

Yes! This is it! I thought I cleaned everything, but obviously not. Thanks much @nagadomi !

@fiandro
Copy link

fiandro commented Dec 27, 2018

Thanks a lot to @nagadomi for the updated repo: Since I had to write a complete howto to installing Torch on Ubuntu 16.04 on RTX equipped machines, I tought to share it with you:

**** Installing (Lua)Torch on Ubuntu 16.04.5 LTS (Updated as 27/12/2018)

This is a tutorial to installing Torch on Ubuntu 16.04 for use with the new NVIDIA RTX20x0 GPUs (older GPUs such as GTX10x0 shall be supported as well). This tutorial addresses the problem of Torch being in mainteinance mode, officially not being able to run on RTX GPUs due to a complex chain of dependencies (look at this thread for more information #834). This tutorial will guide you from installing the NVIDIA kernel drivers to getting a working Torch version on RTX NVIDIA GPUs.
Notice that GCC v 5 or 6 is assumed to be the default compiler; if not, use update-alternatives to set the correct GCC/G++ pair version (see https://askubuntu.com/questions/26498/how-to-choose-the-default-gcc-and-g-version).

  • I recommend disabling automatic package update to prevent NVIDIA kernel module to be updated
    (see also https://www.garron.me/en/linux/turn-off-stop-ubuntu-automatic-update.html)
    Using your favorite editor open the file /etc/apt/apt.conf.d/10periodic and change:
    APT::Periodic::Update-Package-Lists "1";
    To:
    APT::Periodic::Update-Package-Lists "0";

  • Install lates drivers from NVIDIA (410.78 in Nov 2018) supporting RTX GPUs as debs from NVIDIA website (see also http://www.linuxandubuntu.com/home/how-to-install-latest-nvidia-drivers-in-linux)
    sudo apt-get purge nvidia*
    sudo add-apt-repository ppa:graphics-drivers
    sudo apt-get update
    sudo apt-get install nvidia-410

  • Reboot your computer or load the NVIDIA drivers
    sudo modprobe nvidia

  • Download the CUDA metapackage that will add to your APT sources CUDA 8.0 -> 10.0 (repo key addition may be needed) and install CUDA 10.0
    $wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1604/x86_64/https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1604/x86_64/cuda-10-0_10.0.130-1_amd64.deb
    sudo dpkg -i cuda-10-0_10.0.130-1_amd64.deb
    sudo apt update
    sudo apt install cuda-10-0

  • If needed, link the cuda 10.0 version in /usr/local:
    cd /usr/local; sudo ln -s cuda-10.0/ cuda

  • Verify that the CUDA compiler has been installed successfully:
    /usr/local/cuda/bin/nvcc --version
    nvcc: NVIDIA (R) Cuda compiler driver
    Copyright (c) 2005-2018 NVIDIA Corporation
    Built on Sat_Aug_25_21:08:01_CDT_2018
    Cuda compilation tools, release 10.0, V10.0.130

  • If behind a proxy, tell git to use the HTTPS protocol in place of GIT:
    git config --global url."https://github.com/".insteadOf [email protected]:
    git config --global url."https://".insteadOf git://

  • Optionally install libblas3 to avoid torch checking out and building its own copy of the BLAS library
    sudo apt install libblas3

  • Checkout Nagadomi's Torch branch with the patched cunn for cuda 10 support (see also the standard torch building procedure as documented in http://torch.ch/docs/getting-started.html):
    git clone https://github.com/nagadomi/distro.git ~/torch --recursive
    cd ~/torch; bash install-deps;
    ./install.sh

  • Download from the NVIDIA developers FTP server the latest libcudnn for cuda 10 (libcudnn7_7.4.1.5-1+cuda10.0_amd64.deb at the moment of the writing of this document), save it and install it
    sudo dpkg -i libcudnn7_7.4.1.5-1+cuda10.0_amd64.deb

  • Check out in a separate position Soumith's patched cudnn for libcudnn7:
    git clone https://github.com/soumith/cudnn.torch.git -b R7 && cd cudnn.torch && luarocks make cudnn-scm-1.rockspec

  • That's it, now login and logout and test if everything works correctly:
    th
    th> require 'cudnn'

@QROST
Copy link

QROST commented Jun 6, 2019

I encountered an error The imported target “Qt5::Gui” references the file “/usr/lib/x86_64-linux-gnu/libEGL.so” but this file does not exist. while building and installing the new CMake.

I found the solution in the link below.
https://askubuntu.com/questions/616065/the-imported-target-qt5gui-references-the-file-usr-lib-x86-64-linux-gnu-li

@NiklasVJM
Copy link

I've been trying to install "luarocks install cutorch", but it keeps failing at the same spot:

/home/testbench/Downloads/cutorch-master/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/tmp/tmpxft_0000737e_00000000-4_THCTensorScatterGather.cpp4.ii".
CMake Error at THC_generated_THCTensorScatterGather.cu.o.Release.cmake:279 (message):
Error generating file
/home/testbench/Downloads/cutorch-master/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorScatterGather.cu.o

lib/THC/CMakeFiles/THC.dir/build.make:8288: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorScatterGather.cu.o' failed
make[2]: *** [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorScatterGather.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....
/home/testbench/Downloads/cutorch-master/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/tmp/tmpxft_00007386_00000000-4_THCTensorIndex.cpp4.ii".
CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:279 (message):
Error generating file
/home/testbench/Downloads/cutorch-master/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o

lib/THC/CMakeFiles/THC.dir/build.make:6883: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o' failed
make[2]: *** [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o] Error 1
CMakeFiles/Makefile2:168: recipe for target 'lib/THC/CMakeFiles/THC.dir/all' failed
make[1]: *** [lib/THC/CMakeFiles/THC.dir/all] Error 2
Makefile:129: recipe for target 'all' failed
make: *** [all] Error 2

Error: Build error: Failed building.

I have successfully installed just about everything else, including CUDA 10.0, newer Cmake, "luarocks install cunn" -worked just fine, and I tried Nagadomi's tip:

$ rm build/CMakeCache.txt
$ CC=gcc-7 CXX=g++-7 luarocks make rocks/cutorch-scm-1.rockspec

no difference... Is there any other tips or tricks that you could suggest?

@generated
Copy link

generated commented Jul 12, 2019

sorry for this question,
patch -p1 < atomic.patch
outputs:

can't find file to patch at input line 5

any help appreciated

@nagadomi
Copy link

@generated
The error is probably due to the wrong working directory (should run on ~/torch/extra/cutorch).
I recommend that you simply install torch from my repo.

# rm -r ~/torch # delete old installation
git clone https://github.com/nagadomi/distro.git ~/torch --recursive
cd ~/torch
./install-deps
./install.sh

@HaoDing123
Copy link

Hello,everyone! when I install cutorch,I meet some error. I have installed torch7+cuda10.0+cudnn7.4.1 in ubuntu16.04 with RTX2080( Driver Version: 410.78 )
I tried the command :
dh@dh-MS-7B98:/torch/extra/cutorch$ export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__"
dh@dh-MS-7B98:
/torch/extra/cutorch$ luarocks make rocks/cutorch-scm-1.rockspecWarning: unmatched variable LUALIB

jopts=$(getconf _NPROCESSORS_CONF)

echo "Building on $jopts cores"
cmake -E make_directory build && cd build && cmake .. -DLUALIB= -DLUA_INCDIR=/home/dh/torch/install/include -DCMAKE_CXX_FLAGS=${CMAKE_CXX_FLAGS} -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH="/home/dh/torch/install/bin/.." -DCMAKE_INSTALL_PREFIX="/home/dh/torch/install/lib/luarocks/rocks/cutorch/scm-1" && make -j$jopts install

Building on 12 cores
-- Found Torch7 in /home/dh/torch/install
-- Removing -DNDEBUG from compile flags
-- TH_LIBRARIES: TH
-- MAGMA not found. Compiling without MAGMA support
-- Autodetected CUDA architecture(s): 7.5
-- got cuda version 10.0
-- Found CUDA with FP16 support, compiling with torch.CudaHalfTensor
-- CUDA_NVCC_FLAGS: -D__CUDA_NO_HALF_OPERATORS__;-gencode;arch=compute_75,code=sm_75;-DCUDA_HAS_FP16=1
-- THC_SO_VERSION: 0
-- Configuring done
-- Generating done
-- Build files have been written to: /home/dh/torch/extra/cutorch/build
[ 1%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorageCopy.cu.o
[ 2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCReduceApplyUtils.cu.o
[ 3%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o
[ 4%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensor.cu.o
[ 5%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o
[ 6%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o
[ 7%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath2.cu.o
[ 8%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathBlas.cu.o
[ 10%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathMagma.cu.o
[ 11%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathPairwise.cu.o
[ 12%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathReduce.cu.o
[ 13%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathScan.cu.o
CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message):
Removing non-existent dependency file: generic/THStorage.h
CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message):
Removing non-existent dependency file: generic/THStorageCopy.h
CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message):
Removing non-existent dependency file: generic/THTensor.h
......
......
[ 18%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorTopK.cu.o
CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message):
Removing non-existent dependency file: generic/THStorage.h
CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message):
Removing non-existent dependency file: generic/THStorageCopy.h
CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message):
Removing non-existent dependency file: generic/THTensor.h
......
......
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply1I19TensorAddConstantOpIhEhjLi2EEv10TensorInfoIT0_T1_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply1I19TensorAddConstantOpIhEhjLi1EEv10TensorInfoIT0_T1_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply1I19TensorAddConstantOpIhEhjLin2EEv10TensorInfoIT0_T1_ES4_T is out of range. .minnctapersm will be ignored
CMakeFiles/Makefile2:172: recipe for target 'lib/THC/CMakeFiles/THC.dir/all' failed
make[1]: *** [lib/THC/CMakeFiles/THC.dir/all] Error 2
Makefile:127: recipe for target 'all' failed
make: *** [all] Error 2

Error: Build error: Failed building.

Can kindly people help me?

@zhengqibin
Copy link

sorry for this question,
patch -p1 < atomic.patch
outputs:

can't find file to patch at input line 5

any help appreciated
first you should change the directory by:
cd extra/cutorch

@rehanpunjwani
Copy link

I have done all the steps as above but when I rull install.sh
I get the following error:

make[2]: *** No rule to make target 'exe/luajit-rocks/luajit-2.1/src/lib_ffi.c', needed by 'exe/luajit-rocks/luajit-2.1/vmdef.lua'. Stop.
make[1]: *** [CMakeFiles/Makefile2:212: exe/luajit-rocks/luajit-2.1/CMakeFiles/luajit-static.dir/all] Error 2
make: *** [Makefile:130: all] Error 2
./install.sh: line 107: /home/pi/torch/install/bin/luarocks: No such file or directory
Installing common Lua packages
./install.sh: line 111: /home/pi/torch/install/bin/luarocks: No such file or directory
Please help!!

@Yijunmaverick
Copy link

Many thanks with the guidance and I made it work with cuda 10. I have a question about how to install new packages (e.g., matio). It looks luarocks install XXX does not work, as you mentioned on installing cutorch.

@nagadomi
Copy link

@Yijunmaverick
works for me.

% sudo apt-get install libmatio-dev
% luarocks install matio

test

% git clone https://github.com/soumith/matio-ffi.torch.git
% cd matio-ffi.torch/test 
% th test.lua 
(1,.,.) = 
   1   7  13  19
   3   9  15  21
   5  11  17  23

(2,.,.) = 
   2   8  14  20
   4  10  16  22
   6  12  18  24
[torch.DoubleTensor of size 2x3x4]

OK	
OK	
OK	
Could not find variable with name: b in file: test.mat	
OK	
OK	
OK	
OK	
OK	
OK	
Saving is OK!

I've heard that GFW blocks URLs about luarocks.
If git clone can run normally, you can install it from local file.

% git clone https://github.com/soumith/matio-ffi.torch.git
% cd matio-ffi.torch
% luarocks make matio-scm-1.rockspec

rockspec index: https://github.com/torch/rocks

With libmatio4 env(Ubuntu 19.10), Reading string variable from mat file seems to fail (numeric variable is OK).

@labbihi
Copy link

labbihi commented Apr 29, 2020

I have this error after running : CC=gcc-7 CXX=g++-7 luarocks make rocks/cunn-scm-1.rockspec

[ 26%] Building NVCC (Device) object lib/THCUNN/CMakeFiles/THCUNN.dir/THCUNN_generated_LookupTableBag.cu.o
/tmp/cunn/lib/THCUNN/LookupTable.cu(44): error: identifier "FULL_MASK" is undefined

/tmp/cunn/lib/THCUNN/LookupTable.cu(62): error: identifier "FULL_MASK" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(5): error: incomplete type is not allowed

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(5): error: identifier "LookupTable_accGradParameters" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(5): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(37): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: declaration is incompatible with function template "void cunn_LookupTable_accGradParametersKernelByFeature(long *, Dtype *, Dtype *, Dtype, ptrdiff_t, long, int)"
/tmp/cunn/lib/THCUNN/LookupTable.cu(66): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(47): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: declaration is incompatible with "void THCudaHalfTensor_free(THCState *, THCudaHalfTensor *)"
/root/torch/install/include/THC/generic/THCTensor.h(118): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: identifier "state" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: declaration is incompatible with "void __THCudaCheck(cudaError_t, const char *, int)"
/root/torch/install/include/THC/THCGeneral.h(200): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(50): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(51): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: identifier "resize" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: variable "THCIndexTensor_" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: declaration is incompatible with "void THLongStorage_free(THLongStorage *)"
/root/torch/install/include/TH/generic/THStorage.h(67): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: identifier "inputSize" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(61): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: namespace "thrust::cuda" has no actual member "par"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: identifier "thrustAlloc" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(86): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(87): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: expression preceding parentheses of apparent call must have (pointer-to-) function type

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: identifier "data" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: identifier "origIndices" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(91): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(93): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(127): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(128): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(131): error: variable "block" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(132): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(142): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: variable "THCudaHalfTensor_free" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: variable "__THCudaCheck" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(146): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(175): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: expression preceding parentheses of apparent call must have (pointer-to-) function type

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: identifier "idx" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(177): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(177): error: identifier "numel" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(177): error: no instance of overloaded function "thrust::unique" matches the argument list
argument types are: (thrust::device_ptr<>, )

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(178): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(180): error: identifier "normType" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(183): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(37): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(38): error: variable "block" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: variable "cunn_LookupTable_accGradParametersKernelByFeature" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(47): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: declaration is incompatible with "void THCudaTensor_free(THCState *, THCudaTensor *)"
/root/torch/install/include/THC/generic/THCTensor.h(118): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: variable "__THCudaCheck" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(50): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(51): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: variable "THCIndexTensor_" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: variable "THCIndexTensor_" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: variable "THLongStorage_free" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(61): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: namespace "thrust::cuda" has no actual member "par"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(86): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(87): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: variable "origIndices_data" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: expression preceding parentheses of apparent call must have (pointer-to-) function type

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(91): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(91): error: variable "count_data" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(93): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(127): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(128): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(131): error: variable "block" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(132): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(142): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: variable "THCudaTensor_free" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: variable "__THCudaCheck" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(146): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(175): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: variable "idx_ptr" has already been defined

Error limit reached.
100 errors detected in the compilation of "/tmp/tmpxft_0000590c_00000000-13_LookupTable.compute_75.cpp1.ii".
Compilation terminated.
CMake Error at THCUNN_generated_LookupTable.cu.o.Release.cmake:280 (message):
Error generating file
/tmp/cunn/build/lib/THCUNN/CMakeFiles/THCUNN.dir//./THCUNN_generated_LookupTable.cu.o

lib/THCUNN/CMakeFiles/THCUNN.dir/build.make:8563: recipe for target 'lib/THCUNN/CMakeFiles/THCUNN.dir/THCUNN_generated_LookupTable.cu.o' failed
make[2]: *** [lib/THCUNN/CMakeFiles/THCUNN.dir/THCUNN_generated_LookupTable.cu.o] Error 1
make[2]: *** Attente des tâches non terminées....
/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddmLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddmLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddmLin1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddmLin2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin1ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin1ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin1ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi2ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi2ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi2ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi1ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi1ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi1ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin2ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin2ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin2ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffmLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffmLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffmLin1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffmLin2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin1ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin1ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin1ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi2ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi2ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi2ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi1ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi1ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi1ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin2ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin2ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin2ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_mLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_mLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELin1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELin1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELin1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELi2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELi2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELi2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELi2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELi1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELi1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELi1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELi1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELin2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELin2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELin2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin1ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELin1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELin1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELin1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELi2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELi2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELi2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELi2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELi1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELi1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELi1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELi1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELin2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELin2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELin2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi2ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELin1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELin1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELin1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELi2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELi2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELi2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELi2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELi1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELi1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELi1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELi1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELin2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELin2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELin2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLi1ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELin1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELin1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELin1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELi2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELi2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELi2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELi2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELi1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELi1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELi1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELi1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELin2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELin2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELin2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6__halfES1_S1_S1_jLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_mLin1ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_mLin2ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLin1ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLin1ELi2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLin1ELi1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLin1ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLi2ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLi2ELi2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLi2ELi1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLi2ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLi1ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLi1ELi2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLi1ELi1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLi1ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLin2ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLin2ELi2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLin2ELi1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
ptxas warning : Value of threads per SM for entry Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6__halfES1_S1_jLin2ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5_T is out of range. .minnctapersm will be ignored
/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

CMakeFiles/Makefile2:131: recipe for target 'lib/THCUNN/CMakeFiles/THCUNN.dir/all' failed
make[1]: *** [lib/THCUNN/CMakeFiles/THCUNN.dir/all] Error 2
Makefile:149: recipe for target 'all' failed
make: *** [all] Error 2

Error: Build error: Failed building.

@airdine
Copy link

airdine commented May 11, 2020

same issue with RTX 2080 ti,
using docker nvidia/cuda:10.0-cudnn7-devel-ubuntu16.04
cunn won't install with same output @labbihi

si someone have an idea?

@mikeytag
Copy link

mikeytag commented Aug 27, 2020

Ok, I was having issues installing luarocks install cutorch and kept doing the stuff that @nagadomi recommends at the top of this thread.

However, I kept corrupting ~/torch/extras/cutorch because I kept applying the atomic.patch described, but I was already cloning torch from https://github.com/nagadomi/torch and his repo already has the patches!

If you're going to use the @nagadomi repo, here's how I got everything working:

cd ~
git clone https://github.com/nagadomi/distro.git ~/torch --recursive
cd ~/torch
./install-deps
./clean.sh
./update.sh
~/torch/install/bin/torch-activate
luarocks install nn
luarocks install torch
cd extras/cutorch
luarocks make rocks/cutorch-scm-1.rockspec
cd ~/torch/extra/cunn
luarocks make rocks/cunn-scm-1.rockspec
cd ~/torch/extra/cudnn
luarocks make cudnn-scm-1.rockspec

NOTE: I'm on Ubuntu 18.04 with CUDA 10.2

THANK YOU SO MUCH @nagadomi !

@mherkazandjian
Copy link

in case someone is interested, i put everything in a container

docker run --runtime=nvidia mherkazandjian/char-rnn:latest th train.lua -data_dir data/tinyshakespeare -rnn_size 512 -num_layers 2 -dropout 0.5

the dockerfile is here:

 https://github.com/mherkazandjian/docker-torch-rnn/blob/mher/char-rnn/CUDA/10.0-ubuntu18.04/Dockerfile

that includes a fix based on the dockerfile of https://github.com/crisbal/docker-torch-rnn/tree/master/CUDA/10.0-ubuntu18.04 with some mods to include char-rnn in the container

@en1gma713
Copy link

en1gma713 commented Jan 25, 2021

I keep getting
CMake Error: The following variables are used in this project, but they are set to NOTFOUND. Please set them or make sure they are set and tested correctly in the CMake files: CUDA_cublas_LIBRARY (ADVANCED) linked by target "THC" in directory /home/aiden/distro/extra/cutorch/lib/THC
How do I fix this?

@TingtingLiao
Copy link

I keep getting
CMake Error: The following variables are used in this project, but they are set to NOTFOUND. Please set them or make sure they are set and tested correctly in the CMake files: CUDA_cublas_LIBRARY (ADVANCED) linked by target "THC" in directory /home/aiden/distro/extra/cutorch/lib/THC
How do I fix this?

Have you solved it? I have the same problem.

@TatianaZobnina
Copy link

on google colab after:

!export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__"
!/content/torch/install/bin/luarocks make /content/torch/extra/cutorch/rocks/cutorch-scm-1.rockspec

got this error:

Warning: unmatched variable LUALIB

jopts=$(getconf _NPROCESSORS_CONF)

echo "Building on $jopts cores"
cmake -E make_directory build && cd build && cmake .. -DLUALIB= -DLUA_INCDIR=/content/torch/install/include -DCMAKE_CXX_FLAGS=${CMAKE_CXX_FLAGS} -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH="/content/torch/install/bin/.." -DCMAKE_INSTALL_PREFIX="/content/torch/install/lib/luarocks/rocks/cutorch/scm-1" && make -j$jopts install

Building on 2 cores
-- Found Torch7 in /content/torch/install
-- Removing -DNDEBUG from compile flags
-- TH_LIBRARIES: TH
-- MAGMA not found. Compiling without MAGMA support
-- Automatic GPU detection failed. Building for common architectures.
-- Autodetected CUDA architecture(s): 3.0;3.5;5.0;5.2;6.0;6.1;7.0;7.0+PTX;7.5;7.5+PTX
-- got cuda version 10.0
-- Found CUDA with FP16 support, compiling with torch.CudaHalfTensor
-- CUDA_NVCC_FLAGS: -gencode;arch=compute_30,code=sm_30;-gencode;arch=compute_35,code=sm_35;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_52,code=sm_52;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_61,code=sm_61;-gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_75,code=sm_75;-gencode;arch=compute_70,code=compute_70;-gencode;arch=compute_75,code=compute_75;-DCUDA_HAS_FP16=1
-- THC_SO_VERSION: 0
-- Configuring done
-- Generating done
-- Build files have been written to: /content/torch/extra/cutorch/build
[ 1%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o
[ 2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathBlas.cu.o
/content/torch/extra/cutorch/lib/THC/generic/THCTensorMath.cu(393): error: more than one operator "==" matches these operands:
function "operator==(const __half &, const __half &)"
function "operator==(half, half)"
operand types are: half == half

/content/torch/extra/cutorch/lib/THC/generic/THCTensorMath.cu(414): error: more than one operator "==" matches these operands:
function "operator==(const __half &, const __half &)"
function "operator==(half, half)"
operand types are: half == half

2 errors detected in the compilation of "/tmp/tmpxft_0001830f_00000000-13_THCTensorMath.compute_75.cpp1.ii".
CMake Error at THC_generated_THCTensorMath.cu.o.cmake:267 (message):
Error generating file
/content/torch/extra/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorMath.cu.o

lib/THC/CMakeFiles/THC.dir/build.make:3113: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o' failed
make[2]: *** [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....
CMakeFiles/Makefile2:177: recipe for target 'lib/THC/CMakeFiles/THC.dir/all' failed
make[1]: *** [lib/THC/CMakeFiles/THC.dir/all] Error 2
Makefile:129: recipe for target 'all' failed
make: *** [all] Error 2

Error: Build error: Failed building.

all other command works perfectly, except cutorch

@lochotan4794
Copy link

@nagadomi Please help me. I follow your instruction but still get the error.
-- Removing -DNDEBUG from compile flags
-- TH_LIBRARIES: TH
-- MAGMA not found. Compiling without MAGMA support
-- Automatic GPU detection failed. Building for common architectures.
-- Autodetected CUDA architecture(s): 3.5;5.0;5.3;6.0;6.1;7.0;7.5;7.5+PTX
-- got cuda version 10.1
-- Found CUDA with FP16 support, compiling with torch.CudaHalfTensor
-- CUDA_NVCC_FLAGS: -D__CUDA_NO_HALF_OPERATORS__;-gencode;arch=compute_35,code=sm_35;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_53,code=sm_53;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_61,code=sm_61;-gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_75,code=sm_75;-gencode;arch=compute_75,code=compute_75;-DCUDA_HAS_FP16=1
-- THC_SO_VERSION: 0
-- Performing Test HAS_LUAL_SETFUNCS
-- Performing Test HAS_LUAL_SETFUNCS - Failed
-- Configuring done
-- Generating done
-- Build files have been written to: /tmp/luarocks_cutorch-scm-1-7746/cutorch/build
[ 1%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCBlas.cu.o
[ 2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCHalf.cu.o
[ 3%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/generated/THC_generated_THCTensorSortShort.cu.o
[ 4%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCSleep.cu.o
[ 5%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCSortUtils.cu.o
[ 6%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCReduceApplyUtils.cu.o
[ 7%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o
[ 10%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensor.cu.o
[ 10%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorConv.cu.o
[ 11%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorageCopy.cu.o
[ 12%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o
[ 13%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o
/tmp/luarocks_cutorch-scm-1-7746/cutorch/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/tmp/tmpxft_00014089_00000000-12_THCTensorIndex.compute_75.cpp1.ii".
CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:280 (message):
Error generating file
/tmp/luarocks_cutorch-scm-1-7746/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:175: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....
^Cmake[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:182: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorConv.cu.o] Interrupt
make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:119: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o] Interrupt
make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:574: lib/THC/CMakeFiles/THC.dir/THC_generated_THCHalf.cu.o] Interrupt
make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:98: lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o] Interrupt
make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:322: lib/THC/CMakeFiles/THC.dir/generated/THC_generated_THCTensorSortShort.cu.o] Interrupt
make[1]: *** [CMakeFiles/Makefile2:167: lib/THC/CMakeFiles/THC.dir/all] Interrupt
make: *** [Makefile:136: all] Interrupt

Error: Build error: Failed building.

@qitop
Copy link

qitop commented Dec 24, 2021

EDIT: I have merged these changes to my distro repo. https://github.com/nagadomi/distro

Building Torch with CUDA10 has two problems.

  1. cmake/3.6/Modules/FindCUDA.cmake is outdated.
  2. atomicAdd(__half *address, __half val) is defined in /usr/local/cuda/include/cuda_fp16.h (duplicated).

How to build Torch with CUDA10

  1. Install the latest CMake from github repo (the latest FindCUDA.cmake will be installed)
$ sudo apt-get purge cmake
$ git clone https://github.com/Kitware/CMake.git
$ cd CMake
$ ./bootstrap; make; sudo make install
  1. Remove FindCUDA.cmake.
$ cd ~/torch
$ rm -fr cmake/3.6/Modules/FindCUDA*
  1. Apply the following patch to cutorch
diff --git a/lib/THC/THCAtomics.cuh b/lib/THC/THCAtomics.cuh
index 400875c..ccb7a1c 100644
--- a/lib/THC/THCAtomics.cuh
+++ b/lib/THC/THCAtomics.cuh
@@ -94,6 +94,7 @@ static inline __device__ void atomicAdd(long *address, long val) {
 }
 
 #ifdef CUDA_HALF_TENSOR
+#if !(__CUDA_ARCH__ >= 700 || !defined(__CUDA_ARCH__) )
 static inline  __device__ void atomicAdd(half *address, half val) {
   unsigned int * address_as_ui =
       (unsigned int *) ((char *)address - ((size_t)address & 2));
@@ -117,6 +118,7 @@ static inline  __device__ void atomicAdd(half *address, half val) {
    } while (assumed != old);
 }
 #endif
+#endif
$ cd extra/cutorch
$ cat > atomic.patch
<copy and paste the patch>
$ patch -p1 < atomic.patch
  1. Build
$ ./clean.sh
$ export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__"
$ ./install.sh

I was successfully build Torch with CUDA10 with the above steps.

I have followed all the steps. My system gets stuck when I type

$cat > atomic.patch
diff
> diff --git a/lib/THC/THCAtomics.cuh b/lib/THC/THCAtomics.cuh
> index 400875c..ccb7a1c 100644
> --- a/lib/THC/THCAtomics.cuh
> +++ b/lib/THC/THCAtomics.cuh
> @@ -94,6 +94,7 @@ static inline __device__ void atomicAdd(long *address, long val) {
>  }
>  
>  #ifdef CUDA_HALF_TENSOR
> +#if !(__CUDA_ARCH__ >= 700 || !defined(__CUDA_ARCH__) )
>  static inline  __device__ void atomicAdd(half *address, half val) {
>    unsigned int * address_as_ui =
>        (unsigned int *) ((char *)address - ((size_t)address & 2));
> @@ -117,6 +118,7 @@ static inline  __device__ void atomicAdd(half *address, half val) {
>     } while (assumed != old);
>  }
>  #endif
> +#endif
>  

@mw66
Copy link

mw66 commented Jan 14, 2022

cd extras/cutorch

it should be:

cd extra/cutorch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests