Mtj with nvblas
(The following is based on a post from Eibe Frank on the Weka mailing list.)
Here is an example of running MTJ with NVBLAS (NVIDIA's BLAS wrapper) on Ubuntu:
-
Installed
https://prdownloads.sourceforge.net/weka/weka-3-8-6-azul-zulu-linux.zip
in
-
Ran
-
To install CPU-based system BLAS/LAPACK, ran
-
Downloaded and installed CUDA 11.6 from https://developer.nvidia.com/cuda-downloads
-
Copied example
nvblas.conf
from https://docs.nvidia.com/cuda/nvblas/ into local directory using -
Edited
nvblas.conf
to have -
Now, by adapting what's given at https://github.com/fommil/netlib-java/wiki/NVBLAS, issued
-
Then,
-
Observation: Memory is being allocated on the GPU. Looking at
nvblas.log
, the GPU is used, but only for somedgemm
operations. However, according to https://docs.nvidia.com/cuda/nvblas/, thetremm
operation (which is executed on the CPU) should also be supported by the GPU.