RKeOps contains the R bindings for the cpp/cuda library KeOps. It provides standard R functions that can be used in any R (>=3) codes.
For a full documentation you may read:
Feel free to contact us for any bug report or feature request, you can also fill an issue report on GitHub.
The KeOps library provides seamless kernel operations on GPU, with auto-differentiation and without memory overflows.
With RKeOps, you can compute generic reductions of very large arrays whose entries are given by a mathematical formula. It combines a tiled reduction scheme with an automatic differentiation engine. It is perfectly suited to the computation of Kernel dot products and the associated gradients, even when the full kernel matrix does not fit into the GPU memory.
For more information (installation, usage), please visit https://www.kernel-operations.io/ (especially the
section dedicated to RKeOps) and read the vignettes available in R with
the command browseVignettes("rkeops")
or on the CRAN.
Note: RKeOps is avaible on CRAN but only for UNIX environment (GNU/Linux and MacOS) and not for Windows.
install.packages("rkeops")
!! In most recent version of devtools, the
args
argument is not available anymore and it is not possible to usedevtools::install_git
. Please check next section to install from sources.
git
)::install_git("https://github.com/getkeops/keops",
devtoolssubdir = "rkeops",
args="--recursive")
# not possible to use `devtools::intall_github()` because of the required submodule
git clone --recurse-submodules="keops/lib/sequences" https://github.com/getkeops/keops
# or
git clone https://github.com/getkeops/keops
cd keops
git submodule update --init -- keops/lib/sequences
# other submodules are not necessary for RKeOps
keops
directory)::install("rkeops") devtools
Here is an example how to define and compute a Gaussian convolution with RKeOps.
# implementation of a convolution with a Gaussian kernel
= "Sum_Reduction(Exp(-s * SqNorm2(x - y)) * b, 0)"
formula
# input arguments
= c("x = Vi(3)", # vector indexed by i (of dim 3)
args "y = Vj(3)", # vector indexed by j (of dim 3)
"b = Vj(6)", # vector indexed by j (of dim 6)
"s = Pm(1)") # parameter (scalar)
# compilation of the corresponding operator
<- keops_kernel(formula, args)
op
# data and parameter values
<- 100
nx <- 150
ny <- matrix(runif(nx*3), nrow=nx) # matrix 100 x 3
X <- matrix(runif(ny*3), nrow=ny) # matrix 150 x 3
Y <- matrix(runif(ny*6), nrow=ny) # matrix 150 x 6
B <- 0.2
s
# to run computation on CPU (default mode)
use_cpu()
# to run computations on GPU (to be used only if relevant)
use_gpu()
# computation (order of the input arguments should be similar to `args`)
<- op(list(X, Y, B, s)) res
Here is an example how to define and compute the gradient of an existing KeOps operators.
# defining an operator (reduction on squared distance)
<- "Sum_Reduction(SqNorm2(x-y), 0)"
formula <- c("x=Vi(0,3)", "y=Vj(1,3)")
args <- keops_kernel(formula, args)
op # defining its gradient regarding x
<- keops_grad(op, var="x")
grad_op
# data
<- 100
nx <- 150
ny <- matrix(runif(nx*3), nrow=nx, ncol=3) # matrix 100 x 3
x <- matrix(runif(ny*3), nrow=ny, ncol=3) # matrix 150 x 3
y <- matrix(runif(nx*1), nrow=nx, ncol=1) # matrix 100 x 1
eta
# computation
<- list(x, y, eta)
input <- grad_op(input) res
Based on your formulae, RKeOps compile on the fly operators that can be used to run the corresponding computations on CPU or GPU, it uses a tiling scheme to decompose the data and avoid (i) useless and costly memory transfers between host and GPU (performance gain) and (ii) memory overflow.
Note: You can use the same code (i.e. define the same operators) for CPU or GPU computing. The only difference will be the compiler used for the compilation of your operators (upon the availability of CUDA on your system).
To use CPU computing mode, you can call use_cpu()
(with
an optional argument ncore
specifying the number of cores
used to run parallel computations).
To use GPU computing mode, you can call use_gpu()
(with
an optional argument device
to choose a specific GPU id to
run computations).