1 #ifndef STAN_MATH_OPENCL_MULTIPLY_TRANSPOSE_HPP 2 #define STAN_MATH_OPENCL_MULTIPLY_TRANSPOSE_HPP 33 int Mpad = ((A.
rows() + local - 1) / local) * local;
38 cl::NDRange(local, local / wpt), A, temp,
40 }
catch (cl::Error&
e) {
The matrix_cl class - allocates memory space on the OpenCL device, functions for transfering matrices...
Represents a matrix on the OpenCL device.
checking OpenCL error numbers
const kernel_cl< in_buffer, out_buffer, int, int > multiply_transpose("multiply_transpose", {thread_block_helpers, multiply_transpose_kernel_code}, {{"THREAD_BLOCK_SIZE", 32}, {"WORK_PER_THREAD", 4}})
See the docs for add() .
double e()
Return the base of the natural logarithm.
matrix_cl multiply_transpose(const matrix_cl &A)
Computes the product of a square OpenCL matrix with its transpose.
void check_opencl_error(const char *function, const cl::Error &e)
Throws the domain error with specifying the OpenCL error that occured.