|
bnmf-algs
|
Go to the source code of this file.
Namespaces | |
| bnmf_algs | |
| Main namespace for bnmf-algs library. | |
| bnmf_algs::cuda | |
| cuda namespace contains functions that operate on nvidia GPUs using CUDA routines. | |
| bnmf_algs::cuda::kernel | |
| Namespace containing CUDA kernel and device functions. | |
Functions | |
| template<typename Scalar > | |
| __global__ void | bnmf_algs::cuda::kernel::sum_tensor3D (cudaPitchedPtr tensor, Scalar *out, size_t out_pitch, size_t axis, size_t n_rows, size_t n_cols, size_t n_layers) |
| Sum the given 3D tensor along the given axis and write the results to the corresponding index of the given 2D pitched matrix memory. More... | |
1.8.11