Expand description
Sys-level safe wrappers for the cuBLAS entry points cudarc 0.19 doesn’t expose through its safe layer.
Wrapped today (Phase 1 cuBLAS slice):
cublasGemmEx,cublasGemmStridedBatchedExcublasAxpyEx,cublasScalEx,cublasNrm2Ex,cublasDotExcublasIamaxEx,cublasIaminEx,cublasAsumExcublasCopyEx,cublasSwapEx,cublasRotExcublasGemv_v2/cublasDgemv_v2,cublasSger_v2/cublasDger_v2cublasSgeam/cublasDgeamcublasSsyrk_v2/cublasDsyrk_v2cublasStrsm_v2/cublasDtrsm_v2
All callers must hold the cuBLAS handle’s stream current on the
same OS thread. The atomr-accel-cuda actor pipeline guarantees
that via GpuDispatcher.
Functions§
- asum_ex⚠
- axpy_ex⚠
- copy_ex⚠
- dgeam⚠
- dgemv⚠
- dger⚠
- dot_ex⚠
- dsyrk⚠
- dtrsm⚠
- gemm_ex⚠
cublasGemmEx— typed-erased gemm with a separate compute type.- gemm_
strided_ ⚠batched_ ex cublasGemmStridedBatchedEx— typed-erased strided-batched gemm.- iamax_
ex ⚠ - iamin_
ex ⚠ - nrm2_ex⚠
- rot_ex⚠
- scal_ex⚠
- sgeam⚠
cublasSgeam/cublasDgeam— matrix add/scale:C = α·op(A) + β·op(B).- sgemv⚠
- sger⚠
- ssyrk⚠
- strsm⚠
- swap_ex⚠