Skip to main content

GemmRequest

atomr_accel_cuda::kernel::blas::gemm

Struct GemmRequest

pub struct GemmRequest<T: GemmSupported> {Show 14 fields
    pub a: GpuRef<T>,
    pub b: GpuRef<T>,
    pub c: GpuRef<T>,
    pub m: i32,
    pub n: i32,
    pub k: i32,
    pub alpha: T,
    pub beta: T,
    pub trans_a: cublasOperation_t,
    pub trans_b: cublasOperation_t,
    pub lda: i32,
    pub ldb: i32,
    pub ldc: i32,
    pub reply: Sender<Result<(), GpuError>>,
}

Expand description

Typed cuBLAS gemm request: C = α·op(A)·op(B) + β·C.

lda/ldb/ldc follow cuBLAS’s column-major convention (see cuBLAS docs). For the no-transpose case, lda = m, ldb = k, ldc = m.

§Capability marker compile-fail

T: GemmSupported gates the dtype matrix. cuBLAS does not support i64 gemm, so building a GemmRequest::<i64> is rejected at compile time:

// Fails: i64 does not implement `GemmSupported`.
let _req = GemmRequest::<i64> {
    a, b, c,
    m: 1, n: 1, k: 1,
    alpha: 1, beta: 0,
    trans_a: cublasOperation_t::CUBLAS_OP_N,
    trans_b: cublasOperation_t::CUBLAS_OP_N,
    lda: 1, ldb: 1, ldc: 1,
    reply: tx,
};

Fields§

§a: GpuRef<T>§b: GpuRef<T>§c: GpuRef<T>§m: i32§n: i32§k: i32§alpha: T§beta: T§trans_a: cublasOperation_t§trans_b: cublasOperation_t§lda: i32§ldb: i32§ldc: i32§reply: Sender<Result<(), GpuError>>

Implementations§

impl<T> GemmRequest<T>
where T: GemmSupported, GemmRequest<T>: GemmDispatch,

pub fn into_msg(self) -> BlasMsg

Box-and-wrap into a crate::kernel::BlasMsg::Gemm variant.

Trait Implementations§

impl GemmDispatch for GemmRequest<f32>

fn dtype_name(&self) -> &'static str

fn op_name(&self) -> &'static str

fn dispatch(self: Box<Self>, ctx: &BlasDispatchCtx<'_>)

impl GemmDispatch for GemmRequest<f64>

fn dtype_name(&self) -> &'static str

fn op_name(&self) -> &'static str

fn dispatch(self: Box<Self>, ctx: &BlasDispatchCtx<'_>)

Auto Trait Implementations§

impl<T> Freeze for GemmRequest<T>
where T: Freeze,

impl<T> !RefUnwindSafe for GemmRequest<T>

impl<T> Send for GemmRequest<T>

impl<T> Sync for GemmRequest<T>

impl<T> Unpin for GemmRequest<T>
where T: Unpin,

impl<T> UnsafeUnpin for GemmRequest<T>
where T: UnsafeUnpin,

impl<T> !UnwindSafe for GemmRequest<T>

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided [Span], returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a [WithDispatch] wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a [WithDispatch] wrapper. Read more

impl<T> Extension for T
where T: Any + Send + Sync,