Skip to main content

MatmulRequest

Struct MatmulRequest 

Source
pub struct MatmulRequest<T: GemmSupported> {
Show 21 fields pub a: GpuRef<T>, pub b: GpuRef<T>, pub c: GpuRef<T>, pub d: Option<GpuRef<T>>, pub m: i32, pub n: i32, pub k: i32, pub alpha: T::Scalar, pub beta: T::Scalar, pub transa: bool, pub transb: bool, pub lda: i64, pub ldb: i64, pub ldc: i64, pub ldd: i64, pub epilogue: Epilogue, pub bias: Option<GpuRef<T>>, pub gelu_aux: Option<GpuRef<T>>, pub scales: ScaleSet, pub workspace_size: usize, pub reply: Sender<Result<(), GpuError>>,
}
Expand description

Typed matmul request. Public surface; instantiated by callers.

Fields§

§a: GpuRef<T>§b: GpuRef<T>§c: GpuRef<T>§d: Option<GpuRef<T>>

Optional explicit D output buffer. cuBLASLt allows out-of-place matmul where the result lands in D rather than in-place into C. Required for fp8 (the scale-back step produces a different dtype than the accumulator).

§m: i32§n: i32§k: i32§alpha: T::Scalar§beta: T::Scalar§transa: bool§transb: bool§lda: i64§ldb: i64§ldc: i64§ldd: i64§epilogue: Epilogue§bias: Option<GpuRef<T>>§gelu_aux: Option<GpuRef<T>>§scales: ScaleSet§workspace_size: usize

Hint for the heuristic: maximum workspace bytes the algorithm search may use. A reasonable default is 4 * 1024 * 1024 (cuBLASLt’s standard 4 MiB minimum).

§reply: Sender<Result<(), GpuError>>

Trait Implementations§

Source§

impl BlasLtDispatch for MatmulRequest<f32>

Source§

fn dtype_kind(&self) -> DTypeKind

Source§

fn dispatch(self: Box<Self>, ctx: &BlasLtDispatchCtx<'_>)

Source§

impl BlasLtDispatch for MatmulRequest<f64>

f64 (cudarc 0.19.4 has no Matmul<f64> impl).

Source§

fn dtype_kind(&self) -> DTypeKind

Source§

fn dispatch(self: Box<Self>, _ctx: &BlasLtDispatchCtx<'_>)

Source§

impl<T: GemmSupported> Debug for MatmulRequest<T>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

§

impl<T> Freeze for MatmulRequest<T>
where <T as AccelDtype>::Scalar: Freeze,

§

impl<T> !RefUnwindSafe for MatmulRequest<T>

§

impl<T> Send for MatmulRequest<T>

§

impl<T> Sync for MatmulRequest<T>

§

impl<T> Unpin for MatmulRequest<T>
where <T as AccelDtype>::Scalar: Unpin,

§

impl<T> UnsafeUnpin for MatmulRequest<T>
where <T as AccelDtype>::Scalar: UnsafeUnpin,

§

impl<T> !UnwindSafe for MatmulRequest<T>

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T> Instrument for T

§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided [Span], returning an Instrumented wrapper. Read more
§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
§

impl<T> WithSubscriber for T

§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

impl<T> Extension for T
where T: Any + Send + Sync,