Spaces:
Sleeping
Sleeping
cff-version: 1.2.0 | |
title: CUTLASS | |
message: >- | |
If you use this software, please cite using the | |
following metadata. | |
type: software | |
authors: | |
- given-names: Vijay | |
family-names: Thakkar | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Pradeep | |
family-names: Ramani | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Cris | |
family-names: Cecka | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Aniket | |
family-names: Shivam | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Honghao | |
family-names: Lu | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Ethan | |
family-names: Yan | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Jack | |
family-names: Kosaian | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Mark | |
family-names: Hoemmen | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Haicheng | |
family-names: Wu | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Andrew | |
family-names: Kerr | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Matt | |
family-names: Nicely | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Duane | |
family-names: Merrill | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Dustyn | |
family-names: Blasig | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Fengqi | |
family-names: Qiao | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Piotr | |
family-names: Majcher | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Paul | |
family-names: Springer | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Markus | |
family-names: Hohnerbach | |
affiliation: NVIDIA | |
email: [email protected] | |
- given-names: Jin | |
family-names: Wang | |
email: [email protected] | |
affiliation: NVIDIA | |
- given-names: Manish | |
family-names: Gupta | |
affiliation: Google | |
email: [email protected] | |
repository-code: 'https://github.com/NVIDIA/cutlass' | |
abstract: >- | |
CUTLASS is a collection of CUDA C++ template | |
abstractions for implementing high-performance | |
matrix-multiplication (GEMM) and related | |
computations at all levels and scales within CUDA. | |
It incorporates strategies for hierarchical | |
decomposition and data movement similar to those | |
used to implement cuBLAS and cuDNN. CUTLASS | |
decomposes these "moving parts" into reusable, | |
modular software components abstracted by C++ | |
template classes. These thread-wide, warp-wide, | |
block-wide, and device-wide primitives can be | |
specialized and tuned via custom tiling sizes, data | |
types, and other algorithmic policy. The resulting | |
flexibility simplifies their use as building blocks | |
within custom kernels and applications. | |
keywords: | |
- 'cutlass, tensor cores, cuda, cute, nvidia, gpu, linear algebra, matrix computations' | |
license: BSD-3-Clause | |
license-url: https://github.com/NVIDIA/cutlass/blob/v3.0.0/LICENSE.txt | |
version: '3.0.0' | |
date-released: '2023-01-23' | |
identifiers: | |
- type: url | |
value: "https://github.com/NVIDIA/cutlass/tree/v3.0.0" | |
description: The GitHub release URL of tag 3.0.0 | |