Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[mlir][sparse] replace "sparse compiler" with "sparsifier" in doc #67082

Merged
merged 3 commits into from
Sep 22, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 1 addition & 2 deletions mlir/include/mlir/Dialect/SparseTensor/IR/Enums.h
Original file line number Diff line number Diff line change
Expand Up @@ -190,8 +190,7 @@ enum class DimLevelType : uint8_t {
TwoOutOfFour = 64, // 0b10000_00
};

/// This enum defines all the storage formats supported by the sparse compiler,
/// without the level properties.
/// This enum defines all supported storage format without the level properties.
enum class LevelFormat : uint8_t {
Dense = 4, // 0b00001_00
Compressed = 8, // 0b00010_00
Expand Down
33 changes: 20 additions & 13 deletions mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorAttrDefs.td
Original file line number Diff line number Diff line change
Expand Up @@ -106,35 +106,36 @@ def SparseTensorEncodingAttr : SparseTensor_Attr<"SparseTensorEncoding",
sparsity-agnostic representation of the computation, i.e., an implicit sparse
representation is converted to an explicit sparse representation where co-iterating
loops operate on sparse storage formats rather than tensors with a sparsity
encoding. Compiler passes that run before this sparse compiler pass need to
be aware of the semantics of tensor types with such a sparsity encoding.
encoding. Compiler passes that run before this sparsifier pass need to be aware
of the semantics of tensor types with such a sparsity encoding.

In this encoding, we use `dimension` to refer to the axes of the semantic tensor,
and `level` to refer to the axes of the actual storage format, i.e., the
In this encoding, we use **dimension** to refer to the axes of the semantic tensor,
and **level** to refer to the axes of the actual storage format, i.e., the
operational representation of the sparse tensor in memory. The number of
dimensions is usually the same as the number of levels (such as CSR storage format).
However, the encoding can also map dimensions to higher-order levels (for example,
to encode a block-sparse BSR storage format) or to lower-order levels
(for example, to linearize dimensions as a single level in the storage).

The encoding contains a `map` that provides the following:
The encoding contains a map that provides the following:

- An ordered sequence of dimension specifications, each of which defines:
- the dimension-size (implicit from the tensor’s dimension-shape)
- a **dimension-expression**
- An ordered sequence of level specifications, each of which includes a required
**level-type**, which defines how the level should be stored. Each level-type
consists of:
- a **level-expression**, which defines what is stored
- a **level-format**
- a collection of **level-properties** that apply to the level-format
- a **level-expression**, which defines what is stored

Each level-expression is an affine expression over dimension-variables. Thus, the
level-expressions collectively define an affine map from dimension-coordinates to
level-coordinates. The dimension-expressions collectively define the inverse map,
which only needs to be provided for elaborate cases where it cannot be inferred
automatically. Within the sparse storage format, we refer to indices that are
stored explicitly as `coordinates` and indices into the storage format as `positions`.
stored explicitly as **coordinates** and offsets into the storage format as
**positions**.

The supported level-formats are the following:

Expand All @@ -145,26 +146,26 @@ def SparseTensorEncodingAttr : SparseTensor_Attr<"SparseTensorEncoding",
Different level-formats may have different collections of level-properties.
By default, each level-type has the property of being unique (no duplicate
coordinates at that level), ordered (coordinates appear sorted at that
level), and, for compression, storing the positions in a compact way where
an interval is defined by a lower bound "pos(i)" and an upper bound "pos(i+1)-1".
level), and, for compression, storing each position interval in a compact
way with a lowerbound `pos(i)`" and an upperbound `pos(i+1) - 1`.
The following properties can be added to a level-format to change this
default behavior:

- **nonunique** : duplicate coordinates may appear at the level
- **nonordered** : coordinates may appear in arbribratry order
- **high** : the upper bound is stored explicitly in a separate array
- **high** : position interval upperbounds are stored explicitly
- **block2_4** : the compression uses a 2:4 encoding per 1x4 block

In addition to the `map`, the following two fields are optional:
In addition to the map, the following two fields are optional:

- The required bitwidth for `position` storage (integral offsets
- The required bitwidth for position storage (integral offsets
into the sparse storage scheme). A narrow width reduces the memory
footprint of overhead storage, as long as the width suffices to
define the total required range (viz. the maximum number of stored
entries over all indirection levels). The choices are `8`, `16`,
`32`, `64`, or, the default, `0` to indicate the native bitwidth.

- The required bitwidth for `coordinate` storage (the coordinates
- The required bitwidth for coordinate storage (the coordinates
of stored entries). A narrow width reduces the memory footprint
of overhead storage, as long as the width suffices to define
the total required range (viz. the maximum value of each tensor
Expand Down Expand Up @@ -231,20 +232,26 @@ def SparseTensorEncodingAttr : SparseTensor_Attr<"SparseTensorEncoding",
```
}];

//
// Data in sparse tensor encoding.
//
let parameters = (
ins
// A level-type for each level of the sparse storage.
ArrayRefParameter<
"::mlir::sparse_tensor::DimLevelType",
"level-types"
>: $lvlTypes,

// A mapping from dimension-coordinates to level-coordinates.
"AffineMap":$dimToLvl,

// The required bitwidth for position storage.
"unsigned":$posWidth,

// The required bitwidth for coordinate storage.
"unsigned":$crdWidth,

// A slice attribute for each dimension of the tensor type.
ArrayRefParameter<
"::mlir::sparse_tensor::SparseTensorDimSliceAttr",
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -25,11 +25,16 @@ def SparseTensor_Dialect : Dialect {
means of a small sparse runtime support library.

The concept of **treating sparsity as a property, not a tedious
implementation detail**, by letting a **sparse compiler** generate
implementation detail**, by letting a **sparsifier** generate
sparse code automatically was pioneered for linear algebra by [Bik96]
in MT1 (see https://www.aartbik.com/sparse.php) and formalized
to tensor algebra by [Kjolstad17,Kjolstad20] in the Sparse Tensor
Algebra Compiler (TACO) project (see http://tensor-compiler.org).
Please note that we started to prefer the term "sparsifier" over
the also commonly used "sparse compiler" terminology to refer to
such a pass to make it clear that the sparsifier pass is not a
separate compiler, but should be an integral part of any compiler
pipeline that is built with the MLIR compiler infrastructure

The MLIR implementation [Biketal22] closely follows the "sparse
iteration theory" that forms the foundation of TACO. A rewriting
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -74,7 +74,7 @@ def SparseTensor_PackOp : SparseTensor_Op<"pack", [Pure]>,
sources; e.g., when passing two numpy arrays from Python.

Disclaimer: This is the user's responsibility to provide input that can be
correctly interpreted by the sparse compiler, which does not perform
correctly interpreted by the sparsifier, which does not perform
any sanity test during runtime to verify data integrity.

TODO: The returned tensor is allowed (in principle) to have non-identity
Expand Down Expand Up @@ -120,7 +120,7 @@ def SparseTensor_UnpackOp : SparseTensor_Op<"unpack", [Pure, SameVariadicResultS
unpacked MLIR sparse tensor to frontend; e.g., returning two numpy arrays to Python.

Disclaimer: This is the user's responsibility to allocate large enough buffers
to hold the sparse tensor. The sparse compiler simply copies each fields
to hold the sparse tensor. The sparsifier simply copies each fields
of the sparse tensor into the user-supplied buffer without bound checking.

TODO: the current implementation does not yet support non-identity mappings.
Expand Down Expand Up @@ -362,7 +362,7 @@ def SparseTensor_ToSliceOffsetOp : SparseTensor_Op<"slice.offset", [Pure]>,
Extracts the offset of the sparse tensor slice at the given dimension.

Currently, sparse tensor slices are still a work in progress, and only
works when runtime library is disabled (i.e., running sparse compiler
works when runtime library is disabled (i.e., running the sparsifier
with `enable-runtime-library=false`).

Example:
Expand All @@ -389,7 +389,7 @@ def SparseTensor_ToSliceStrideOp : SparseTensor_Op<"slice.stride", [Pure]>,
Extracts the stride of the sparse tensor slice at the given dimension.

Currently, sparse tensor slices are still a work in progress, and only
works when runtime library is disabled (i.e., running sparse compiler
works when runtime library is disabled (i.e., running the sparsifier
with `enable-runtime-library=false`).

Example:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -127,8 +127,8 @@ class SparseTensorType {
/// Allow implicit conversion to `RankedTensorType`, `ShapedType`,
/// and `Type`. These are implicit to help alleviate the impedance
/// mismatch for code that has not been converted to use `SparseTensorType`
/// directly. Once more of the sparse compiler has been converted to
/// using `SparseTensorType`, we may want to make these explicit instead.
/// directly. Once more uses have been converted to `SparseTensorType`,
/// we may want to make these explicit instead.
///
/// WARNING: This user-defined-conversion method causes overload
/// ambiguity whenever passing a `SparseTensorType` directly to a
Expand Down
4 changes: 2 additions & 2 deletions mlir/include/mlir/Dialect/SparseTensor/Transforms/Passes.td
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ def PreSparsificationRewrite : Pass<"pre-sparsification-rewrite", "ModuleOp"> {
def SparsificationPass : Pass<"sparsification", "ModuleOp"> {
let summary = "Automatically generate sparse tensor code from sparse tensor types";
let description = [{
A pass that implements the core functionality of a **sparse compiler**.
A pass that implements the core functionality of a **sparsifier**.
Each Linalg operation (MLIR's tensor index notation) that operates on
sparse tensor types is converted into code in which the sparsity is
explicit both in terms of co-iterating looping logic as well as
Expand Down Expand Up @@ -332,7 +332,7 @@ def SparseVectorization : Pass<"sparse-vectorization", "ModuleOp"> {
def SparseGPUCodegen : Pass<"sparse-gpu-codegen", "ModuleOp"> {
let summary = "Generates GPU code during sparsification";
let description = [{
Enables sparse compiler to use GPU acceleration.
Enables the sparsifier to use GPU acceleration.
}];
let constructor = "mlir::createSparseGPUCodegenPass()";
let dependentDialects = [
Expand Down