Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md

Section 2c - Data Layout Transformations

Section 2 - Data Movement (Object FIFOs)
- Section 2a - Introduction
- Section 2b - Key Object FIFO Patterns
- Section 2c - Data Layout Transformations
- Section 2d - Programming for multiple cores
- Section 2e - Practical Examples
- Section 2f - Data Movement Without Object FIFOs
- Section 2g - Runtime Data Movement

While the Object FIFO primitive aims to reduce the complexity tied to data movement configuration on the AI Engine array, it also gives the user control over some advanced features of the underlying architecture. One such feature is the ability to do data layout transformations on the fly using the tile's dedicated hardware: the Data Movement Accelerators (DMAs). This is available on AIE-ML devices.

Tile DMAs interact directly with the memory modules of their tiles and are responsible for pushing and retrieving data to and from the AXI stream interconnect. When data is pushed onto the stream, the user can program the DMA's n-dimensional address generation scheme such that the data's layout when pushed may be different than how it is stored in the tile's local memory. In the same way, a user can also specify in what layout a DMA should store the data retrieved from the AXI stream.

DMA blocks contain buffer descriptor operations that summarize what data is being moved, from what offset, how much of it, and in what layout. These buffer descriptors are the AIE_DMABDOp operations in MLIR and have their own auto-generated Python binding (available under <MLIR_AIE_INSTALL_PATH>/python/aie/dialects/_aie_ops_gen.py after the repository is built):

def dma_bd
    (
        buffer,
        *,
        offset=None,
        len=None,
        dimensions=None,
        bd_id=None,
        next_bd_id=None,
        loc=None,
        ip=None
    )

It is not necessary to understand these low-level operations in order to use the data layout transformations with the Object FIFO primitive.

A data layout transformation is presented as a tuple of pairs, where each pair represents a size and a stride for a particular dimension of the data:

[<size_2, stride_2>, <size_1, stride_1>, <size_0, stride_0>]

Transformations can be expressed in up to three dimensions on each compute and Shim tile, and in up to four dimensions on Mem tiles. The first pair of this array gives the outer-most dimension's stride and size <size_2, stride_2>, while the last pair of the array gives the inner-most dimension's stride and size <size_0, stride_0>. All strides are expressed in multiples of the element width.

NOTE: Only for 4B data types the inner-most dimension's stride must be 1 by design.

Data layout transformations can be viewed as a way to specify to the hardware which location in the data to access next and as such it is possible to model the access pattern using a series of nested loops. For example, the transformation using the strides and sizes from above can be expressed as:

int *buffer;
for(int i = 0; i < size_2; i++)
    for(int j = 0; j < size_1; j++)
        for(int k = 0; k < size_0; k++)
            // access/store element at/to buffer[  i * stride_2
            //                                   + j * stride_1
            //                                   + k * stride_0]

As a practical example, here is an access pattern that corresponds to alternating between even and odd elements every 8 elements in a 128 element buffer/stream:

aie.dma_bd(%buf : memref<128xi32>, 0, 128, [<8, 16>, <2, 1>, <8, 2>])

which translates to:

for(int i = 0; i < 8; i++)          // size_2
    for(int j = 0; j < 2; j++)      // size_1
        for(int k = 0; k < 8; k++)  // size_0
            // access/store element at/to index:
            (
                i * 16  // stride_2 
                + j * 1 // stride_1 
                + k * 2 // stride_0
            )

Data Layout Transformations with the Object FIFO

Remember that the Object FIFO class constructor has two default-valued inputs: dimensionsToStream and dimensionsFromStreamPerConsumer.

class object_fifo:
    def __init__(
        self,
        name,
        producerTile,
        consumerTiles,
        depth,
        datatype,
        dimensionsToStream=None,
        dimensionsFromStreamPerConsumer=None,
    )

Our compiler directly lowers Object FIFOs that make use of the aforementioned data layout transformations to AIE_DMABDOp. You can use the dimensionsToStream input to describe in which order the producerTile's DMA should push the objects onto the stream. Similarly, the dimensionsFromStreamPerConsumer input describes to the DMAs of each individual tile in the consumerTiles in what layout to retrieve the objects from the stream.

As an example, the Object FIFO in the code below contains objects with datatype <4x8xi8>. Using the dimensionsToStream input it performs a data layout transformation on the producer tile side that pushes elements from memory onto the stream as follows: For every even length-8 row, select the first three even-indexed elements.

A = tile(1, 1)
B = tile(1, 3)
of0 = object_fifo
    (
        "objfifo0",
        A,
        B,
        3,
        np.ndarray[(4, 8), np.dtype[np.int8]],
        [
            (2, 16),
            (3, 2),
        ],
    )

The access pattern of the transformation can be written as:

for(int i = 0; i < 2; i++)      // size_1
    for(int j = 0; j < 3; j++)  // size_0
        // access/store element at/to index:
        (
            i * 16  // stride_1 
            + j * 2 // stride_0
        )

and further represented as in the image below:

Other examples containing data layout transformations are available in the programming_examples. A few notable ones are matrix_vector_multiplication and matrix_multiplication_whole_array.

[Prev - Section 2b] [Up] [Next - Section 2d]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

section-2c

section-2c

README.md

Section 2c - Data Layout Transformations

Data Layout Transformations with the Object FIFO

Files

section-2c

Directory actions

More options

Directory actions

More options

Latest commit

History

section-2c

Folders and files

parent directory

README.md

Section 2c - Data Layout Transformations

Data Layout Transformations with the Object FIFO