[Tracking Issue] Abstraction for sub-warp reduction in the IR. #55

yzh119 · 2022-10-14T23:43:28Z

In apache/tvm#10207 we introduce sub-warp reduction.

User can use 1 warp(32 threads) to perform eight 4-element aggregations in parallel:

@T.prim_func
def subwarpreduce(a: T.handle, b: T.handle) -> None:
    A = T.match_buffer(a, [8, 4], dtype="float32")
    B = T.match_buffer(b, [8], dtype="float32")
    for o, i, j in T.grid(1, 8, 4):
        with T.block("red"):
            vi, vj = T.axis.remap("SR", [i, j])
            with T.init():
                B[vi] = 0.0
            B[vi] = B[vi] + A[vi, vj]
    

def test_subwarp_reduce_flag():
    mod = tvm.IRModule.from_expr(subwarpreduce)
    sch = tvm.tir.Schedule(mod["main"])
    blk = sch.get_block("red")
    o, i, j = sch.get_loops(blk)
    sch.bind(i, "threadIdx.y")
    sch.bind(j, "threadIdx.x")
    sch.bind(o, "blockIdx.x")

However, in some cases, we have bound threadIdx.x with an extent of 32 in other blocks, which leads to contradictions.
Another way is to fuse i and j, then bind fused loop to threads in a warp:

@T.prim_func
def func(A: T.Buffer[(8, 4), "float32"], B: T.Buffer[(8,), "float32"]) -> None:
    # body
    # with T.block("root")
    for o in T.thread_binding(1, thread="blockIdx.x"):
        for i_j_fused in T.thread_binding(32, thread="threadIdx.x"):
            with T.block("red"):
                vi = T.axis.spatial(8, i_j_fused // 4)
                vj = T.axis.reduce(4, i_j_fused % 4)
                T.reads(A[vi, vj])
                T.writes(B[vi])
                with T.init():
                    B[vi] = T.float32(0)
                B[vi] = B[vi] + A[vi, vj]

Nevertheless, in this case, the LowerThreadAllReduce pass cannot recognize the sub-warp reduction structure and will emit code that reduces all threads in a warp together.

There should be an abstraction to denote sub-warp reduction in TensorIR.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tracking Issue] Abstraction for sub-warp reduction in the IR. #55

[Tracking Issue] Abstraction for sub-warp reduction in the IR. #55

yzh119 commented Oct 14, 2022

[Tracking Issue] Abstraction for sub-warp reduction in the IR. #55

[Tracking Issue] Abstraction for sub-warp reduction in the IR. #55

Comments

yzh119 commented Oct 14, 2022