Skip to content

support grouped query attention(MQA & GQA) for flash_attn (#22) #32

support grouped query attention(MQA & GQA) for flash_attn (#22)

support grouped query attention(MQA & GQA) for flash_attn (#22) #32

build

succeeded May 27, 2024 in 4m 9s