Is the element-wise reduce op functor, specialized for stream_k dispatch Load partial sum from scratchspace Reduce in GRF Store zero to scratchspace Do these steps with smaller tiles to minimize GRF pressure. More...
#include <tile_op_functor.hpp>
Is the element-wise reduce op functor, specialized for stream_k dispatch Load partial sum from scratchspace Reduce in GRF Store zero to scratchspace Do these steps with smaller tiles to minimize GRF pressure.
| reduce_kind | Is the reduce type, can be sum, prod, min and max. |
| dtype_in | Is the memory side buffer data type. |
| arch_tag | Is the hardware architecture tag. |