XeTLA v0.3.6
IntelĀ® Xe Templates for Linear Algebra - API Definition Document
 
Loading...
Searching...
No Matches
gpu::xetla::group::global_reduce_t< reduce_kind, tile_shape_acc, tile_shape_cnt, mem_desc_acc_t, mem_desc_cnt_t, num_group_reduction, counter_size, arch_tag, enable > Class Template Reference

Cross group global reduction. More...

#include <global_reduction.hpp>

Detailed Description

template<reduce_op reduce_kind, typename tile_shape_acc, typename tile_shape_cnt, typename mem_desc_acc_t, typename mem_desc_cnt_t, uint32_t num_group_reduction, uint32_t counter_size, gpu_arch arch_tag, class enable = void>
class gpu::xetla::group::global_reduce_t< reduce_kind, tile_shape_acc, tile_shape_cnt, mem_desc_acc_t, mem_desc_cnt_t, num_group_reduction, counter_size, arch_tag, enable >

Cross group global reduction.

Template Parameters
reduce_kindIs the reduction type.
tile_shape_accIs the group-level tile shape for accumulation tile.
tile_shape_cntIs the group-level tile shape for counter tile.
mem_desc_acc_tIs the memory descriptor of accumulation buffer.
mem_desc_cnt_tIs the memory descriptor of counter buffer.
num_group_reductionIs the number of workgroups to do the reduction.
counter_sizeIs the stride to the next counter.
arch_tagIs the HW architecture.