This is the group reduction. More...
#include <reduction_api.hpp>
This is the group reduction.
Use slm to exchange the data.
| T | Is the data type to do the reduction |
| SZ | Is the vector size per item |
| N | Is the number of independent sets for one subgroup to do the parallel all-reduction |
| Op | Is the reduction op |
| N_SG | Is the number of subgroups that participate in this reduction. |
| is_all_reduce | Is the flag to enable all_reduce. If it is false, only sg_id 0 will have the updated result; otherwise all the N_SG subgroups will have the updated result. |
| arch_ | Is the HW generation. |