Is the element-wise gelu training forward op functor. More...
#include <tile_op_functor.hpp>
Is the element-wise gelu training forward op functor.
Get the gelu input from matAcc, update the the gelu output in place, and dump the intermediate buffer_w to memory for backward purpose. Used in epilogue::tile_op or chained_tile_op.
| dtype_out | Is the data type of the intermediate buffer_w. |
| arch_tag | Is the hardware architecture tag. |