tdpbf16ps
Tile Dot Product BFloat16 Packed Single
TDPBF16PS tmm1, tmm2, tmm3
Matrix multiply (BFloat16) accumulating to Float32.
Details
The Tile Dot Product BFloat16 Packed Single instruction matrix multiply (BFloat16) accumulating to Float32.
Pseudocode Operation
// Matrix multiply (BFloat16) accumulating to Float32
Example
TDPBF16PS tmm1, tmm2, tmm3
Encoding
Binary Layout
VEX
+0
opcode
+3
ModRM
+4
Operands
-
dest
AMX tile register -
src1
AMX tile register -
src2
AMX tile register