tdpfp16ps
Tile Dot Product FP16 Packed Single
TDPFP16PS tmm1, tmm2, tmm3
Matrix multiply (FP16 * FP16) accumulating to Float32.
Details
The Tile Dot Product FP16 Packed Single instruction matrix multiply (FP16 * FP16) accumulating to Float32.
Pseudocode Operation
// Matrix multiply (FP16 * FP16) accumulating to Float32
Example
TDPFP16PS tmm1, tmm2, tmm3
Encoding
Binary Layout
VEX
+0
opcode
+3
ModRM
+4
Operands
-
dest
TMM -
src1
TMM -
src2
TMM