tdpfp16ps

Tile Dot Product FP16 Packed Single

TDPFP16PS tmm1, tmm2, tmm3

Matrix multiply (FP16 * FP16) accumulating to Float32.

Details

The Tile Dot Product FP16 Packed Single instruction matrix multiply (FP16 * FP16) accumulating to Float32.

Pseudocode Operation

// Matrix multiply (FP16 * FP16) accumulating to Float32

Example

TDPFP16PS tmm1, tmm2, tmm3

Encoding

Binary Layout
VEX
+0
opcode
+3
ModRM
+4
 
Format VEX
Opcode C4 ... 5C
Extension AMX-FP16

Operands

  • dest
    TMM
  • src1
    TMM
  • src2
    TMM