tdpbf16ps

Tile Dot Product BFloat16 Packed Single

TDPBF16PS tmm1, tmm2, tmm3

Matrix multiply (BFloat16) accumulating to Float32.

Details

The Tile Dot Product BFloat16 Packed Single instruction matrix multiply (BFloat16) accumulating to Float32.

Pseudocode Operation

// Matrix multiply (BFloat16) accumulating to Float32

Example

TDPBF16PS tmm1, tmm2, tmm3

Encoding

Binary Layout
VEX
+0
opcode
+3
ModRM
+4
 
Format VEX
Opcode C4 ... 5C
Extension AMX-BF16

Operands

  • dest
    AMX tile register
  • src1
    AMX tile register
  • src2
    AMX tile register