_mm256_dp_ps

Conditionally multiply the packed single-precision (32-bit) floating-point elements in a and b using the high 4 bits in imm8, sum the four products, and conditionally store the sum using the low 4 bits of imm8.

nothrow @nogc
__m256
_mm256_dp_ps
(
int imm8
)
(
__m256 a
,
__m256 b
)

Meta