_mm_maskstore_pd

Store packed double-precision (64-bit) floating-point elements from a into memory using mask. Note: emulating that instruction isn't efficient, since it needs to perform memory access only when needed. See: "Note about mask load/store" to know why you must address valid memory only.

Meta