@shachaf I think a guarantee (AVX CPUID flag must be set, natural alignment is a requirement, etc) was only very recently (I found an SO answer that says 2021) added to Intel's instruction manual so compilers are probably lagging. In the mean time you can implement it yourself using intrinsics.
Add comment