4 Answered Questions

[SOLVED] Fastest way to do horizontal float vector sum on x86

4 Answered Questions

[SOLVED] AVX2 what is the most efficient way to pack left based on a mask?

2 Answered Questions

[SOLVED] Why is this SSE code 6 times slower without VZEROUPPER on Skylake?

1 Answered Questions

[SOLVED] What are the best instruction sequences to generate vector constants on the fly?

6 Answered Questions

[SOLVED] SSE instructions: which CPUs can do atomic 16B memory operations?

5 Answered Questions

[SOLVED] How to check if a CPU supports the SSE3 instruction set?

2 Answered Questions

4 Answered Questions

[SOLVED] SIMD prefix sum on Intel cpu

1 Answered Questions

[SOLVED] parallel prefix (cumulative) sum with SSE

  • 2013-10-21 12:10:18
  • Z boson
  • 3665 View
  • 10 Score
  • 1 Answer
  • Tags:   c sum openmp sse

6 Answered Questions

[SOLVED] Why is SSE scalar sqrt(x) slower than rsqrt(x) * x?

3 Answered Questions

[SOLVED] What is the meaning of "non temporal" memory accesses in x86

  • 2008-08-31 20:18:34
  • Nathan Fellman
  • 31398 View
  • 113 Score
  • 3 Answer
  • Tags:   x86 sse assembly

2 Answered Questions

[SOLVED] How to implement atoi using SIMD?

  • 2016-02-01 09:33:51
  • the_drow
  • 3645 View
  • 27 Score
  • 2 Answer
  • Tags:   c++ x86 sse simd atoi

4 Answered Questions

[SOLVED] SSE integer division?

  • 2013-05-29 19:58:05
  • fogbit
  • 8837 View
  • 18 Score
  • 4 Answer
  • Tags:   c++ sse

4 Answered Questions

8 Answered Questions

4 Answered Questions

[SOLVED] print a __m128i variable

1 Answered Questions

[SOLVED] Fast vectorized rsqrt and reciprocal with SSE/AVX depending on precision

1 Answered Questions

[SOLVED] Is it possible to use SSE and SSE2 to make a 128-bit wide integer?

  • 2012-08-30 15:45:41
  • Erkling
  • 1422 View
  • 10 Score
  • 1 Answer
  • Tags:   assembly sse sse2

2 Answered Questions

8 Answered Questions

[SOLVED] How is a vector's data aligned?

2 Answered Questions

[SOLVED] How to use Fused Multiply-Add (FMA) instructions with SSE/AVX

1 Answered Questions

[SOLVED] Fastest way to compute absolute value using SSE

2 Answered Questions

[SOLVED] How to efficiently perform double/int64 conversions with SSE/AVX?

3 Answered Questions

3 Answered Questions

[SOLVED] SSE, intrinsics, and alignment

3 Answered Questions

[SOLVED] practical BigNum AVX/SSE possible?