The non programmers might balk at the text, but some of the graphs should be plain enough:
http://www.phoronix.com/scan.php?page=article&item=intel_core_avx2&num=1
AVX2 turns out to be much better than originally expected if properly optimized for.
Intelās latest compiler (which has always been ridiculously strong on numerical) already hinted at that, but it was hard to get past the rumours of Intel making it purposedly so for the sake of making them appear better than they are. When GCC pulls it off though you know itās outside of marketing scheme and well into āit is soā territory.
4.9 of GCC might do an even better job again.
Turns out if the compiler does a good job of autovectorization and is used to specifically target AVX2 (the newest SSE like instruction set on Haswell) there is a very real benefit to be had on the cheap.
Sadly, most of the apps you use day in and day out not only arenāt built to that level of optimization, they are often compiled largely, if not entirely, with ancient compilers that will barely tap SSE2.
