On 2016-01-20 02:05, Gael Guennebaud wrote:
Eigen: 0.342062s

Blitz++: 14.6185s

(compiled with gcc 5 -O3 -DNDEBUG -mfma)

I was not surprised that Eigen is faster -- mostly as it uses SIMD.
But I think for a fairer comparison, you should at least initialize the values in the Blitz++ implementation (working on uninitialized values tends to be slow, if there is a number of NaNs and Infs in the data), and probably also compare how big the difference without SIMD would be (or with just SSE2/3/4, instead of AVX+FMA).


