[eigen] Eigen is slow with complex matrices

[ Thread Index | Date Index | More lists.tuxfamily.org/eigen Archives ]

To: eigen@xxxxxxxxxxxxxxxxxxx
Subject: [eigen] Eigen is slow with complex matrices
From: Robert Lupton the Good <rhl@xxxxxxxxxxxxxxxxxxx>
Date: Mon, 19 Oct 2009 07:22:35 -0400
Cc: Mike Jarvis <michael@xxxxxxxxxx>

We're using eigen on a large astronomical project; it's great -- noneed to find the proper combination du jour of optimised blas andlapack.

Unfortunately, in the context of complex matrices we are having toreconsider the use of BLAS. The reason appears to be in the complexclass itself (see below) Is there any way that Eigen can work aroundthis?

It occurs to me that you might be interested in the reason Eigen isslow for complex calculations compared to BLAS. It isn't reallytheir fault. It's the fault of the implementers of the complexclass in the std library.
One of Scott Meyer's recommendations for writing operator+ andoperator+= is to always implement operator+ _in terms of_ operator+=. (Likewise for the other operators.)
In general, this is a good practice to follow. It helps make surethat the two operations are consistent with each other, and soavoids unexpected behavior.
However, it turns out that multiplying to complex numbers is a casewhere the opposite is true. operator*= should instead beimplemented in terms of operator*. Look at the commonimplementation of the two functions in the standard library:
complex<_Tp>&
complex<_Tp>::operator*=(const complex<_Up>& __z)
{
   const _Tp __r = _M_real * __z.real() - _M_imag * __z.imag();
   _M_imag = _M_real * __z.imag() + _M_imag * __z.real();
   _M_real = __r;
   return *this;
}

inline complex<_Tp>
operator*(const complex<_Tp>& __x, const complex<_Tp>& __y)
{
   complex<_Tp> __r = __x;
   __r *= __y;
   return __r;
}
The implementation for *= is probably fine for itself. But doing *in terms of it requires an extra temporary, and, more importantly,it moves the memory around in a way that no compiler I've tried hasbeen able to pipeline effectively.
A better implementation would be to have op* directly calculate bothcomponents of the result, and return it as a new complex object:
{
   return complex<_Tp>(
       _x.real() * _y.real() - _x.imag() * _y.imag(),
       _x.real() * _y.imag() + _x.imag() * _y.real());
}
If you wanted to follow the spirit of Scott Meyer's rule, you couldthen implement op*= in terms of op*:
{ *this = *this * _z; return *this; }

But I haven't found any stl implementation that does it this way.

Follow-Ups:
- Re: [eigen] Eigen is slow with complex matrices
  - From: Benoit Jacob

Messages sorted by: [ date | thread ]
Prev by Date: [eigen] Skyline matrix
Next by Date: Re: [eigen] Eigen is slow with complex matrices
Previous by thread: Re: [eigen] Skyline matrix
Next by thread: Re: [eigen] Eigen is slow with complex matrices

Mail converted by MHonArc 2.6.19+

http://listengine.tuxfamily.org/