Re: [eigen] Optimization advice for a specific expression

[ Thread Index | Date Index | More lists.tuxfamily.org/eigen Archives ]


Gael Guennebaud writes:

> Hi,
>
> you can avoid one temp with:
>
> I4x4.noalias() += ...;
>

Ok.  Nevertheless, there is no big difference.

>
> Then slowndown might come from the compiler doing a bad job at
> inlining. What is your compiler?

gcc version 5.3.1 20160121 (Debian 5.3.1-7) 

> version?

libeigen3-dev  3.3~beta1-2

> What are the exact compilation flags?
>

-msse4 -O3 -DNDEBUG   -std=gnu++11

....aside from -I preprocessor flags.  "-msse4" seems to make no
difference either.

>
> Also, please paste the complete declaration of
> call_dense_assignment_loop.
>

Eigen::internal::call_dense_assignment_loop<Eigen::Matrix<double, 4, 4, 0, 4, 4>, Eigen::Product<Eigen::Product<Eigen::Matrix<double, 4, 4, 0, 4, 4>, Eigen::Matrix<double, 4, 4, 0, 4, 4>, 0>, Eigen::Matrix<double, 4, 4, 0, 4, 4>, 1>, Eigen::internal::assign_op<double> >

Thanks!



Mail converted by MHonArc 2.6.19+ http://listengine.tuxfamily.org/