[eigen-commits] commit/eigen: 5 new changesets

[ Thread Index | Date Index | More lists.tuxfamily.org/eigen-commits Archives ]

5 new commits in eigen:

Changeset:   ea3d21db5a50
User:        rmlarsen
Date:        2016-10-04 21:22:56+00:00
Summary:     Fix a bug in the implementation of Carmack's fast sqrt algorithm in Eigen (enabled by EIGEN_FAST_MATH), which causes the vectorized parts of the computation to return -0.0 instead of NaN for negative arguments.

Benchmark speed in Giga-sqrts/s
Intel(R) Xeon(R) CPU E5-1650 v3 @ 3.50GHz
                    SSE        AVX
Fast=1              2.529G     4.380G
Fast=0              1.944G     1.898G
Fast=1 fixed        2.214G     3.739G

This table illustrates the worst case in terms speed impact: It was measured by repeatedly computing the sqrt of an n=4096 float vector that fits in L1 cache. For large vectors the operation becomes memory bound and the differences between the different versions almost negligible.
Affected #:  3 files

Changeset:   2b11d472d9f7
User:        rmlarsen
Date:        2016-10-04 22:08:41+00:00
Summary:     Update comment for fast sqrt.
Affected #:  1 file

Changeset:   24587e3541cf
User:        rmlarsen
Date:        2016-10-04 22:09:11+00:00
Summary:     Update comment for fast sqrt.
Affected #:  1 file

Changeset:   c8a3ef8107c9
User:        rmlarsen
Date:        2016-10-12 15:34:39+00:00
Summary:     Fix copy-paste error: Must use _mm256_cmp_ps for AVX.
Affected #:  1 file

Changeset:   fd9765d4abad
User:        ggael
Date:        2016-10-12 16:30:51+00:00
Summary:     Merged in rmlarsen/eigen (pull request #230)

Fix a bug in psqrt for SSE and AVX when EIGEN_FAST_MATH=1
Affected #:  3 files

Repository URL: https://bitbucket.org/eigen/eigen/


This is a commit notification from bitbucket.org. You are receiving
this because you have the service enabled, addressing the recipient of
this email.

Mail converted by MHonArc 2.6.19+ http://listengine.tuxfamily.org/