[eigen-commits] commit/eigen: 5 new changesets |
[ Thread Index |
Date Index
| More lists.tuxfamily.org/eigen-commits Archives
]
5 new commits in eigen:
https://bitbucket.org/eigen/eigen/commits/bbc5e3174838/
Changeset: bbc5e3174838
User: Benoit Jacob
Date: 2015-01-30 22:27:56+00:00
Summary: Bug 935: Add asm comments in GEBP kernels to work around a bug
in both GCC and Clang on ARM/NEON, whereby they spill registers,
severely harming performance. The reason why the asm comments
make a difference is that they prevent the compiler from
reordering code across these boundaries, which has the effect
of extending the lifetime of local variables and increasing
register pressure on this register-tight code.
Affected #: 2 files
https://bitbucket.org/eigen/eigen/commits/c3ac45922854/
Changeset: c3ac45922854
User: Benoit Jacob
Date: 2015-01-30 22:43:56+00:00
Summary: Bug 936, patch 1/3: some cleanup and renaming for consistency.
Affected #: 3 files
https://bitbucket.org/eigen/eigen/commits/9f01eb284589/
Changeset: 9f01eb284589
User: Benoit Jacob
Date: 2015-01-31 19:15:57+00:00
Summary: Bug 936, patch 1.5/3: rename _FUSED_ macros to _SINGLE_INSTRUCTION_,
because this is what they are about. "Fused" means "no intermediate rounding
between the mul and the add, only one rounding at the end". Instead,
what we are concerned about here is whether a temporary register is needed,
i.e. whether the MUL and ADD are separate instructions.
Concretely, on ARM NEON, a single-instruction mul-add is always available: VMLA.
But a true fused mul-add is only available on VFPv4: VFMA.
Affected #: 5 files
https://bitbucket.org/eigen/eigen/commits/0130a244c123/
Changeset: 0130a244c123
User: Benoit Jacob
Date: 2015-01-30 22:44:26+00:00
Summary: Bug 936, patch 2/3: Remove EIGEN_VECTORIZE_FMA, was redundant with EIGEN_HAS_SINGLE_INSTRUCTION_MADD
Affected #: 5 files
https://bitbucket.org/eigen/eigen/commits/06f163b5221f/
Changeset: 06f163b5221f
User: Benoit Jacob
Date: 2015-01-30 22:45:03+00:00
Summary: Bug 936, patch 3/3: Properly detect FMA support on ARM (requires VFPv4)
and use it instead of MLA when available, because it's both more accurate,
and faster.
Affected #: 1 file
Repository URL: https://bitbucket.org/eigen/eigen/
--
This is a commit notification from bitbucket.org. You are receiving
this because you have the service enabled, addressing the recipient of
this email.