[eigen-commits] commit/eigen: 2 new changesets |
[ Thread Index |
Date Index
| More lists.tuxfamily.org/eigen-commits Archives
]
- To: eigen-commits@xxxxxxxxxxxxxxxxxxx
- Subject: [eigen-commits] commit/eigen: 2 new changesets
- From: Bitbucket <commits-noreply@xxxxxxxxxxxxx>
- Date: Mon, 30 Jul 2018 11:07:33 +0000 (UTC)
- Dkim-signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d=bitbucket.org; h=mime-version:content-type:content-transfer-encoding:subject:from:to; s=s1; bh=MTNoeCUHQdDNH8NZfN+VNXlYBjE=; b=jD3fO4Fqc5+PLMGGxQE0hxe d/2MD6bcTMrL/UcgpRaUEMkYRIlUSn/p1sdQX9j7duzl9gaaV2i87PW1Kmk1yNlK PysaZQsAmt/2C36myVS68vXfOrs8Ip7MhvjIZ1NwPYoGN6LPjq2eJ06xiqZjybAb gEFEEtsHvs8gTTOkKGGo=
2 new commits in eigen:
https://bitbucket.org/eigen/eigen/commits/c4cda242a4b5/
Changeset: c4cda242a4b5
User: markdryan
Date: 2018-06-25 12:05:02+00:00
Summary: Fix AVX512 implementations of psqrt
This commit fixes the AVX512 implementations of psqrt in the same
way that ea3d21d fixed the AVX2 version of this function. The
AVX512 versions of psqrt incorrectly return -0.0 for negative
values, instead of NaN. Fixing the issues requires adding
some additional instructions that slow down the algorithms. A
similar test to the one used in ea3d21d shows that the
corrected Packet16f code runs at 73% of the speed of the existing code,
while the corrected Packed8d function runs at 68% of the original.
Affected #: 1 file
https://bitbucket.org/eigen/eigen/commits/bb8a5963135a/
Changeset: bb8a5963135a
User: markdryan
Date: 2018-07-30 09:19:51+00:00
Summary: Re-enable FMA for fast sqrt functions
This commit re-enables the use of FMA for the FAST sqrt functions.
Doing so improves the performance of both algorithms. The float32
version is now 88% the speed of the original function, while the
double version is 90%.
Affected #: 0 files
Repository URL: https://bitbucket.org/eigen/eigen/
--
This is a commit notification from bitbucket.org. You are receiving
this because you have the service enabled, addressing the recipient of
this email.