[eigen-commits] commit/eigen: ggael: Complete rewrite of column-major-matrix * vector product to deliver higher performance of modern CPU. |
[ Thread Index |
Date Index
| More lists.tuxfamily.org/eigen-commits Archives
]
- To: eigen-commits@xxxxxxxxxxxxxxxxxxx
- Subject: [eigen-commits] commit/eigen: ggael: Complete rewrite of column-major-matrix * vector product to deliver higher performance of modern CPU.
- From: Bitbucket <commits-noreply@xxxxxxxxxxxxx>
- Date: Sat, 03 Dec 2016 20:14:32 -0000
- Dkim-signature: v=1; a=rsa-sha1; c=relaxed; d=bitbucket.org; h=mime-version:content-type:content-transfer-encoding:subject:from:to; s=s1; bh=F3CGKFRCwB3yS2MB81DmC1zaufM=; b=cGQlIUDNBKHC0GeQCxEKmIK Xouu0uy+9o/+8Ch1sM8cf9HmL57pF9VgbZGjyPZzvCv2Herxsek5oqvnh37P75rG 2LR2cHbNNNOpUKbwhWhhWrT79iifApSLx647FcvHYwP8o2zUSL7xt6Qwb+snoOaC YJp2KJZVSS5mVJHmngzw=
1 new commit in eigen:
https://bitbucket.org/eigen/eigen/commits/2213991340ea/
Changeset: 2213991340ea
User: ggael
Date: 2016-12-03 20:14:14+00:00
Summary: Complete rewrite of column-major-matrix * vector product to deliver higher performance of modern CPU.
The previous code has been optimized for Intel core2 for which unaligned loads/stores were prohibitively expensive.
This new version exhibits much higher instruction independence (better pipelining) and explicitly leverage FMA.
According to my benchmark, on Haswell this new kernel is always faster than the previous one, and sometimes even twice as fast.
Even higher performance could be achieved with a better blocking size heuristic and, perhaps, with explicit prefetching.
We should also check triangular product/solve to optimally exploit this new kernel (working on vertical panel of 4 columns is probably not optimal anymore).
Affected #: 2 files
Repository URL: https://bitbucket.org/eigen/eigen/
--
This is a commit notification from bitbucket.org. You are receiving
this because you have the service enabled, addressing the recipient of
this email.