Re: [eigen] heads-up: ARM prefetch fixes |
[ Thread Index | Date Index | More lists.tuxfamily.org/eigen Archives ]
BenoitCheers,- on ARM32, the asm statement was needlessly clobbering "cc" (condition code). There is nothing in the ARM assembler reference that suggests that this instruction touches condition codes. http://infocenter.arm.com/- actually generate prefetch instructions on ARM64. On a Pixel XL Android device, running on 1 big core (Kryo @ 2.15 GHz), 1024x1024 matrix multiplication speed (rowmajor * colmajor -> colmajor, which is what we tend to use in NN applications) is improved by ~ 10% by this change.This does two things:3.3 branch: https://bitbucket.org/eigen/default branch: https://bitbucket.org/eigen/Hi,
Hope that was OK --- I just pushed the following changeset to default and 3.3 branches:eigen/commits/ 0974c5e72c12891855a2e01dd886c2 1e881fd310 eigen/commits/ 6ae8b07ef7d4787836bfaf7c599ecd 2134f49f30 help/index.jsp?topic=/com.arm. doc.dui0068b/Chdjffbi.html
Mail converted by MHonArc 2.6.19+ | http://listengine.tuxfamily.org/ |