[eigen] Use of streaming loads in eigen |
[ Thread Index |
Date Index
| More lists.tuxfamily.org/eigen Archives
]
- To: eigen@xxxxxxxxxxxxxxxxxxx
- Subject: [eigen] Use of streaming loads in eigen
- From: Rohit Garg <rpg.314@xxxxxxxxx>
- Date: Fri, 27 Mar 2009 19:27:19 +0530
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type:content-transfer-encoding; bh=tX/ANEk9PiRwIhWDsRQLqyW2sksDj8ObKaTpCF/3Pss=; b=TGqeax68B9ivL8qEDhcXLy+Zue6O5VcR0ZdwTabE2IanTUrm9ZV+8LBG8lVoSUo1Pw nCztpK9JKh7qxrzF9QkYi4s6BVcClrlD9RlMI3Da0I5ue6B31nu/jI82m9n53UAiTCwk 533A74qP5fj12/ujhmBYkjUeBzl1XMeRg5GRQ=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type :content-transfer-encoding; b=jZ3DcEnmpok0gNnPl7PGn5ppoF+LzkxX2n91qFUcCYkrPkMDekq55YK9J+PLF9Pv3X c68sz9prftxebQK1OwO5BoM7tVV35LFXqvgCRlJ3fVkYNTSJ7BkYz+oYlS44RC/RMwIz o6gdCie8krXZdAGH7RpaVJRi7i3ubxUL2rCts=
In the PacketMath.h file, the only eigen function to send the sse
registers back to memory is the _mm_store_ intrinsic. Has anybody
looked at using the _mm_stream intrinsics instead. They should help
atleast in BLAS 1 and BLAS 2 codes where the code is fundamentally
bandwidth bound. For a pseudo benchmark. look here
http://humus.name/index.php?page=Comments&ID=244
Even for BLAS 3 it should only help. However, for small sized stuff,
it will definitely not be helpful as one will want to have the item in
the cache as long as possible as it doesn't pollute the cache because
of it's small size.
--
Rohit Garg
http://rpg-314.blogspot.com/
Senior Undergraduate
Department of Physics
Indian Institute of Technology
Bombay