Re: [eigen] Issues regarding Quaternion-alignment and const Maps |
[ Thread Index |
Date Index
| More lists.tuxfamily.org/eigen Archives
]
- To: eigen@xxxxxxxxxxxxxxxxxxx
- Subject: Re: [eigen] Issues regarding Quaternion-alignment and const Maps
- From: Benoit Jacob <jacob.benoit.1@xxxxxxxxx>
- Date: Fri, 9 Jul 2010 18:03:53 -0400
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=J/kz8/Yitbu2bisxDOZ9QAU8SkxtMQwzFsYKSuqsIkY=; b=cXzLfokgzy0Ne3oWLBHYEJrUNi7gpl2J6ChJqjKmyBSgsqheknzzyVqL7l7sPb3B1s E9fk62fp2TnTkkZH6zr/LXqJ1mHrCNuIJvxh3n5TulcALEt3CsD0fyqOFY1F3QiB+HfI oQpdqIVG8FvafxXIuE3UDjWfDWKmBRcNiAalY=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=g4GiGH3xrvpKl1SIdij5NPrMe5t8aRWgFmTLMLro5tXCyTizhh7Wbh2sWr2iTTQJCk zXtxSABoPkbhlp0SN+LaPdpH6eHxkRWwfa//oEGDx+aNa9Xs+ThJgAmfnNul+sAEdv0l lvBSljy2ez1UHtY0R7c8cnxTnbW/AUAEIdcS0=
2010/7/9 Christoph Hertzberg <chtz@xxxxxxxxxxxxxxxxxxxxxxxx>:
> Benoit Jacob wrote:
>>
>> Wow, very good work.
>>
>> I indeed confirm the 2x speed improvement, and once i moved your
>> benchmarking code to a non-inlinable function called from main(), it
>> even got a bit higher (indeed GCC fails to optimize correctly code in
>> the main() function).
>>
>> Could you make a patch against the development branch? (We're not
>> going to add features to 2.0 at this point).
>
> I think I can do that, but most likely not before Monday/Tuesday.
>
>>
>> http://eigen.tuxfamily.org/index.php?title=Developer%27s_Corner#Generating_a_patch
>>
>> Also, I didn't know about that loaddup instruction in SSE3. It's
>> great! I'll have a look at using it in ei_pset1 when SSE3 is
>> available.
>
> It's actually a pity that there is no complete list with *just* all
> SSE-instructions (not mixed with every other x86-instruction), including a
> short description, maybe a usage example, and intrinsics for some common
> compilers. At least I did't find any ...
Yes, I've been trying to see if there is a single-precision equivalent
for MOVDDUP and I still don't know...
Benoit
>
>
> --
> ----------------------------------------------
> Dipl.-Inf. Christoph Hertzberg
> Cartesium 0.051
> Universität Bremen
> Enrique-Schmidt-Straße 5
> 28359 Bremen
>
> Tel: (+49) 421-218-64252
> ----------------------------------------------
>
>
>