Re: [eigen] benchmarks for large matrices? |
[ Thread Index |
Date Index
| More lists.tuxfamily.org/eigen Archives
]
- To: eigen@xxxxxxxxxxxxxxxxxxx
- Subject: Re: [eigen] benchmarks for large matrices?
- From: David Roundy <daveroundy@xxxxxxxxx>
- Date: Wed, 18 Feb 2009 08:19:32 -0800
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=A9h/PIqEQI0j7S4mjCK9OTOmpGdS9rfFgJVA5NhKMGQ=; b=Icq0FnG28RV9Fycm5GNdaKnad5QPpb8P/PLOKlaM2nZEqhqTAXrwkCs5ACogOq2Lge 6IhhU+XwLP+YpjBcmyRiK97qOolm2VxuFGX/n4IFdiFq5bugT9oX6t60WhR0GHHmTWE0 96Cku3oGfpMV0S97I3aNdofleN8wHryYQCX6A=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=kfdm8Jr3qmkxE7CtGtvSB1QfxjkT8EQjex3cd71l7c6s5hHd8Sir6P3aQfR2FB+jjW mlEW1DQWb0hLNP342Qb6rHGVUJrIpOwWNpMJdo9sSNBJXVnMP35b3oUvNDhaZWOQnf50 m60j9e6neZOXRTOQVcJmtgELtkgpeFmJzdk6Q=
Thanks!
David
On Wed, Feb 18, 2009 at 7:56 AM, Gael Guennebaud
<gael.guennebaud@xxxxxxxxx> wrote:
> yep, actually I've just tried to compile the latest ATLAS myself even
> though it seems to be a bit faster than the older one I used for the
> benchmark, Eigen is still faster, especially for non multiple of 4
> matrix sizes.
>
> I attached a small benchmark that you can easily try:
>
> compilation:
>
> g++ -O2 -ffast-math -DNDEBUG gemm.cpp -latlas -lcblas -o gemm
>
> then:
>
>
> time ./gemm
>
> and I get:
>
> eigen: 0.79 s
> ATLAS: 1.28s
> MKL: 0.44
>
> In each case I used a single thread, my CPU is:
> Intel(R) Core(TM)2 Quad CPU Q9400 @ 2.66GHz
>
> so the peak performance is 21 GFLOPS, MKL reach ~18.2GFLOPS, eigen
> ~10.2, and Atlas ~6.25
>
>
>
> On Wed, Feb 18, 2009 at 4:37 PM, Benoit Jacob <jacob.benoit.1@xxxxxxxxx> wrote:
>> 2009/2/18 David Roundy <daveroundy@xxxxxxxxx>:
>>> If you're using an ATLAS tuned for a
>>> machine with a larger cache, it'd be no surprise that you'd get poor
>>> numbers...
>>
>> I wouldn't expect that, because Gael's CPU is a Core 2 duo T7200, and
>> those have 4 MB of cache.
>>
>> Benoit
>>
>>
>>
>
--
David Roundy