Re: [eigen] Slow matrix-matrix multiply |
[ Thread Index |
Date Index
| More lists.tuxfamily.org/eigen Archives
]
- To: eigen <eigen@xxxxxxxxxxxxxxxxxxx>
- Subject: Re: [eigen] Slow matrix-matrix multiply
- From: Gael Guennebaud <gael.guennebaud@xxxxxxxxx>
- Date: Tue, 2 Apr 2013 13:00:34 +0200
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type; bh=lgwHcolwnsMdLh891ZJxZZHKhBtmPrXvVgf3D3c8ags=; b=ORQrSfJCfzlVMd6v0w3Qij6BpMrIXEmSyO/hXfMOeHhrN3zYRCOlFfBvb16maMhaMb jdAKxs/YJ31q23uPqyZXwoll/2fbjySPFPggWnFx5TqZyUON5yYm1+N/HtRUjZLwq2WP u+rOOG+nD9uVXnbj4JtVJdkvocG+I39RdrspI22wiBId4pxZkMVet0f+ygRyc2H66DiK da4whJDPt81xZEGNNOk0mJ3mui1XLkARwdTQfwwHJjELNjqUFxStC9LnOdfXL88aXCEL 71ZMTdHrB589smK1rt0RVwaoOW8pf7mnn5LdSWPdNQk3/C+1YHLt6W9Y58OpGDbpmbOR ZFKQ==
On Tue, Apr 2, 2013 at 11:26 AM, Gael Guennebaud
<gael.guennebaud@xxxxxxxxx> wrote:
> I can also reproduce the performance drop with linux/gcc-4.7. However,
> the generated assembly in both cases are extremely similar (see the
> attached files), with even an advantage to Eigen with only 18
> additions compared to 27 for custom_gemm. Frankly, I cannot explain
> the perf difference.
oops, actually my system was a bit too loaded and the results too
random. Stable results with gcc4.7 on an Intel(R) Xeon(R) CPU X5570 @
2.93GHz:
-- Eigen --
Time (in seconds):
Preprocessor 0.050
Residual Evaluations 0.077
Jacobian Evaluations 0.695
Linear Solver 0.945
Minimizer 1.839
Postprocessor 0.001
Total 1.907
-- Custom GEMM --
Time (in seconds):
Preprocessor 0.067
Residual Evaluations 0.085
Jacobian Evaluations 0.712
Linear Solver 0.952
Minimizer 1.901
Postprocessor 0.001
Total 1.990
gael