Re: [eigen] SSE square root |
[ Thread Index |
Date Index
| More lists.tuxfamily.org/eigen Archives
]
- To: eigen@xxxxxxxxxxxxxxxxxxx
- Subject: Re: [eigen] SSE square root
- From: Rohit Garg <rpg.314@xxxxxxxxx>
- Date: Fri, 27 Mar 2009 16:08:54 +0530
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=5+bCHuXD81hKrhe9b/Lx0KdJ8XNDMKhwZsDsnAelDks=; b=YOuByDGthHSLDO/9wLjjjU7uQIrgD/T7oAFit1OK5fc0CxAG5x+c3X+HvkKiQFiADk eQXTNik0dOF8lte3+EXzOcCuApQRSP77qYhA3ZvUbLo6K8txtSr70y5v8czLhp8tIzE4 TFuhtLZHutH4H6FS2ph8kR0lbMbu0y4GBMFu4=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=JuJA/1xoG8HN2TwtJec4Zo/+L6GxauD5i2Mb7LKHDswUN5k6Xy/Qo/pZ+1787RgfNR ftl/EvNUOs3sI83IcOVQGZEisyXnpGybuPwpPtIYZk+FJR0upvgUIFjLOU9boRTlDg67 EHxpre0x7g1u7RDSAEE4oqI3zgRh2fJtz4fXY=
On Fri, Mar 27, 2009 at 3:58 PM, Gael Guennebaud
<gael.guennebaud@xxxxxxxxx> wrote:
> On Fri, Mar 27, 2009 at 8:32 AM, Rohit Garg <rpg.314@xxxxxxxxx> wrote:
>> This file has my sse float implementation for square root. The SSE
>> square root instruction has only 12 bits of precision so extra
>
> where did you find sqrtss or sqrtps has only 12 bits of precision ?
This info is from the CUDA classes. The lectures put up there say that
the precision for square root is only 12 bits. Now I need to confirm.
Your idea for a approximate reciprocal square root, a mul, and 1
iteration is a good one. Let me try that.
--
Rohit Garg
http://rpg-314.blogspot.com/
Senior Undergraduate
Department of Physics
Indian Institute of Technology
Bombay