Re: [eigen] about changeset 6eb14e380

2010/8/19 Benoit Jacob <jacob.benoit.1@xxxxxxxxx>

2010/8/19 joel falcou <joel.falcou@xxxxxx>

On 19/08/10 13:03, Gael Guennebaud wrote:

To make things more complicated we could also think about how to
support packet of different sizes for the same scalar type. Indeed,
both the NEON (for ARM) and the future AVX engines support packet of
different sizes. 2 or 4 floats for NEON, and 4 or 8 floats for AVX.
For dynamic sized objects it is pretty clear that supporting largest
packets is enough. However, for small fixed sized objects it be very
welcome to be able to instantiate packet types according to the
context. For instance, if AVX is enabled, we still want to able to use
packets of 4 floats to vectorize Vector4 and Matrix4. This is very
important for what you know...

It is not.

Gael was just saying that it's important to offer vectorization for small fixed-size objects, such as Vector4f, which means that we must be able to use 16 byte packets instead of 32 byte packets in some circumstances.

Earlier today on IRC, we discussed replacing the AlignedBit flag by a new, separate enum value Alignment. Heap allocated arrays would use a generously large alignment, e.g. cache line boundary alignment. Static arrays would use the highest alignment that is useful for SIMD, doesn't increase their sizeof, and is <= some limit that the user could manually set if he really knows what he's doing (setting such a limit would be ABI incompatible). At computation-time, we'd then check the value of Alignment and emit SIMD code accordingly.

Note for the worried: I think that all of that can be done for Eigen 3.1, as it's essentially internal. The only issue is that if at a later time we enable AVX vectorization, then if we run AVX code on a matrix that was created by a shared library compiled against Eigen 3.0, it would crash (the library would have to be re-built against newer Eigen). Given that AVX is still a thing of the future, I think we can ignore that for now, and by the time we add AVX support in Eigen, we document that issue. We've got to put some realistic boundaries on our ABI compatibility guarantees, we can't boast compatibility with every future hardware platform without recompilation.

Benoit

Benoit

In nt2, the simd packet class is native<T,X> where T is the scalar
type and X the extension tag. Then functions knwo which packet they are allowed
to mix and the configuration include w/e tag needed.