If you compile with -mavx, there will be no SSE code generated by GCC.
The SSE intrinsics will compile to corresponding AVX intrinsics. So
this penalty is not much of a bother.

On Wed, Dec 7, 2011 at 11:23 AM, Rhys Ulerich <rhys.ulerich@xxxxxxxxx> wrote:
>> W.r.t porting to AVX: Be aware that there might be some pitfalls with
>> AVX-performance:
> Interesting tidbit from that link "If the programmer inadvertently
> mixes AVX and non-AVX vector instructions in the same code then there
> is a penalty of 70 clock cycles for each transition between the two
> forms."
