| Re: [AD] Color convertors | 
[ Thread Index | 
Date Index
| More lists.liballeg.org/allegro-developers Archives
] 
At 19:30 06/09/2001 +0200, you wrote:
Here are some results for the MMX code:
[Warning: tons of numbers !!!]
[tons of numbers snipped]
step0: original Isaac's code
step0->step1: added _align_ at jump points
step1->step2: register permutation
step2->step3: replaced incl %ebp        by  decl %ecx
                       cmpl %ebp, %ecx      jnz
                       jb ...
step3->step4: removed the three 'nop'
step4->step5: replaced '%ebp' by '%mm7'
It seems there are improvements only in step 1 and step 5, so why don't we 
apply only this modifications? Also, we should test it in other CPUs (yours 
is K6-3 isn't it?), afterall I wrote the code with the AMD optimization 
guide in my hands and a k6-2 (now I own a Duron)
Also, I think we should target the most towards "slow" CPUs (Pentium MMX, 
K6...) rather than P4 or athlons, because these won't have any problem in 
maintaining a playable fps