Hi,
I can check the version of VASM I was using - probably 1.6c or 1.7. It should assemble without hangs though :)
The test really just hashes binary results upon writing final calcs back to ram - to ensure sensitivity to small errors/differences. Lots of calculations are repeated with different kinds of value ranges, including special values such as NaN, denorm values etc. It won't show you the magnitude of the error - but will give clues about which instructions may misbehave.
I can help explain specific case if it helps.
Doug.