While working on a profiler tool, I captured some performance differences between Hatari and a real Falcon.
The profile captures differences on just two instructions - a trivial ALU instruction (add.l Dn,Dn) and a word move (read) operation. The same tests are repeated with different permutations of I/D caches - and PMMU is used to inhibit caching for some tests via custom page descriptors.
The VIDEL was deliberately shut off to limit bus contention on the F30 while the tests were running, to get memory profile results as close to ideal as possible.
The results are quite varied. It is probably most interesting that the F30 results indicate single operations using fractions of a cycle for *anything* involving memory accesses - while pure cached operations round exactly to 16MHz cycles as expected. This is despite the VIDEL bus fetching being disabled. That one is going to be fun to figure out :)
Hatari/WinUAE:
F030:
The profiler tests are still very limited but I'll be adding interesting cases as time goes on, particularly around different addressing modes, overlapping fetches, head/tail cycle cases and so on.
D