OK, so it does work, then! If the behavior has changed with movem -> move change, that implies that:
- movem indeed somehow slowed down the AGU and it works as it should (i.e. no pipeline effect)
- when the instruction is quick enough, the pipeline effect takes place and therefore it works exactly as I described in the previous email (see the final values in r1 are always half the number of iteration)