Another optimization question - can I speed up this 32 bit multiply?

Plus it's another reason not to use defines

What do you mean by that anyway? I shouldn't be using defines? How else can I use inline assembly?