Assembly or GCC?

you can even write assembly in Arduino/GCC:

for example a very tiny delay that will NOT be optimized away:

void verysmalldelay(unsigned char loop)
{
for(unsigned char i=0;i<loop;i++)
{
asm("nop");
}
}