Amblone running via TLC5940

If you can avoid having to multiply your 8-bit values and use 12-bit values directly, you'll probably get nicer looking results.

If you have to translate 8-bit values into 12-bit values, you should be able to bit-shift them (by 4) instead of multiplying them, which is faster if I'm not mistaken.