Small 8x8 matrix driver?

a 168/328 cannot receive that fast tho due to the slave having to sample (the CS pin?)

True, when implemented poorly.