you have to use tuned speeds that fit nicely with the 16MHz clock.
got succesfull transfers at 230400, 345600 and 500000
see older thread - Fast communication to PC with low Arduino footprint - #12 by robtillaart - Networking, Protocols, and Devices - Arduino Forum