Sortest pulse

Probably, you have to replace digitalWrite with direct port manipulation. DigitalWrite / Read functions include some overhead to make safety check, and it takes some time of course, I'd expect ~5 / 7 usec that is pretty much in agreement what you have.