It is hard to calculate - it depends on too many variables. I think trying it is only viable way.
Clock needs buffering for sure. Data pin does not - when daisy chained one output drives only one input. I guess latch pin does not need buffering if you wait a bit longer until the signal has time to settle. But buffer it too would be "cleaner".