I have had a chance to look at the data sheets now.
The 4021 has a parallel load input that is level sensitive so you put it high to load the parallel data in and low to clock it out.
The 4014 loads the data on the rising edge of the signal, it can stay high while you clock it out. If you take this line low then nothing will be clocked out of the shift register.
I assume the code in the tutorial is holding the line low during clocking out, you need to change the code so that this line is held high most of the time and when you want to load data in put it low and then immediately high again. Then it should work.