SK6812 LED strips with a total of 1118 LEDS using STM32F103C8

You cannot bitbang it faster, you have to follow the preceise timing that the WS2812 requires. Faster CPU means faster busy-waiting.

You can split the matrix into separate segments and connect these to different pins. Now you dont have to update everything when you change only one LED which saves time.