what kind of language is it, it does not look like the regular C

A memcpy() should do the job, copy 256 samples by 256 samples. A more sophisticated solution would require an AHB DMA to copy, without core clock cycles, a buffer into another one: