The code is interesting, but I have one more question regarding the topic. The component (Esp32-s2) has OTG type communication in its central, which helps it to recognize compatible USB devices and facilitate connectivity (the same OTG of cell phones). But the question I have is... how to acquire image/how to configure the camera for OTG/USB connection through pins 19&20(USB data) ??? I've looked everywhere for sketch with something I could use as a model/example, but none with that.
The video shown at the top, actually has somewhere the view of using USB camera right on these pins to acquire image.
The ESP32 S2 USB OTG port does work in USB host mode with appropriate cabling. See links below for various ways to make a cable. Some ESP32 S2 Arduino USB host driver work is going on. None of it applies to USB cameras. Some examples are very minimal and need more work to be useful. This work might be useful as a start for writing a camera driver. The video in the other thread shows Espressif apparently made a USB camera work but there is no sign they ever released source code.
ESP32 S2 USB device and host examples.
ESP32 S2 USB host minimal examples.
The ESP-IDF docs also includes API info for the USB host stack.