How did you measure it? You probably added delay() to an empty sketch and saw the difference. This is an incorrect comparison. Operator uses system libraries, which are used in any large program without it.
By itself, the overhead from using the delay() does not exceed a few dozen bytes