If you can tap into the video feed electrically you can use a video capture card as a video source and OpenCV or similar to detect the objects on the screen. If you can't connect electrically you could use a webcam instead of the video capture - just as easy to implement but the results would be less precise. None of this would involve Arduino.