Use the GPU with your openCV code, get performance gains… Not!

Several times I’ve been drawn to the siren call of using key openCV calls on the GPU. Most recently for the matchTemplate call — before that for calls relating to visual odometry. Ah, disappointment! The situation now is that I have 31 possible images (or templates) to match against the image from the sensor. I want to find out which one fits best — anywhere on the big image. Using the openCV call matchTemplate for each of the 31 images, I get a score for the best match — the highest score is the winner. I plan on running this on an nVidia single board computer — whether nano, Xavier or whatever is appropriate for cost and performance.

cuda opencv gpu image-processing

