Learn how to build a face recognition system using Python and OpenCV. This comprehensive guide covers everything you need to know, from the basics of face detection and recognition to advanced techniques such as deep learning. With the right knowledge and skills, you can build powerful face recognition systems for a variety of applications, such as security, surveillance, and marketing.
Table of Contents
OpenCV is the most popular library for computer vision. Originally written in C/C++, it now provides bindings for Python.
OpenCV uses machine learning algorithms to search for faces within a picture. Because faces are so complicated, there isn’t one simple test that will tell you if it found a face or not. Instead, there are thousands of small patterns and features that must be matched. The algorithms break the task of identifying the face into thousands of smaller, bite-sized tasks, each of which is easy to solve. These tasks are also called classifiers.
For something like a face, you might have 6,000 or more classifiers, all of which must match for a face to be detected (within error limits, of course). But therein lies the problem: for face detection, the algorithm starts at the top left of a picture and moves down across small blocks of data, looking at each block, constantly asking, “Is this a face? … Is this a face? … Is this a face?” Since there are 6,000 or more tests per block, you might have millions of calculations to do, which will grind your computer to a halt.
To get around this, OpenCV uses cascades. What’s a cascade? The best answer can be found in the dictionary: “a waterfall or series of waterfalls.”
Like a series of waterfalls, the OpenCV cascade breaks the problem of detecting faces into multiple stages. For each block, it does a very rough and quick test. If that passes, it does a slightly more detailed test, and so on. The algorithm may have 30 to 50 of these stages or cascades, and it will only detect a face if all stages pass.
The advantage is that the majority of the picture will return a negative during the first few stages, which means the algorithm won’t waste time testing all 6,000 features on it. Instead of taking hours, face detection can now be done in real time.
Though the theory may sound complicated, in practice it is quite easy. The cascades themselves are just a bunch of XML files that contain OpenCV data used to detect objects. You initialize your code with the cascade you want, and then it does the work for you.
Since face detection is such a common case, OpenCV comes with a number of built-in cascades for detecting everything from faces to eyes to hands to legs. There are even cascades for non-human things. For example, if you run a banana shop and want to track people stealing bananas, this guy has built one for that!
First, you need to find the correct setup file for your operating system.
I found that installing OpenCV was the hardest part of the task. If you get strange unexplainable errors, it could be due to library clashes, 32/64 bit differences, and so on. I found it easiest to just use a Linux virtual machine and install OpenCV from scratch.
Once you have completed the installation, you can test whether or not it works by firing up a Python session and typing:
>>> import cv2
>>>
If you don’t get any errors, you can move on to the next part.
Let’s break down the actual code, which you can download from the repo. Grab the face_detect.py script, the abba.png pic, and the haarcascade_frontalface_default.xml.
# Get user supplied values
imagePath = sys.argv[1]
cascPath = sys.argv[2]
You first pass in the image and cascade names as command-line arguments. We’ll use the ABBA image as well as the default cascade for detecting faces provided by OpenCV.
# Create the haar cascade
faceCascade = cv2.CascadeClassifier(cascPath)
Now we create the cascade and initialize it with our face cascade. This loads the face cascade into memory so it’s ready for use. Remember, the cascade is just an XML file that contains the data to detect faces.
# Read the image
image = cv2.imread(imagePath)
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
Here we read the image and convert it to grayscale. Many operations in OpenCV are done in grayscale.
# Detect faces in the image
faces = faceCascade.detectMultiScale(
gray,
scaleFactor=1.1,
minNeighbors=5,
minSize=(30, 30),
flags = cv2.cv.CV_HAAR_SCALE_IMAGE
)
This function detects the actual face and is the key part of our code, so let’s go over the options:
The detectMultiScale
function is a general function that detects objects. Since we are calling it on the face cascade, that’s what it detects.
The first option is the grayscale image.
The second is the scaleFactor
. Since some faces may be closer to the camera, they would appear bigger than the faces in the back. The scale factor compensates for this.
The detection algorithm uses a moving window to detect objects. minNeighbors
defines how many objects are detected near the current one before it declares the face found. minSize
, meanwhile, gives the size of each window.
Note: I took commonly used values for these fields. In real life, you would experiment with different values for the window size, scale factor, and so on until you found one that works best for you.
The function returns a list of rectangles in which it believes it found a face. Next, we will loop over where it thinks it found something.
print "Found {0} faces!".format(len(faces))
# Draw a rectangle around the faces
for (x, y, w, h) in faces:
cv2.rectangle(image, (x, y), (x+w, y+h), (0, 255, 0), 2)
This function returns 4 values: the x
and y
location of the rectangle, and the rectangle’s width and height (w
, h
).
We use these values to draw a rectangle using the built-in rectangle()
function.
cv2.imshow("Faces found", image)
cv2.waitKey(0)
In the end, we display the image and wait for the user to press a key.
Let’s test against the ABBA photo:
$ python face_detect.py abba.png haarcascade_frontalface_default.xml
That worked. How about another photo:
That … is not a face. Let’s try again. I changed the parameters and found that setting the scaleFactor
to 1.2 got rid of the wrong face.
Well, the first photo was taken fairly close up with a high quality camera. The second one seems to have been taken from afar and possibly with a mobile phone. This is why the scaleFactor
had to be modified. As I said, you’ll have to set up the algorithm on a case-by-case basis to avoid false positives.
Be warned though that since this is based on machine learning, the results will never be 100% accurate. You will get good enough results in most cases, but occasionally the algorithm will identify incorrect objects as faces.
The final code can be found here.
Thanks!
#python #opencv #machine-learning #image #data-science