Trying to Count Faces Using OpenCV, Haar Cascades and Raspberry PI

Question

So I have a project for a trail camera to count people entering the trail by detecting their faces. There isn't any power available so I am stuck using a Raspberry Pi 3 B+ for power reasons. I am currently using opencv and Haar cascades to detect faces. My problem is two-fold. The first one is that my counter behaves

Accepted Answer

First, you can use Dlib as you said but you have to use &#8220;HOG&#8221; method (Histogram of oriented gradients) instead of &#8220;CNN&#8221; for performance.locations = face_recognition.face_locations(frame, model="hog")But, if you really wanna get the faster performance I will recommend you to use Mediapipe for that purpose.Download Mediapipe on your rpi3:sudo pip3 install mediapipe-rpi3Here is an example code from Mediapipe documentation for faces detector:import cv2import mediapipe as mpmp_face_detection = mp.solutions.face_detectionmp_drawing = mp.solutions.drawing_utils# For webcam input:cap = cv2.VideoCapture(0)with mp_face_detection.FaceDetection(    model_selection=0, min_detection_confidence=0.5) as face_detection:  while cap.isOpened():    success, image = cap.read()    if not success:      print("Ignoring empty camera frame.")      # If loading a video, use 'break' instead of 'continue'.      continue    # To improve performance, optionally mark the image as not writeable to    # pass by reference.    image.flags.writeable = False    image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)    results = face_detection.process(image)    # Draw the face detection annotations on the image.    image.flags.writeable = True    image = cv2.cvtColor(image, cv2.COLOR_RGB2BGR)    if results.detections:      for detection in results.detections:        mp_drawing.draw_detection(image, detection)    # Flip the image horizontally for a selfie-view display.    cv2.imshow('MediaPipe Face Detection', cv2.flip(image, 1))    if cv2.waitKey(5) & 0xFF == 27:      breakcap.release()I am not sure about how much FPS you will get (but surely better than Dlib and very accurately), but you can speed up the performance by detecting faces on every third frame instead of on all of them.Secondly, you can do a naive way that probably will work fine.You can extract the centers of the bounding boxes in the last detection and if a center from the previous frame is inside a bounding box of a face in the current frame, it&#8217;s probably the same person.You can do it more accurately by determining if the new center of the face in the current frame (the center of his bounding box) is close enough to one of the last centers in the last frame by an offset that you choose. If it does, it&#8217;s probably the same face so just don&#8217;t count it.Visualization:Hope it will work fine for you!

Advertisement

Answer