Get the bounding box coordinates in the TensorFlow object detection API tutorial

Question

I am new to both Python and Tensorflow. I am trying to run the object detection tutorial file from the Tensorflow Object Detection API, but I cannot find where I can get the coordinates of the bounding boxes when objects are detected. Relevant code: The place where I assume bounding boxes are drawn is like this: I tried printing output_dict['detection_boxes']

Accepted Answer

I tried printing output_dict[&#8216;detection_boxes&#8217;] but I am not sure what  the numbers meanYou can check out the code for yourself.  visualize_boxes_and_labels_on_image_array is defined here.Note that you are passing use_normalized_coordinates=True.  If you trace the function calls, you will see your numbers [ 0.56213236,  0.2780568 ,  0.91445708,  0.69120586] etc. are the values [ymin, xmin, ymax, xmax] where the image coordinates: (left, right, top, bottom) = (xmin * im_width, xmax * im_width,                               ymin * im_height, ymax * im_height)are computed by the function:def draw_bounding_box_on_image(image,                           ymin,                           xmin,                           ymax,                           xmax,                           color='red',                           thickness=4,                           display_str_list=(),                           use_normalized_coordinates=True):  """Adds a bounding box to an image.  Bounding box coordinates can be specified in either absolute (pixel) or  normalized coordinates by setting the use_normalized_coordinates argument.  Each string in display_str_list is displayed on a separate line above the  bounding box in black text on a rectangle filled with the input 'color'.  If the top of the bounding box extends to the edge of the image, the strings  are displayed below the bounding box.  Args:    image: a PIL.Image object.    ymin: ymin of bounding box.    xmin: xmin of bounding box.    ymax: ymax of bounding box.    xmax: xmax of bounding box.    color: color to draw bounding box. Default is red.    thickness: line thickness. Default value is 4.    display_str_list: list of strings to display in box                      (each to be shown on its own line).    use_normalized_coordinates: If True (default), treat coordinates      ymin, xmin, ymax, xmax as relative to the image.  Otherwise treat      coordinates as absolute.  """  draw = ImageDraw.Draw(image)  im_width, im_height = image.size  if use_normalized_coordinates:    (left, right, top, bottom) = (xmin * im_width, xmax * im_width,                                  ymin * im_height, ymax * im_height)

Advertisement

Answer