bbox

Bounding boxes are geometric entities that enclose the shape of an object in Cartesian coordinates. Bounding boxes define minimum and maximum limits at each dimension so the entire object lies within the specified limits. A 2D bounding box is defined as a 4-dimensional vector [x, y, w, h], where [x, y] is the center of the bounding box and [w, h] represents the width (horizontal, x-coordinate dimension) and height (vertical, y-coordinate dimension), respectively.

Example

"bbox": [{
        "name": "bbox_shape",
        "val": [400, 200, 100, 120],
        "stream": "camera1",
        "confidence_score":0.8,
        "attributes" : {
            "boolean" : [{
                    "name" : "visible",
                    "val" : false
                }, {
                    "name" : "occluded",
                    "val" : false
                }
            ],
            "text" : [{
                    "name" : "brand",
                    "val" : "toyota"
                }, {
                    "name" : "color",
                    "val" : "red"
                }
            ]
        }
    ]
}

Schema

A 2D bounding box is defined as a 4-dimensional vector [x, y, w, h], where [x, y] is the center of the bounding box and [w, h] represents the width (horizontal, x-coordinate dimension) and height (vertical, y-coordinate dimension), respectively.

namedescriptiontypeunitrequired

${OBJECT_TYPE}

The name of this type. This case is “bbox”.

object

-

true

name

The name of this bounding box. Usually to be "bbox_shape"

string

-

true

val

Meanings of each element in order as a 4-dimensional vector [x, y, w, h]: ➤ x-coordinate of the center ➤ y-coordinate of the center ➤ w-the width of the rectangle ➤ h-the height of the rectangle

4 elements array of int

px

true

stream

Represents which stream this shape is on.

string

-

true

confidence_score

The confidence score of model prediction of this object. Ground truth does not have this attribute.

number

-

false

attributes

attributes this bounding box has

object

-

false


Use Case

bbox

To describe a bbox dataset with one camera sensor:

  • sensor: camera (#camera1)

  • ontology:

    • people

      • ischild - boolean (static info)

      • direction - front, left, right, back (dynamic info)

      • age - number (static info)

    • car

      • color - white, silver, blue, red, black (static info)

    • truck

    • bus

Example Code

pagebbox

Last updated