matrix_nms

paddle.vision.ops. matrix_nms ( bboxes: Tensor, scores: Tensor, score_threshold: float, post_threshold: float, nms_top_k: int, keep_top_k: int, use_gaussian: bool = False, gaussian_sigma: float = 2.0, background_label: int = 0, normalized: bool = True, return_index: Literal[True] = False, return_rois_num: Literal[False] = True, name: str | None = None ) → tuple[Tensor, None, Tensor] [source]

paddle.vision.ops. matrix_nms ( bboxes: Tensor, scores: Tensor, score_threshold: float, post_threshold: float, nms_top_k: int, keep_top_k: int, use_gaussian: bool = False, gaussian_sigma: float = 2.0, background_label: int = 0, normalized: bool = True, return_index: Literal[True] = False, return_rois_num: Literal[True] = True, name: str | None = None ) → tuple[Tensor, Tensor, Tensor]

paddle.vision.ops. matrix_nms ( bboxes: Tensor, scores: Tensor, score_threshold: float, post_threshold: float, nms_top_k: int, keep_top_k: int, use_gaussian: bool = False, gaussian_sigma: float = 2.0, background_label: int = 0, normalized: bool = True, return_index: Literal[False] = False, return_rois_num: Literal[True] = True, name: str | None = None ) → tuple[Tensor, Tensor, None]

paddle.vision.ops. matrix_nms ( bboxes: Tensor, scores: Tensor, score_threshold: float, post_threshold: float, nms_top_k: int, keep_top_k: int, use_gaussian: bool = False, gaussian_sigma: float = 2.0, background_label: int = 0, normalized: bool = True, return_index: Literal[False] = False, return_rois_num: Literal[False] = True, name: str | None = None ) → tuple[Tensor, None, None]

paddle.vision.ops. matrix_nms ( bboxes: Tensor, scores: Tensor, score_threshold: float, post_threshold: float, nms_top_k: int, keep_top_k: int, use_gaussian: bool = False, gaussian_sigma: float = 2.0, background_label: int = 0, normalized: bool = True, return_index: bool = False, return_rois_num: bool = True, name: str | None = None ) → tuple[Tensor, Tensor | None, Tensor | None]

This operator does matrix non maximum suppression (NMS). First selects a subset of candidate bounding boxes that have higher scores than score_threshold (if provided), then the top k candidate is selected if nms_top_k is larger than -1. Score of the remaining candidate are then decayed according to the Matrix NMS scheme. After NMS step, at most keep_top_k number of total bboxes are to be kept per image if keep_top_k is larger than -1.

Parameters

bboxes (Tensor) – A 3-D Tensor with shape [N, M, 4] represents the predicted locations of M bounding bboxes, N is the batch size. Each bounding box has four coordinate values and the layout is [xmin, ymin, xmax, ymax], when box size equals to 4. The data type is float32 or float64.
scores (Tensor) – A 3-D Tensor with shape [N, C, M] represents the predicted confidence predictions. N is the batch size, C is the class number, M is number of bounding boxes. For each category there are total M scores which corresponding M bounding boxes. Please note, M is equal to the 2nd dimension of BBoxes. The data type is float32 or float64.
score_threshold (float) – Threshold to filter out bounding boxes with low confidence score.
post_threshold (float) – Threshold to filter out bounding boxes with low confidence score AFTER decaying.
nms_top_k (int) – Maximum number of detections to be kept according to the confidences after the filtering detections based on score_threshold.
keep_top_k (int) – Number of total bboxes to be kept per image after NMS step. -1 means keeping all bboxes after NMS step.
use_gaussian (bool, optional) – Use Gaussian as the decay function. Default: False
gaussian_sigma (float, optional) – Sigma for Gaussian decay function. Default: 2.0
background_label (int, optional) – The index of background label, the background label will be ignored. If set to -1, then all categories will be considered. Default: 0
normalized (bool, optional) – Whether detections are normalized. Default: True
return_index (bool, optional) – Whether return selected index. Default: False
return_rois_num (bool, optional) – whether return rois_num. Default: True
name (str|None, optional) – Name of the matrix nms op. Default: None.

Returns

A tuple with three Tensor, (Out, Index, RoisNum) if return_index is True, otherwise, a tuple with two Tensor (Out, RoisNum) is returned.
Out (Tensor), A 2-D Tensor with shape [No, 6] containing the detection results. Each row has 6 values, [label, confidence, xmin, ymin, xmax, ymax]
Index (Tensor), A 2-D Tensor with shape [No, 1] containing the selected indices, which are absolute values cross batches.
rois_num (Tensor), A 1-D Tensor with shape [N] containing the number of detected boxes in each image.

Examples

           >>> import paddle
>>> from paddle.vision.ops import matrix_nms

>>> boxes = paddle.rand([4, 1, 4])
>>> boxes[..., 2] = boxes[..., 0] + boxes[..., 2]
>>> boxes[..., 3] = boxes[..., 1] + boxes[..., 3]
>>> scores = paddle.rand([4, 80, 1])
>>> out = matrix_nms(bboxes=boxes, scores=scores, background_label=0,
...                         score_threshold=0.5, post_threshold=0.1,
...                         nms_top_k=400, keep_top_k=200, normalized=False)