RoIAlign

class paddle.vision.ops. RoIAlign ( output_size, spatial_scale=1.0 ) [source]

This interface is used to construct a callable object of the RoIAlign class. Please refer to roi_align.

Parameters
  • output_size (int or tuple[int, int]) – The pooled output size(h, w), data type is int32. If int, h and w are both equal to output_size.

  • spatial_scale (float32, optional) – Multiplicative spatial scale factor to translate ROI coords from their input scale to the scale used when pooling. Default: 1.0.

Returns

The output of ROIAlign operator is a 4-D tensor with shape (num_boxes, channels, pooled_h, pooled_w).

Examples

>>> import paddle
>>> from paddle.vision.ops import RoIAlign

>>> data = paddle.rand([1, 256, 32, 32])
>>> boxes = paddle.rand([3, 4])
>>> boxes[:, 2] += boxes[:, 0] + 3
>>> boxes[:, 3] += boxes[:, 1] + 4
>>> boxes_num = paddle.to_tensor([3]).astype('int32')
>>> roi_align = RoIAlign(output_size=(4, 3))
>>> align_out = roi_align(data, boxes, boxes_num)
>>> print(align_out.shape)
[3, 256, 4, 3]
forward ( x, boxes, boxes_num, aligned=True )

forward

Defines the computation performed at every call. Should be overridden by all subclasses.

Parameters
  • *inputs (tuple) – unpacked tuple arguments

  • **kwargs (dict) – unpacked dict arguments