XavierNormal

class paddle.nn.initializer. XavierNormal ( fan_in: Optional[float] = None, fan_out: Optional[float] = None, gain: float = 1.0, name: Optional[str] = None ) [source]

This class implements the Xavier weight initializer from the paper Understanding the difficulty of training deep feedforward neural networks by Xavier Glorot and Yoshua Bengio, using a normal distribution whose mean is \(0\) and standard deviation is

\[gain \times \sqrt{\frac{2.0}{fan\_in + fan\_out}}.\]

Parameters

fan_in (float|None, optional) – fan_in for Xavier initialization, which is inferred from the Tensor. Default is None.
fan_out (float|None, optional) – fan_out for Xavier initialization, which is inferred from the Tensor. Default is None.
gain (float, optional) – Scaling Tensor. Default is 1.0.
name (str|None, optional) – For details, please refer to Name. Generally, no setting is required. Default: None.

Returns

A parameter initialized by Xavier weight, using a normal distribution.

Examples

>>> import paddle
>>> paddle.seed(1)
>>> data = paddle.ones(shape=[3, 1, 2], dtype='float32')
>>> weight_attr = paddle.framework.ParamAttr(
...     name="linear_weight",
...     initializer=paddle.nn.initializer.XavierNormal())
>>> bias_attr = paddle.framework.ParamAttr(
...     name="linear_bias",
...     initializer=paddle.nn.initializer.XavierNormal())
>>> linear = paddle.nn.Linear(2, 2, weight_attr=weight_attr, bias_attr=bias_attr)
>>> print(linear.weight)
Parameter containing:
Tensor(shape=[2, 2], dtype=float32, place=Place(cpu), stop_gradient=False,
[[-0.21607460,  0.08382989],
 [ 0.29147008, -0.07049121]])

>>> print(linear.bias)
Parameter containing:
Tensor(shape=[2], dtype=float32, place=Place(cpu), stop_gradient=False,
[1.06076419, 0.87684733])

>>> res = linear(data)
>>> print(res)
Tensor(shape=[3, 1, 2], dtype=float32, place=Place(cpu), stop_gradient=False,
[[[1.13615966, 0.89018601]],
 [[1.13615966, 0.89018601]],
 [[1.13615966, 0.89018601]]])