设为首页收藏本站

切换到窄版

鱼C论坛»论坛 › 萌新训练营 › 萌新交流区 › 从LayerNorm中看keepdim参数的作用

发新帖

查看: 3572|回复: 0

[学习笔记] 从LayerNorm中看keepdim参数的作用

发表于 2023-8-8 11:42:03 | 显示全部楼层 |阅读模式

马上注册，结交更多好友，享用更多功能^_^

您需要登录才可以下载或查看，没有账号？立即注册

x

Transformer的LayerNorm层中做均值计算等运算时用到了keepdim参数：

class LayerNorm(nn.Module):
"Construct a layernorm module (see citation for details)."
def __init__(self, features, eps=1e-6):
super(LayerNorm, self).__init__()
self.a_2 = nn.Parameter(torch.ones(features))
self.b_2 = nn.parameter(torch.zeros(features))
self.eps = eps
def forward(self, x):
mean = x.mean(-1, keepdim=True)
std = x.std(-1, keepdim=True)
return self.a_2 * (x - mean) / (std + self.eps) + self.b_2

复制代码

keepdim参数的作用是可以使得在做归并操作时原来的数据维度不变。

示例：

>>>a = torch.arange(12).reshape(3, 4)
>>>print(a)
tensor([[ 0, 1, 2, 3],
[ 4, 5, 6, 7],
[ 8, 9, 10, 11]])
>>>print(torch.sum(a, dim=0, keepdim=True))
>>>print(torch.sum(a, dim=0, keepdim=True).shape)
tensor([[12, 15, 18, 21]])
torch.Size([1, 4])
>>>print(torch.sum(a, dim=0, keepdim=False))
>>>print(torch.sum(a, dim=0, keepdim=False).shape)
tensor([12, 15, 18, 21])
torch.Size([4])

复制代码

使用 keepdim=True后，输出张量的维度不变。
keepdim = False后，输出张量丢失了第一个维度。

小甲鱼最新课程 -> https://ilovefishc.com

回复

使用道具举报

发新帖

小黑屋|手机版|Archiver|鱼C工作室 ( 粤ICP备18085999号-1 | 粤公网安备 44051102000585号)

GMT+8, 2026-2-14 11:06

Powered by Discuz! X3.4

© 2001-2023 Discuz! Team.

快速回复 返回顶部 返回列表