torch.flatten vs torch.nn.Flatten 的區別

原創

LabVIEW_Python

2023-03-12 07:34

torch.flatten 和 torch.nn.Flatten 都用於把多維Tensor展平(flatten), 區別是：

torch.flatten是函數，使用前無需先實例化，默認從第0維開始展平，通用化好

torch.flatten(input, start_dim=0, end_dim=- 1)

torch.nn.Flatten是類，使用前需要先實例化，由於其在torch.nn模塊中，默認專門處理神經網數據的展平，而神經網絡數據通常第0維是Batch_Size, Batch_Size無需展平，所以其默認從第1維開始展平。

Class torch.nn.Flatten(start_dim=1, end_dim=- 1)

測試範例程序如下：

import torch

input_tensor = torch.randn(32, 4, 5, 5)
m = torch.nn.Flatten() #實例化Flatten
output1 = m(input_tensor)
print(output1.shape)
output2 = torch.flatten(input_tensor)
print(output2.shape)

運行結果如下：

torch.Size([32, 100])
torch.Size([3200])

另外，torch.nn.Flatten適合作爲一個“神經網絡層”，加入神經網絡中，範例：

def _create_fcs(self, split_size, num_boxes, num_classes):
        S, B, C = split_size, num_boxes, num_classes
        return nn.Sequential(
            nn.Flatten(),
            nn.Linear(1024 * S * S, 4096), 
            # Usually, dropout is placed on the fully connected layers only
            # A rule of thumb is to set the keep probability (1 - drop probability) to 0.5 when dropout is applied to fully connected layers
            # https://stackoverflow.com/questions/46841362/where-dropout-should-be-inserted-fully-connected-layer-convolutional-layer
            nn.Dropout(0.5),
            nn.LeakyReLU(0.1),
            # The predictions are encoded as an S × S × (B ∗ 5 + C) tensor
            nn.Linear(4096, S * S * (B * 5 + C)), # 7*7*(2*5+20)=1470
        )

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

torch.flatten vs torch.nn.Flatten 的區別

AI模型 Llama 3體驗筆記

【面試準備】又一次失敗的面試經歷，題目離譜～資深軟件測試工程師

dotnet 8 版本與銀河麒麟V10和UOS系統的 glibc 兼容性

如何解決報錯:unable to get local issuer certificate

torch.flatten vs torch.nn.Flatten 的區別

fetch_california_housing報錯：urllib.error.HTTPError: HTTP Error 403: Forbidden

在ImageNet 1k數據集上訓練yolov5m-cls分類模型

OpenCV vs Pillow 誰讀取文件速度更快？

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結