PyTorch の Automatic Differentiation の詳細：torch.autograd.Function.backward() の仕組み

2024-04-02

PyTorchの自動微分とtorch.autograd.Function.backward()

torch.autograd.Function は、自動微分における計算グラフのノードを表すクラスです。このクラスには、backward() メソッドが定義されています。このメソッドは、出力テンソルの勾配を計算するために呼び出されます。

backward() メソッドは、以下の引数を受け取ります:

grad_output: 出力テンソルの勾配。
input_tensors: 入力テンソルのリスト。
kwargs: オプションの引数。

backward() メソッドは、入力テンソルの勾配を計算し、grad_output 変数に格納する必要があります。

例

以下のコードは、torch.autograd.Function を継承した自作関数の例です。

class MyFunction(torch.autograd.Function):

    @staticmethod
    def forward(ctx, input):
        output = torch.sin(input)
        ctx.save_for_backward(input)
        return output

    @staticmethod
    def backward(ctx, grad_output):
        input, = ctx.saved_tensors
        grad_input = grad_output * torch.cos(input)
        return grad_input

この関数は、入力テンソルの正弦波を計算します。backward() メソッドは、入力テンソルの勾配を計算します。

torch.autograd.Function.backward() メソッドは、PyTorchの自動微分における重要な機能です。このメソッドを理解することで、複雑な計算グラフの勾配を効率的に計算することができます。

PyTorch の Automatic Differentiation に関するサンプルコード

単純な関数

import torch

def f(x):
    return torch.sin(x)

x = torch.tensor(1.0, requires_grad=True)
y = f(x)

# 勾配計算
y.backward()

# 出力テンソルの勾配
print(x.grad)

複合関数

import torch

def g(x):
    return torch.exp(x) + torch.log(x)

x = torch.tensor(1.0, requires_grad=True)
y = g(x)

# 勾配計算
y.backward()

# 出力テンソルの勾配
print(x.grad)

このコードは、torch.exp() と torch.log() 関数を組み合わせた複合関数の勾配を計算します。

ニューラルネットワーク

import torch

class Net(torch.nn.Module):
    def __init__(self):
        super().__init__()
        self.fc1 = torch.nn.Linear(1, 1)

    def forward(self, x):
        x = self.fc1(x)
        return x

net = Net()
x = torch.tensor(1.0, requires_grad=True)
y = net(x)

# 損失関数の定義
loss = torch.nn.MSELoss()

# 損失計算
loss_value = loss(y, torch.tensor(2.0))

# 勾配計算
loss_value.backward()

# パラメータの勾配
print(net.fc1.weight.grad)

このコードは、単層のニューラルネットワークの勾配を計算します。

ベクトルヤコビアン

import torch

def f(x):
    return torch.stack([torch.sin(x), torch.cos(x)])

x = torch.tensor(1.0, requires_grad=True)
y = f(x)

# ベクトルヤコビアン計算
jacobian = torch.autograd.jacobian(y, x)

# 出力
print(jacobian)

このコードは、torch.sin() と torch.cos() 関数を組み合わせた関数のベクトルヤコビアンを計算します。

高階微分

import torch

def f(x):
    return torch.sin(x)

x = torch.tensor(1.0, requires_grad=True)
y = f(x)

# 二階微分計算
y.backward(create_graph=True)

# 出力
print(x.grad.grad)

このコードは、torch.sin() 関数の二階微分を計算します。

これらのサンプルコードは、PyTorch の Automatic Differentiation の基本的な使い方を理解するのに役立ちます。

PyTorch の Automatic Differentiation のその他の方法

torch.autograd.grad()

outputs: 勾配を計算したいテンソルのリスト。
inputs: 入力テンソルのリスト。
gradients: 出力テンソルの勾配 (オプション)。
create_graph: 計算グラフを保持するかどうか (オプション)。
only_inputs: 勾配を計算したい入力テンソルのインデックス (オプション)。

import torch

def f(x):
    return torch.sin(x)

x = torch.tensor(1.0, requires_grad=True)
y = f(x)

# 勾配計算
grad, = torch.autograd.grad(y, x)

# 出力
print(grad)

このコードは、torch.autograd.grad() 関数を使って torch.sin() 関数の勾配を計算します。

手動による微分

PyTorch では、手動で微分を行うことも可能です。これは、計算グラフが複雑な場合や、勾配計算の速度を最適化したい場合に役立ちます。

import torch

def f(x):
    return torch.sin(x)

x = torch.tensor(1.0)
y = f(x)

# 手動による微分
grad = torch.cos(x)

# 出力
print(grad)

このコードは、torch.sin() 関数の勾配を手動で計算します。

その他のライブラリ

PyTorch 以外にも、自動微分を行うためのライブラリがいくつかあります。

これらのライブラリは、それぞれ異なる機能と利点を持っています。

PyTorch の Automatic Differentiation は、ニューラルネットワークなどの計算グラフにおける勾配を効率的に計算するための強力なツールです。torch.autograd.Function.backward() メソッド以外にも、torch.autograd.grad() 関数や手動による微分など、さまざまな方法があります。

どの方法を選択するかは、計算グラフの複雑さや、勾配計算の速度などの要件によって異なります。

PyTorch の Automatic Differentiation の詳細：torch.autograd.Function.backward() の仕組み

PyTorchの自動微分とtorch.autograd.Function.backward()

例

PyTorch の Automatic Differentiation に関するサンプルコード

単純な関数

複合関数

ニューラルネットワーク

ベクトルヤコビアン

高階微分

PyTorch の Automatic Differentiation のその他の方法

torch.autograd.grad()

手動による微分

その他のライブラリ

PyTorch「torch.autograd.functional.vjp()」の解説とサンプルコード集

【初心者向け】PyTorchで自動微分を使いこなす！「torch.autograd.function.FunctionCtx.mark_dirty()」の役割と使い方

GradScaler.state_dict() を使って、PyTorch Automatic Mixed Precision の訓練を中断して後で再開する方法

Tensor Core を活用してさらに高速化！ torch.backends.cuda.matmul.allow_fp16_reduced_precision_reduction の活用方法

PyTorch Backends: torch.backends.cuda.flash_sdp_enabled()のサンプルコード集

PyTorchにおけるTensorの要素ごとの除算: 詳細解説とサンプルコード集

画像処理、機械学習で威力を発揮！PyTorch「torch.Tensor.addcdiv」の使い方をマスターしよう

PyTorch NN 関数における torch.nn.functional.celu(): 網羅的な解説

Tensor.sort() の代替方法

PyTorch の Storage と torch.UntypedStorage.share_memory_() に関する完全ガイド