Pytorch Basics 2: Model

Last updated on Jun 6, 2022 2 min read

| First first, the forward of nn.Module is the one single most important part of the entire implementation. It may effect:

other part of the nn.Module
customized dataloader
optimizer
and loss function.

Component

Your DIY model must inherit from nn.Module.

And must contains 2 functions:

__init__
- define all trainable layers and other parameters
- things begin with self and nn will be included to model.parameters() by default

#e.g.:

self.d1 = nn.Linear()
self.x = nn.Parameter
self.y = 2 # this will not included
self.z = [nn.Linear, nn.Linear] # this will also not inlcuded

forward
- input is a batch of data
- describe how data pass through the network
- and generate some output (logits)

GPU

move model to GPU if avaliable

forward operations are made in GPU
model and a batch of data will be loaded to GPU


#put tensor to GPU
ta = torch.tensor(...).to('cuda:0')

# if I need to call some numpy method
# e.g. using sklearn metrics
l = (...).cpu().numpy()
l2 = np.function(l)

Commonly used `nn.`

`nn.ModuleList` vs `nn.Sequential`

nn.ModuleList is a module list, literally. A list store modules. See that as a Python list
nn.Sequential must run sequentially

Dropout

when training, must specify training: e.g.

x = F.dropout(x, p=self.dropout, training=self.training) # test time, pass everything; training time, dropout

Examples


# Get cpu or gpu device for training.
device = "cuda" if torch.cuda.is_available() else "cpu"
print("Using {} device".format(device))

# Define model
class NeuralNetwork(nn.Module):
    def __init__(self):
        super(NeuralNetwork, self).__init__()
        self.flatten = nn.Flatten()
        self.linear_relu_stack = nn.Sequential(
            nn.Linear(28*28, 512),
            nn.ReLU(),
            nn.Linear(512, 512),
            nn.ReLU(),
            nn.Linear(512, 10)
        )

    def forward(self, x):
        x = self.flatten(x)
        logits = self.linear_relu_stack(x)
        return logits

model = NeuralNetwork().to(device)
print(model)

Pytorch

Pytorch Basics 2: Model

Component

GPU

Commonly used `nn.`

`nn.ModuleList` vs `nn.Sequential`

Dropout

Examples

Dayu Yang

Ph.D. Student in Financial Services Analytics

Related

Pytorch Basics 2: Model

Component

GPU

Commonly used nn.

nn.ModuleList vs nn.Sequential

Dropout

Examples

Dayu Yang

Ph.D. Student in Financial Services Analytics

Related

Commonly used `nn.`

`nn.ModuleList` vs `nn.Sequential`