Optuna on Pytorch CNN

500 Views Asked by At
class ConvolutionalNetwork(nn.Module):
    def __init__(self, in_features, trial):
        super().__init__()
        self.in_features = in_features
        self.trial = trial
        # this computes no of features outputted by 2 conv layers
        c1 = int(((self.in_features - 2)) / 64)  # this is to account for the loss due to conversion to int type
        c2 = int((c1 - 2) / 64)
        self.n_conv = int(c2 * 16)
        # self.n_conv = int((( ( (self.in_features - 2)/4 ) - 2 )/4 ) * 16)
        num_filters1 = trial.suggest_int("num_filters1",16,64,step=16)
        num_filters2 = trial.suggest_int("num_filters2",16,64,step=16)
        #num_filters = 16        
        kernel_size = trial.suggest_int('kernel_size', 2, 7)
        self.conv1 = nn.Conv1d(1, num_filters1, kernel_size, 1)
        self.conv1_bn = nn.BatchNorm1d(num_filters1)
        self.conv2 = nn.Conv1d(num_filters1, num_filters2, kernel_size, 1)
        self.conv2_bn = nn.BatchNorm1d(num_filters2)
        #Add in trial range for dropout to determine optimal dropout value
        self.dp = nn.Dropout(trial.suggest_uniform('dropout_rate',0,1.0))
        self.fc3 = nn.Linear(self.n_conv, 2)

I tried to add in the Optuna parameter tuning trial for number of filters 1&2 as per below, but the following error occurred. Trial 0 managed to go through but Trial 1 no. I made adjustments to the batch size but the same error format occurred with a different recommended batch size.

num_filters1 = trial.suggest_int("num_filters1",16,64,step=16)
            num_filters2 = trial.suggest_int("num_filters2",16,64,step=16)

[I 2020-12-07 09:22:53,512] Trial 0 finished with value: 0.6597743630409241 and parameters: {'num_filters1': 16, 'num_filters2': 16, 'kernel_size': 7, 'dropout_rate': 0.5225509182876455, 'optimizer': 'SGD', 'lr': 0.00020958259416674875, 'Weight_decay': 3.8111213220887506e-08}. Best is trial 0 with value: 0.6597743630409241.
[W 2020-12-07 09:22:55,502] Trial 1 failed because of the following error: ValueError('Expected input batch_size (8000) to match target batch_size (2000).')
1

There are 1 best solutions below

0
On

Removed Optuna autotuning for num_filter2 as there are more chain effect afterwards following the last filter layer.

Retained num_filter1 for Optuna autotuning and the model is able to run smoothly.

thanks!