1. Inception Block (v1)

Extract information from 4 different paths, and then concatenate them in output channel. It has less parameters and time complexity. The assignment of channels are based on the significance.

  • 1x1 Conv (64)
  • 1x1 Conv (96); 3x3 Conv, pad 1 (128)
  • 1x1 Conv (16); 5x5 Conv, pad 2 (32)
  • 3x3 MaxPool, pad 1; 1x1 Conv (32)

2. Architecture

GoogLeNet

You can see that there are so many hyper-parameters (num of channels). So it has limited use.

3. Q&A

3x3 => 1x3, 3x1 less computation channels ‘power 2’ make the best of gpu.