Convolutional Neural Network
See also: this one assignment on CNN
how can we exploit sparsity and locality?
think of sparse connectivity rather than full connectivity
where we exploiting invariance, it might be useful in other parts of the image as well
convolution
accept volume of size W1×H1×D1 with four hyperparameters
- filters K
- spatial extent F
- stride S
- amount of zero padding P
produces a volume of size W2×H2×D2 where:
- W2=SW1−F+2P+1
- H2=SH1−F+2P+1
- D2=K
1D convolution:
yy(i)=(x∗w)=t∑x(t)w(i−t)
2D convolution:
yy(i,j)=(x∗w)=t1∑t2∑x(t1,t2)w(i−t1,j−t2)
max pooling
idea to reduce number of parameters
batchnorm
xj=[x1j,…,xdj]
Batch X=[(x1)T…(xb)T]T