Standard vs Latin

Access code on GitHub

Can you distinguish between Standard and Latin-American dancing at the Blackpool Dance Festival? Because my model can! 

It's a binary classification task using a simple convolutional neural network. Its architecture comprises 3 sets of convolution and max pooling, along with 1 fully connected hidden layer. The training set consists of 800 images of dance couples (sized 133x200), divided into 2 classes: standard and Latin. The validation set also includes 200 images with the same classes. 

After just 10 epochs of training, it achieved around 98% accuracy on both the training and validation sets. Although the architecture was tested using dropout, data augmentation, and extra convolutions, the accuracy actually decreased and the training became significantly slower. It seems that in this case, simplicity works best!