Multi Layer Percpetrons (MLPs) and Stochastic Gradient Descent (SGD)

Slides

Andrey Kurenkov's blog post on the history of deep learning.

See Goodfellow et al. sections 6.1 through 6.3